Spanish NewsCorpus: Multi-tagged corpus for authorship analysis in Spanish

The aim of the project is the compilation of news in Spanish from digital media sites and its categorization into three areas: variation of Spanish, author, and author’s gender. The collection was carried out semi-automatically with a web crawler developed for this purpose.

Related