Resultados totales (Incluyendo duplicados): 4
Encontrada(s) 1 página(s)
CORA.Repositori de Dades de Recerca
doi:10.34810/data313
Dataset. 2013

TRL MALT PARSER MODULE FOR SPANISH

  • Universitat Pompeu Fabra. Institut Universitari de Lingüística Aplicada (IULA)
The file espmalt-1.0.mco contains a single malt configuration for parsing Spanish text with MaltParser. The parser presupposes that the input is in CoNLL-X format and tagged with the part-of-speech tags of FreeLing tagger.

Proyecto: //
DOI: https://doi.org/10.34810/data313
CORA.Repositori de Dades de Recerca
doi:10.34810/data313
HANDLE: https://doi.org/10.34810/data313
CORA.Repositori de Dades de Recerca
doi:10.34810/data313
PMID: https://doi.org/10.34810/data313
CORA.Repositori de Dades de Recerca
doi:10.34810/data313
Ver en: https://doi.org/10.34810/data313
CORA.Repositori de Dades de Recerca
doi:10.34810/data313

CORA.Repositori de Dades de Recerca
doi:10.34810/data314
Dataset. 2013

TIBIDABO TREEBANK AND IULA SPANISH LSP TREEBANK TRAIN AND TEST PARTITIONS

  • Universitat Pompeu Fabra. Institut Universitari de Lingüística Aplicada (IULA)
  • Marimon, Montserrat
This package contains a partition of the Iula Spanish LSP Treebank into train and test sets to perform Machine Learning experiments. In that way the same partitions can be used by different researchers and their results can be directly compared. In this package we also deliver the Tibidabo Treebank (Marimon 2010) which contains a set of sentences extracted from Ancora corpus annotated in the same way than the Iula Treebank. Tibidabo Treebank is a very good test set for models trained with Iula Spanish LSP Treebank since the sentences that form it from a very different domain than those of the Iula Spanish LSP Treebank.

Proyecto: //
DOI: https://doi.org/10.34810/data314
CORA.Repositori de Dades de Recerca
doi:10.34810/data314
HANDLE: https://doi.org/10.34810/data314
CORA.Repositori de Dades de Recerca
doi:10.34810/data314
PMID: https://doi.org/10.34810/data314
CORA.Repositori de Dades de Recerca
doi:10.34810/data314
Ver en: https://doi.org/10.34810/data314
CORA.Repositori de Dades de Recerca
doi:10.34810/data314

CORA.Repositori de Dades de Recerca
doi:10.34810/data375
Dataset. 2013

PANACEA ENGLISH AUTOMATICALLY ACQUIRED LEXICON FOR ENV DOMAIN: SUBCATEGORIZATION FRAMES AND LEXICAL SEMANTIC CLASSES FOR NOUNS

  • University of Cambridge. Department of Theoretical and Applied Linguistics
  • Universitat Pompeu Fabra. Institut Universitari de Lingüística Aplicada (IULA)
This is a domain-specific lexicon for English for environment (ENV) domain. This lexicon contain both, subcategorization frames for verbs and lexical semantic classes for nouns. This lexicon has been automatically created using PANACEA webservices using crawled data. The crawled data was obtained crawling web pages that were automatically detected to be in the English language and were automatically classified as relevant to the ENV domain. Data collection took place in the summer of 2011.

Proyecto: //
DOI: https://doi.org/10.34810/data375
CORA.Repositori de Dades de Recerca
doi:10.34810/data375
HANDLE: https://doi.org/10.34810/data375
CORA.Repositori de Dades de Recerca
doi:10.34810/data375
PMID: https://doi.org/10.34810/data375
CORA.Repositori de Dades de Recerca
doi:10.34810/data375
Ver en: https://doi.org/10.34810/data375
CORA.Repositori de Dades de Recerca
doi:10.34810/data375

CORA.Repositori de Dades de Recerca
doi:10.34810/data378
Dataset. 2013

PANACEA ENGLISH AUTOMATICALLY ACQUIRED LEXICON FOR LAB DOMAIN: SUBCATEGORIZATION FRAMES AND LEXICAL SEMANTIC CLASSES FOR NOUNS

  • University of Cambridge. Department of Theoretical and Applied Linguistics
  • Universitat Pompeu Fabra. Institut Universitari de Lingüística Aplicada (IULA)
This is a domain-specific lexicon for English for labour (LAB) domain. This lexicon contain both, subcategorization frames for verbs and lexical semantic classes for nouns. This lexicon has been automatically created using PANACEA webservices using crawled data. The crawled data was obtained crawling web pages that were automatically detected to be in the English language and were automatically classified as relevant to the LAB domain. Data collection took place in the summer of 2011.

Proyecto: //
DOI: https://doi.org/10.34810/data378
CORA.Repositori de Dades de Recerca
doi:10.34810/data378
HANDLE: https://doi.org/10.34810/data378
CORA.Repositori de Dades de Recerca
doi:10.34810/data378
PMID: https://doi.org/10.34810/data378
CORA.Repositori de Dades de Recerca
doi:10.34810/data378
Ver en: https://doi.org/10.34810/data378
CORA.Repositori de Dades de Recerca
doi:10.34810/data378

Buscador avanzado