Resultados totales (Incluyendo duplicados): 4
Encontrada(s) 1 página(s)
Encontrada(s) 1 página(s)
CORA.Repositori de Dades de Recerca
doi:10.34810/data313
Dataset. 2013
TRL MALT PARSER MODULE FOR SPANISH
- Universitat Pompeu Fabra. Institut Universitari de Lingüística Aplicada (IULA)
The file espmalt-1.0.mco contains a single malt configuration for parsing Spanish text with MaltParser. The parser presupposes that the input is in CoNLL-X format and tagged with the part-of-speech tags of FreeLing tagger.
Proyecto: //
CORA.Repositori de Dades de Recerca
doi:10.34810/data314
Dataset. 2013
TIBIDABO TREEBANK AND IULA SPANISH LSP TREEBANK TRAIN AND TEST PARTITIONS
- Universitat Pompeu Fabra. Institut Universitari de Lingüística Aplicada (IULA)
- Marimon, Montserrat
This package contains a partition of the Iula Spanish LSP Treebank into train and test sets to perform Machine Learning experiments. In that way the same partitions can be used by different researchers and their results can be directly compared. In this package we also deliver the Tibidabo Treebank (Marimon 2010) which contains a set of sentences extracted from Ancora corpus annotated in the same way than the Iula Treebank. Tibidabo Treebank is a very good test set for models trained with Iula Spanish LSP Treebank since the sentences that form it from a very different domain than those of the Iula Spanish LSP Treebank.
Proyecto: //
CORA.Repositori de Dades de Recerca
doi:10.34810/data375
Dataset. 2013
PANACEA ENGLISH AUTOMATICALLY ACQUIRED LEXICON FOR ENV DOMAIN: SUBCATEGORIZATION FRAMES AND LEXICAL SEMANTIC CLASSES FOR NOUNS
- University of Cambridge. Department of Theoretical and Applied Linguistics
- Universitat Pompeu Fabra. Institut Universitari de Lingüística Aplicada (IULA)
This is a domain-specific lexicon for English for environment (ENV) domain. This lexicon contain both, subcategorization frames for verbs and lexical semantic classes for nouns. This lexicon has been automatically created using PANACEA webservices using crawled data. The crawled data was obtained crawling web pages that were automatically detected to be in the English language and were automatically classified as relevant to the ENV domain. Data collection took place in the summer of 2011.
Proyecto: //
CORA.Repositori de Dades de Recerca
doi:10.34810/data378
Dataset. 2013
PANACEA ENGLISH AUTOMATICALLY ACQUIRED LEXICON FOR LAB DOMAIN: SUBCATEGORIZATION FRAMES AND LEXICAL SEMANTIC CLASSES FOR NOUNS
- University of Cambridge. Department of Theoretical and Applied Linguistics
- Universitat Pompeu Fabra. Institut Universitari de Lingüística Aplicada (IULA)
This is a domain-specific lexicon for English for labour (LAB) domain. This lexicon contain both, subcategorization frames for verbs and lexical semantic classes for nouns. This lexicon has been automatically created using PANACEA webservices using crawled data. The crawled data was obtained crawling web pages that were automatically detected to be in the English language and were automatically classified as relevant to the LAB domain. Data collection took place in the summer of 2011.
Proyecto: //
Buscador avanzado