Buscador | Buscador

Resultados totales (Incluyendo duplicados): 4
Encontrada(s) 1 página(s)

CORA.Repositori de Dades de Recerca

doi:10.34810/data313

Dataset. 2013

TRL MALT PARSER MODULE FOR SPANISH

Universitat Pompeu Fabra. Institut Universitari de Lingüística Aplicada (IULA)

The file espmalt-1.0.mco contains a single malt configuration for parsing Spanish text with MaltParser. The parser presupposes that the input is in CoNLL-X format and tagged with the part-of-speech tags of FreeLing tagger.

Proyecto: //

DOI: https://doi.org/10.34810/data313

CORA.Repositori de Dades de Recerca

doi:10.34810/data313

HANDLE: https://doi.org/10.34810/data313

CORA.Repositori de Dades de Recerca

doi:10.34810/data313

PMID: https://doi.org/10.34810/data313

CORA.Repositori de Dades de Recerca

doi:10.34810/data313

Ver en: https://doi.org/10.34810/data313

CORA.Repositori de Dades de Recerca

doi:10.34810/data313

CORA.Repositori de Dades de Recerca

doi:10.34810/data314

Dataset. 2013

TIBIDABO TREEBANK AND IULA SPANISH LSP TREEBANK TRAIN AND TEST PARTITIONS

Universitat Pompeu Fabra. Institut Universitari de Lingüística Aplicada (IULA)
Marimon, Montserrat

This package contains a partition of the Iula Spanish LSP Treebank into train and test sets to perform Machine Learning experiments. In that way the same partitions can be used by different researchers and their results can be directly compared. In this package we also deliver the Tibidabo Treebank (Marimon 2010) which contains a set of sentences extracted from Ancora corpus annotated in the same way than the Iula Treebank. Tibidabo Treebank is a very good test set for models trained with Iula Spanish LSP Treebank since the sentences that form it from a very different domain than those of the Iula Spanish LSP Treebank.

Proyecto: //

DOI: https://doi.org/10.34810/data314

CORA.Repositori de Dades de Recerca

doi:10.34810/data314

HANDLE: https://doi.org/10.34810/data314

CORA.Repositori de Dades de Recerca

doi:10.34810/data314

PMID: https://doi.org/10.34810/data314

CORA.Repositori de Dades de Recerca

doi:10.34810/data314

Ver en: https://doi.org/10.34810/data314

CORA.Repositori de Dades de Recerca

doi:10.34810/data314

CORA.Repositori de Dades de Recerca

doi:10.34810/data375

Dataset. 2013

PANACEA ENGLISH AUTOMATICALLY ACQUIRED LEXICON FOR ENV DOMAIN: SUBCATEGORIZATION FRAMES AND LEXICAL SEMANTIC CLASSES FOR NOUNS

University of Cambridge. Department of Theoretical and Applied Linguistics
Universitat Pompeu Fabra. Institut Universitari de Lingüística Aplicada (IULA)

This is a domain-specific lexicon for English for environment (ENV) domain. This lexicon contain both, subcategorization frames for verbs and lexical semantic classes for nouns. This lexicon has been automatically created using PANACEA webservices using crawled data. The crawled data was obtained crawling web pages that were automatically detected to be in the English language and were automatically classified as relevant to the ENV domain. Data collection took place in the summer of 2011.

Proyecto: //

DOI: https://doi.org/10.34810/data375

CORA.Repositori de Dades de Recerca

doi:10.34810/data375

HANDLE: https://doi.org/10.34810/data375

CORA.Repositori de Dades de Recerca

doi:10.34810/data375

PMID: https://doi.org/10.34810/data375

CORA.Repositori de Dades de Recerca

doi:10.34810/data375

Ver en: https://doi.org/10.34810/data375

CORA.Repositori de Dades de Recerca

doi:10.34810/data375

CORA.Repositori de Dades de Recerca

doi:10.34810/data378

Dataset. 2013

PANACEA ENGLISH AUTOMATICALLY ACQUIRED LEXICON FOR LAB DOMAIN: SUBCATEGORIZATION FRAMES AND LEXICAL SEMANTIC CLASSES FOR NOUNS

University of Cambridge. Department of Theoretical and Applied Linguistics
Universitat Pompeu Fabra. Institut Universitari de Lingüística Aplicada (IULA)

This is a domain-specific lexicon for English for labour (LAB) domain. This lexicon contain both, subcategorization frames for verbs and lexical semantic classes for nouns. This lexicon has been automatically created using PANACEA webservices using crawled data. The crawled data was obtained crawling web pages that were automatically detected to be in the English language and were automatically classified as relevant to the LAB domain. Data collection took place in the summer of 2011.

Proyecto: //

DOI: https://doi.org/10.34810/data378

CORA.Repositori de Dades de Recerca

doi:10.34810/data378

HANDLE: https://doi.org/10.34810/data378

CORA.Repositori de Dades de Recerca

doi:10.34810/data378

PMID: https://doi.org/10.34810/data378

CORA.Repositori de Dades de Recerca

doi:10.34810/data378

Ver en: https://doi.org/10.34810/data378

CORA.Repositori de Dades de Recerca