Resultados totales (Incluyendo duplicados): 17
Encontrada(s) 2 página(s)
Encontrada(s) 2 página(s)
CORA.Repositori de Dades de Recerca
doi:10.34810/data265
Dataset. 2012
IULA PENN TREEBANK
- Universitat Pompeu Fabra. Institut Universitari de Lingüística Aplicada (IULA)
This treebank consists of a number of Spanish and English sentences that has been manually annotated with syntactical information. The sentences have been choosed from the Penn TreeBank corpus, a resource containing texts from Wall Street Journal and originally compiled by the University of Pennsylvania./nIt contains 805 sentences that have been human translated to Spanish. The original English and the translated Spanish sentences share the same identification number. Sentences in both languages have been processed using the DELPH-IN environment (http://www.delph-in.net/).
Proyecto: //
CORA.Repositori de Dades de Recerca
doi:10.34810/data268
Dataset. 2012
IULA SPANISH-ENGLISH TECHNICAL CORPUS
- Universitat Pompeu Fabra. Institut Universitari de Lingüística Aplicada (IULA)
The corpus consists of a number of specialized texts (Law, Economics, Medicine, Environment and Computer Science domains) available in both Spanish and English languages. This LSP corpus has been compiled with articles from specialized Publications, PhD theses, etc./nIt contains about a total of about 2,1 M words in 127 documents in each language.
Proyecto: //
CORA.Repositori de Dades de Recerca
doi:10.34810/data269
Dataset. 2012
ENGLISH-GALICIAN CLUVI DICTIONARY
- Universidade de Vigo. Grupo de investigación TALG
- Universitat Pompeu Fabra. Institut Universitari de Lingüística Aplicada (IULA)
This is the LMF version of the English-Galician CLUVI Dictionary developed under the direction of Xavier Gómez Guinovart (2005-2012) from parallel texts in the CLUVI Corpus of the University of Vigo.
Proyecto: //
CORA.Repositori de Dades de Recerca
doi:10.34810/data279
Dataset. 2011
ENGLISH-CATALAN LMF APERTIUM BILINGUAL DICTIONARY
- Universitat d'Alacant. Grup Transducens
- Breen, Paul
- O'Regan, Jimmy
- Universitat Pompeu Fabra. Institut Universitari de Lingüística Aplicada (IULA)
This is the LMF version of the Apertium bilingual dictionary for English and Catalan languages. Bilingual LMF dictionaries were generated from Apertium bilingual dix files. For each Apertium bilingual correspondence, the corresponding source and target monolingual entries (LexicalEntry) were generated in addition to the bilingual correspondence (SenseAxis) element. Apertium is a free/open-source machine translation platform, initially aimed at related-language pairs but recently expanded to deal with more divergent language pairs (such as English-Catalan). The platform provides: a language-independent machine translation engine; tools to manage the linguistic data necessary to build a machine translation system for a given language pair and linguistic data for a growing number of language pairs.
Proyecto: //
CORA.Repositori de Dades de Recerca
doi:10.34810/data287
Dataset. 2012
TERMOTECA
- Universidade de Vigo. Grupo de investigación TALG
- Universitat Pompeu Fabra. Institut Universitari de Lingüística Aplicada (IULA)
This lexical resource is the LMF version of the Termoteca, a multilingual terminological database based on the monolingual and parallel speciality texts collected in the corpora of the University of Vigo, namely in the CLUVI Corpus and in the Galician Technical Corpus.
Proyecto: //
CORA.Repositori de Dades de Recerca
doi:10.34810/data292
Dataset. 2023
ENGLISH-GALICIAN LMF APERTIUM BILINGUAL DICTIONARY
- imaxin|software
- Universitat Politècnica de Catalunya
- Universitat d'Alacant. Grup Transducens
- Universitat Pompeu Fabra. Institut Universitari de Lingüística Aplicada (IULA)
This is the LMF version of the Apertium bilingual dictionary for English and Galician languages. Apertium is a free/open-source machine translation platform, initially aimed at related-language pairs but recently expanded to deal with more divergent language pairs (such as English-Catalan). The platform provides: a language-independent machine translation engine; tools to manage the linguistic data necessary to build a machine translation system for a given language pair and linguistic data for a growing number of language pairs.
Proyecto: //
CORA.Repositori de Dades de Recerca
doi:10.34810/data293
Dataset. 2012
ENGLISH-SPANISH LMF APERTIUM BILINGUAL DICTIONARY
- Universitat d'Alacant. Grup Transducens
- Universitat Politècnica de Catalunya
- O'Regan, Jimmy
- Breen, Paul
- Universitat Pompeu Fabra. Institut Universitari de Lingüística Aplicada (IULA)
This is the LMF version of the Apertium bilingual dictionary for English and Spanish languages. Bilingual LMF dictionaries were generated from Apertium bilingual dix files. For each Apertium bilingual correspondence, the corresponding source and target monolingual entries (LexicalEntry) were generated in addition to the bilingual correspondence (SenseAxis) element. Apertium is a free/open-source machine translation platform, initially aimed at related-language pairs but recently expanded to deal with more divergent language pairs (such as English-Catalan). The platform provides: a language-independent machine translation engine; tools to manage the linguistic data necessary to build a machine translation system for a given language pair and linguistic data for a growing number of language pairs.
Proyecto: //
CORA.Repositori de Dades de Recerca
doi:10.34810/data311
Dataset. 2012
APERTIUM MONOLINGUAL LEXICON TO LMF CONVERTER
- Universitat Pompeu Fabra. Institut Universitari de Lingüística Aplicada (IULA)
This tool generates the LMF version of Apertium monolingual lexicons. The script takes as input an expanded monolingual Apertium lexicon (generated using: lt-expand apertium.dix > apertium.expanded) and generates the corresponding LMF version. In the Apertium expended lexicons, the first tag corresponds to the part of speech. The rest of tags (all enclosed in angle brackets) encode additional information depending on the lemma and PoS tag. Run "perl ApertiumMonolingual2LMF.pl --help" to get more information.
Proyecto: //
CORA.Repositori de Dades de Recerca
doi:10.34810/data313
Dataset. 2013
TRL MALT PARSER MODULE FOR SPANISH
- Universitat Pompeu Fabra. Institut Universitari de Lingüística Aplicada (IULA)
The file espmalt-1.0.mco contains a single malt configuration for parsing Spanish text with MaltParser. The parser presupposes that the input is in CoNLL-X format and tagged with the part-of-speech tags of FreeLing tagger.
Proyecto: //
CORA.Repositori de Dades de Recerca
doi:10.34810/data314
Dataset. 2013
TIBIDABO TREEBANK AND IULA SPANISH LSP TREEBANK TRAIN AND TEST PARTITIONS
- Universitat Pompeu Fabra. Institut Universitari de Lingüística Aplicada (IULA)
- Marimon, Montserrat
This package contains a partition of the Iula Spanish LSP Treebank into train and test sets to perform Machine Learning experiments. In that way the same partitions can be used by different researchers and their results can be directly compared. In this package we also deliver the Tibidabo Treebank (Marimon 2010) which contains a set of sentences extracted from Ancora corpus annotated in the same way than the Iula Treebank. Tibidabo Treebank is a very good test set for models trained with Iula Spanish LSP Treebank since the sentences that form it from a very different domain than those of the Iula Spanish LSP Treebank.
Proyecto: //
Advanced search