Resultados totales (Incluyendo duplicados): 17
Encontrada(s) 2 página(s)
CORA.Repositori de Dades de Recerca
doi:10.34810/data265
Dataset. 2012

IULA PENN TREEBANK

  • Universitat Pompeu Fabra. Institut Universitari de Lingüística Aplicada (IULA)
This treebank consists of a number of Spanish and English sentences that has been manually annotated with syntactical information. The sentences have been choosed from the Penn TreeBank corpus, a resource containing texts from Wall Street Journal and originally compiled by the University of Pennsylvania./nIt contains 805 sentences that have been human translated to Spanish. The original English and the translated Spanish sentences share the same identification number. Sentences in both languages have been processed using the DELPH-IN environment (http://www.delph-in.net/).

Proyecto: //
DOI: https://doi.org/10.34810/data265
CORA.Repositori de Dades de Recerca
doi:10.34810/data265
HANDLE: https://doi.org/10.34810/data265
CORA.Repositori de Dades de Recerca
doi:10.34810/data265
PMID: https://doi.org/10.34810/data265
CORA.Repositori de Dades de Recerca
doi:10.34810/data265
Ver en: https://doi.org/10.34810/data265
CORA.Repositori de Dades de Recerca
doi:10.34810/data265

CORA.Repositori de Dades de Recerca
doi:10.34810/data268
Dataset. 2012

IULA SPANISH-ENGLISH TECHNICAL CORPUS

  • Universitat Pompeu Fabra. Institut Universitari de Lingüística Aplicada (IULA)
The corpus consists of a number of specialized texts (Law, Economics, Medicine, Environment and Computer Science domains) available in both Spanish and English languages. This LSP corpus has been compiled with articles from specialized Publications, PhD theses, etc./nIt contains about a total of about 2,1 M words in 127 documents in each language.

Proyecto: //
DOI: https://doi.org/10.34810/data268
CORA.Repositori de Dades de Recerca
doi:10.34810/data268
HANDLE: https://doi.org/10.34810/data268
CORA.Repositori de Dades de Recerca
doi:10.34810/data268
PMID: https://doi.org/10.34810/data268
CORA.Repositori de Dades de Recerca
doi:10.34810/data268
Ver en: https://doi.org/10.34810/data268
CORA.Repositori de Dades de Recerca
doi:10.34810/data268

CORA.Repositori de Dades de Recerca
doi:10.34810/data269
Dataset. 2012

ENGLISH-GALICIAN CLUVI DICTIONARY

  • Universidade de Vigo. Grupo de investigación TALG
  • Universitat Pompeu Fabra. Institut Universitari de Lingüística Aplicada (IULA)
This is the LMF version of the English-Galician CLUVI Dictionary developed under the direction of Xavier Gómez Guinovart (2005-2012) from parallel texts in the CLUVI Corpus of the University of Vigo.

Proyecto: //
DOI: https://doi.org/10.34810/data269
CORA.Repositori de Dades de Recerca
doi:10.34810/data269
HANDLE: https://doi.org/10.34810/data269
CORA.Repositori de Dades de Recerca
doi:10.34810/data269
PMID: https://doi.org/10.34810/data269
CORA.Repositori de Dades de Recerca
doi:10.34810/data269
Ver en: https://doi.org/10.34810/data269
CORA.Repositori de Dades de Recerca
doi:10.34810/data269

CORA.Repositori de Dades de Recerca
doi:10.34810/data279
Dataset. 2011

ENGLISH-CATALAN LMF APERTIUM BILINGUAL DICTIONARY

  • Universitat d'Alacant. Grup Transducens
  • Breen, Paul
  • O'Regan, Jimmy
  • Universitat Pompeu Fabra. Institut Universitari de Lingüística Aplicada (IULA)
This is the LMF version of the Apertium bilingual dictionary for English and Catalan languages. Bilingual LMF dictionaries were generated from Apertium bilingual dix files. For each Apertium bilingual correspondence, the corresponding source and target monolingual entries (LexicalEntry) were generated in addition to the bilingual correspondence (SenseAxis) element. Apertium is a free/open-source machine translation platform, initially aimed at related-language pairs but recently expanded to deal with more divergent language pairs (such as English-Catalan). The platform provides: a language-independent machine translation engine; tools to manage the linguistic data necessary to build a machine translation system for a given language pair and linguistic data for a growing number of language pairs.

Proyecto: //
DOI: https://doi.org/10.34810/data279
CORA.Repositori de Dades de Recerca
doi:10.34810/data279
HANDLE: https://doi.org/10.34810/data279
CORA.Repositori de Dades de Recerca
doi:10.34810/data279
PMID: https://doi.org/10.34810/data279
CORA.Repositori de Dades de Recerca
doi:10.34810/data279
Ver en: https://doi.org/10.34810/data279
CORA.Repositori de Dades de Recerca
doi:10.34810/data279

CORA.Repositori de Dades de Recerca
doi:10.34810/data287
Dataset. 2012

TERMOTECA

  • Universidade de Vigo. Grupo de investigación TALG
  • Universitat Pompeu Fabra. Institut Universitari de Lingüística Aplicada (IULA)
This lexical resource is the LMF version of the Termoteca, a multilingual terminological database based on the monolingual and parallel speciality texts collected in the corpora of the University of Vigo, namely in the CLUVI Corpus and in the Galician Technical Corpus.

Proyecto: //
DOI: https://doi.org/10.34810/data287
CORA.Repositori de Dades de Recerca
doi:10.34810/data287
HANDLE: https://doi.org/10.34810/data287
CORA.Repositori de Dades de Recerca
doi:10.34810/data287
PMID: https://doi.org/10.34810/data287
CORA.Repositori de Dades de Recerca
doi:10.34810/data287
Ver en: https://doi.org/10.34810/data287
CORA.Repositori de Dades de Recerca
doi:10.34810/data287

CORA.Repositori de Dades de Recerca
doi:10.34810/data292
Dataset. 2023

ENGLISH-GALICIAN LMF APERTIUM BILINGUAL DICTIONARY

  • imaxin|software
  • Universitat Politècnica de Catalunya
  • Universitat d'Alacant. Grup Transducens
  • Universitat Pompeu Fabra. Institut Universitari de Lingüística Aplicada (IULA)
This is the LMF version of the Apertium bilingual dictionary for English and Galician languages. Apertium is a free/open-source machine translation platform, initially aimed at related-language pairs but recently expanded to deal with more divergent language pairs (such as English-Catalan). The platform provides: a language-independent machine translation engine; tools to manage the linguistic data necessary to build a machine translation system for a given language pair and linguistic data for a growing number of language pairs.

Proyecto: //
DOI: https://doi.org/10.34810/data292
CORA.Repositori de Dades de Recerca
doi:10.34810/data292
HANDLE: https://doi.org/10.34810/data292
CORA.Repositori de Dades de Recerca
doi:10.34810/data292
PMID: https://doi.org/10.34810/data292
CORA.Repositori de Dades de Recerca
doi:10.34810/data292
Ver en: https://doi.org/10.34810/data292
CORA.Repositori de Dades de Recerca
doi:10.34810/data292

CORA.Repositori de Dades de Recerca
doi:10.34810/data293
Dataset. 2012

ENGLISH-SPANISH LMF APERTIUM BILINGUAL DICTIONARY

  • Universitat d'Alacant. Grup Transducens
  • Universitat Politècnica de Catalunya
  • O'Regan, Jimmy
  • Breen, Paul
  • Universitat Pompeu Fabra. Institut Universitari de Lingüística Aplicada (IULA)
This is the LMF version of the Apertium bilingual dictionary for English and Spanish languages. Bilingual LMF dictionaries were generated from Apertium bilingual dix files. For each Apertium bilingual correspondence, the corresponding source and target monolingual entries (LexicalEntry) were generated in addition to the bilingual correspondence (SenseAxis) element. Apertium is a free/open-source machine translation platform, initially aimed at related-language pairs but recently expanded to deal with more divergent language pairs (such as English-Catalan). The platform provides: a language-independent machine translation engine; tools to manage the linguistic data necessary to build a machine translation system for a given language pair and linguistic data for a growing number of language pairs.

Proyecto: //
DOI: https://doi.org/10.34810/data293
CORA.Repositori de Dades de Recerca
doi:10.34810/data293
HANDLE: https://doi.org/10.34810/data293
CORA.Repositori de Dades de Recerca
doi:10.34810/data293
PMID: https://doi.org/10.34810/data293
CORA.Repositori de Dades de Recerca
doi:10.34810/data293
Ver en: https://doi.org/10.34810/data293
CORA.Repositori de Dades de Recerca
doi:10.34810/data293

CORA.Repositori de Dades de Recerca
doi:10.34810/data311
Dataset. 2012

APERTIUM MONOLINGUAL LEXICON TO LMF CONVERTER

  • Universitat Pompeu Fabra. Institut Universitari de Lingüística Aplicada (IULA)
This tool generates the LMF version of Apertium monolingual lexicons. The script takes as input an expanded monolingual Apertium lexicon (generated using: lt-expand apertium.dix > apertium.expanded) and generates the corresponding LMF version. In the Apertium expended lexicons, the first tag corresponds to the part of speech. The rest of tags (all enclosed in angle brackets) encode additional information depending on the lemma and PoS tag. Run "perl ApertiumMonolingual2LMF.pl --help" to get more information.

Proyecto: //
DOI: https://doi.org/10.34810/data311
CORA.Repositori de Dades de Recerca
doi:10.34810/data311
HANDLE: https://doi.org/10.34810/data311
CORA.Repositori de Dades de Recerca
doi:10.34810/data311
PMID: https://doi.org/10.34810/data311
CORA.Repositori de Dades de Recerca
doi:10.34810/data311
Ver en: https://doi.org/10.34810/data311
CORA.Repositori de Dades de Recerca
doi:10.34810/data311

CORA.Repositori de Dades de Recerca
doi:10.34810/data313
Dataset. 2013

TRL MALT PARSER MODULE FOR SPANISH

  • Universitat Pompeu Fabra. Institut Universitari de Lingüística Aplicada (IULA)
The file espmalt-1.0.mco contains a single malt configuration for parsing Spanish text with MaltParser. The parser presupposes that the input is in CoNLL-X format and tagged with the part-of-speech tags of FreeLing tagger.

Proyecto: //
DOI: https://doi.org/10.34810/data313
CORA.Repositori de Dades de Recerca
doi:10.34810/data313
HANDLE: https://doi.org/10.34810/data313
CORA.Repositori de Dades de Recerca
doi:10.34810/data313
PMID: https://doi.org/10.34810/data313
CORA.Repositori de Dades de Recerca
doi:10.34810/data313
Ver en: https://doi.org/10.34810/data313
CORA.Repositori de Dades de Recerca
doi:10.34810/data313

CORA.Repositori de Dades de Recerca
doi:10.34810/data314
Dataset. 2013

TIBIDABO TREEBANK AND IULA SPANISH LSP TREEBANK TRAIN AND TEST PARTITIONS

  • Universitat Pompeu Fabra. Institut Universitari de Lingüística Aplicada (IULA)
  • Marimon, Montserrat
This package contains a partition of the Iula Spanish LSP Treebank into train and test sets to perform Machine Learning experiments. In that way the same partitions can be used by different researchers and their results can be directly compared. In this package we also deliver the Tibidabo Treebank (Marimon 2010) which contains a set of sentences extracted from Ancora corpus annotated in the same way than the Iula Treebank. Tibidabo Treebank is a very good test set for models trained with Iula Spanish LSP Treebank since the sentences that form it from a very different domain than those of the Iula Spanish LSP Treebank.

Proyecto: //
DOI: https://doi.org/10.34810/data314
CORA.Repositori de Dades de Recerca
doi:10.34810/data314
HANDLE: https://doi.org/10.34810/data314
CORA.Repositori de Dades de Recerca
doi:10.34810/data314
PMID: https://doi.org/10.34810/data314
CORA.Repositori de Dades de Recerca
doi:10.34810/data314
Ver en: https://doi.org/10.34810/data314
CORA.Repositori de Dades de Recerca
doi:10.34810/data314

Advanced search