Resultados totales (Incluyendo duplicados): 7
Encontrada(s) 1 página(s)
CORA.Repositori de Dades de Recerca
doi:10.34810/data277
Dataset. 2011

BASQUE LMF APERTIUM DICTIONARY

  • Prompsit Language Engineering, S.L
  • Universitat d'Alacant. Grup Transducens
  • Euskal Herriko Unibertsitatea. IXA Taldea
  • Universitat Politècnica de Catalunya. TALP Research Center
  • Universitat Pompeu Fabra. Institut Universitari de Lingüística Aplicada (IULA)
This is the LMF version of the Basque Apertium dictionary. Monolingual dictionaries for Spanish, Catalan, Gallego and Euskera have been generated from the Apertium expanded lexicons of the es-ca (for both Spanish andCatalan) es-gl (for Galician) and eu-es (for Basque). Apertium is a free/open-source machine translation platform, initially aimed at related-language pairs but recently expanded to deal with more divergent language pairs (such as English-Catalan). The platform provides: a language-independent machine translation engine; tools to manage the linguistic data necessary to build a machine translation system for a given language pair and linguistic data for a growing number of language pairs.

Proyecto: //
DOI: https://doi.org/10.34810/data277
CORA.Repositori de Dades de Recerca
doi:10.34810/data277
HANDLE: https://doi.org/10.34810/data277
CORA.Repositori de Dades de Recerca
doi:10.34810/data277
PMID: https://doi.org/10.34810/data277
CORA.Repositori de Dades de Recerca
doi:10.34810/data277
Ver en: https://doi.org/10.34810/data277
CORA.Repositori de Dades de Recerca
doi:10.34810/data277

CORA.Repositori de Dades de Recerca
doi:10.34810/data278
Dataset. 2011

BASQUE-SPANISH LMF APERTIUM BILINGUAL DICTIONARY

  • Prompsit Language Engineering, S.L
  • Universitat d'Alacant. Grup Transducens
  • Euskal Herriko Unibertsitatea. IXA Taldea
  • Universitat Politècnica de Catalunya. TALP Research Center
  • Universitat Pompeu Fabra. Institut Universitari de Lingüística Aplicada (IULA)
This is the LMF version of the Apertium bilingual dictionary for Basque and Spanish languages. Bilingual LMF dictionaries were generated from Apertium bilingual dix files. For each Apertium bilingual correspondence, the corresponding source and target monolingual entries (LexicalEntry) were generated in addition to the bilingual correspondence (SenseAxis) element. Apertium is a free/open-source machine translation platform, initially aimed at related-language pairs but recently expanded to deal with more divergent language pairs (such as English-Catalan). The platform provides: a language-independent machine translation engine; tools to manage the linguistic data necessary to build a machine translation system for a given language pair and linguistic data for a growing number of language pairs.

Proyecto: //
DOI: https://doi.org/10.34810/data278
CORA.Repositori de Dades de Recerca
doi:10.34810/data278
HANDLE: https://doi.org/10.34810/data278
CORA.Repositori de Dades de Recerca
doi:10.34810/data278
PMID: https://doi.org/10.34810/data278
CORA.Repositori de Dades de Recerca
doi:10.34810/data278
Ver en: https://doi.org/10.34810/data278
CORA.Repositori de Dades de Recerca
doi:10.34810/data278

CORA.Repositori de Dades de Recerca
doi:10.34810/data281
Dataset. 2011

GRAF VERSION OF THE BASQUE DEPENDENCY TREEBANK

  • Aldezabal, Izaskun
  • Aranzabe, Maxux
  • Arriola, Jose Mari
  • Atutxa, Aitziber
  • Díaz de Ilarraza, Arantza
  • Estarrona, Ainara
  • Fernandez, Kike
  • Iruskieta, Mikel
  • Uria, Larraitz
  • Universitat Pompeu Fabra. Institut Universitari de Lingüística Aplicada (IULA)
  • Euskal Herriko Unibertsitatea. IXA Taldea
This is the stand-off GrAF version of the Basque Dependency Treebank (BDT). It is the Reference Corpus for the Processing of Basque (EPEC) annotated at syntactic level. EPEC is a 300,000 word corpus of standard written journal texts which aims to be a training corpus for the development and inprovement of several Natural Language Procesing tools. It has been manually tagged at different levels: morphology, partial syntax and semantic This is the stand-off GrAF version of the Constituent Basque Treebank.

Proyecto: //
DOI: https://doi.org/10.34810/data281
CORA.Repositori de Dades de Recerca
doi:10.34810/data281
HANDLE: https://doi.org/10.34810/data281
CORA.Repositori de Dades de Recerca
doi:10.34810/data281
PMID: https://doi.org/10.34810/data281
CORA.Repositori de Dades de Recerca
doi:10.34810/data281
Ver en: https://doi.org/10.34810/data281
CORA.Repositori de Dades de Recerca
doi:10.34810/data281

CORA.Repositori de Dades de Recerca
doi:10.34810/data321
Dataset. 2023

BASQUE-ENGLISH LMF APERTIUM BILINGUAL DICTIONARY

  • Euskal Herriko Unibertsitatea. IXA Taldea
  • Prompsit Language Engineering, S.L
  • Universitat d'Alacant. Grup Transducens
  • Universitat Politècnica de Catalunya. TALP Research Center
  • Universitat Pompeu Fabra. Institut Universitari de Lingüística Aplicada (IULA)
-

Proyecto: //
DOI: https://doi.org/10.34810/data321
CORA.Repositori de Dades de Recerca
doi:10.34810/data321
HANDLE: https://doi.org/10.34810/data321
CORA.Repositori de Dades de Recerca
doi:10.34810/data321
PMID: https://doi.org/10.34810/data321
CORA.Repositori de Dades de Recerca
doi:10.34810/data321
Ver en: https://doi.org/10.34810/data321
CORA.Repositori de Dades de Recerca
doi:10.34810/data321

CORA.Repositori de Dades de Recerca
doi:10.34810/data324
Dataset. 2023

BASQUE WORDNET-LMF

  • Euskal Herriko Unibertsitatea. IXA Taldea
-

Proyecto: //
DOI: https://doi.org/10.34810/data324
CORA.Repositori de Dades de Recerca
doi:10.34810/data324
HANDLE: https://doi.org/10.34810/data324
CORA.Repositori de Dades de Recerca
doi:10.34810/data324
PMID: https://doi.org/10.34810/data324
CORA.Repositori de Dades de Recerca
doi:10.34810/data324
Ver en: https://doi.org/10.34810/data324
CORA.Repositori de Dades de Recerca
doi:10.34810/data324

CORA.Repositori de Dades de Recerca
doi:10.34810/data398
Dataset. 2022

MULTIBOOKED_CORPORA [RESEARCH DATA]

  • Barnes, Jeremy
We release two corpora of hotel reviews annotated for aspect-level sentiment analysis in Catalan and Basque. We also include scripts which allow the conversion to sentence-level annotations and provide benchmarks for opinion holder, target, and expression extraction based on conditional random fields.

Proyecto: //
DOI: https://doi.org/10.34810/data398
CORA.Repositori de Dades de Recerca
doi:10.34810/data398
HANDLE: https://doi.org/10.34810/data398
CORA.Repositori de Dades de Recerca
doi:10.34810/data398
PMID: https://doi.org/10.34810/data398
CORA.Repositori de Dades de Recerca
doi:10.34810/data398
Ver en: https://doi.org/10.34810/data398
CORA.Repositori de Dades de Recerca
doi:10.34810/data398

CORA.Repositori de Dades de Recerca
doi:10.34810/data781
Dataset. 2023

NOMENCLATURE OF "DOCTORAL THESIS SUPERVISOR" IN 47 LANGUAGES

  • Guarimata Salinas, Génesis María
  • Jiménez López, M. Dolores
  • Carvajal, Joan Josep
The dataset revolves around the nomenclature of thesis supervisors at a global level. It comprises 55 distinct labels originating from 116 countries and spanning across 47 different languages. The data collection process involved extracting information from university websites, official institutions, and doctoral thesis repositories

Proyecto: //
DOI: https://doi.org/10.34810/data781
CORA.Repositori de Dades de Recerca
doi:10.34810/data781
HANDLE: https://doi.org/10.34810/data781
CORA.Repositori de Dades de Recerca
doi:10.34810/data781
PMID: https://doi.org/10.34810/data781
CORA.Repositori de Dades de Recerca
doi:10.34810/data781
Ver en: https://doi.org/10.34810/data781
CORA.Repositori de Dades de Recerca
doi:10.34810/data781

Buscador avanzado