Dataset.
2023
Occitan-Catalan LMF Apertium Bilingual dictionary
CORA.Repositori de Dades de Recerca
doi:10.34810/data290
CORA.Repositori de Dades de Recerca
- Universitat d'Alacant. Grup Transducens
- Prompsit Language Engineering, S.L
- Universitat Pompeu Fabra. Institut Universitari de Lingüística Aplicada (IULA)
This is the LMF version of the Apertium bilingual dictionary for Occitan and Catalan languages. Bilingual LMF dictionaries were generated from Apertium bilingual dix files. For each Apertium bilingual correspondence, the corresponding source and target monolingual entries (LexicalEntry) were generated in addition to the bilingual correspondence (SenseAxis) element. Apertium is a free/open-source machine translation platform, initially aimed at related-language pairs but recently expanded to deal with more divergent language pairs (such as English-Catalan). The platform provides: a language-independent machine translation engine; tools to manage the linguistic data necessary to build a machine translation system for a given language pair and linguistic data for a growing number of language pairs.