Resultados totales (Incluyendo duplicados): 45402
Encontrada(s) 4541 página(s)
Encontrada(s) 4541 página(s)
CORA.Repositori de Dades de Recerca
doi:10.34810/data377
Dataset. 2023
PANACEA LABOUR MULTI WORD ITALIAN LEXICON
- Consiglio Nazionale delle Ricerche. Istituto di Linguistica Computazionale "Antonio Zampolli"
The Labour MW Italian Lexicon is a lexicon of noun-noun multiword expressions automatically /nextracted from a 70Mio word web crawled corpus in the labour law domain. The lexicon was produced at CNR-ILC, Pisa, Italy as an outcome of the PANACEA EU-FP7 Funded Project under Grant Agreement 248064 (http://www.panacea-lr.eu)
Proyecto: //
CORA.Repositori de Dades de Recerca
doi:10.34810/data378
Dataset. 2013
PANACEA ENGLISH AUTOMATICALLY ACQUIRED LEXICON FOR LAB DOMAIN: SUBCATEGORIZATION FRAMES AND LEXICAL SEMANTIC CLASSES FOR NOUNS
- University of Cambridge. Department of Theoretical and Applied Linguistics
- Universitat Pompeu Fabra. Institut Universitari de Lingüística Aplicada (IULA)
This is a domain-specific lexicon for English for labour (LAB) domain. This lexicon contain both, subcategorization frames for verbs and lexical semantic classes for nouns. This lexicon has been automatically created using PANACEA webservices using crawled data. The crawled data was obtained crawling web pages that were automatically detected to be in the English language and were automatically classified as relevant to the LAB domain. Data collection took place in the summer of 2011.
Proyecto: //
CORA.Repositori de Dades de Recerca
doi:10.34810/data379
Dataset. 2023
PANACEA ITALIAN V-SUBCAT REPUBBLICA LEXICON (LANGUAGE DEPENDENT EXTRACTOR)
- Consiglio Nazionale delle Ricerche. Istituto di Linguistica Computazionale "Antonio Zampolli"
-
Proyecto: //
CORA.Repositori de Dades de Recerca
doi:10.34810/data37
Dataset. 2021
BIOACTIVE COMPOSITION OF WHOLE FOOD BARLEYS GRAINS AND THEIR PERLING FRACTIONS
- Martínez Subirà, Mariona
- Romero Fabregat, Mª Paz
- Macià i Puig, Ma Alba
- Puig, Eva
- Romagosa Clariana, Ignacio
- Moralejo Vidal, Mª Angeles
The data reported in the dataset refer to the protein, ash, β-glucans, arabinoxylans, tocols, phenolic compounds contents and antioxidant capacity of whole grains of three food barley genotypes grown in Spain and seven pearling fractions from each genotype.
Proyecto: //
CORA.Repositori de Dades de Recerca
doi:10.34810/data380
Dataset. 2023
PANACEA ITALIAN V-SUBCAT REPUBBLICA LEXICON (LANGUAGE INDEPENDENT EXTRACTOR)
- Consiglio Nazionale delle Ricerche. Istituto di Linguistica Computazionale "Antonio Zampolli"
This is a lexicon of verb subcategorisation frames automatically extracted from a 300Mio words newspaper corpus using a language independent SCF acquisition software. The lexicon was produced at CNR-ILC, Pisa, Italy as an outcome of the PANACEA EU-FP7 Funded Project under Grant Agreement 248064 (http://www.panacea-lr.eu).
Proyecto: //
CORA.Repositori de Dades de Recerca
doi:10.34810/data381
Dataset. 2013
PANACEA ENVIRONMENT AND REPUBBLICA MERGED ITALIAN LEXICON
- Consiglio Nazionale delle Ricerche. Istituto di Linguistica Computazionale "Antonio Zampolli"
The Italian PANACEA_rep_env_merged.lmf.xml is SCF lexicon obtained by merging two automatically extracted lexicons: a domain lexicon (environment)for SCFs, PANACEA_SCF_IT_environment.lmf.xml and a general domain SCFs Italian lexicon repubblica.scf_extracted.lmf. The lexicon was produced at CNR-ILC, Pisa, Italy as an outcome of the PANACEA EU-FP7 Funded Project under Grant Agreement 248064 (http://www.panacea-lr.eu).
Proyecto: //
CORA.Repositori de Dades de Recerca
doi:10.34810/data382
Dataset. 2023
PANACEA ENVIRONMENT MULTI WORD ITALIAN LEXICON
- Consiglio Nazionale delle Ricerche. Istituto di Linguistica Computazionale "Antonio Zampolli"
The Environment MW Italian Lexicon is a lexicon of noun-noun multiword expressions automatically /nextracted from a 36Mio word web crawled corpus in the environmental domain. The lexicon is distributed as an xml file of about 15MB. It contains: 15,491 LexicalEntries, of which 10,000 having entryType = Multiword
Proyecto: //
CORA.Repositori de Dades de Recerca
doi:10.34810/data383
Dataset. 2013
PANACEA ENVIRONMENT AND PAROLE MERGED ITALIAN LEXICON
- Consiglio Nazionale delle Ricerche. Istituto di Linguistica Computazionale "Antonio Zampolli"
The Italian PAROLE_env_merged.lmf.xml is SCF lexicon obtained by merging two automatically extracted lexicons: a domain lexicon (environment) PANACEA_SCF_IT_environment.lmf.xml and a the SCFs Italian lexicon PAROLE_lmf_subcat_ita.lmf.xml, generated from PAROLE SIMPLE Lexicon The lexicon was produced at CNR-ILC, Pisa, Italy as an outcome of the PANACEA EU-FP7 Funded Project under Grant Agreement 248064 (http://www.panacea-lr.eu).
Proyecto: //
CORA.Repositori de Dades de Recerca
doi:10.34810/data384
Dataset. 2023
MATE PARSER MODULE FOR SPANISH
- Vivaldi, J. (Jorge), 1952-
- Padró, Muntsa
- Marimon, Montserrat
- Bel Rafecas, Núria
In this package we include the following: logonFinal20130315_4matetools361.model; parse_ESCAsentences_mate.sh; freeling_spaMate.sh; toconll2006.py; prueba.txt (test file: 4 sentences for testing); prueba.conll.conll06 (output in CoNLL06); prueba.conll.conll09 (output in CoNLL09); anna-3.61.jar, the version of mate parser used to generate the output. And also: metadata_mateparser_module_ES.xml; metadata_mateparser_module_ES.xml-Authors.txt; readme-espmate.txt
Proyecto: //
CORA.Repositori de Dades de Recerca
doi:10.34810/data385
Dataset. 2022
THE COMBINED EFFECT OF COMPETITIVENESS AND THE NUMBER OF PARTIES: ADDRESSING THE DETERMINANTS OF CORRUPTION IN SMD COUNTRIES
- Simón, Pablo
- Guinjoan Cesena, Marc
Contains data on the explicative factors accounting for differences in the perception of corruption in single-member district countries.
Conté dades sobre els factors explicatius de les diferències en la percepció de la corrupció en països amb districtes uninominals.
Proyecto: //
Buscador avanzado