Resultados totales (Incluyendo duplicados): 34368
Encontrada(s) 3437 página(s)
CORA.Repositori de Dades de Recerca
doi:10.34810/data336
Dataset. 2023

PANACEA ENVIRONMENT BILINGUAL GLOSSARY FRENCH-TO-ENGLISH

  • Linguatec GmbH
-

Proyecto: //
DOI: https://doi.org/10.34810/data336
CORA.Repositori de Dades de Recerca
doi:10.34810/data336
HANDLE: https://doi.org/10.34810/data336
CORA.Repositori de Dades de Recerca
doi:10.34810/data336
PMID: https://doi.org/10.34810/data336
CORA.Repositori de Dades de Recerca
doi:10.34810/data336
Ver en: https://doi.org/10.34810/data336
CORA.Repositori de Dades de Recerca
doi:10.34810/data336

CORA.Repositori de Dades de Recerca
doi:10.34810/data337
Dataset. 2023

PANACEA SPANISH AUTOMATICALLY ACQUIRED LEXICON FOR ENV DOMAIN: SUBCATEGORIZATION FRAMES (V-SUBCAT)

  • Universitat Pompeu Fabra. Institut Universitari de Lingüística Aplicada (IULA)
This is a domain-specific lexicon for Spanish subcategorization frames for environment (ENV) domain. This lexicon has been automatically created using the PANACEA web service named tpc_subcat_inductive (http://registry.elda.org/services/223) and the crawled data for this domain and language, previously annotated with Spanish Malt Parser web service (http://registry.elda.org/services/249).

Proyecto: //
DOI: https://doi.org/10.34810/data337
CORA.Repositori de Dades de Recerca
doi:10.34810/data337
HANDLE: https://doi.org/10.34810/data337
CORA.Repositori de Dades de Recerca
doi:10.34810/data337
PMID: https://doi.org/10.34810/data337
CORA.Repositori de Dades de Recerca
doi:10.34810/data337
Ver en: https://doi.org/10.34810/data337
CORA.Repositori de Dades de Recerca
doi:10.34810/data337

CORA.Repositori de Dades de Recerca
doi:10.34810/data338
Dataset. 2012

PANACEA ENGLISH GOLD STANDARD FOR LEXICAL SEMANTIC CLASSIFICATION

  • Universitat Pompeu Fabra. Institut Universitari de Lingüística Aplicada (IULA)
We present a set of English gold-standards for different noun classes created in PANACEA to train and test automatic classifiers. To create these gold-standards we used we the data from the SemEval 2007 workshop Task 07: Coarse Grained English All-Words (Navigli et al., 2007). The words used in this task were first automatically tagged with an automatic clustering method (Navigli, 2006) using senses based on the WordNet sense inventory and later manually validated by expert lexicographers. For our experiments, we extracted all of the words from this inventory that contained as their first sense a sense that corresponded to the lexical semantic classes, i.e. “people” in the case of the class HUMAN. These gold-standards were created in the context of PANACEA http://www.panacea-lr.eu), an EU-FP7 Funded Project under Grant Agreement 248064.

Proyecto: //
DOI: https://doi.org/10.34810/data338
CORA.Repositori de Dades de Recerca
doi:10.34810/data338
HANDLE: https://doi.org/10.34810/data338
CORA.Repositori de Dades de Recerca
doi:10.34810/data338
PMID: https://doi.org/10.34810/data338
CORA.Repositori de Dades de Recerca
doi:10.34810/data338
Ver en: https://doi.org/10.34810/data338
CORA.Repositori de Dades de Recerca
doi:10.34810/data338

CORA.Repositori de Dades de Recerca
doi:10.34810/data339
Dataset. 2023

PANACEA SPANISH V-SUBCAT GOLD STANDARD LEXICON LAB DOMAIN

  • Universitat Pompeu Fabra. Institut Universitari de Lingüística Aplicada (IULA)
-

Proyecto: //
DOI: https://doi.org/10.34810/data339
CORA.Repositori de Dades de Recerca
doi:10.34810/data339
HANDLE: https://doi.org/10.34810/data339
CORA.Repositori de Dades de Recerca
doi:10.34810/data339
PMID: https://doi.org/10.34810/data339
CORA.Repositori de Dades de Recerca
doi:10.34810/data339
Ver en: https://doi.org/10.34810/data339
CORA.Repositori de Dades de Recerca
doi:10.34810/data339

CORA.Repositori de Dades de Recerca
doi:10.34810/data340
Dataset. 2023

PANACEA SPANISH V-SUBCAT GOLD STANDARD LEXICON ENV DOMAIN

  • Universitat Pompeu Fabra. Institut Universitari de Lingüística Aplicada (IULA)
This is a domain-specific gold-standard for Spanish subcategorization frames, in the case, for environment (ENV) domain. This gold-standard was manually developed, choosing a set of 30 verbs and 200 senteces for each verb. For each sentence, the SCFs present for the studied verb were manually annotated. The sentences were selected from crawled Web pages that were automatically detected to be in the Spanish language and were automatically classified as relevant to the ENV domain. Data collection took place in the summer of 2011. This gold-standard was created in the context of PANACEA (http://www.panacea-lr.eu), an EU-FP7 Funded Project under Grant Agreement 248064.

Proyecto: //
DOI: https://doi.org/10.34810/data340
CORA.Repositori de Dades de Recerca
doi:10.34810/data340
HANDLE: https://doi.org/10.34810/data340
CORA.Repositori de Dades de Recerca
doi:10.34810/data340
PMID: https://doi.org/10.34810/data340
CORA.Repositori de Dades de Recerca
doi:10.34810/data340
Ver en: https://doi.org/10.34810/data340
CORA.Repositori de Dades de Recerca
doi:10.34810/data340

CORA.Repositori de Dades de Recerca
doi:10.34810/data34
Dataset. 2021

TRIACYLGLYCEROL PROFILING DATA OF OLIVE OIL SAMPLES OBTAINED BY FLOW INJECTION ANALYSIS-HEATED ELECTROSPRAY IONISATION-HIGH RESOLUTION MASS SPECTROMETRY (FIA-HESI-HRMS)

  • Quintanilla-Casas, Beatriz
  • Strocchi, Giulia
  • Bustamante Alonso, Julen
  • Torres-Cobos, Berta
  • Guardiola Ibarz, Francesc
  • Moreda, Wenceslao
  • Martínez-Rivas, José Manuel
  • Valli, Enrico
  • Bendini, Alessandra
  • Gallina Toschi, Tullia
  • Tres Oliver, Alba
  • Vichi, Stefania
Data base containing data on the triacylclgyerol profile of genuine and adulterated olive oils obtained by Flow Injection Analysis-Heated Electrospray Ionisation-High Resolution Mass Spectrometry (FIA-HESI-HRMS). Data (total ion chromatograms) were obtained and used by Quintanilla-Casas et al. (Food Control, 2021, 123: 107851). Data correspond to relative abundances obtained for each mass signal that agreed with single positive charged sodium molecular ions and that could match with a triacylglycerol elemental formula (mass tolerance error set at 5ppm; molecular formulae calculation performed with Xcalibur 4.1. (Thermo Fisher Scientific, Bremen, Germany). This data base includes samples used in the study that were obtained by the OLEUM consortium. Samples included genuine olive oils, as well as Taylor-made adulterations blending genuine olive oils with other vegetable oils as adulterants (sunflower oli; soy oil; high oleic sunflower oil; avocado oil (refined and virgin); hazelnut oil (refined and virgin)). Blind samples are also included in the data base. Partial Least Squares-Discriminant Analysis (PLS-DA) was applied to these data to develop two independent binary models that discriminated between (i) genuine oils and blends with ≥2% of HL adulterants, and (ii) genuine oils and blends with ≥5% of HO adulterants, as described in Quintanilla-Casas et al., 2021.

Proyecto: //
DOI: https://doi.org/10.34810/data34
CORA.Repositori de Dades de Recerca
doi:10.34810/data34
HANDLE: https://doi.org/10.34810/data34
CORA.Repositori de Dades de Recerca
doi:10.34810/data34
PMID: https://doi.org/10.34810/data34
CORA.Repositori de Dades de Recerca
doi:10.34810/data34
Ver en: https://doi.org/10.34810/data34
CORA.Repositori de Dades de Recerca
doi:10.34810/data34

CORA.Repositori de Dades de Recerca
doi:10.34810/data341
Dataset. 2012

PANACEA LABOUR LEGISLATION CORPUS N-GRAMS EN (ENGLISH)

  • Dublin City University. School of Computing
This data set contains English word n-grams and English word/tag/lemma n-grams in the "labour Legislation" (LAB) domain. N-grams are accompanied by their observed frequency counts. The length of the n-grams ranges from unigrams (single words) to five-grams. The data were collected in the context of PANACEA (http://www.panacea-lr.eu), an EU-FP7 Funded Project under Grant Agreement 248064. The n-gram counts were generated from crawled Web pages that were automatically detected to be in the English language and were automatically classified as relevant to the LAB domain. The LAB domain collection used consisted of approximately 46.4 million tokens.Data collection took place in the summer of 2011.

Proyecto: //
DOI: https://doi.org/10.34810/data341
CORA.Repositori de Dades de Recerca
doi:10.34810/data341
HANDLE: https://doi.org/10.34810/data341
CORA.Repositori de Dades de Recerca
doi:10.34810/data341
PMID: https://doi.org/10.34810/data341
CORA.Repositori de Dades de Recerca
doi:10.34810/data341
Ver en: https://doi.org/10.34810/data341
CORA.Repositori de Dades de Recerca
doi:10.34810/data341

CORA.Repositori de Dades de Recerca
doi:10.34810/data342
Dataset. 2023

PANACEA ANNOTATED DEPENDENCY SPANISH ENVIRONMENT CORPUS VERSION 2

  • Universitat Pompeu Fabra. Institut Universitari de Lingüística Aplicada (IULA)
-

Proyecto: //
DOI: https://doi.org/10.34810/data342
CORA.Repositori de Dades de Recerca
doi:10.34810/data342
HANDLE: https://doi.org/10.34810/data342
CORA.Repositori de Dades de Recerca
doi:10.34810/data342
PMID: https://doi.org/10.34810/data342
CORA.Repositori de Dades de Recerca
doi:10.34810/data342
Ver en: https://doi.org/10.34810/data342
CORA.Repositori de Dades de Recerca
doi:10.34810/data342

CORA.Repositori de Dades de Recerca
doi:10.34810/data343
Dataset. 2023

PANACEA LABOUR LEGISLATION CORPUS N-GRAMS ES (SPANISH)

  • Universitat Pompeu Fabra. Institut Universitari de Lingüística Aplicada (IULA)
-

Proyecto: //
DOI: https://doi.org/10.34810/data343
CORA.Repositori de Dades de Recerca
doi:10.34810/data343
HANDLE: https://doi.org/10.34810/data343
CORA.Repositori de Dades de Recerca
doi:10.34810/data343
PMID: https://doi.org/10.34810/data343
CORA.Repositori de Dades de Recerca
doi:10.34810/data343
Ver en: https://doi.org/10.34810/data343
CORA.Repositori de Dades de Recerca
doi:10.34810/data343

CORA.Repositori de Dades de Recerca
doi:10.34810/data344
Dataset. 2012

PANACEA ANNOTATED DEPENDENCY GREEK LABOUR LEGISLATION CORPUS VERSION 2

  • Institute for Language and Speech Processing / Athena Research Center
PANACEA Annotated Greek Labour Legislation Corpus Version 2 consists of Greek texts in the Labour Legislation (LAB) domain that were collected and automatically annotated in the framework of PANACEA (http://www.panacea-lr.eu), an EU-FP7 Funded Project under Grant Agreement 248064. The texts were crawled web pages that were automatically detected to be in the Greek language and were automatically classified as relevant to the LAB domain. Data collection took place in the summer of 2011. The automatically assigned annotations deal with sentence and token segmentation, POS and lemma, dependency relations and named entities.

Proyecto: //
DOI: https://doi.org/10.34810/data344
CORA.Repositori de Dades de Recerca
doi:10.34810/data344
HANDLE: https://doi.org/10.34810/data344
CORA.Repositori de Dades de Recerca
doi:10.34810/data344
PMID: https://doi.org/10.34810/data344
CORA.Repositori de Dades de Recerca
doi:10.34810/data344
Ver en: https://doi.org/10.34810/data344
CORA.Repositori de Dades de Recerca
doi:10.34810/data344

Buscador avanzado