Dataset.

CT-EBM-SP - Corpus of Clinical Trials for Evidence-Based-Medicine in Spanish

Digital.CSIC. Repositorio Institucional del CSIC
oai:digital.csic.es:10261/285045
Digital.CSIC. Repositorio Institucional del CSIC
  • Campillos-Llanos, Leonardo
  • Valverde Mateos, Ana
  • Capllonch Carrión, Adrián
  • Moreno Sandoval, Antonio
A collection of 1200 texts (292 173 tokens) about clinical trials studies and clinical trials announcements in Spanish: - 500 abstracts from journals published under a Creative Commons license, e.g. available in PubMed or the Scientific Electronic Library Online (SciELO). - 700 clinical trials announcements published in the European Clinical Trials Register and Repositorio Español de Estudios Clínicos. Texts were annotated with entities from the Unified Medical Language System semantic groups: anatomy (ANAT), pharmacological and chemical substances (CHEM), pathologies (DISO), and lab tests, diagnostic or therapeutic procedures (PROC). 46 699 entities were annotated (13.98% are nested entities). 10% of the corpus was doubly annotated, and inter-annotator agreement (IAA) achieved a mean F-measure of 85.65% (±4.79, strict match) and a mean F-measure of 93.94% (±3.31, relaxed match)., European Commission: InterTalentum - Programme for Post-Doctoral Talent Attraction to CEI UAM+CSIC (713366), Peer reviewed
 

DOI: http://hdl.handle.net/10261/285045
Digital.CSIC. Repositorio Institucional del CSIC
oai:digital.csic.es:10261/285045

HANDLE: http://hdl.handle.net/10261/285045
Digital.CSIC. Repositorio Institucional del CSIC
oai:digital.csic.es:10261/285045
 
Ver en: http://hdl.handle.net/10261/285045
Digital.CSIC. Repositorio Institucional del CSIC
oai:digital.csic.es:10261/285045

Digital.CSIC. Repositorio Institucional del CSIC
oai:digital.csic.es:10261/285045
Dataset. 2021

CT-EBM-SP - CORPUS OF CLINICAL TRIALS FOR EVIDENCE-BASED-MEDICINE IN SPANISH

Digital.CSIC. Repositorio Institucional del CSIC
  • Campillos-Llanos, Leonardo
  • Valverde Mateos, Ana
  • Capllonch Carrión, Adrián
  • Moreno Sandoval, Antonio
A collection of 1200 texts (292 173 tokens) about clinical trials studies and clinical trials announcements in Spanish: - 500 abstracts from journals published under a Creative Commons license, e.g. available in PubMed or the Scientific Electronic Library Online (SciELO). - 700 clinical trials announcements published in the European Clinical Trials Register and Repositorio Español de Estudios Clínicos. Texts were annotated with entities from the Unified Medical Language System semantic groups: anatomy (ANAT), pharmacological and chemical substances (CHEM), pathologies (DISO), and lab tests, diagnostic or therapeutic procedures (PROC). 46 699 entities were annotated (13.98% are nested entities). 10% of the corpus was doubly annotated, and inter-annotator agreement (IAA) achieved a mean F-measure of 85.65% (±4.79, strict match) and a mean F-measure of 93.94% (±3.31, relaxed match)., European Commission: InterTalentum - Programme for Post-Doctoral Talent Attraction to CEI UAM+CSIC (713366), Peer reviewed

Proyecto: EC/H2020/713366



1106