Subject: Computer and Information Science
Data Type: Dataset
Subject: Medicine, Health and Life Sciences
Keyword Term: text mining
1 to 5 of 5 Results
28 oct. 2023
Rasamoelina, Harena; Veerapa-Mangroo, Lovena Preeyadarshini; Bedja, Said Ahmed; Roche, Mathieu, 2023, "Mots-clés pour PADI-web mis en place dans l'Océan Indien", https://doi.org/10.18167/DVN1/E7WMAO, CIRAD Dataverse, V1
Au cours d’un atelier de travail entre le CIRAD et la COI (Commission de l'océan Indien) organisé en octobre 2023 à Ebène (Mauritius) et adossé au projet MOOD, une liste de mots-clés a été produite. Ce lexique est lié à trois maladies à surveiller (Leptospirose, Dengue, Influenza... |
6 janv. 2023
Roche, Mathieu, 2023, "News dealing with 'Xylella fastidiosa' collected with PADI-web", https://doi.org/10.18167/DVN1/NGA1JI, CIRAD Dataverse, V1
Due to the increasing number of new and reemerging pests resulting from intensification, globalisation and climate change, monitoring of plant health is crucial. In this context, the PADI-web pipeline was implemented for plant disease surveillance. This dataset presents data coll... |
7 août 2022
Valentin Sarah; De Waele Valérie; Vilain Aline; Arsevska Elena; Lancelot Renaud; Roche Mathieu, 2019, "Annotation of epidemiological information in animal disease-related news articles: guidelines and manually labelled corpus", https://doi.org/10.18167/DVN1/YGAKNB, CIRAD Dataverse, V3, UNF:6:H+qzG30RSQ4fWYYA2UBwEQ== [fileUNF]
This dataset contains two files: (i) An annotated corpus ("epi_info_corpus‧xlsx") containing 486 manually annotated sentences extracted from 32 animal disease-related news articles. These news articles were obtained from the database of an event-based biosurveillance system dedic... |
21 déc. 2020
Roche, Mathieu, 2020, "COVID-19 and media dataset: Mining textual data according periods and countries (UK, Spain, France)", https://doi.org/10.18167/DVN1/ZUA8MF, CIRAD Dataverse, V2
These datasets contain a set of news articles in English, French and Spanish extracted from Medisys (i‧e. advanced search) according the following criteria: (1) Keywords (at least): COVID-19, ncov2019, cov2019, coronavirus; (2) Keywords (all words): masque (French), mask (English... |
18 déc. 2017
Rabatel, Julien; Arsevska, Elena; de Goër de Hervé, Jocelyn; Falala, Sylvain; Lancelot, Renaud; Roche, Mathieu, 2017, "PADI-web corpus: news manually labeled", https://doi.org/10.18167/DVN1/KMTIFG, CIRAD Dataverse, V2
This dataset contains a set of news articles in English related to animal disease outbreaks, that have been used to evaluate and train the information extraction module of the PADI-web system (http://epia.clermont.inra.fr/vsi). It is composed of 532 articles (in JSON), with infor... |