Featured Dataverses

In order to use this feature you must have at least one published dataverse.

Publish Dataverse

Are you sure you want to publish your dataverse? Once you do so it must remain published.

Publish Dataverse

This dataverse cannot be published because the dataverse it is in has not been published.

Delete Dataverse

Are you sure you want to delete your dataverse? You cannot undelete this dataverse.

Advanced Search

1 to 10 of 14 Results
28 oct. 2023
Rasamoelina, Harena; Veerapa-Mangroo, Lovena Preeyadarshini; Bedja, Said Ahmed; Roche, Mathieu, 2023, "Mots-clés pour PADI-web mis en place dans l'Océan Indien", https://doi.org/10.18167/DVN1/E7WMAO, CIRAD Dataverse, V1
Au cours d’un atelier de travail entre le CIRAD et la COI (Commission de l'océan Indien) organisé en octobre 2023 à Ebène (Mauritius) et adossé au projet MOOD, une liste de mots-clés a été produite. Ce lexique est lié à trois maladies à surveiller (Leptospirose, Dengue, Influenza...
26 avr. 2023
Menya, Edmond; Interdonato, Roberto; Owuor, Dickson; Roche, Mathieu, 2023, "PADI-web corpus used for the EpidBioELECTRA approach", https://doi.org/10.18167/DVN1/WD1UC2, CIRAD Dataverse, V1, UNF:6:yAzQEeampF5r1vKlkaDRVA== [fileUNF]
This dataset contains a set of news articles in English related to animal disease outbreaks, that have been used to train and evaluate EpidBioELECTRA epidemiological classifier and explainer. It is composed of 70,707 articles in csv format found in several folders (relevant folde...
6 janv. 2023
Roche, Mathieu, 2023, "News dealing with 'Xylella fastidiosa' collected with PADI-web", https://doi.org/10.18167/DVN1/NGA1JI, CIRAD Dataverse, V1
Due to the increasing number of new and reemerging pests resulting from intensification, globalisation and climate change, monitoring of plant health is crucial. In this context, the PADI-web pipeline was implemented for plant disease surveillance. This dataset presents data coll...
7 août 2022
Valentin Sarah; De Waele Valérie; Vilain Aline; Arsevska Elena; Lancelot Renaud; Roche Mathieu, 2019, "Annotation of epidemiological information in animal disease-related news articles: guidelines and manually labelled corpus", https://doi.org/10.18167/DVN1/YGAKNB, CIRAD Dataverse, V3, UNF:6:H+qzG30RSQ4fWYYA2UBwEQ== [fileUNF]
This dataset contains two files: (i) An annotated corpus ("epi_info_corpus‧xlsx") containing 486 manually annotated sentences extracted from 32 animal disease-related news articles. These news articles were obtained from the database of an event-based biosurveillance system dedic...
3 janv. 2022
Lentschat, Martin, 2022, "TRANSMAT n-Ary relations", https://doi.org/10.18167/DVN1/1BBJBQ, CIRAD Dataverse, V1, UNF:6:xTM1cMCmnS2wfmB5Fr5slg== [fileUNF]
This dataset presents a Gold Standard of data annotated on documents from the Science Direct website. The relations are related to permeability n-Ary relations, as defined in the TRANSMAT Ontology (a href="https://ico.iate.inra.fr/atWeb/">https://ico.iate.inra.fr/atWeb/, https://...
10 févr. 2021
Lentschat, Martin; Buche, Patrice; Menut, Luc, 2020, "TRANSMAT Gold Standard", https://doi.org/10.18167/DVN1/U7HK8J, CIRAD Dataverse, V3, UNF:6:D0te91j6BiKD23wqCty2/A== [fileUNF]
This dataset presents a Gold Standard of data annotated on documents from the Science Direct website. The entities annotated are the ones related to permeability n-Ary relations, as defined in the TRANSMAT Ontology (https://ico.iate.inra.fr/atWeb/, https://doi.org/10.15454/NK24ID...
21 déc. 2020
Roche, Mathieu, 2020, "COVID-19 and media dataset: Mining textual data according periods and countries (UK, Spain, France)", https://doi.org/10.18167/DVN1/ZUA8MF, CIRAD Dataverse, V2
These datasets contain a set of news articles in English, French and Spanish extracted from Medisys (i‧e. advanced search) according the following criteria: (1) Keywords (at least): COVID-19, ncov2019, cov2019, coronavirus; (2) Keywords (all words): masque (French), mask (English...
17 avr. 2020
Roche, Mathieu; Helmer, Thierry; Martin, Pierre; Chaminuka, Petronella; Dimitriou, Ioannis; Csorba, Adam; Lindsten, Agneta; Lundén, Tomas; Van Boheemen, Peter, 2020, "LEAP4FNSSA (WP3 - KMS): Terminology for KEOPS", https://doi.org/10.18167/DVN1/GQ8DPL, CIRAD Dataverse, V1, UNF:6:q+wkvuEpP3iSokRJHqL1qA== [fileUNF]
In order to highlight terminology to integrate in the LEAP4FNSSA KMS (Knowledge Management System) called KEOPS (Knowledge ExtractOr Pipeline System), a dedicated terminology to the LEAP4FNSSA project has been extracted with a text-mining tool (BioTex) from a corpus dealing with...
21 août 2018
Bonin, Muriel; Roche, Mathieu, 2018, "Corpus 'Controverses sur l’épandage aérien en Guadeloupe'", https://doi.org/10.18167/DVN1/LSGN42, CIRAD Dataverse, V1
THEME : Controverses concernant des traitements aériens contre la cercosporiose des bananiers (car contestation citoyenne et succession d’interdiction/dérogation fruit d’un rapport de force entre société civile et producteurs de banane) en Guadeloupe. CORPUS : Corpus en français...
18 déc. 2017
Rabatel, Julien; Arsevska, Elena; de Goër de Hervé, Jocelyn; Falala, Sylvain; Lancelot, Renaud; Roche, Mathieu, 2017, "PADI-web corpus: news manually labeled", https://doi.org/10.18167/DVN1/KMTIFG, CIRAD Dataverse, V2
This dataset contains a set of news articles in English related to animal disease outbreaks, that have been used to evaluate and train the information extraction module of the PADI-web system (http://epia.clermont.inra.fr/vsi). It is composed of 532 articles (in JSON), with infor...
Add Data

Log in to create a dataverse or add a dataset.

Share Dataverse

Share this dataverse on your favorite social media networks.

Link Dataverse
Reset Modifications

Are you sure you want to reset the selected metadata fields? If you do this, any customizations (hidden, required, optional) you have done will no longer appear.