1 to 4 of 4 Results
7 août 2022
Valentin Sarah; De Waele Valérie; Vilain Aline; Arsevska Elena; Lancelot Renaud; Roche Mathieu, 2019, "Annotation of epidemiological information in animal disease-related news articles: guidelines and manually labelled corpus", https://doi.org/10.18167/DVN1/YGAKNB, CIRAD Dataverse, V3, UNF:6:H+qzG30RSQ4fWYYA2UBwEQ== [fileUNF]
This dataset contains two files: (i) An annotated corpus ("epi_info_corpus‧xlsx") containing 486 manually annotated sentences extracted from 32 animal disease-related news articles. These news articles were obtained from the database of an event-based biosurveillance system dedic... |
21 déc. 2020
Roche, Mathieu, 2020, "COVID-19 and media dataset: Mining textual data according periods and countries (UK, Spain, France)", https://doi.org/10.18167/DVN1/ZUA8MF, CIRAD Dataverse, V2
These datasets contain a set of news articles in English, French and Spanish extracted from Medisys (i‧e. advanced search) according the following criteria: (1) Keywords (at least): COVID-19, ncov2019, cov2019, coronavirus; (2) Keywords (all words): masque (French), mask (English... |
21 août 2018
Bonin, Muriel; Roche, Mathieu, 2018, "Corpus 'Controverses sur l’épandage aérien en Guadeloupe'", https://doi.org/10.18167/DVN1/LSGN42, CIRAD Dataverse, V1
THEME : Controverses concernant des traitements aériens contre la cercosporiose des bananiers (car contestation citoyenne et succession d’interdiction/dérogation fruit d’un rapport de force entre société civile et producteurs de banane) en Guadeloupe. CORPUS : Corpus en français... |
8 sept. 2017
Zenasni, Sarah; Kergosien, Eric; Roche, Mathieu; Teisseire, Maguelonne, 2017, "A corpus of 1000 authentic SMS in French with spatial labels", https://doi.org/10.18167/DVN1/0ZGJRC, CIRAD Dataverse, V2
Extract of 1000 authentic French SMS from a corpus of more than 88000 SMS (http://88milsms.huma-num.fr/). Spatial entities are tagged (with label). First, an automatic labelling approach based on text-mining techniques is applied in order to obtain the first corpus ("corpus1_auto... |