121 to 130 of 133 Results
18 déc. 2017
Rabatel, Julien; Arsevska, Elena; de Goër de Hervé, Jocelyn; Falala, Sylvain; Lancelot, Renaud; Roche, Mathieu, 2017, "PADI-web corpus: news manually labeled", https://doi.org/10.18167/DVN1/KMTIFG, CIRAD Dataverse, V2
This dataset contains a set of news articles in English related to animal disease outbreaks, that have been used to evaluate and train the information extraction module of the PADI-web system (http://epia.clermont.inra.fr/vsi). It is composed of 532 articles (in JSON), with infor... |
6 nov. 2017
Zenasni, Sarah; Kergosien, Eric; Roche, Mathieu; Teisseire, Maguelonne, 2017, "Dic-ES : Liste d'entités spatiales en français", https://doi.org/10.18167/DVN1/LPY080, CIRAD Dataverse, V1
Le dictionnaire "dic-ES" est un ensemble de noms de lieux à partir des listes fournies par (1) la métropole de Montpellier (rues, quartiers, etc.) ; (2) la métropole européenne de Lille ; (3) les noms de pays et les capitales de chaque pays. Le dictionnaire contient également une... |
19 sept. 2017
Roche, Mathieu; Teisseire, Maguelonne; Shrivastava, Gaurav, 2017, "Valorcarn-TETIS: Terms extracted with Biotex", https://doi.org/10.18167/DVN1/PGQGQL, CIRAD Dataverse, V1
Text-Mining: Terms extracted with Biotex tool (http://tubo.lirmm.fr/biotex) from "Valorcarn Corpus" (http://dx.doi.org/10.18167/DVN1/7YTQGQ). -- Valorcarn Project (2015-2017) [project supported by GloFoodS program (INRA-Cirad)]. Topic: Mining of scientific documents for identific... |
19 sept. 2017
Roche, Mathieu; Teisseire, Maguelonne; Shrivastava, Gaurav, 2017, "Valorcarn-TETIS: Terms extracted with Rake", https://doi.org/10.18167/DVN1/YGYL3W, CIRAD Dataverse, V1
Text-Mining: Terms extracted with Rake tool (https://github.com/aneesha/RAKE) from "Valorcarn Corpus" (http://dx.doi.org/10.18167/DVN1/7YTQGQ). Valorcarn Project (2015-2017) [project supported by GloFoodS program (INRA-Cirad)]. Mining of scientific documents for identification of... |
19 sept. 2017
Roche, Mathieu; Teisseire, Maguelonne; Shrivastava, Gaurav, 2017, "Valorcarn-TETIS: Fusion of terms extracted with Biotex and Fastr", https://doi.org/10.18167/DVN1/CFBIYD, CIRAD Dataverse, V1
Text-Mining: Fusion of terms extracted with Biotex and Fastr from "Valorcarn Corpus" (http://dx.doi.org/10.18167/DVN1/7YTQGQ). -- Valorcarn Project (2015-2017) [project supported by GloFoodS program (INRA-Cirad)]. Topic: Mining of scientific documents for identification of proces... |
19 sept. 2017
Roche, Mathieu; Teisseire, Maguelonne; Shrivastava, Gaurav, 2017, "Valorcarn-TETIS: Variations of terms extracted with Fastr (driven extraction)", https://doi.org/10.18167/DVN1/LPBHWP, CIRAD Dataverse, V1
Text mining: Extraction of variations of term extraction. Input: (1) list of terms, (2) corpus ("Valorcarn Corpus" - http://dx.doi.org/10.18167/DVN1/7YTQGQ) For instance, with "biltong samples", we obtain "biltong spice sample", "samples to produce biltong", etc. -- Valorcarn Pro... |
19 sept. 2017
Roche, Mathieu; Teisseire, Maguelonne; Shrivastava, Gaurav, 2017, "Valorcarn-TETIS: Semantic groups of terms", https://doi.org/10.18167/DVN1/0WEHKT, CIRAD Dataverse, V1
Text-Mining: The extracted terms are gathered according the head (first and last words) (e‧g. (1) food consumption / food pathogen / food preservation, (2) spoiled biltong / venison biltong / wet biltong, and so forth. -- Valorcarn Project (2015-2017) [project supported by GloFoo... |
19 sept. 2017
Roche, Mathieu; Teisseire, Maguelonne; Shrivastava, Gaurav, 2017, "Valorcarn-TETIS: Candidates for OTR (Ontological and Terminological Resource)", https://doi.org/10.18167/DVN1/KNFAGG, CIRAD Dataverse, V1
Text Mining: The different terms extracted by text-mining approaches are candidates for an OTR (Ontological and Terminological Resource) associated to Valorcarn Project. -- Valorcarn Project (2015-2017) [project supported by GloFoodS program (INRA-Cirad)]. Topic: Mining of scient... |
8 sept. 2017
Zenasni, Sarah; Kergosien, Eric; Roche, Mathieu; Teisseire, Maguelonne, 2017, "A corpus of 1000 authentic SMS in French with spatial labels", https://doi.org/10.18167/DVN1/0ZGJRC, CIRAD Dataverse, V2
Extract of 1000 authentic French SMS from a corpus of more than 88000 SMS (http://88milsms.huma-num.fr/). Spatial entities are tagged (with label). First, an automatic labelling approach based on text-mining techniques is applied in order to obtain the first corpus ("corpus1_auto... |
5 sept. 2017
Roche, Mathieu; Teisseire, Maguelonne; Shrivastava, Gaurav, 2017, "Valorcarn-TETIS: Terms extracted with Fastr (free extraction)", https://doi.org/10.18167/DVN1/FC2YXC, CIRAD Dataverse, V1
Text-Mining: Terms extracted with FASTR tool (free extraction) from "Valorcarn Corpus" (http://dx.doi.org/10.18167/DVN1/7YTQGQ). -- Valorcarn Project (2015-2017) [project supported by GloFoodS program (INRA-Cirad)]. Topic: Mining of scientific documents for identification of proc... |