[Defended thesis] Sarah Valentin: Extraction and aggregation of information from multi-source data for international monitoring of infectious animal diseases

Sarah defended her thesis on Thursday 24th September at 3.30 pm, at the Maison de la Télédétection, (room Silva/Saltus/Ager)

Sarah’s PhD is #DigitAg-labeled

Extraction and aggregation of information from multi-source data for international monitoring of infectious animal diseases

  • Start Date: September 2017
  • University: MUSE Montpellier University of Excellence
  • PhD School: I2S (Information, Structures, Systèmes) Montpellier
  • Field(s): Computer Science, Epidemiology
  • Co-supervisors : Mathieu Roche (Cirad, UMR TETIS), Renaud Lancelot (Cirad, UMR ASTRE)
  • Funding: Cirad
  • #DigitAg: Labeled PhD – Challenge 8 – Axis 4 & 5

Keywords: epidemic intelligence, animal health, unstructured data, web


Monitoring animal health worldwide, especially the early detection of outbreaks of emerging and exotic pathogens, is one of the means of preventing the introduction of infectious diseases in France. In that context, the French epidemic intelligence team for international monitoring of animal health has created a tool dedicated to automatic surveillance of electronic media. This tool is based on a text mining approach, which detects, collects, classifies and extracts information from non-structured textual data available in the media reports on the Web. In addition to methodological improvements (spatial uncertainty, automatic classification), the thesis will focus on identification, combination and qualification of information and indicators from multi-sources data to detect the emergence of animal diseases.


Contact:  sarah.valentin [AT] cirad.fr – 07 81 89 28 50

Réseaux : LinkedInResearchGate

Communications /Publications

Valentin S., Arsevska E., Mercier A., Falala S., Rabatel J., Lancelot R., Roche M. (2020) PADI-web: An Event-Based Surveillance System for Detecting, Classifying and Processing Online News, Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

R. Goel, S. Valentin, A. Delaforge, S. Fadloun, A. Sallaberry, M. Roche, P. Poncelet (2020) EpidNews: Extracting, exploring and annotating news for monitoring animal diseases, Journal of Computer Languages

Rohan Goel, Samiha Fadloun, Sarah Valentin, Arnaud Sallaberry, Mathieu Roche, Pascal Poncelet (2018). EpidNews: An Epidemiological News Explorer for Monitoring Animal Diseases. VINCI ’18 Proceedings of the 11th International Symposium on Visual Information Communication and Interaction, Växjö, Sweden — August 13 – 15, 2018, pp 1-8 – https://doi.org/10.1145/3231622.3231624

Elena Arsevska, Sarah Valentin, Julien Rabatel, Jocelyn de Goër de Hervé, Sylvain Falala, Renaud Lancelot, Mathieu Roche (2018). Web monitoring of emerging animal infectious diseases integrated in the French Animal Health Epidemic Intelligence System. PLOS One – https://doi.org/10.1371/journal.pone.0199960

Sarah ValentinRenaud Lancelot, Mathieu Roche  (2018). Surveillance automatisée de sources non-officielles pour la veille sanitaire internationale : évaluation rétrospective sur la fièvre aphteuse en Afrique. LIRIMA Workshop “Agriculture numérique en Afrique”, Montpellier, France, 4 Avril 2018 – http://agritrop.cirad.fr/587790/

Sarah Valentin, Sylvain Falala, Jocelyn De Goër,Elena Arsevska, Julien Rabatel, Mathieu Roche, Renaud Lancelot (2018). An automatic animal diseases surveillance system based on textual media analysis. InnovSur 2018 : ‘INNOVATION in Health Surveillance’ International Forum, Montpellier, France, 16 Mai 2018 (poster) – http://agritrop.cirad.fr/588023/