Phd position f/m ontologies for research processes

Montbonnot-Saint-Martin

CDD

INRIA

Publiée le 22 mai

Description de l'offre

PhD Position F/M Ontologies for research processes

Le descriptif de l’offre ci-dessous est en Anglais

Type de contrat : CDD

Niveau de diplôme exigé : Bac + 5 ou équivalent

Fonction : Doctorant

Niveau d'expérience souhaité : Jeune diplômé

A propos du centre ou de la direction fonctionnelle

The Centre Inria de l’Université de Grenoble groups together almost 450 people in 26 research teams and 9 research support departments.

Staff is present on three campuses in Grenoble, in close collaboration with other research and higher education institutions (Université Grenoble Alpes, CNRS, CEA, INRAE, …), but also with key economic players in the area.

The Centre Inria de l’Université Grenoble Alpes is active in the fields of high-performance computing, verification and embedded systems, modeling of the environment at multiple levels, and data science and artificial intelligence. The center is a top-level scientific institute with an extensive network of international collaborations in Europe and the rest of the world.

Mission confiée

Description of experimental and simulation settings is key to interpretation and reproducibility of scientific results. However, they are not currently described in a way that would make them exploitable automatically. We aim to define representations of scientific processes enabling their query, analysis, comparison and reproduction.

Research reliability relies partly on data recording and communication. Although data is important, it is not less important to record and publish the processes that led to the production of this data. These data may be collected through confirmatory experiments, simulations or evaluations. In order to be useful, process descriptions must refer to many facets of the process such as hypotheses, code and model, parameters, measure collected.

Recording such processes in a relatively formal way brings many opportunities:

1. Reproducibility: automatic process rerun and data re-analysis
2. Repurposability: production of new processes by modifying the description [Werner et. al., 2024];
3. Presentation: automatic generation of process reports;
4. Collection: aggregating experiment descriptions for retrieving, querying and comparing them [Euzenat, 2022]. Ideally, it will be possible to generate a meta-analysis on a specific topic from a set of descriptions.

This contributes to the objective to make research data Findable, Interoperable, Accessible and Reproducible, i.e. FAIR [Wilkinson et. al., 2016].

We aim at developing formal descriptions of research processes that enable this. The goal of this thesis proposal is to design, develop and evaluate descriptions expressed with relevant `semantic' technologies.

Defining ontologies for research process description, using existing ontologies and field ontologies should help answering such queries.

For that purpose, it will be necessary to identify generic experiment life cycles, based on design, execution, analysis corresponding to as many stages of experiments and to leverage on semantic technologies (RDF, OWL, SPARQL). Semantic models (ontologies) that match a general notion of experiment will have to be designed. They can take inspiration and build on existing generic ontologies:

5. frbr: for documenting work of thought;
6. prov: for recording the provenance of resources;
7. researchobjects: for describing research artefacts;
8. i-adopt: for describing scientific variables;
9. etc.

and existing efforts:

10. ontologies for representing experiment protocols [Giraldo et. al., 2017],;
11. ODD for describing agent-based simulations [Grimm et. al., 2020];
12. the COMSES computational model library [Rollins et. al., 2014];
13. etc.

Simulation of various (human) processes in a territory digital twin will constitute a central use case for the project. Digital twins may be used to measure or to simulate phenomena occurring in the actual artifact; this is called experiments here. It is necessary that such assessment many be properly indexed so as to to guarantee their highest usability: that they can be retrieved on various criteria. In the specific case of digital twins, it can also be used to compare effects of simulations to what happens in the actual process.

However, developed technologies should be sufficiently general to apply to other use cases. We plan to consider other fields, such as machine learning for weather forecast and experiments in cultural evolution [Bourahla et. al., 2021].

The proposal relies on ontological modelling but it would benefit from epistemological thinking about the nature and role of scientific processes.

References:

[Bourahla et. al., 2021] Yasser Bourahla, Manuel Atencia, Jérôme Euzenat, Knowledge improvement and diversity under interaction-driven adaptation of learned ontologies, Proc. 20th AAMAS, London (UK), pp242-250, 2021 et. al., 2017] Olga Giraldo, Alexander Garcia, Federico Lopez, Oscar Corcho. 2017. Using semantics for representing experimental protocols. Journal of Biomedical Semantics 8, 52. et. al., 2014] Nathan Rollins, Michael Barton, Sean Bergin, Marco Janssen, Allen Lee, A Computational Model Library for publishing model documentation and code, Environmental modelling and software 61.4:59–64, 2014 et. al., 2016] Mark Wilkinson, Michel Dumontier, IJsbrand Jan Aalbersberg, Gabrielle Appleton, Myles Axton, Arie Baak, Niklas Blomberg, Jan-Willem Boiten, Luiz Bonino da Silva Santos, Philip Bourne, Jildau Bouwman, Anthony Brookes, Tim Clark, Mercè Crosas, Ingrid Dillo, Olivier Dumon, Scott Edmunds, Chris Evelo, Richard Finkers, Alejandra Gonzalez-Beltran, Alasdair Gray, Paul Groth, Carole Goble, Jeffrey Grethe, Jaap Heringa, Peter A.C ’t Hoen, Rob Hooft, Tobias Kuhn, Ruben Kok, Joost Kok, Scott Lusher, Maryann Martone, Albert Mons, Abel Packer, Bengt Persson, Philippe Rocca-Serra, Marco Roos, Rene van Schaik, Susanna-Assunta Sansone, Erik Schultes, Thierry Sengstag, Ted Slater, George Strawn, Morris Swertz, Mark Thompson, Johan van der Lei, Erik van Mulligen, Jan Velterop, Andra Waagmeester, Peter Wittenburg, Katherine Wolstencroft, Jun Zhao, Barend Mons, The FAIR Guiding Principles for scientific data management and stewardship, Scientific data 3:160018, 2016 Experiment repository:

Principales activités

Doctoral school: (Jerome:Euzenat#inria:fr) or Cássia Trojahn dos Santos (Cassia:Trojahn-dos-Santos#univ-grenoble-alpes.fr) and a colleague from the. The employer will be INRIA; the candidate will be subject to ZRR clearance.

Group: The work will be carried out in the (near Grenoble, France), a main computer science research lab, in a stimulating research environment.

Compétences

Qualification: Master or equivalent in computer science.

Researched skills:

14. Curiosity and openness.
15. Interaction with other researchers.
16. Autonomous researcher.
17. Interests in epistemology or the methodology of sciences.
18. Innovative.

Further info:

Avantages

19. Subsidized meals
20. Partial reimbursement of public transport costs
21. Leave: 7 weeks of annual leave + 10 extra days off due to RTT (statutory reduction in working hours) + possibility of exceptional leave (sick children, moving home, etc.)
22. Possibility of teleworking and flexible organization of working hours
23. Professional equipment available (videoconferencing, loan of computer equipment, etc.)
24. Social, cultural and sports events and activities
25. Access to vocational training
26. Social security coverage under condition

Rémunération

2300 euros gross salary /month

Postuler

Créer une alerte

Sauvegarder

Offre similaire

Ingénieur scientifique contractuel: simulation des phénomènes non-lisses de disjoncteurs électriques industriels h/f

Montbonnot-Saint-Martin

CDD

INRIA

Offre similaire

Post-doctorant f/h dataverse pour l'architecture

Montbonnot-Saint-Martin

CDD

INRIA

Offre similaire

Post-doctorant f/h modèles d'apprentissage basés sur la physique pour la prévision de la flexibilité électrique des véhicules électriques

Montbonnot-Saint-Martin

CDD

Alternance

INRIA