A propos d'Inria
Inria est l'institut national de recherche dédié aux sciences et technologies du numérique. Il emploie 2600 personnes. Ses 215 équipes-projets agiles, en général communes avec des partenaires académiques, impliquent plus de 3900 scientifiques pour relever les défis du numérique, souvent à l'interface d'autres disciplines. L'institut fait appel à de nombreux talents dans plus d'une quarantaine de métiers différents. 900 personnels d'appui à la recherche et à l'innovation contribuent à faire émerger et grandir des projets scientifiques ou entrepreneuriaux qui impactent le monde. Inria travaille avec de nombreuses entreprises et a accompagné la création de plus de 200 start-up. L'institut s'eorce ainsi de répondre aux enjeux de la transformation numérique de la science, de la société et de l'économie. Post-Doctoral Research Visit F/M Safe AI Planning and Reinforcement learning using Formal Methods
Le descriptif de l'offre ci-dessous est en Anglais
Type de contrat : CDD
Contrat renouvelable : Oui
Niveau de diplôme exigé : Thèse ou équivalent
Fonction : Post-Doctorant
Niveau d'expérience souhaité : Jusqu'à 3 ans
A propos du centre ou de la direction fonctionnelle
The Inria Centre at Rennes University is one of Inria's nine centres and has more than thirty research teams. The Inria Centre is a major and recognized player in the field of digital sciences. It is at the heart of a rich R&D and innovation ecosystem: highly innovative PMEs, large industrial groups, competitiveness clusters, research and higher education players, laboratories of excellence, technological research institute, etc.
Contexte et atouts du poste
The post-doc position is part of a collaboration between Inria and Mitsubishi Electric R&D Centre Europe (MERCE) within the FRAIME project on artificial intelligence and formal methods. The project explores, on the one hand, how Formal Methods can provide guarantees on AI systems, and on the other hand how AI can help Formal Methods to be more efficient and easier to use by practitioners. The vision is to intertwine Formal Methods and AI to efficiently design safe systems.
This is a postdoctoral position in the fields of AI planning, reinforcement learning (RL), and formal methods. The position is initially funded for 12 months, but it is further extensible to at least another year.
While this is an academic position based at Inria Rennes, the candidate will collaborate with researchers from both Inria and MERCE, thus benefiting from both academic and industrial research environments.
The work will be done in collaboration with Nathalie Bertrand and Ocan Sankur (Inria DEVINE team https://devine.inria.fr/) and Benoît Boyer (MERCE).
Mission confiée
The main objective is to develop safe planning and reinforcement learning algorithms with various degrees of confidence for variants of Markov decision processes.
More precisely, we will develop algorithms for multi-environment MDPs, partially observable MDPs, and their variants and apply these in appropriate applications provided by MERCE.
We will focus on developing practical solutions for these formalisms. Some possibilities are to develop solutions based on dynamic programming over finite horizon, or using mathematical solvers, or adapting reinforcement learning algorithms to the desired context. Furthermore, the candidate can also study theoretical properties of the developed algorithms such as their complexity, optimality, and measures such as the regret.
These algorithms are expected to be validated experimentally on appropriate case studies.
The overall objective is to contribute to the state of the art of planning and RL algorithms with strong safety guarantees.
References:
- Sun et al. Online MDP with Prototypes Information: A Robust Adaptive Approach. AAAI 2025.
- Royer et al. Multiple-environment markov decision processes: Efficient analysis and applications. ICAPS 2020.
- Chatterjee et al. The Value Problem for Multiple-Environment MDPs with Parity Objective. ICALP 2025.
Principales activités
This is a fully academic post-doc position. The candidate is expected to conduct research and work on applications in collaboration with other researchers,write papers, and present their research in conferences.
Compétences
- PhD in computer science
- Background in probability, Markov chains, MDPs
Knowledge about reinforcement learning and planning are a plus but not necessary for candidates with a strong theoretical background on MDPs.
- Good level of English
- Good communication and reporting skills
- An interest in collaborative work
Avantages
- Subsidized meals
- Partial reimbursement of public transport costs
- Leave: 7 weeks of annual leave + 10 extra days off due to RTT (statutory reduction in working hours) + possibility of exceptional leave (sick children, moving home, etc.)
- Possibility of teleworking (after 6 months of employment) and flexible organization of working hours
- Professional equipment available (videoconferencing, loan of computer equipment, etc.)
- Social, cultural and sports events and activities
- Access to vocational training
Rémunération
Monthly gross salary amounting to 2788 euros
En cliquant sur "JE DÉPOSE MON CV", vous acceptez nos CGU et déclarez avoir pris connaissance de la politique de protection des données du site jobijoba.com.