Kevin Riou
PhD in Computer Vision - Embodied AI - Imitation learning
Exploring Entrepreneurship

Nantes Université

Location: Nantes, France
Education | Experience | Publications | Services

Email: kevin.riou97@gmail.com
[Google Scholar] [Linkedin] [Research Lab]

About Me

My interest in embodied AI began in 2018 during my M.Sc. (engineering school), where I proposed and led a project on a fruit-harvesting robot. Working with Prof. Patrick Le Callet, we developed few-shot learning models to detect new fruits from minimal examples. While the computer vision worked well, manually programming a robot to crop each new fruit variety proved a major bottleneck. This challenge led me to focus on imitation learning for my PhD—a promising approach to bypass the need for task-specific programming in robotics.

We therefore defined a PhD project with Prof. Patrick Le Callet and Dr. Kevin Subrin, respectively my main and co-supervisors. The project focused on enabling robots to understand human video demonstrations and subsequently reproducing demonstrated tasks within their own action space, even in new environments. This is what we called "Embodiment and Environment Agnostic Imitation Learning for robots".

While we were building my PhD project and searching for fundings, from 2020 to 2021, I worked at Capacités on tactile exploration strategies learnt by reinforcement learning. The ultimate goal was to develop a mine sweeping robot that could explore and recognize objects burried in the ground using tactile sensors. It was a great opportunity to apprehend the challenges associated with embodied AI and real world robotics applications.

In october 2021, I started my PhD at Nantes Université in the LS2N lab. I defended this PhD in January 2025 🎥 [Recording Link] 🎥. This project involved 3D Human Pose Estimation, Action Recognition, 0-shot object detection/segmentation with large vision-and-language models, and diverse imitation learning strategies.

Below are teasers of some of the solutions we developed during my PhD:

Multi-View 3D Human Pose Estimation
Scene Understanding
Imitation Learning


What's next?
I am currently exploring entrepreneurship opportunities to transfer my skills and knowledge to create innovative solutions that address unmet needs, bridging the gap between cutting-edge research and practical real-world applications.

Education

  • 2021 - 2025     Ph.D. in Nantes Université, LS2N.
  • 2017 - 2020     Master of Engineering in Electronics and Digital Technologies in Nantes Université.
  • 2015 - 2017     Preparatory Classes in Math-Physics in Lycée Kerichen, Brest.

Experience

Publications

3D Human Pose Estimation:

Geometric Consistency-Guaranteed Spatio-Temporal Transformer for Unsupervised Multi-View 3D Pose Estimation
Kaiwen Dong, Kevin Riou, Jingwen Zhu, Andreas Pastor, Kevin Subrin, Yu Zhou, Xiao Yun, Yanjing Sun, Patrick Le Callet
IEEE Transactions on Instrumentation and Measurement (TIM), 2024.
Keywords: 3D human pose estimation, Multi-view-and-temporal transformer, Unsupervised learning, Scene pose estimation.
[PDF] [ Bibtex]

Evaluating 3d human pose estimation in occluded multi-sensor scenarios: dataset and annotation approach
Kevin Riou, Kaiwen Dong, Yujie Huang, Kevin Subrin, Patrick Le Callet, Yanjing Sun
IEEE International Conference on Image Processing (ICIP), 2024.
Keywords: Multi-sensor data acquisition, 3D Human Pose Annotation (3DHPE) tool, 3DHPE under occlusions. .
[PDF] [ Bibtex]


Action recognition:

Behavioral Recognition of Skeletal Data Based on Targeted Dual Fusion Strategy
Xiao Yun, Chenglong Xu, Kevin Riou, Kaiwen Dong, Yanjing Sun, Song Li, Kevin Subrin, Patrick Le Callet
Proceedings of the AAAI Conference on Artificial Intelligence (AAAI), 2024.
Keywords: Video understanding, Activity recognition, Graph convolutional networks, Multi-modalities fusion.
[PDF] [ Bibtex]


Imitation Learning:

Vision Foundation Models for an embodiment and environment agnostic scene representation for robotic manipulation
Kevin Riou, Kevin Subrin, Patrick Le Callet
IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), on Brain over Brawn Workshop (BoB)(https://bob-workshop. github. io/) , 2024.
Keywords: Imitation learning, Large vision-language-model (0-shot detection), Diffusion policy, RGB-D Human pose estimation.
[PDF] [ Bibtex]

From Temporal-evolving to Spatial-fixing: A Keypoints-based Learning Paradigm for Visual Robotic Manipulation
Kevin Riou, Kaiwen Dong, Kevin Subrin, Yanjing Sun, Patrick Le Callet
IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2023.
Keywords: Imitation learning, Keypoints-based bahavior cloning, Multi-view transformer.
[PDF] [ Bibtex]


Reinforcement learning based tactile exploration:

Reinforcement Learning Based Tactile Sensing for Active point cloud Acquisition, Recognition and Localization
Kevin Riou, Kaiwen Dong, Kevin Subrin, Patrick Le Callet
IEEE Journal of Selected Topics in Signal Processinge (JSTSP), 2024.
Keywords: Reinforcement learning, Tactile exploration, Point-cloud processing/classification.
[PDF] [ Bibtex]

Reinforcement Learning Based Point-Cloud Acquisition and Recognition Using Exploration-Classification Reward Combination
Kevin Riou, Kevin Subrin, Patrick Le Callet
IEEE International Conference on Multimedia and Expo (ICME), 2022.
Keywords: Reinforcement learning, Tactile exploration, Point-cloud processing/classification.
[PDF] [ Bibtex]

Seeing by haptic glance: Reinforcement learning based 3d object recognition
Kevin Riou, Suiyi Ling, Guillaume Gallot, Patrick Le Callet
IEEE International Conference on Image Processing (ICIP), 2021.
Keywords: Reinforcement learning, Tactile exploration, Point-cloud processing/classification.
[PDF] [ Bibtex]


Others:

Kinetic particles: from human pose estimation to an immersive and interactive piece of art questionning thought-movement relationships
Mickael Lafontaine, Julie Cloarec-Michaud, Kevin Riou, Yujie Huang, Kaiwen Dong, Patrick Le Callet
ACM International Conference on Interactive Media Experiences (IMX), 2023.
Keywords: Interactive digital art, 3D human pose estimation.
🏆 Runner-up demo award 🏆
[PDF] [ Bibtex]

Multi-layer perceptron for network intrusion detection: From a study on two recent data sets to deployment on automotive processor
Arnaud Rosay, Kevin Riou, Florent Carlier, Pascal Leroux
Annals of Telecommunications, 2022.
Keywords: Network (Ethernet) intrusion detection, Real-time Ethernet flow processing, Embedded AI .
[PDF] [ Bibtex]

Few-shot object detection in real life: case study on auto-harvest
Kevin Riou, Jingwen Zhu, Suiyi Ling, Mathis Piquet, Vincent Truffault, Patrick Le Callet
IEEE International Workshop on Multimedia Signal Processing (MMSP), 2020.
Keywords: Few-shot object detection, Data augmentation, Fruit detection dataset.
[PDF] [ Bibtex]

Services

  • Reviewer - IEEE International Conference on Multimedia and Expo (ICME)
  • Reviewer - IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)
  • Reviewer - IEEE International Conference on Image Processing (ICIP)
  • Reviewer - European conference on signal processing (EUSIPCO)
  • Student volunteer (organization) - ACM International Conference on Interactive Media Experiences (IMX) 2023