|
Richard Li
I am a PhD student at MIT CSAIL, where I work on robotics and machine learning.
Email
|
|
Projects
My research interest lies in finding a scalable path to robotics foundation models. I believe this will require solving cross-embodiment learning—especially transferring knowledge from human videos —as well as developing reinforcement learning methods that scale to large, real-world robotics datasets.
|
|
|
What Matters When Cotraining Robot Manipulation Policies on Everyday Human Videos?
Richard Li, Aditya Prakash, Andrew Wen, Saurabh Gupta, Yilun Du, Pulkit Agrawal
In submission
project page  / 
arXiv
Hand pose quality and embodiment-specific network specialization are key to enabling transfer from everyday videos to robots.
|
|
|
Prompt-Driven Exploration for VLA Policies
Sunshine Jiang,
John Marangola,
David Zhang,
Raghuram Kowdeed,
Ruiyang Luo,
Nitish Dashora,
Richard Li,
Pulkit Agrawal,
Zhang-Wei Hong
ICRA 2026 Workshop on VLA Pipelines for Real Robots
project page  / 
paper
Exploring in language space: a VLM iteratively refines task prompts from rollout observations, enabling VLA policies to bootstrap reinforcement learning from zero-reward starts.
|
|
|
Bimanual 3D Hand Motion and Articulation Forecasting in Everyday Images
Aditya Prakash,
Richard Li,
David Forsyth,
Saurabh Gupta
2025
project page  / 
code
Forecasting bimanual 3D hand motion and articulation from a single image using a diffusion-based lifting and forecasting pipeline.
|
|
|
Stable Object Reorientation using Contact Plane Registration
Richard Li,
Carlos Esteves,
Ameesh Makadia,
Pulkit Agrawal
ICRA 2022
project page  / 
video  / 
code
Predicting contact points with a CVAE and plane segmentation improves object generalization and handles multimodality.
|
|
|
Contact-Aware Lyapunov Controller Design via Alternating Optimization
Richard Li,
Timur Garipov
2022
paper  / 
video
Synthesizing Lyapunov controllers through contact with alternating optimization.
|
|
|
Vision-Based Proprioceptive Sensing for Soft Robotic Fingers
Richard Li,
Annan Zhang
2021
paper
Vision-based fingertip pose estimation with internal camera and CNN pose estimator.
|
|
|
Towards Practical Multi-object Manipulation using Relational Reinforcement Learning
Richard Li,
Allan Jabri,
Trevor Darrell,
Pulkit Agrawal
ICRA 2020, ICML 2020: Bridge Between Perception and Reasoning Workshop
project page  / 
video  / 
code
Multi-object, long-horizon manipulation can be autonomously learned using a curriculum and graph neural network architecture.
|
|