Yiming Dou

Yiming Dou 窦铱明

I'm a third-year Ph.D. student at Cornell Tech, advised by Prof. Andrew Owens.

I received my M.S.E. from the University of Michigan in 2025, before transferring to Cornell. Prior to that, I received my B.Eng. and B.Ec. from Shanghai Jiao Tong University in 2023, with an honors degree from Zhiyuan College. During my undergraduate, I was fortunate to work with Prof. Ruohan Gao and Prof. Jiajun Wu at Stanford. I also worked closely with Prof. Yong-Lu Li and Prof. Cewu Lu at SJTU.

I mainly work on multisensory learning and robotic manipulation.

Email · Google Scholar · Github · Twitter · WeChat

News

08/2025: 🥳 Our paper "Cross-Sensor Touch Generation" is selected as an oral presentation at CoRL 2025!

08/2025: 🎉 One paper accepted to CoRL 2025! See you in Seoul!

02/2025: 🎉 One paper accepted to CVPR 2025! See you in Nashville!

01/2025: 🎉 Two papers accepted to ICRA 2025! See you in Atlanta!

09/2024: 🥳 Selected as an Outstanding Reviewer for ECCV 2024!

02/2024: 🎉 Three papers accepted to CVPR 2024! See you in Seattle!

Research Interests

Humans perceive the world with multiple senses, based on which we establish abstract concepts to understand it. From the concepts we develop logical reasoning ability, and thus creating brilliant achievements. Inspired by this, my dream is to design human-like multisensory intelligent systems, which can be divided into four specific problems:

Multimodal Perception: how to perceive and model the multimodal physical world.

Concept Learning: how to abstract the perceived information into high-level concepts.

Reasoning: how to perform causal reasoning on the basis of concepts.

Robot Learning: how to enable robots to actively interact with the real-world environments and humans.

Publications ( / )

(* indicates equal contribution)

	Cross-Sensor Touch Generation Samanta Rodriguez, Yiming Dou, Miquel Oller, Andrew Owens, Nima Fazeli CoRL 2025 (Oral**) paper We learn to translate touch signals captured from one touch sensor to another, which allows us to transfer object manipulation policies between sensors.
	Hearing Hands: Generating Sounds from Physical Interactions in 3D Scenes Yiming Dou, Wonseok Oh, Yuqing Luo, Antonio Loquercio, Andrew Owens CVPR 2025 paper · project page We make 3D scene reconstruction interactive by predicting the sounds of human hands physically interacting with the scene.
	Tactile-Augmented Radiance Fields Yiming Dou, Fengyu Yang, Yi Liu, Antonio Loquercio, Andrew Owens CVPR 2024 paper · project page · code We present a visuo-tactile 3D scene representation that can estimate the visual and tactile signals for a given 3D position within the scene.
	The ObjectFolder Benchmark: Multisensory Learning with Neural and Real Objects Ruohan Gao, Yiming Dou*, Hao Li, Tanmay Agarwal, Jeannette Bohg, Yunzhu Li, Li Fei-Fei, Jiajun Wu CVPR 2023 paper · project page · code · interactive demo · video We introduce a benchmark suite for multisensory object-centric learning with sight, sound, and touch. We also introduce a dataset including the multisensory measurements for real-world objects

Teaching

Graduate Student Instructor (GSI), EECS 442: Computer Vision, Fall 2023

Service

Conference reviewer: ICRA (2025), ICLR (2025, 2026), AAAI (2025), ECCV (2024), CVPR (2023, 2024, 2025), ICCV (2023, 2025), ACMMM (2025)

Journal reviewer: IEEE RA-L (2025)

Experience

	Cornell Tech 2025.08 ~ Present New York, U.S. Ph.D. Student in Computer Science Advisor: Prof. Andrew Owens
	University of Michigan 2023.08 ~ 2025.08 Ann Arbor, U.S. M.S.E. in Computer Science and Engineering Advisor: Prof. Andrew Owens Completed first two years of Ph.D. before transferring to Cornell
	Stanford University 2022.03 ~ 2023.04 Stanford, U.S. Visiting Research Intern Supervisor: Prof. Ruohan Gao, Prof. Jiajun Wu and Prof. Fei-Fei Li
	Shanghai Jiao Tong University 2019.09 ~ 2023.06 Shanghai, China B.Eng. (Honors) in Computer Science and Technology B.Ec. (Minor) in Economics Member of Zhiyuan Honors Program Supervisor: Prof. Cewu Lu and Prof. Yong-Lu Li

Selected Honors

Outstanding Reviewer, ECCV 2024

Zhiyuan Scholarship (top 30 students), SJTU, 2023

Outstanding Graduate, SJTU, 2023

Zhiyuan Honors Scholarship (top 5%), SJTU, 2019-2022

Misc.

As a person working on building multisensory systems, I also enjoy being a multisensory embodied agent outside of work:

👁 Photography: I've been learning to take photos since I was 7 years old, and have been fortunate to capture some impressive moments along the way. See some of them here!

👂 Classical music: I love listening to classical music, especially those from the Viennese Classic period to the Romantic period. (alphabetical order)

💪 Tennis: Despite having been playing for 2+ years, I still regard myself as a beginner -- probably around NTRP level 3.0? -- but I really enjoy it and look forward to getting better!