Yiming Dou 窦铱明

I'm a third-year Ph.D. student at Cornell Tech, advised by Prof. Andrew Owens.

I received my M.S.E. from the University of Michigan in 2025, before transferring to Cornell. Prior to that, I received my B.Eng. and B.Ec. from Shanghai Jiao Tong University in 2023, with an honors degree from Zhiyuan College. During my undergraduate, I was fortunate to work with Prof. Ruohan Gao and Prof. Jiajun Wu at Stanford. I also worked closely with Prof. Yong-Lu Li and Prof. Cewu Lu at SJTU.

My research interests mainly lie in multimodal perception, reasoning and robot learning.

Email  ·  Google Scholar  ·  Github  ·  Twitter  ·  WeChat

profile photo
News

  • 08/2025: 🥳 Our paper "Cross-Sensor Touch Generation" is selected as an oral presentation at CoRL 2025!
  • 08/2025: 🎉 One paper accepted to CoRL 2025! See you in Seoul!
  • 02/2025: 🎉 One paper accepted to CVPR 2025! See you in Nashville!
  • 01/2025: 🎉 Two papers accepted to ICRA 2025! See you in Atlanta!
  • 09/2024: 🥳 Honored to be selected as Outstanding Reviewer for ECCV 2024!
  • 02/2024: 🎉 Three papers accepted to CVPR 2024! See you in Seattle!
  • 06/2023: 🥳 Graduated from SJTU with honors degree!
  • 02/2023: 🎉 One paper accepted to CVPR 2023!
  • Research Interests

    Humans perceive the world with multiple senses, based on which we establish abstract concepts to understand it. From the concepts we develop logical reasoning ability, and thus creating brilliant achievements. Inspired by this, my dream is to design human-like multisensory intelligent systems, which can be divided into four specific problems:

  • Multimodal Perception: how to perceive and model the multimodal physical world.
  • Concept Learning: how to abstract the perceived information into high-level concepts.
  • Reasoning: how to perform causal reasoning on the basis of concepts.
  • Robot Learning: how to enable robots to actively interact with the real-world environments and humans.
  • Publications ( / )

    (* indicates equal contribution)

    Cross-Sensor Touch Generation
    Samanta Rodriguez*, Yiming Dou*, Miquel Oller, Andrew Owens, Nima Fazeli
    CoRL 2025 (Oral)
    Coming soon!

    We learn to translate touch signals captured from one touch sensor to another, which allows us to transfer object manipulation policies between sensors.

    Hearing Hands: Generating Sounds from Physical Interactions in 3D Scenes
    Yiming Dou, Wonseok Oh, Yuqing Luo, Antonio Loquercio, Andrew Owens
    CVPR 2025
    paper · project page

    We make 3D scene reconstruction interactive by predicting the sounds of human hands physically interacting with the scene.

    Tactile-Augmented Radiance Fields
    Yiming Dou, Fengyu Yang, Yi Liu, Antonio Loquercio, Andrew Owens
    CVPR 2024
    paper · project page · code

    We present a visuo-tactile 3D scene representation that can estimate the visual and tactile signals for a given 3D position within the scene.

    The ObjectFolder Benchmark: Multisensory Learning with Neural and Real Objects
    Ruohan Gao*, Yiming Dou*, Hao Li*, Tanmay Agarwal, Jeannette Bohg, Yunzhu Li, Li Fei-Fei, Jiajun Wu
    CVPR 2023
    paper · project page · code · interactive demo · video

    We introduce a benchmark suite for multisensory object-centric learning with sight, sound, and touch. We also introduce a dataset including the multisensory measurements for real-world objects

    Teaching

  • Graduate Student Instructor (GSI), EECS 442: Computer Vision, Fall 2023
  • Service

  • Conference reviewer: ICRA 2025, ICLR 2025, AAAI 2025, ECCV 2024, CVPR 2023-2025, ICCV 2023-2025, ACMMM 2025, ACMMM Asia 2025
  • Journal reviewer: IEEE RA-L
  • Experience
    Cornell Tech
    2025.08 ~ Present
    New York, U.S.
    Ph.D. Student in Computer Science
    Advisor: Prof. Andrew Owens
    University of Michigan
    2023.08 ~ 2025.08
    Ann Arbor, U.S.
    M.S.E. in Computer Science and Engineering
    Advisor: Prof. Andrew Owens
    Completed first two years of Ph.D. before transferring to Cornell
    Stanford University
    2022.03 ~ 2023.04
    Stanford, U.S.
    Visiting Research Intern
    Supervisor: Prof. Ruohan Gao, Prof. Jiajun Wu and Prof. Fei-Fei Li
    Shanghai Jiao Tong University
    2019.09 ~ 2023.06
    Shanghai, China
    B.Eng. (Honors) in Computer Science and Technology
    B.Ec. (Minor) in Economics
    Member of Zhiyuan Honors Program
    Supervisor: Prof. Cewu Lu and Prof. Yong-Lu Li
    Selected Honors

  • Outstanding Reviewer, ECCV 2024
  • Zhiyuan Scholarship (top 30 students), SJTU, 2023
  • Outstanding Graduate, SJTU, 2023
  • Academic Excellence Scholarship (top 10%), SJTU, 2022
  • Zhanjiajun Scholarship (six winners at SJTU), SJTU, 2022
  • Meritorious Winner (top 7%), MCM, 2022
  • Merit Student Award (top 5%), SJTU, 2021
  • Zhiyuan Honors Scholarship (top 5%), SJTU, 2019-2022
  • Misc.

    As a person working on building multisensory systems, I also enjoy being a multisensory embodied agent outside of work:

  • 👁 Photography: I'm always grateful to have grown up in a family that loves photography (my parents own a collection of various cameras released from 1999 to 2020, and they are all functioning!). I've been learning to take photos since I was 7 years old, and have been fortunate to capture some impressive moments along the way. See some of them here!
  • 👂 Classical music: I love listening to classical music, especially those from the Viennese Classic period to the Romantic period. (alphabetical order)
  • 💪 Tennis: Despite having been playing for 2+ years, I still regard myself as a beginner -- probably around NTRP level 3.0? -- but I really enjoy it and look forward to getting better!
  • 🦵 Running: I usually go running in the afternoon after work and it helps me regain my mental clarity. My short-term goal is to complete a half marathon!
  • Template from Jon Barron's website
    Last updated on Aug 11, 2025