I’m Baoqi Pei(裴宝琦). I am a third-year Ph.D. student at College of Computer Science and Technology, Zhejiang University, supervised by Prof. Fei Wu and Prof Yu Qiao, and I work closely with Yifei Huang. Prior to this, I got my Bachelor’s degree from Beihang University in 2023.
My research interest includes general video understanding, egocentric vision perception and multimodal large language models.
🔥 News
- 2025.09: Two papers EgoThinker and Egoexobench were accepted to NeurIPS 2025.
- 2025.06: Our Vinci was accepted to IMMUT 2025.
- 2025.06: Our CoQo was accepted to IJCV.
- 2024.12: 3 papers EgoHOD, EgoExo-Gen and CG-Bench were accepted to ICLR 2025.
- 2024.07: Our EgoVideo won 7 championships in EgoVis Challenge at CVPR 2024 Workshop.
- 2024.06: Internvideo2 was accepted to ECCV 2024.
- 2024.01: EgoexoLearn was accepted to CVPR 2024.
📝 Publications

EgoThinker: Unveiling Egocentric Reasoning with Spatio-Temporal CoT
Baoqi Pei, Yifei Huang, Jilan Xu, Yuping He, Guo Chen, et al.
- A framework which equips MLLMs with strong egocentric reasoning via EgoRe-5M dataset, spatio-temporal chain-of-thought supervision and a two-stage training stage.

EgoHOD: Modeling Fine-Grained Hand-Object Dynamics for Egocentric Video Representation Learning
Baoqi Pei, Yifei Huang, Jilan Xu, Guo Chen, et al.
- An egocentric video-language pretrained model that learns fine-grained egocentric video representations by modeling hand-object dynamics.

Vinci: A real-time embodied smart assistant based on egocentric vision-language model
Yifei Huang*, Jilan Xu*, Baoqi Pei*, Lijin Yang, MingFang Zhang, Yuping He, Guo Chen, et al.
- A real-time egocentric wearable assistant to assist users with daily tasks, including scene understanding, grounding, summarization, and future planning.

CoQo: Guiding Audio-Visual Question Answering with Collective Question Reasoning
Baoqi Pei, Yifei Huang, Guo Chen, Jilan Xu, et al.
- A multimodal model to parse AVQA task with a Question Guided Transformer and Collective Question-Answering Training strategy.

Internvideo2: Scaling foundation models for multimodal video understanding
Yi Wang, Kunchang Li, Xinhao Li, Jiashuo Yu, Yinan He, Guo Chen, Baoqi Pei, Rongkun Zheng, Zun Wang, Yansong Shi, Tianxiang Jiang, Songze Li, Jilan Xu, Hongjie Zhang, Yifei Huang, Yu Qiao, Yali Wang, Limin Wang
- A foundation model for video / text / audio understanding, achieving SOTA over several benchmarks.
- Egoexobench: A benchmark for first-and third-person view video understanding in mllms, Yuping He, Yifei Huang, Guo Chen, Baoqi Pei, et al. NeurIPS 2025
- Bridging Perspectives: A Survey on Cross-view Collaborative Intelligence with Egocentric-Exocentric Vision, Yuping He, Yifei Huang, Guo Chen, Lidong Lu, Baoqi Pei, et al. Arxiv 2025
- X-Gen: Ego-centric Video Prediction by Watching Exo-centric Videos, Jilan Xu, Yifei Huang, Baoqi Pei, Junlin Hou, Qingqiu Li, Guo Chen, et al. ICLR 2025
- Cg-bench: Clue-grounded question answering benchmark for long video understanding, Guo Chen, Yicheng Liu, Yifei Huang, Yuping He, Baoqi Pei, Jilan Xu, Yali Wang, Tong Lu, Limin Wang. ICLR 2025
- Egovideo: Exploring egocentric foundation model and downstream adaptation, Baoqi Pei, Guo Chen, Jilan Xu, Yuping He, Yicheng Liu, et al. EgoVis Challenge
- Video mamba suite: State space model as a versatile alternative for video understanding, Guo Chen*, Yifei Huang*, Jilan Xu*, Baoqi Pei*, Zhe Chen, Zhiqi Li, Jiahao Wang, Kunchang Li, Tong Lu, Limin Wang. Arxiv 2024
- Egoexolearn: A dataset for bridging asynchronous ego-and exo-centric view of procedural activities in real world, Yifei Huang, Guo Chen, Jilan Xu, Mingfang Zhang, Lijin Yang, Baoqi Pei, et al. Arxiv 2024
📖 Educations
- 2023.09 - Present, Ph.D. in College of Computer Science and Technology, Zhejiang University.
- 2019.09 - 2023.06, B.Sc. in College of Computer Science, BeiHang University.
🎖 Honors and Awards
- Winner of the 7 tracks in the 1st EgoVis Workshop @ CVPR 2024
- Distinguished Paper Award in Egovis 2023/2024
- Outstanding Graduate Student in Zhejiang University, 2025
- Outstanding Student Scholarship in Beihang University, 2021