I’m Baoqi Pei(裴宝琦). I am a third-year Ph.D. student at College of Computer Science and Technology, Zhejiang University, supervised by Prof. Fei Wu and Prof Yu Qiao, and I work closely with Yifei Huang. Prior to this, I got my Bachelor’s degree from Beihang University in 2023.

My research interest includes general video understanding, egocentric vision perception and multimodal large language models.

🔥 News

2025.09: Two papers EgoThinker and Egoexobench were accepted to NeurIPS 2025.
2025.06: Our Vinci was accepted to IMMUT 2025.
2025.06: Our CoQo was accepted to IJCV.
2024.12: 3 papers EgoHOD, EgoExo-Gen and CG-Bench were accepted to ICLR 2025.
2024.07: Our EgoVideo won 7 championships in EgoVis Challenge at CVPR 2024 Workshop.
2024.06: Internvideo2 was accepted to ECCV 2024.
2024.01: EgoexoLearn was accepted to CVPR 2024.

📝 Publications

NeurIPS 2025

EgoThinker: Unveiling Egocentric Reasoning with Spatio-Temporal CoT

Baoqi Pei, Yifei Huang, Jilan Xu, Yuping He, Guo Chen, et al.

[Paper] [Code] [Data]

A framework which equips MLLMs with strong egocentric reasoning via EgoRe-5M dataset, spatio-temporal chain-of-thought supervision and a two-stage training stage.

ICLR 2025

EgoHOD: Modeling Fine-Grained Hand-Object Dynamics for Egocentric Video Representation Learning

Baoqi Pei, Yifei Huang, Jilan Xu, Guo Chen, et al.

[Paper] [Code] [Data]

An egocentric video-language pretrained model that learns fine-grained egocentric video representations by modeling hand-object dynamics.

IMMUT 2025

Vinci: A real-time embodied smart assistant based on egocentric vision-language model

Yifei Huang*, Jilan Xu*, Baoqi Pei*, Lijin Yang, MingFang Zhang, Yuping He, Guo Chen, et al.

[Paper] [Code]

A real-time egocentric wearable assistant to assist users with daily tasks, including scene understanding, grounding, summarization, and future planning.

IJCV 2025

CoQo: Guiding Audio-Visual Question Answering with Collective Question Reasoning

Baoqi Pei, Yifei Huang, Guo Chen, Jilan Xu, et al.

[Paper]

A multimodal model to parse AVQA task with a Question Guided Transformer and Collective Question-Answering Training strategy.

ECCV 2024

Internvideo2: Scaling foundation models for multimodal video understanding

Yi Wang, Kunchang Li, Xinhao Li, Jiashuo Yu, Yinan He, Guo Chen, Baoqi Pei, Rongkun Zheng, Zun Wang, Yansong Shi, Tianxiang Jiang, Songze Li, Jilan Xu, Hongjie Zhang, Yifei Huang, Yu Qiao, Yali Wang, Limin Wang

[Paper] [Code]

A foundation model for video / text / audio understanding, achieving SOTA over several benchmarks.

Egoexobench: A benchmark for first-and third-person view video understanding in mllms, Yuping He, Yifei Huang, Guo Chen, Baoqi Pei, et al. NeurIPS 2025
Bridging Perspectives: A Survey on Cross-view Collaborative Intelligence with Egocentric-Exocentric Vision, Yuping He, Yifei Huang, Guo Chen, Lidong Lu, Baoqi Pei, et al. Arxiv 2025
X-Gen: Ego-centric Video Prediction by Watching Exo-centric Videos, Jilan Xu, Yifei Huang, Baoqi Pei, Junlin Hou, Qingqiu Li, Guo Chen, et al. ICLR 2025
Cg-bench: Clue-grounded question answering benchmark for long video understanding, Guo Chen, Yicheng Liu, Yifei Huang, Yuping He, Baoqi Pei, Jilan Xu, Yali Wang, Tong Lu, Limin Wang. ICLR 2025
Egovideo: Exploring egocentric foundation model and downstream adaptation, Baoqi Pei, Guo Chen, Jilan Xu, Yuping He, Yicheng Liu, et al. EgoVis Challenge
Video mamba suite: State space model as a versatile alternative for video understanding, Guo Chen*, Yifei Huang*, Jilan Xu*, Baoqi Pei*, Zhe Chen, Zhiqi Li, Jiahao Wang, Kunchang Li, Tong Lu, Limin Wang. Arxiv 2024
Egoexolearn: A dataset for bridging asynchronous ego-and exo-centric view of procedural activities in real world, Yifei Huang, Guo Chen, Jilan Xu, Mingfang Zhang, Lijin Yang, Baoqi Pei, et al. Arxiv 2024

📖 Educations

2023.09 - Present, Ph.D. in College of Computer Science and Technology, Zhejiang University.
2019.09 - 2023.06, B.Sc. in College of Computer Science, BeiHang University.

🎖 Honors and Awards

Winner of the 7 tracks in the 1st EgoVis Workshop @ CVPR 2024
Distinguished Paper Award in Egovis 2023/2024
Outstanding Graduate Student in Zhejiang University, 2025
Outstanding Student Scholarship in Beihang University, 2021