Xiaoshuai Zhang is a Research Scientist at sudoAI, where he is a founding member and research lead of the foundation model group. His current focus is on scaling up the embodied AI data to support faster and more efficient VLA and world model training. He received his Ph.D. in CS from UC San Diego in 2024 (w/ Prof. Hao Su) and a B.S. in CS from Peking University in 2019. He worked / interned at Apple, Adobe, Google, and Hillbot, and was a visiting scholar at Stanford (w/ Prof. Leonidas Guibas). Xiaoshuaiโs research spans embodied AI models, sim2real, 3D vision, foundation models, and graphics.
For publications, visit Google Scholar. Feel free to connect if you are interested in collaborating.
Area Chair: CVPR'25 '26
Reviewer: CVPR, ICCV, ECCV, ICLR, ICML, ICRA, IROS, ACM MM, NeurIPS, SIGGRAPH Asia, TPAMI, IJCV, TVCG, RA-L, T-RO
๐๏ธ Xiaoshuai enjoys photography in his free time, especially film photography. He usually shoots with a Hasselblad 500C/M, Leica M6, Fujifilm GA645, and Sony A7IV.
๐๏ธ Heโs also an avid outdoor enthusiast who enjoys camping, hiking, and backpacking. He has summited Mt. Whitney and plans to tackle Mt. Shasta and Mt. Rainier next. He also likes offroading with his Rivian "Dash".
What can you infer from these images? Can you guess the camera, lens, film stock, location, and time? I have a special gift for educated guesses.
More Pieces of Me
๐ ๐ฅ โ๏ธ ๐ป ๐ชฉ ๐ ๐ ๐
Nice to meet you! Feel free to connect.