๐Ÿ“„ My first work in Vision-Language-Action models is now on arXiv!

Dec 27, 2025ยท
Khoa Vo
Khoa Vo
ยท 1 min read

“Clutter-Robust Vision-Language-Action Models through Object-Centric and Geometry Grounding” is now available on arXiv. This paper is currently under review and under submission at IEEE Transactions on Robotics (T-RO).

Authors: Khoa Vo, Taisei Hanyu, Yuki Ikebe, Trong Thang Pham, Nhat Chung, Minh Nhat Vu, Duy Nguyen Ho Minh, Anh Nguyen, Anthony Gunderman, Chase Rainwater, Ngan Le


Links: