๐Ÿ“„ My first work in Vision-Language-Action models is now on arXiv!

Dec 27, 2025ยท
Khoa Vo
Khoa Vo
ยท 1 min read

“Clutter-Resistant Vision-Language-Action Models through Object-Centric and Geometry Grounding”, my first work in Vision-Language-Action models, is now available on arXiv. This paper is currently under submission at IEEE Transactions on Robotics (T-RO).

Authors: Khoa Vo, Taisei Hanyu, Yuki Ikebe, Trong Thang Pham, Nhat Chung, Minh Nhat Vu, Duy Nguyen Ho Minh, Anh Nguyen, Anthony Gunderman, Chase Rainwater, Ngan Le


Links: