๐ My first work in Vision-Language-Action models is now on arXiv!
Dec 27, 2025ยท
ยท
1 min read
Khoa Vo
“Clutter-Resistant Vision-Language-Action Models through Object-Centric and Geometry Grounding”, my first work in Vision-Language-Action models, is now available on arXiv. This paper is currently under submission at IEEE Transactions on Robotics (T-RO).
Authors: Khoa Vo, Taisei Hanyu, Yuki Ikebe, Trong Thang Pham, Nhat Chung, Minh Nhat Vu, Duy Nguyen Ho Minh, Anh Nguyen, Anthony Gunderman, Chase Rainwater, Ngan Le
Links: