๐ My first work in Vision-Language-Action models is now on arXiv!
Dec 27, 2025ยท
ยท
1 min read
Khoa Vo
“Clutter-Robust Vision-Language-Action Models through Object-Centric and Geometry Grounding” is now available on arXiv. This paper is currently under review and under submission at IEEE Transactions on Robotics (T-RO).
Authors: Khoa Vo, Taisei Hanyu, Yuki Ikebe, Trong Thang Pham, Nhat Chung, Minh Nhat Vu, Duy Nguyen Ho Minh, Anh Nguyen, Anthony Gunderman, Chase Rainwater, Ngan Le
Links: