Recent News

🎉 One paper accepted at ECCV 2026!

Gripper-aware Vision Language Action Models

Jun 24, 2026

📄 Our recent work in VLA models for non-Markovian tasks is published on arXiv!

CodeGraphVLP: Code-as-Planner Meets Semantic-Graph State for Non-Markovian Vision-Language-Action Models

Apr 24, 2026

🎉 One paper accepted at CVPR 2026!

SemLT3D: Semantic-Guided Expert Distillation for Camera-only Long-Tailed 3D Object Detection

Feb 20, 2026

🎉 One paper accepted at ICRA 2026!

SlotVLA: Towards Modeling of Object-Relation Representations in Robotic Manipulation

Jan 31, 2026

📄 My first work in Vision-Language-Action models is now on arXiv!

Clutter-Robust Vision-Language-Action Models through Object-Centric and Geometry Grounding

Dec 27, 2025

🎉 One paper accepted at AAAI 2026!

Rethinking Progression of Memory State in Robotic Manipulation: An Object-Centric Perspective

Nov 3, 2025

My Advisor Awarded NSF CAREER Grant

Dr. Ngan Le has received a prestigious 2025 NSF Faculty Early Career Development (CAREER) Award

Jul 10, 2025

🎉 One paper accepted at ICCV 2025!

CT-ScanGaze: A Dataset and Baselines for 3D Volumetric Scanpath Modeling

Jun 26, 2025

🎓 PhD Dissertation Defense Successfully Completed

The Doctor is in, officially Dr. Khoa Vo! 💼👨‍🎓

Nov 14, 2024

🎉 One paper accepted at NeurIPS 2024 !

HENASY: Learning to Assemble Scene-Entities for Egocentric Video-Language Model

Sep 25, 2024