About
Resume
Publications

Recent News
Selected Publications
Experience

HENASY: Learning to Assemble Scene-Entities for Egocentric Video-Language Model

Oct 10, 2024·

Khoa Vo

,

Thinh Phan

,

Kashu Yamazaki

,

Minh Tran

,

Ngan Le

· 0 min read

Publication

NeurIPS (2024)

Last updated on Oct 10, 2024

Video Language Model NeurIPS (2024)

← Clutter-Robust Vision-Language-Action Models through Object-Centric and Geometry Grounding Dec 27, 2025

DNA: Deformable Neural Articulations Network for Template-Free Dynamic 3D Human Reconstruction From Monocular RGB-D Video Jun 1, 2023 →

© 2026 Me. This work is licensed under CC BY NC ND 4.0

Published with Hugo Blox Builder — the free, open source website builder that empowers creators.