Selected Publications

For a complete list of publication, please view my Google Scholar.

Khoa Vo , Taisei Hanyu , Yuki Ikebe , Trong Thang Pham , Nhat Chung , Minh Nhat Vu , Duy Nguyen Ho Minh , Anh Nguyen , Anthony Gunderman , Chase Rainwater , Ngan Le

arXiv preprint (2025), under submission at IEEE Transactions on Robotics (T-RO)

Clutter-Robust Vision-Language-Action Models through Object-Centric and Geometry Grounding

Kashu Yamazaki* , Khoa Vo* , Sang Truong , Bhiksha Raj , Ngan Le

AAAI (2023) Oral Session

VLTinT: Visual-Linguistic Transformer-in-Transformer for Coherent Video Paragraph Captioning

Khoa Vo , Sang Truong , Kashu Yamazaki , Bhiksha Raj , Minh-Triet Tran , Ngan Le

International Journal Computer Vision (2023)

AOE-Net: Entities Interactions Modeling with Adaptive Attention Mechanism for Temporal Action Proposals Generation

Kashu Yamazaki , Sang Truong , Khoa Vo , Michael Kidd , Chase Rainwater , Khoa Luu , Ngan Le

ICIP (2022)

VLCap: Vision-Language with Contrastive Learning for Coherent Video Paragraph Captioning

Minh Tran , Khoa Vo , Kashu Yamazaki , Arthur Fernandes , Michael Kidd , Ngan Le

BMVC (2022)

AISFormer: Amodal Instance Segmentation with Transformer

Khoa Vo , Kashu Yamazaki , Hieu Hoang , Minh-Triet Tran , Ngan Le

Meta Learning With Medical Imaging and Health Informatics Applications

Book Chapter: Neural Architecture Search for Medical Image Applications

Kashu Yamazaki , Khoa Vo , Darshan Bulsara , Ngan Le

Brain Sciences (2022) Best Review Paper Award

Spiking Neural Networks and Their Applications: A Review

Khoa Vo , Hyekang Joo , Kashu Yamazaki , Sang Truong , Kris Kitani , Minh-Triet Tran , Ngan Le

BMVC (2021) Oral Session

AEI: Actors-Environment Interaction with Adaptive Attention for Temporal Action Proposals Generation