Khoa Vo

Dept. of CSCE, University of Arkansas, Fayetteville, AR, USA.

prof_pic.png

I am a PhD student at University of Arkansas in Fayetteville, advised by Asst. Prof. Ngan Le. I am working on problems at the intersection of Computer Vision x Deep Learning and I am also very intrigued by the problems of bridging vision and language understandings.

Before I joined UArk, I was a MSc student in the Department of Computer Science, Ho Chi Minh University of Science, Vietnam, advised by Assoc. Prof. Minh-Triet Tran. During this program, I luckily had an 6-month internship period under the advisory of Prof. Akihiro Sugimoto. My Master’s thesis is an attempt to resolve temporal action proposal problem in videos by a newly proposed method that learns adaptively attends to main actors.

Further in the past, I received my Bachelor degree in the Honors Program of Ho Chi Minh University of Science with the advisory of Prof. Minh-Triet Tran. In my final thesis, I studied the problem of image captioning and proposed a novel approach to improve captions quality over previous state-of-the-art. Thanks to the thesis, I made two publications in an international conference and a international journal, and achieved the highest prize in the Nation-wide Awards for Student Scientfic Research, Eureka.

News

Sep 25, 2024 One first-authored paper has been accepted to NeurIPS 2024.
Sep 20, 2024 One co-authored paper has been accepted to ACCV 2024.
Jan 15, 2024 One co-authored paper has been accepted to ICRA 2024.
Nov 28, 2023 One co-authored paper has been accepted to IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) 2024.
Sep 15, 2023 I gave a talk at Southeast Symposium on Contemporary Engineering Topics, Arkansas Engineering Forum, held at Little Rock, Arkansas.
Jul 19, 2023 One co-authored paper has been accepted to International Conference on Image Processing (ICIP) 2023.
Apr 7, 2023 One first-authored paper has been accepted to The 5th Precognition Workshop at CVPR 2023.
Nov 19, 2022 One co-authored paper has been accepted to The 37th AAAI Conference on Artificial Intelligence.
Oct 2, 2022 One first-authored manuscript has been accepted to International Journal of Computer Vision.
Oct 2, 2022 One co-authored paper has been accepted to British Machine Vision Conference 2022.

Selected Publications

  1. AAAI
    VLTinT: Visual-Linguistic Transformer-in-Transformer for Coherent Video Paragraph Captioning
    Kashu Yamazaki,  Khoa Vo, Sang Truong, Bhiksha Raj, and Ngan Le
    In AAAI Conference on Artificial Intelligence 2023
  2. IJCV
    AOE-Net: Entities Interactions Modeling with Adaptive Attention Mechanism for Temporal Action Proposals Generation
    Khoa Vo, Sang Truong, Kashu Yamazaki, Bhiksha Raj, Minh-Triet Tran, and Ngan Le
    International Journal of Computer Vision Jan 2023
  3. BMVC
    Oral Session
    AEI: Actors-Environment Interaction with Adaptive Attention for Temporal Action Proposals Generation
    Khoa Vo, Hyekang Joo, Kashu Yamazaki, Sang Truong, Kris Kitani, Minh-Triet Tran, and Ngan Le
    In Proceedings of the British Machine Vision Conference (BMVC) Jan 2021
  4. App. Sci.
    A Smart System for Text-Lifelog Generation from Wearable Cameras in Smart Environment Using Concept-Augmented Image Captioning with Modified Beam Search Strategy
    Khoa Vo, Quoc-An Luong, Duy-Tam Nguyen, Mai-Khiem Tran, and Minh-Triet Tran
    Applied Sciences Jan 2019
  5. IEEE Access
    ABN: Agent-Aware Boundary Networks for Temporal Action Proposal Generation
    Khoa Vo, Kashu Yamazaki, Sang Truong, Minh-Triet Tran, Akihiro Sugimoto, and Ngan Le
    IEEE Access Jan 2021