Khoa Vo

Dept. of CSCE, University of Arkansas, Fayetteville, AR, USA.

prof_pic.png

I am a PhD student at University of Arkansas in Fayetteville, advised by Asst. Prof. Ngan Le. I am working on problems at the intersection of Computer Vision x Deep Learning and I am also very intrigued by the problems of bridging vision and language understandings.

Before I joined UArk, I was a MSc student in the Department of Computer Science, Ho Chi Minh University of Science, Vietnam, advised by Assoc. Prof. Minh-Triet Tran. During this program, I luckily had an 6-month internship period under the advisory of Prof. Akihiro Sugimoto. My Master’s thesis is an attempt to resolve temporal action proposal problem in videos by a newly proposed method that learns adaptively attends to main actors.

Further in the past, I received my Bachelor degree in the Honors Program of Ho Chi Minh University of Science with the advisory of Prof. Minh-Triet Tran. In my final thesis, I studied the problem of image captioning and proposed a novel approach to improve captions quality over previous state-of-the-art. Thanks to the thesis, I made two publications in an international conference and a international journal, and achieved the highest prize in the Nation-wide Awards for Student Scientfic Research, Eureka.

News

Nov 28, 2023 One co-authored paper has been accepted to IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) 2024.
Sep 15, 2023 I gave a talk at Southeast Symposium on Contemporary Engineering Topics, Arkansas Engineering Forum, held at Little Rock, Arkansas.
Jul 19, 2023 One co-authored paper has been accepted to International Conference on Image Processing (ICIP) 2023.
Apr 7, 2023 One first-authored paper has been accepted to The 5th Precognition Workshop at CVPR 2023.
Nov 19, 2022 One co-authored paper has been accepted to The 37th AAAI Conference on Artificial Intelligence.
Oct 2, 2022 One first-authored manuscript has been accepted to International Journal of Computer Vision.
Oct 2, 2022 One co-authored paper has been accepted to British Machine Vision Conference 2022.
Aug 17, 2022 I honorably received Graduate Fellowship for Spring 2023 from Rodger S. Kline Endowed Chair, CSCE Department, University of Arkansas.
Aug 17, 2022 I honorably received W.R. Thomas Endowed Graduate Fellowship for Fall 2022.
Jun 20, 2022 Our paper has been accepted to International Conference on Image Processing (ICIP) 2022.

Selected Publications

  1. AAAI
    VLTinT: Visual-Linguistic Transformer-in-Transformer for Coherent Video Paragraph Captioning
    Kashu Yamazaki,  Khoa Vo, Sang Truong, Bhiksha Raj, and Ngan Le
    In AAAI Conference on Artificial Intelligence 2023
  2. IJCV
    AOE-Net: Entities Interactions Modeling with Adaptive Attention Mechanism for Temporal Action Proposals Generation
    Khoa Vo, Sang Truong, Kashu Yamazaki, Bhiksha Raj, Minh-Triet Tran, and Ngan Le
    International Journal of Computer Vision Jan 2023
  3. BMVC
    Oral Session
    AEI: Actors-Environment Interaction with Adaptive Attention for Temporal Action Proposals Generation
    Khoa Vo, Hyekang Joo, Kashu Yamazaki, Sang Truong, Kris Kitani, Minh-Triet Tran, and Ngan Le
    In Proceedings of the British Machine Vision Conference (BMVC) Jan 2021
  4. App. Sci.
    A Smart System for Text-Lifelog Generation from Wearable Cameras in Smart Environment Using Concept-Augmented Image Captioning with Modified Beam Search Strategy
    Khoa Vo, Quoc-An Luong, Duy-Tam Nguyen, Mai-Khiem Tran, and Minh-Triet Tran
    Applied Sciences Jan 2019
  5. IEEE Access
    ABN: Agent-Aware Boundary Networks for Temporal Action Proposal Generation
    Khoa Vo, Kashu Yamazaki, Sang Truong, Minh-Triet Tran, Akihiro Sugimoto, and Ngan Le
    IEEE Access Jan 2021