HENASY: Learning to Assemble Scene-Entities for Egocentric Video-Language Model

Oct 10, 2024ยท
Khoa Vo
,
Thinh Phan
,
Kashu Yamazaki
,
Minh Tran
,
Ngan Le
ยท 0 min read
Publication
NeurIPS (2024)