VLCap: Vision-Language with Contrastive Learning for Coherent Video Paragraph CaptioningOct 16, 2022ยทKashu Yamazaki,Sang Truong,Khoa Vo,Michael Kidd,Chase Rainwater,Khoa Luu,Ngan Leยท 0 min read PDF Preprint CodePublicationICIP (2022)Last updated on Oct 16, 2022 โ VLTinT: Visual-Linguistic Transformer-in-Transformer for Coherent Video Paragraph Captioning Jan 1, 2023AISFormer: Amodal Instance Segmentation with Transformer Oct 1, 2022 โ