Publications (Google Scholar)


2023

ICCV'23          

LASO: Language-guided Affordance Segmentation on 3D Object
Yicong Li, Na Zhao, Junbin Xiao, Chun Feng, Xiang Wang, Tat-seng Chua

ICCV'23          

Discovering Spatio-Temporal Rationales for Video Question Answering
Yicong Li, Junbin Xiao, Chun Feng, Xiang Wang, Tat-seng Chua

TPAMI'23          

Transformer-Empowered Invariant Grounding for Video Question Answering
Yicong Li, Xiang Wang, Junbin Xiao, Wei Ji, Tat-seng Chua

ACM MM'23          

Redundancy-aware Transformer for Video Question Answering
Yicong Li, Xun Yang, An Zhang, Chun Feng, Xiang Wang, Tat-seng Chua


2022

CVPR'22          

Invariant grounding for video question answering [Oral Presentation & Best Paper Finalist]
Yicong Li, Xiang Wang, Junbin Xiao, Wei Ji, Tat-seng Chua

ACM MM'22          

Equivariant and invariant grounding for video question answering
Yicong Li, Xiang Wang, Junbin Xiao, Tat-seng Chua


2021

ACM MM'21          

Interventional video relation detection
Yicong Li, Xun Yang, Xindi Shang, Tat-seng Chua

ACM MM'21          

Video visual relation detection via iterative inference
Xindi Shang, Yicong Li, Jinbin Xiao, Wei Ji, Tat-seng Chua