Publications (Google Scholar)
2023
| ICCV'23 | LASO: Language-guided Affordance Segmentation on 3D Object Yicong Li, Na Zhao, Junbin Xiao, Chun Feng, Xiang Wang, Tat-seng Chua | 
| ICCV'23 | Discovering Spatio-Temporal Rationales for Video Question Answering Yicong Li, Junbin Xiao, Chun Feng, Xiang Wang, Tat-seng Chua | 
| TPAMI'23 | Transformer-Empowered Invariant Grounding for Video Question Answering Yicong Li, Xiang Wang, Junbin Xiao, Wei Ji, Tat-seng Chua | 
| ACM MM'23 | Redundancy-aware Transformer for Video Question Answering Yicong Li, Xun Yang, An Zhang, Chun Feng, Xiang Wang, Tat-seng Chua | 
2022
| CVPR'22 | Invariant grounding for video question answering [Oral Presentation & Best Paper Finalist] Yicong Li, Xiang Wang, Junbin Xiao, Wei Ji, Tat-seng Chua | 
| ACM MM'22 | Equivariant and invariant grounding for video question answering Yicong Li, Xiang Wang, Junbin Xiao, Tat-seng Chua | 
2021
| ACM MM'21 | Interventional video relation detection Yicong Li, Xun Yang, Xindi Shang, Tat-seng Chua | 
| ACM MM'21 | Video visual relation detection via iterative inference Xindi Shang, Yicong Li, Jinbin Xiao, Wei Ji, Tat-seng Chua |