Publications (Google Scholar)
2023
ICCV'23 |
LASO: Language-guided Affordance Segmentation on 3D Object Yicong Li, Na Zhao, Junbin Xiao, Chun Feng, Xiang Wang, Tat-seng Chua |
ICCV'23 |
Discovering Spatio-Temporal Rationales for Video Question Answering Yicong Li, Junbin Xiao, Chun Feng, Xiang Wang, Tat-seng Chua |
TPAMI'23 |
Transformer-Empowered Invariant Grounding for Video Question Answering Yicong Li, Xiang Wang, Junbin Xiao, Wei Ji, Tat-seng Chua |
ACM MM'23 |
Redundancy-aware Transformer for Video Question Answering Yicong Li, Xun Yang, An Zhang, Chun Feng, Xiang Wang, Tat-seng Chua |
2022
CVPR'22 |
Invariant grounding for video question answering [Oral Presentation & Best Paper Finalist] Yicong Li, Xiang Wang, Junbin Xiao, Wei Ji, Tat-seng Chua |
ACM MM'22 |
Equivariant and invariant grounding for video question answering Yicong Li, Xiang Wang, Junbin Xiao, Tat-seng Chua |
2021
ACM MM'21 |
Interventional video relation detection Yicong Li, Xun Yang, Xindi Shang, Tat-seng Chua |
ACM MM'21 |
Video visual relation detection via iterative inference Xindi Shang, Yicong Li, Jinbin Xiao, Wei Ji, Tat-seng Chua |