I devote myself to fine-grained video understanding and its interaction with natural languages, with a special focus on VideoQA and Video Grounding. The techniques emphasize video structural representation learning, large language models, robustness and interpretability. I am looking forward to working with Research Interns/Assistants/Visiting Students. (If you are interested in my research, please contact to junbin at comp dot nus dot edu dot sg. Students with CV/NLP/MultiModal background are preferred.)
News
Invited to be reviewer in CVPR'24 and ICLR'24.
Two papers are accepted to T-PAMI'23 and ACM MM'23 respectively.
Two papers are accepted to T-PAMI'23 and ICCV'23 respectively.
Invited to serve as PC Member in AAAI'24.
Invited to be reviewer in NeurIPS'23 dataset and benchmark track.
Invited to be reviewer in ACM MM'23.
Successfully defensed my Ph.D.
Thesis: Visual Relation Driven Video Question Answering. Supervisor: Tat-Seng Chua. Committee: Prof. Mohan Kankanhalli, Prof. Roger Zimmermann. Chair: Prof. Terence Sim
Invited to be reviewer in ICCV'23.
Invited to be reviewer in CVPR'23.
Receive the Dean's Graduate Research Excellence Award.
Invited to serve as PC member in AAAI'23.
One VideoQA paper wins the Best Paper FinalList in CVPR'22.
Featured Publications
Others
Reviewer for Conference: ICCV'23, CVPR'23, AAAI'23, ICASSP'23, ECCV'22, ACM MM'19&20&23
Reviewer for Journal: TMM, TNNLS, ToMM, NeurComputing, JVCIR.