About Me


I am a PhD student in the College of Engineering, Computing and Cybernetics (CECC) at the Australian National University (ANU). I am also a research student at the Australian Centre for Robotic Vision (ACRV) and the Vision-Ask-Answer-Act Lab (V3A).

I am under the supervision of Prof. Stephen Gould (ANU), Prof. Qi Wu (UoA) and Prof. Lexing Xie (ANU).

Currently, I am doing a research internship at the Creative Intelligence Lab, Adobe Research.

Prior to that, in Novโ€™2018, I received my bachelor degree of engineering in mechatronic systems with a first-class honours in the College of Engineering and Computer Science at ANU. In 2018, I was also a part-time research student at the Data61, CSIRO, working on human pose and shape visualization.

I have a broad research interests in computer vision, natural language processing and robotics. Currently, my main research focus is on Text/Image to 3D Shape Generation, and Embodied Vision-and-Language.

My latest works include developing generalizable large 3D models for reconstruction, large-scale training for langauge-guided navigation agents, and using LLMs in embodied AI tasks.


News

2023.02.19 โ€ƒ Join Adobe Research again! Working on Text-to-3D Generation, a totally unfamiliar topic to me! ๐Ÿ˜Š๐Ÿ”ฅ๐Ÿ”ฅ

2022.12.29 โ€ƒ Paper HOP+: History-Enhanced and Order-Aware Pre-Training for Vision-and-Language Navigation by Yanyuan Qiao, Yuankai Qi, Zheng Yu, Peng Wang, Qi Wu and myself has been accepted by TPAMI! Congrats Yanyuan!!! ๐Ÿ˜€๐Ÿ˜€๐Ÿ˜€

2022.06.19

  • Attending CVPR2022 in person!!! Finally meeting so many great researchers! I have learned so much!!! โค๏ธโค๏ธโค๏ธ
  • Congrats to Zun Wang, Dong An and Team JoyBoy for winning the 1st Place in the Room-Across-Room (RxR) Habitat Challenge 2022!!! ๐Ÿ˜†โšกโšก

2022.06.15 โ€ƒ Visiting Professor Eric Xin Wang and the ERIC Lab at the University of California Santa Cruz! It was amazing to learn from so many young researchers! ๐Ÿ˜„

2022.05.10 โ€ƒ Invited talk by the NLP Lab at the Fudan University, really enjoyed chatting with everyone! ๐Ÿ˜„

2022.03.28 โ€ƒ My VLN project has been selected to be a part of the NVIDIA Academic Hardware Grant Program! ๐Ÿ˜† Thank you so much NVIDIA for the A100 GPU grant!!! ๐Ÿ˜ญ๐Ÿ˜ญ๐Ÿ˜ญ

2022.03.14 โ€ƒ I have started a research internship at the Creative Intelligence Lab in Adobe Research in San Jose, California, US!!! ๐Ÿ˜†๐Ÿ˜†๐Ÿ˜†

2022.03.02

  • Our paper Bridging the Gap Between Learning in Discrete and Continuous Environments for Vision-and-Language Navigation has been accepted to CVPR 2022! ๐Ÿ˜Š I am so happy to share lots of thoughts about VLN in this paper! See you guys in New Orleans! โค๏ธ
  • Paper HOP: History-and-Order Aware Pre-training for Vision-and-Language Navigation by Yanyuan Qiao, Yuankai Qi, Peng Wang, Qi Wu and myself has been accepted to CVPR 2022! Congrats Yanyuan on the first paper in her PhD! ๐Ÿ˜€

2021.08.17 โ€ƒ Paper The Road To Know-Where: An Object-and-Room Informed Sequential BERT for Indoor Vision-Language Navigation by Yuankai Qi, Zizheng Pan, Ming-Hsuan Yang, Anton van den Hengel, Qi Wu and myself has been accepted to ICCV 2021! ๐Ÿ˜€

2021.04.10 โ€ƒ Paper Learning Structure-Aware Semantic Segmentation with Image-Level Supervision by Jiawei Liu, Dr. Jing Zhang, Prof. Nick Barnes and myself, has been accepted to IJCNN 2021! Congrats Jiawei on his first paper in computer vision! ๐Ÿ˜€

2021.03.16 โ€ƒ Our Thinking-VLN repo is online! Come to enjoy our immature ideas and share your thoughts! Just for FUN thinking!

2021.03.06 โ€ƒ Our paper A Recurrent Vision-and-Language BERT for Navigation has been accepted to CVPR 2021 as an Oral paper with 3 strong accepts! ๐Ÿ˜†๐Ÿ˜†๐Ÿ˜†

2020.10.05 โ€ƒ I gave a guest lecture in the Deep Learning Course at ANU (ENGN8536) about Vision and Language Research! My first lecture at Uni! Nervous and Fun! ๐Ÿ˜€

2020.09.26 โ€ƒ Our paper Language and Visual Entity Relationship Graph for Agent Navigation has been accepted to NeurIPS 2020! ๐Ÿ˜€

2020.09.15 โ€ƒ Our paper Sub-Instruction Aware Vision-and-Language Navigation has been accepted to EMNLP 2020! My first paper! ๐Ÿ˜Š


Research

Bridging the Gap Between Learning in Discrete and Continuous Environments for Vision-and-Language Navigation
Yicong Hong, Zun Wang, Qi Wu, Stephen Gould
Conference on Computer Vision and Pattern Recognition (CVPR), 2022

A Recurrent Vision-and-Language BERT for Navigation
Yicong Hong, Qi Wu, Yuankai Qi, Cristian Rodriguez-Opazo, Stephen Gould
Conference on Computer Vision and Pattern Recognition (CVPR), 2021

Language and Visual Entity Relationship Graph for Agent Navigation
Yicong Hong, Cristian Rodriguez-Opazo, Yuankai Qi, Qi Wu, Stephen Gould
Conference on Neural Information Processing Systems (NeurIPS), 2020

Sub-Instruction Aware Vision-and-Language Navigation
Yicong Hong, Cristian Rodriguez-Opazo, Qi Wu, Stephen Gould
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2020

1st Place Solutions for RxR-Habitat Vision-and-Language Navigation Competition (CVPR 2022)
Dong An, Zun Wang, Yangguang Li, Yi Wang, Yicong Hong, Yan Huang, Liang Wang, Jing Shao
Room-Across-Room (RxR) Habitat Challenge (CVPR Embodied AI Workshop), 2022

HOP: History-and-Order Aware Pre-training for Vision-and-Language Navigation
Yanyuan Qiao, Yuankai Qi, Yicong Hong, Zheng Yu, Peng Wang, Qi Wu
Conference on Computer Vision and Pattern Recognition (CVPR), 2022

The Road to Know-Where: An Object-and-Room Informed Sequential BERT for Indoor Vision-Language Navigation
Yuankai Qi, Zizheng Pan, Yicong Hong, Ming-Hsuan Yang, Anton van den Hengel, Qi Wu
International Conference on Computer Vision Systems (ICCV), 2021