About Me


I am a graduated Ph.D. from the College of Engineering, Computing and Cybernetics (CECC) at the Australian National University (ANU). I was also a previous research student at the Australian Centre for Robotic Vision (ACRV@ANU). I was advised by Prof. Stephen Gould (ANU), Prof. Qi Wu (UoA) and Prof. Lexing Xie (ANU).

Currently, I am a free researcher looking for a full-time research scientist job (Embodied AI, Vision-Language, Generative models in 3D/Videos) ๐Ÿ˜‰. If you have any opportunities for me, please let me know!

I have a broad research interest in computer vision, natural language processing, and robotics. My latest works (also my current research focus) include developing generalizable large models for 3D reconstruction, large-scale training for language-guided navigation agents, and using LLMs in embodied AI tasks.


News

2023.07.14 โ€ƒ Our papers Scaling Data Generation in Vision-and-Language Navigation and Learning Navigational Visual Representations with Semantic Map Supervision have been accepted to ICCV 2023! The projects were completed/initialized during my first internship at Adobe! It was my great pleasure to work on them with my friends around the world (@ZunWang, @JialuLi, @HaoTan)! ๐Ÿ˜€๐Ÿ˜Šโค๏ธ Thank heaps OpenGVLab@Shanghai AI Laboratory for the great support! โญ๐Ÿ™Œ

2023.02.19 โ€ƒ Join Adobe Research again (intern)! Working on Text-to-3D Generation and Single-Image-to-3D Reconstruction, totally unfamiliar topics to me! ๐Ÿ˜Š๐Ÿ”ฅ๐Ÿ”ฅ

2022.12.29 โ€ƒ Paper HOP+: History-Enhanced and Order-Aware Pre-Training for Vision-and-Language Navigation by Yanyuan Qiao, Yuankai Qi, Zheng Yu, Peng Wang, Qi Wu and myself has been accepted by TPAMI! Congrats Yanyuan!!! ๐Ÿ˜€๐Ÿ˜€๐Ÿ˜€

2022.06.19

  • Attending CVPR2022 in person!!! Finally meeting so many great researchers! I have learned so much!!! โค๏ธโค๏ธโค๏ธ
  • Congrats to Zun Wang, Dong An and Team JoyBoy for winning the 1st Place in the Room-Across-Room (RxR) Habitat Challenge 2022!!! ๐Ÿ˜†โšกโšก

2022.06.15 โ€ƒ Visiting Professor Eric Xin Wang and the ERIC Lab at the University of California Santa Cruz! It was amazing to learn from so many young researchers! ๐Ÿ˜„

2022.05.10 โ€ƒ Invited talk by the NLP Lab at the Fudan University, really enjoyed chatting with everyone! ๐Ÿ˜„

2022.03.28 โ€ƒ My VLN project has been selected to be a part of the NVIDIA Academic Hardware Grant Program! ๐Ÿ˜† Thank you so much NVIDIA for the A100 GPU grant!!! ๐Ÿ˜ญ๐Ÿ˜ญ๐Ÿ˜ญ

2022.03.14 โ€ƒ I have started a research internship at the Creative Intelligence Lab in Adobe Research in San Jose, California, US!!! ๐Ÿ˜†๐Ÿ˜†๐Ÿ˜†

2022.03.02

  • Our paper Bridging the Gap Between Learning in Discrete and Continuous Environments for Vision-and-Language Navigation has been accepted to CVPR 2022! ๐Ÿ˜Š I am so happy to share lots of thoughts about VLN in this paper! See you guys in New Orleans! โค๏ธ
  • Paper HOP: History-and-Order Aware Pre-training for Vision-and-Language Navigation by Yanyuan Qiao, Yuankai Qi, Peng Wang, Qi Wu and myself has been accepted to CVPR 2022! Congrats Yanyuan on the first paper in her PhD! ๐Ÿ˜€

2021.08.17 โ€ƒ Paper The Road To Know-Where: An Object-and-Room Informed Sequential BERT for Indoor Vision-Language Navigation by Yuankai Qi, Zizheng Pan, Ming-Hsuan Yang, Anton van den Hengel, Qi Wu and myself has been accepted to ICCV 2021! ๐Ÿ˜€

2021.04.10 โ€ƒ Paper Learning Structure-Aware Semantic Segmentation with Image-Level Supervision by Jiawei Liu, Dr. Jing Zhang, Prof. Nick Barnes and myself, has been accepted to IJCNN 2021! Congrats Jiawei on his first paper in computer vision! ๐Ÿ˜€

2021.03.16 โ€ƒ Our Thinking-VLN repo is online! Come to enjoy our immature ideas and share your thoughts! Just for FUN thinking!

2021.03.06 โ€ƒ Our paper A Recurrent Vision-and-Language BERT for Navigation has been accepted to CVPR 2021 as an Oral paper with 3 strong accepts! ๐Ÿ˜†๐Ÿ˜†๐Ÿ˜†

2020.10.05 โ€ƒ I gave a guest lecture in the Deep Learning Course at ANU (ENGN8536) about Vision and Language Research! My first lecture at Uni! Nervous and Fun! ๐Ÿ˜€

2020.09.26 โ€ƒ Our paper Language and Visual Entity Relationship Graph for Agent Navigation has been accepted to NeurIPS 2020! ๐Ÿ˜€

2020.09.15 โ€ƒ Our paper Sub-Instruction Aware Vision-and-Language Navigation has been accepted to EMNLP 2020! My first paper! ๐Ÿ˜Š


Research

Scaling Data Generation in Vision-and-Language Navigation
Zun Wang, Jialu Li, Yicong Hong, Yi Wang, Qi Wu, Mohit Bansal, Stephen Gould, Hao Tan, Yu Qiao
International Conference on Computer Vision (ICCV), 2023

Learning Navigational Visual Representations with Semantic Map Supervision
Yicong Hong, Yang Zhou, Ruiyi Zhang, Franck Dernoncourt, Trung Bui, Stephen Gould, Hao Tan
International Conference on Computer Vision (ICCV), 2023

Bridging the Gap Between Learning in Discrete and Continuous Environments for Vision-and-Language Navigation
Yicong Hong, Zun Wang, Qi Wu, Stephen Gould
Conference on Computer Vision and Pattern Recognition (CVPR), 2022

A Recurrent Vision-and-Language BERT for Navigation
Yicong Hong, Qi Wu, Yuankai Qi, Cristian Rodriguez-Opazo, Stephen Gould
Conference on Computer Vision and Pattern Recognition (CVPR), 2021

Language and Visual Entity Relationship Graph for Agent Navigation
Yicong Hong, Cristian Rodriguez-Opazo, Yuankai Qi, Qi Wu, Stephen Gould
Conference on Neural Information Processing Systems (NeurIPS), 2020

Sub-Instruction Aware Vision-and-Language Navigation
Yicong Hong, Cristian Rodriguez-Opazo, Qi Wu, Stephen Gould
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2020

1st Place Solutions for RxR-Habitat Vision-and-Language Navigation Competition (CVPR 2022)
Dong An, Zun Wang, Yangguang Li, Yi Wang, Yicong Hong, Yan Huang, Liang Wang, Jing Shao
Room-Across-Room (RxR) Habitat Challenge (CVPR Embodied AI Workshop), 2022

HOP: History-and-Order Aware Pre-training for Vision-and-Language Navigation
Yanyuan Qiao, Yuankai Qi, Yicong Hong, Zheng Yu, Peng Wang, Qi Wu
Conference on Computer Vision and Pattern Recognition (CVPR), 2022

The Road to Know-Where: An Object-and-Room Informed Sequential BERT for Indoor Vision-Language Navigation
Yuankai Qi, Zizheng Pan, Yicong Hong, Ming-Hsuan Yang, Anton van den Hengel, Qi Wu
International Conference on Computer Vision Systems (ICCV), 2021