I am currently a Ph.D. candidate in the Department of Computer Science at the City University of Hong Kong (CityU) under the supervision of Prof. Jianping Wang (汪建平). My academic journey began at the University of Electronic Science and Technology of China (UESTC), where I earned my B.S. and M.S. degrees in Software Engineering in 2018 and 2021, respectively. During my master’s studies, I was fortunate to work under the guidance of Prof. Fan Zhou (周帆), Prof. Ting Zhong (钟婷), and Prof. Ji Geng (耿技).

Between my master’s and doctoral studies, I gained valuable industry experience as a strategy algorithm engineer in the Department of Smart Office Platform (智能办公平台部) at Baidu (百度), one of China’s leading tech companies. This year-long stint provided me with practical insights into the application of AI in real-world scenarios.

My research interests span a wide range of cutting-edge areas in artificial intelligence, including: Machine Learning, Deep Learning, Autonomous Driving, Meta-Learning, Few-Shot Learning, Continual Learning, Graph Neural Network, Uncertainty, and Calibration. I have published 7 papers at the top international AI conferences and journals with total .

🔥 News

  • 2024.09:  🎉 A paper is accepted by ACM Computing Surveys (CSUR) 2024.
  • 2024.04:  🎉 A paper is accepted by International Joint Conferences on Artificial Intelligence (IJCAI) 2024.
  • 2023.12:  🎉 A paper is accepted by Association for the Advancement of Artificial Intelligence (AAAI) 2024.

📝 Publications

Autonomous Driving

IJCAI 2024
sym

SGDCL: Semantic-Guided Dynamic Correlation Learning for Explainable Autonomous Driving [Project]
Chengtai Cao, Xinhong Chen, Jianping Wang, Qun Song, Rui Tan, and Yung-Hui Li

  • This work introduces SGDCL, a novel approach for explainable autonomous driving. SGDCL addresses critical shortcomings of existing methods via a semantic-guided learning module and a dynamic correlation learning module to learn category-specific features and model their interplay. Furthermore, we propose a novel loss item that leverages fine-grained co-occurrence statistics to regularize model training. Our comprehensive evaluation of two benchmarks demonstrates its effectiveness, surpassing seven state-of-the-art baselines and a large vision-language model. SGDCL improves prediction performance by a large margin and offers interpretable attention scores, enhancing the explainability and transparency of autonomous driving systems.
AAAI 2024
sym

CCTR: Calibrating Trajectory Prediction for Uncertainty-Aware Motion Planning in Autonomous Driving
Chengtai Cao, Xinhong Chen, Jianping Wang, Qun Song, Rui Tan, and Yung-Hui Li

  • This paper presents a novel CCTR framework to address the challenge of proper uncertainty calibration in trajectory prediction models, improving their reliability. CCTR offers a solution by introducing a calibration-oriented regularizer to align predicted variances with ground truth divergence and generating tailor-made temperature scalers for each prediction based on context and historical information. Extensive experiments demonstrate the superiority of CCTR over various baselines in uncertainty estimation and downstream planning tasks, leading to better-calibrated predictions and more trustworthy planning. Moreover, the ablation studies show the effectiveness of each component, with in-depth empirical analysis verifying CCTR’s desirable properties. Future work can exploit more advanced post-processing modules to further improve calibration quality.

Continual Learning

AAAI 2021
sym

Overcoming Catastrophic Forgetting in Graph Neural Networks with Experience Replay
Fan Zhou and Chengtai Cao* (Corresponding Author)

  • In this paper, we propose a dedicated continual learning method for graph neural networks, which is to our best knowledge the first attempt along this line. Specifically, we design a topology-aware weight preserving module which explicitly captures the topological information of graphs and measures the importance of the network’s parameters based on the task-related loss function and the topological information. When learning a new task, changes to the important parameters will be penalized to remember old tasks. Moreover, the proposed approach can be readily extended to arbitrary GNNs. The extensive experiments on both node-level tasks and graph-level one demonstrates the effectiveness and applicability of the proposed continual learning method on the graph domain.

Meta-Learning

INFOCOM 2020
sym

Fast Network Alignment via Graph Meta-learning
Fan Zhou, Chengtai Cao, Goce Trajcevski, Kunpeng Zhang, Ting Zhong, and Ji Geng.

  • In this paper, we recast the network alignment (NA) problem as a one-shot classification problem and presented an effective and efficient meta-learning based model for addressing this task, providing a new perspective. The proposed Meta-NA is a flexible and general framework that can significantly improve the NA accuracy and reduce the computational overheads. As our future work, one immediate extension is to incorporate the auxiliary information for more accurate network alignment. In addition, leveraging Meta-NA we plan to tackle another interesting topic – the network alignment without structural information – e.g., linking the anchor nodes across locationbased social networks with only the footprints of users available.
CIKM 2019
sym

Meta-GNN: on Few-shot Node Classification in Graph Meta-Learning [Project]
Fan Zhou, Chengtai Cao, Kunpeng Zhang, Goce Trajcevski, Ting Zhong, and Ji Geng

  • We have presented a generic graph meta-learning framework for few-shot node classification that leverages meta-learning mechanism to learn better parameter initialization of GNNs. The proposed Meta-GNN model can adapt well to new learning tasks (even new classes) with few labeled samples and significantly improves the performance in the context of few-shot node classification under meta-learning paradigm. Encouraging results have been obtained on three widely used datasets. In our future work, we would like to extend our framework to address more challenging problems such as few-shot graph classification and zero-shot node classification.

Graph Neural Network

ICC 2024
sym

Trajectory-User Linking via Graph Neural Network
Fan Zhou, Shupei Chen, Jin Wu, Chengtai Cao, and Shengming Zhang

  • This paper presented a general graph construction method to generate a check-in graph from massive trajectory data while modeling geographical features associated with users’ checkins and temporal moving intentions. We also proposed an effective and efficient model GNNTUL to address the human mobility discrimination problem by utilizing graph neural networks to capture higher-order spatio-temporal information, as well as the implicit transition patterns between check-ins from the constructed graph. Extensive experiments have been conducted on real-world LBSN data, and the results prove that our method can successfully enhance TUL performance and improve mobility learning efficiency.

Survey

CSUR 2024
sym

A Survey of Mix-based Data Augmentation: Taxonomy, Methods, Applications, and Explainability [Project] Chengtai Cao, Fan Zhou, Yurou Dai, Jianping Wang, and Kunpeng Zhang

  • Data augmentation (DA) is indispensable in modern machine learning and deep neural networks. The basic idea of DA is to construct new training data to improve the model’s generalization by adding slightly disturbed versions of existing data or synthesizing new data. This survey comprehensively reviews a crucial subset of DA techniques, namely Mix-based Data Augmentation (MixDA), which generates novel samples by combining multiple examples. In contrast to traditional DA approaches that operate on single samples or entire datasets, MixDA stands out due to its effectiveness, simplicity, computational efficiency, theoretical foundation, and broad applicability. We begin by introducing a novel taxonomy that categorizes MixDA into Mixup-based, Cutmix-based, and mixture approaches based on a hierarchical perspective of the data mixing operation. Subsequently, we provide an in-depth review of various MixDA techniques, focusing on their underlying motivations. Owing to its versatility, MixDA has penetrated a wide range of applications, which we also thoroughly investigate in this survey. Moreover, we delve into the underlying mechanisms of MixDA’s effectiveness by examining its impact on model generalization and calibration while providing insights into the model’s behavior by analyzing the inherent properties of MixDA. Finally, we recapitulate the critical findings and fundamental challenges of current MixDA studies while outlining the potential directions for future works. Different from previous related surveys that focus on DA approaches in specific domains (e.g., computer vision and natural language processing) or only review a limited subset of MixDA studies, we are the first to provide a systematical survey of MixDA, covering its taxonomy, methodology, application, and explainability. Furthermore, we provide promising directions for researchers interested in this exciting area.

Thesis

Master Thesis
sym

Application of Graph Neural Network Base on Meta-learning (基于元学习的图神经网络应用研究)
Chengtai Cao

  • With the advancement of machine learning and deep learning, graph structure data learning has gained significant attention from researchers. Graph data is prevalent in applications like social networks, citation networks, and biomolecules, leading to various graph learning models such as graph convolutional neural networks. However, existing methods face challenges: poor generalization, difficulty learning from few samples, inability to learn to learn, and low model efficiency. To address these issues, this research proposes two methods: Meta-GNN and Meta-NA. Meta-GNN focuses on few-shot node classification, using meta-learning to achieve good model initialization. Meta-NA approaches network alignment as a few-shot inter-network node classification problem, mapping nodes from multiple graphs into a shared metric space. Experimental results on real-world benchmarks demonstrate that both proposed methods outperform existing baselines in their respective applications.

Others

ESWA 2021
sym

Learning Meta-Knowledge for Few-shot Image Emotion Recognition
Fan, Zhou, Chengtai Cao, Ting Zhong, and Ji Geng.

  • In this research, we propose a generic meta-learning framework for the few-shot image emotion classification, called Meta-IEC, which provides the capability of well adapting or generalizing to new classes that have not been encountered before, and transferring to a new dataset where labels are completely different and only very few labeled examples are available. To capture the uncertainty and ambiguity during meta-testing, we implement a hierarchical Bayesian graphical model to understand latent relationships among various parameters between meta-training and meta-testing. Extensive experiments conducted on three publicly available datasets show that our proposed model outperforms several state-of-the-art baselines ranging from feature-engineering oriented to deep learning based. In addition, the study also demonstrates the impact of various factors on the model performance, such as the Meta-IEC framework, the network architecture, the hyperparameters, and the choice of labels in meta-training and meta-testing.

Patents

  • China Invention Patent, No. CN113095440B: Training Data Generation Method and Causal Effect Heterogeneous Response Difference Estimation Method Based on Meta-Learner (基于元学习者的训练数据生成方法及因果效应异质反应差异估计方法). Fan Zhou, Chengtai Cao, Ting Zhong, and Xovee Xu
  • China Invention Patent, No. CN112613556B: Few-Shot Image Sentiment Classification Method Based on Meta-Learning (基于元学习的少样本图像情感分类方法). Fan Zhou, Chengtai Cao, Ting Zhong, and Tianliang Wang

🎖 Honors and Awards

  • 2021 Excellent Master Thesis, University of Electronic Science and Technology of China (UESTC)
  • 2020 National Scholarship, Ministry of Education of China
  • 2020 First Prize Scholarship for Master Students, University of Electronic Science and Technology of China (UESTC)
  • 2020 Excellent Graduate Student (Master Program), University of Electronic Science and Technology of China (UESTC)
  • 2019 National Scholarship, Ministry of Education of China
  • 2019 First Prize Scholarship for Master Students, University of Electronic Science and Technology of China (UESTC)
  • 2019 Excellent Graduate Student (Master Program), University of Electronic Science and Technology of China (UESTC)
  • 2014 Merit Student, Education Department of Sichuan

📖 Educations

  • 2022 - Present Ph.D. Candidate, City University of Hong Kong (CityU)
  • 2018 - 2021 M.S., University of Electronic Science and Technology of China (UESTC)
  • 2014 - 2018 B.S., University of Electronic Science and Technology of China (UESTC)

💻 Works

  • 2021 - 2022 Strategy Algorithm Engineer, Baidu International Technology Co., Ltd. (Shenzhen)
  • 2017 - 2018 Front-End Developer and Quality Assurance, SAP China Co., Ltd. (Chengdu)

🧑‍🏫 Teaching

  • 2024/25 Semester A Teaching Assistant, CS5222 Computer Networks & Internets (CityU)
  • 2023/24 Semester B Teaching Assistant, CS5491 Artificial Intelligence (CityU)
  • 2023/24 Semester A Teaching Assistant, CS5489 Machine Learning: Algorithms & Applications (CityU)
  • 2022/23 Semester B Teaching Assistant, CS5296 Cloud Computing: Theo & Prac (CityU)
  • 2022/23 Semester A Teaching Assistant, CS3201 Computer Networks (CityU)

🎓 Activities

Reviewer

  • IEEE IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
  • IEEE Transactions on Knowledge and Data Engineering (IEEE TKDE).
  • ACM SIGKDD conference on Knowledge Discovery and Data Mining (ACM SIGKDD).
  • IEEE Transactions on Cybernetics (IEEE Trans. Cyber)
  • International World Wide Web Conference (WWW)
  • Proceedings of the AAAI Conference on Artificial Intelligence (AAAI)
  • Transactions on Machine Learning Research (TMLR)
  • IEEE Transactions on Neural Networks and Learning Systems (TNNLS)
  • Artificial Intelligence and Autonomous Systems (AIAS)

Membership

  • IEEE: Student Member.
  • ACM: Member.

Presentation

  • 2019.11 ACM International Conference on Information and Knowledge Management (CIKM). Beijing, China.
  • 2020.07 IEEE International Conference on Computer Communications (INFOCOM). Virtual.
  • 2021.02 Proceedings of the AAAI Conference on Artificial Intelligence (AAAI). Virtual.
  • 2024.02 Proceedings of the AAAI Conference on Artificial Intelligence (AAAI). Vancouver, Canada. Video
  • 2024.08 International Joint Conference on Artificial Intelligence (IJCAI). Jeju, South Korea.