EGU23-16766, updated on 26 Feb 2023
https://doi.org/10.5194/egusphere-egu23-16766
EGU General Assembly 2023
© Author(s) 2023. This work is distributed under
the Creative Commons Attribution 4.0 License.

Supervised versus semi-supervised urban functional area prediction: uncertainty, robustness and sensitivity

Rui Deng1,2, Yanning Guan1, Danlu Cai1, Tao Yang3, Klaus Fraedrich4, Chunyan Zhang1, Jiakui Tang5,6, Zhouwei Liao7, Zhishou Wei1,2, and Shan Guo1
Rui Deng et al.
  • 1Aerospace Information Research Institute, Chinese Academy of Sciences, China
  • 2University of Chinese Academy of Sciences, Beijing, China
  • 3The School of Architecture, Tsinghua University, Beijing, China
  • 4Max Planck Institute for Meteorology, Hamburg, Germany
  • 5College of Resources and Environment, University of Chinese Academy of Sciences, Beijing, China
  • 6Yanshan earth key zone and surface flux observation and research station, University of Chinese Academy of Sciences, Beijing, China
  • 7Yangtze River Basin Operation Management Center, China Three Gorges, Co., Ltd., Yichang, China

To characterize a community-scale urban functional area using geo-tagged data and available land-use information, several supervised and semi-supervised classification models are presented and evaluated in Hong Kong for comparing their uncertainty, robustness and sensitivity. The following results are noted: (i) As the training set size grows, models’ accuracies are improved, particularly for multi-layer perceptron (MLP) or random forest (RF). The graph convolutional network (GCN) (MLP or RF) model reveals top accuracy when the proportion of training samples is less (greater) than 10% of the total number of functional areas; (ii) With a large amount of training samples, MLP shows the highest prediction accuracy and good performances in cross-validation, but less stability on same training sets; (iii) With a small amount of training samples, GCN provides viable results, by incorporating the auxiliary information provided by the proposed semantic linkages, which is meaningful in real-world predictions; (iv) When the training samples are less than 10%, one should be cautious using MLP to test the optimal epoch for obtaining the best accuracy, due to its model overfitting problem. The above insights could support efficient and scalable urban functional area mapping, even with insufficient land-use information (e.g., covering only ~20% of Beijing in the case study).

How to cite: Deng, R., Guan, Y., Cai, D., Yang, T., Fraedrich, K., Zhang, C., Tang, J., Liao, Z., Wei, Z., and Guo, S.: Supervised versus semi-supervised urban functional area prediction: uncertainty, robustness and sensitivity, EGU General Assembly 2023, Vienna, Austria, 24–28 Apr 2023, EGU23-16766, https://doi.org/10.5194/egusphere-egu23-16766, 2023.