Challenges and Opportunities for Findable, Accessible, Interoperable and Re-usable Training Dataset
1) to discuss the cutting-edge topics of machine learning training data for the geospatial community;
2) to describe the spatial, temporal and thematic representativeness of TDS and their uncertainties;
3) to focus on sharing and reusability of TDS to increase the adaptation of TDS for geospatial analysis.
This session will focus on the following topics around training datasets:
-How to describe a training dataset to enable efficient re-use in ML/AI applications?
-What are the main characteristics of the training dataset, and what additional information needs to be provided to sufficiently understand the privacy, nature and usability of the dataset?
-Exploring the effect of training data accuracy level, uncertainty of the measurement, labelling procedure used to generate the training data, original data used to create labels, external classification schemes for label semantics, e.g. ontologies or vocabularies;
-What metadata is required, recommended, or optionally provided?
-How to express the quality of a TDS? Is it possible to auto-generate quality indicators?
-Evaluating the effect of training data size, spatial resolution and structure, temporal resolution and currency, the spectral resolution of imagery used for annotation, and annotating accuracy.
-Methods for documenting, storing, evaluating, publishing, and sharing the training datasets;
-Transfer learning and impact of combining various training datasets;
-Open standards and open source training datasets;
-How to enable FAIR (findable, accessible, interoperable and reusable) data principles to be at the heart of future TDS standardization.
16:15–16:20
5-minute convener introduction
16:20–16:30
|
PICO2.1
|
EGU23-14061
|
solicited
|
On-site presentation
16:32–16:34
|
PICO2.3
|
EGU23-16998
|
Virtual presentation
16:36–16:38
|
PICO2.5
|
EGU23-10594
|
Virtual presentation
Automatic labeling of the Training Dataset for Individual and Group Activities Detection
(withdrawn)
16:40–16:42
|
PICO2.7
|
EGU23-12352
|
ECS
|
On-site presentation
16:42–16:44
|
PICO2.8
|
EGU23-7299
|
On-site presentation
16:44–16:46
|
PICO2.9
|
EGU23-888
|
ECS
|
On-site presentation
16:46–16:48
|
PICO2.10
|
EGU23-2203
|
On-site presentation
16:48–16:50
|
PICO2.11
|
EGU23-17570
|
Virtual presentation
16:50–16:52
|
PICO2.12
|
EGU23-10590
|
ECS
|
Virtual presentation
The impact of training dataset on a vision-based smart road sensor to measure the level of flood on the streets
(withdrawn)
16:52–18:00
Interactive presentations at PICO screens