EGU24-18775, updated on 11 Mar 2024
https://doi.org/10.5194/egusphere-egu24-18775
EGU General Assembly 2024
© Author(s) 2024. This work is distributed under
the Creative Commons Attribution 4.0 License.

SoilPulse – A software package for semi-automated metadata management and publication

Jan Devátý1, Jonas Lenz2, and Conrad Jackisch3
Jan Devátý et al.
  • 1Czech Technical University in Prague, Faculty of Civil Engineering, Dept. of Landscape Water Conservation, Prague, Czechia (jan.devaty@fsv.cvut.cz)
  • 2IPROconsult Gmbh, Environmental consulting, Dresden
  • 3TU Bergakademie Freiberg, Interdisciplinary Environmental Research Centre

Every model calibration/validation task is as good as the range of data available for the task and using other team’s data can greatly enhance the calibration outcome for wider range of conditions. Lot of data was gathered during the long research history on soil erosion, but the interoperability of this data is in many cases hindered by inhomogeneous data structure of the single datasets, if these are at least available digitally. The analysis and aggregation of existing digital data sets is a complicated task due to vastly heterogeneous field situations, various spatio-temporal scales involved, different experimental setups and equipment, and numerous repository types and structures. Resources often lack sufficient description in metadata making it hard for humans and impossible for computers to fully understand the structure and contents of the data set. The missing common data management and data structure format in soil erosion research can be seen as major drawback, which hampers data reusability and scientific exchange and progress. However, expecting all the research teams to adopt a common data management approach is naïve.

Within the NFDI4Earth pilot SoilPulse (soilpulse.github.io) we aim to develop a software library responsible for handling metadata from existing data sets of various types. The package will contain tools for metadata extraction (if already existing), creation (by parsing the data set and recognizing metadata elements), representing by a common general metadata scheme, storing the resource’s metadata image, and providing tools to query the storage to reach all available data sets fitting particular conditions.

The poster presents a SoilPulse package structure, intended process-flow of interactive dataset registration and recognition, and metadata mining tools overview. As SoilPulse is in active development we highly appreciate comments, hints and impulses to further improve the tool!

How to cite: Devátý, J., Lenz, J., and Jackisch, C.: SoilPulse – A software package for semi-automated metadata management and publication, EGU General Assembly 2024, Vienna, Austria, 14–19 Apr 2024, EGU24-18775, https://doi.org/10.5194/egusphere-egu24-18775, 2024.