ESA EarthCODE 101
Stephan Meißl, Anca Anghelea, Deyan Samardzhiev, Anne Fouilloux
The ESA EarthCODE platform (https://earthcode.esa.int/) is revolutionizing Earth System Science with Open Science principles.
The platform is a dynamic Collaborative Open Development Environment created to transform Earth System Science by embedding FAIR and Open Science Principles into the scientific development process. By providing tools, expertise, and opportunities, EarthCODE enables science teams to seamlessly adopt open science practices as part of their workflows. With seamless access to commercial platform services and a wealth of community resources and tools, EarthCODE empowers scientists to collaborate, innovate, and adhere to open research practices, driving the next wave of discovery in Earth Science.
Summary:
This hands-on workshop is designed to introduce participants to EarthCODE's capabilities, guiding them from searching, finding, and accessing EO datasets and workflows to publishing reproducible experiments that can be shared with the wider scientific community. This workshop will equip you with the tools and knowledge to leverage EarthCODE for your own projects and contribute to the future of open science.
During this 4-hour workshop, participants will, in a hands-on fashion, learn about:
- Introduction to EarthCODE and the future of FAIR and Open Science in Earth Observation
- Gain understanding in Finding, Accessing, Interoperability, and Reusability of data and workflows on EarthCODE
- Creating reproducible experiments using EarthCODE’s platforms - with a hands-on example with Euro Data Cube and Pangeo
- Publishing data and experiments to EarthCODE
At the end of the workshop, we will take time for discussion and feedback on how to make EarthCODE better for the community.
Workshop Overview:
1. Introduction to EarthCODE and the Future of FAIR and Open Science in Earth Observation
The workshop begins with an introduction to the EarthCODE platform and its role in promoting FAIR principles within the EO community. This section will address the challenges of working with EO data at scale and how EarthCODE provides a robust infrastructure for data discovery, processing, and sharing. Participants will gain insight into the importance of reproducibility in EO research and the ways open science practices are shaping the future of the field.
Topics covered:
- Overview of EarthCODE’s mission and architecture
- Overview of EarthCODE platforms (at present including Euro Data Cube, DeepESDL and Copernicus/OpenEO)
- The importance of FAIR principles in EO data
- Challenges in EO data management and reproducibility
- Finding, Accessing, Interoperability, and Re-Use of Data and Reproducible Workflows on EarthCODE
This section goes through navigating the EarthCODE Science Catalog. Participants will be guided through methods for efficiently finding and accessing a diverse range of EO research datasets and workflows. The session will highlight the practical application of STAC/OGC metadata standards.
Hands-on activities:
- Finding and accessing datasets on EarthCODE Science Catalog
- Understanding STAC/OGC API Records metadata standards
- Re-running experiments and adapting pre-existing workflows to new datasets and research objectives using EarthCODE's cross-platform capabilities
- Creating FAIR Data and Workflows Using EarthCODE’s Platforms – Hands-On with EDC and Pangeo
EarthCODE’s key strengths are its cross-platform capabilities and integration with platforms such as Euro Data Cube (EDC) and Pangeo. This session will focus on creating and publishing an experiment on EarthCODE, including workflow and data, from start to finish. Participants will work with real-world datasets, apply processing workflows, and learn how to share their results.
We will introduce the Pangeo ecosystem (http://pangeo.io/) and EDC (https://eurodatacube.com/), demonstrating the core concepts of open, reproducible, and scalable Earth science. Participants will explore hands-on tasks, including:
- Accessing data using Pangeo on EDC
- Loading and analysing data with Xarray
- Visualizing data with HvPlot
- Understanding how to scale analysis with Dask
Example datasets will include Copernicus Sentinel satellite data, as well as products from the Atmosphere Monitoring and Climate Change Services. All Python packages used during the training are open-source to ensure accessibility and reusability for all participants.
Hands-on activities:
- Setting up an EDC Pangeo environment
- Introduction to Pangeo
- Using Pangeo for scalable data processing
- Publishing Data and Experiments to EarthCODE
This section will follow-up on the developed workflows/data from the last session, participants will learn how to publish their experiments and datasets to EarthCODE, making them available for other researchers. We will cover best practices and guidelines for FAIR and Open-Science and common tools to help you generate metadata.
Hands-on activities:
- Preparing an experiment for publication to EarthCODE and data formats
- FAIR and Open-Science Guidelines for EarthCODE
- Using stactools and validation tools to create metadata
- Publishing datasets and workflows to EarthCODE’s repository
- Discussion and Feedback: Making EarthCODE Better for the Community
The final session is an open discussion where participants can share feedback about EarthCODE , including improvements and new features. This is an opportunity to help shape the future of EarthCODE.