
Exploring the Frontiers in Earth System Modeling with Machine Learning and Big Data
This workshop brings a small but varied group of geoscientists and climate modelers together with machine learners, statisticians, and representatives of other fields where ML has already had a big impact. Discussions will center on how innovative and efficient ML methods will provide new, innovative and transformative ways of modeling and projecting the Earth system and extracting information from massive data volumes. Of particular importance will be an assessment of the challenges ahead in combining physical knowledge with data-driven methods. This is a challenging and promising research field where a concentrated effort will have a high impact both to advance science and to address topics of critical importance and high relevance for human society. Newly developed methods can open opportunities for innovative new hybrid ML-Earth system models, as well as analyses of models and observations. With enhanced analysis capabilities, we can look forward to revolutionary advancement of Earth system sciences to accelerate scientific understanding, modeling, and projecting climate change.
Detailed Description
Over the last decade, the volume of data from satellite sensors and Earth system models has increased by at least an order of magnitude. The exponential growth in data is threatening to overwhelm the relatively simple tools and methods currently used to analyze the data within the climate research community, and is providing new opportunities to improve Earth system and climate models with machine learning (ML).
ML is a promising and growing path of research that could contribute to substantially enhancing our understanding of the Earth system and reduce uncertainty in climate change projections. While ML is revolutionizing many fields, from molecular science, bioinformatics, and genomics to material sciences, in Earth sciences this revolution is still in its infancy with the potential of ML methods and ideas yet to be fully explored. One reason is that a large body of physical knowledge enabled to develop theories and Earth system models is derived from first principles. This purely physics-driven approach has its limits and there are many ways in which ML can be harnessed to improve our understanding, models, and to predict and project the Earth system from weather to climate scales.
ML excels at modeling highly complex functional relationships. One novel path forward involves ML modules for simulating small-scale processes in Earth system models that cannot be fully resolved and that are currently often heuristically parametrized. Another opportunity is for ML to deliver powerful tools for analyzing high-dimensional datasets which are especially prevalent in the Earth sciences. Additionally, ML, augmented by causal inference methods, allows more information to be extracted from observations and models on how processes interact causally.
This workshop brings a small but varied group of geoscientists and climate modelers together with machine learners, statisticians, and representatives of other fields where ML has already had a big impact. Discussions will center on how innovative and efficient ML methods will provide new, innovative and transformative ways of modeling and projecting the Earth system and extracting information from massive data volumes.
Of particular importance will be an assessment of the challenges ahead in combining physical knowledge with data-driven methods. This is a challenging and promising research field where a concentrated effort will have a high impact both to advance science and to address topics of critical importance and high relevance for human society. Newly developed methods can open opportunities for innovative new hybrid ML-Earth system models, as well as analyses of models and observations. With enhanced analysis capabilities, we can look forward to revolutionary advancement of Earth system sciences to accelerate scientific understanding, modeling, and projecting climate change.
Main Topics of the Workshop
(I) Hybrid modeling and subgrid-scale parametrizations
- ML tasks: Learning from observational data and high-resolution model output, Parameter estimation
- Approaches: Deep Learning (generative models, classification models), Unsupervised Learning, Transfer learning
- Unique Challenges: Incorporating physical constraints, Developing physical regularization techniques, Interpretability, Out-of-sample generalization, Uncertainty propagation
(II) Extreme event detection and forecasting
- ML tasks: Anomaly detection, Pattern Classification, Change detection, Dimension reduction, Uncertainty quantification
- Approaches: Deep Learning (spatial; space-time; sequence-to-sequence models); generative models
- Unique Challenges: Handling large-scale multivariate, spatio-temporal data; Developing parallel algorithms for distributed computing; Physical interpretability; Explainability; Robustness, generalization guarantees
(III) Extracting dynamical causal dependencies
- Analytics Tasks: Combining analyses of climate model experiments with observations; Variable extraction and dimension reduction from spatio-temporal data
- Approaches: Conditional independence-based algorithms; Structural causal modeling and invariant prediction techniques; Causal network comparison metrics
- Unique challenges: High-dimensional nonlinear interdependencies; Uncertainty propagation
(IV) Constraining climate change projections
- Analytics tasks: Dimension reduction; Uncertainty quantification
- Approaches: Causal transfer learning and transportability; Bayesian optimal weighting schemes; Data mining
- Unique challenges: Physical interpretation / plausibility
(V) Cross-cutting challenges for ML method development
- Physical interpretation/plausibility
- Explainability
- Incorporating physical constraints, physics-based regularization techniques
- Robustness, generalization guarantees

Agenda
Expand to see available videos and presentations
10:45 am Session 1. Hybrid Modeling and Subgrid-scale Parametrizations
1:45 pm Session 1 continued. Hybrid modeling and subgrid-scale parametrizations
2:15 pm Adventures in Latent Variable ML Modeling Presented by Mike Pritchard
3:30 pm Breakout Group 3: Hybrid modeling and subgrid-scale parametrizations – Ocean and Cryosphere
3:30 pm Breakout Groups 1-3 on Session 1
3:30 pm Breakout Group 1: Hybrid modeling and subgrid-scale parametrizations – Atmosphere
3:30 pm Breakout Group 2: Hybrid modeling and subgrid-scale parametrizations – Land
9:00 am Session 2. Extreme event detection and forecasting
11:00 am Session 2 continued Extreme event detection and forecasting
11:15 am Machine Learning Applications Across Climate Action Presented by David Rolnick
1:30 pm Session 3 Industry and Public sector perspectives
3:15 pm Session 4. Extracting dynamical causal dependencies
To improve societal and environmental wellbeing, decision makers need climate information that is actionable. Actionable climate information needs to be timely; sufficiently detailed as to describe the region and processes of interest; trustworthy and truthful in acknowledging uncertainties; and policy- or decision-relevant. We have already begun to observe the transformative power of machine learning (ML) to produce actionable knowledge in weather forecasting (where we can conveniently check whether our predictions are correct). How could machine learning do the same, and more, in the context of climate change? This talk will introduce how we make climate predictions, and then describe ways in which machine learning can enable us to make more trustworthy and more detailed climate predictions, at a faster pace. We’ll also explore how machine learning can provide a bridge between climate.
9:00 am Breakout Group 4: Extreme event detection and forecasting
9:00 am Breakout Groups 4-5 on Sessions 2 and 4 Breakout Group 5: Causal dependencies
9:00 am Breakout Groups 4-5 on Sessions 2 and 4 Breakout Group 4: Extreme event detection and forecasting
11:15 am Session 5. Climate Model Evaluation & Constraining uncertainty in climate projections
9:00 am Session 5 continued. Climate Model Evaluation & Constraining uncertainty in climate projections
10:45 am Session 6. ML-based predictions, stochastic modeling and nowcasting
11:45 am Session 7. Cross-cutting challenges for ML method development
2:30 pm Breakout Group 6: Climate Model Evaluation & Constraining uncertainty in climate projections
2:30 pm Breakout Groups 6-8 on Sessions 5-7
2:30 pm Breakout Group 8: Cross-cutting challenges for ML method development
2:30 pm Breakout Group 7: ML-based predictions and nowcasting
9:00 am Session 8. Synthesis & Follow-On Research Needs
10:45 am Working breakout sessions on follow-on activities, outputs
Organizers
Attendees




























The attendee list and participant profiles are regularly updated. For information on participant affiliation at the time of workshop, please refer to the historical roster. If you are aware of updates needed to participant or workshop records, please notify AGCI’s workshops team.