Glossary
AerosolsTiny particles or droplets suspended in the atmosphere, often originating from natural sources or human activities, influencing climate by scattering or absorbing sunlight and affecting cloud formation.
Aerosol PrecursorsSubstances that contribute to the formation of aerosols in the atmosphere
Biomass Burning DataPart of Input4MIPs, representing emissions from open biomass burning, used as input for climate models.
Climate EmulationThe development of machine learning models to simulate climate model outputs.
Climate ModelA mathematical representation of the Earth’s climate system used for predicting future climate conditions.
Climate ProjectionA prediction of future climate conditions based on climate model simulations.
Climate ScenarioA set of conditions used in climate models to project possible future climate states.
Climate VariablesParameters such as temperature, precipitation, and wind velocity used in climate models.
ClimateSetThe dataset introduced here, providing climate model outputs and emission inputs for use in large-scale machine learning models.
ClimateSet Data PipelineA modular pipeline for retrieving and preprocessing climate model data for ML tasks.
CMIP6 (Coupled Model Intercomparison Project Phase 6)An archive uniting climate model outputs from various sources.
CMIP6 (Coupled Model Intercomparison Project Phase 6)A project that collects climate model outputs from various sources, providing a comprehensive archive for climate-related research.
Dimension ReductionTechniques to streamline large datasets by reducing the number of variables while retaining essential information.
DownscalingA process of generating high-resolution climate predictions from lower-resolution climate models.
ESGF (Earth System Grid Federation)A system for managing and distributing climate model data.
EmulationIn the context of ClimateSet, it involves developing machine learning models to simulate climate model outputs, providing faster predictions for climate variables based on input data.
ForcingsExternal factors influencing the Earth’s energy balance, such as variations in solar radiation, greenhouse gas emissions, aerosols, and land use.
GCMs (Global Climate Models)Complex simulations representing Earth’s climate system, focusing primarily on the atmosphere.
GHG (Greenhouse Gases)Gases like CO2 and CH4 that trap heat in the Earth’s atmosphere.
GridSpatial and temporal framework dividing the Earth’s surface and atmosphere into discrete cells, facilitating the representation of physical and environmental variables at specific locations and time intervals for simulation and analysis.
IPCC (Intergovernmental Panel on Climate Change)An international body assessing climate science.
IPCC Assessment ReportsComprehensive scientific evaluations of climate change, informing policymakers and based on consensus-building.
HPC (High-Performance Compute) ClusterA computing cluster designed for tasks requiring substantial processing power, beneficial for extending ClimateSet with additional climate models.
Input4MIPsDatasets collecting future emission trajectories of climate-forcing agents used as input for climate models.
Model OutputsThe diverse information generated by climate models, including climate variables, radiative forcing, sea level change, and more.
PreprocessingThe process of preparing raw climate data for machine learning tasks by handling inconsistencies, syncing parameters, and adjusting resolutions.
Projection UncertaintiesVariabilities in climate model projections arising from differences in model formulations (inter-model variability) and initializations (intra-model variability).
RMSE (Root Mean Squared Error)An evaluation metric used to assess the accuracy of climate emulators.
Single Emulators and Super EmulatorsML models trained on a single climate model vs. those trained on a set of climate models for broader applications.
Spatial and Temporal ResolutionThe granularity of spatial and temporal dimensions in climate data.
SSP (Shared Socioeconomic Pathways)Scenarios within ScenarioMIP representing different socioeconomic development pathways that influence greenhouse gas emissions.
Weighting of Climate ModelsAssigning different weights to climate models to prevent over or under-representation.