This page will serve as a central hub for information on accessing and interacting with data from the Coupled Model Intercomparison Project Phase 6 (CMIP6) in cloud storage, managed by Pangeo. This data is formatted using Zarr, a cloud-optimized storage format.
The Coupled Model Intercomparison Project (CMIP) is an international collaborative effort to improve the knowledge about climate change and its impacts on the Earth System and on our society. CMIP began in 1995, and is currently in its sixth phase (CMIP6). The CMIP6 data archive consists of data models created across approximately 30 working groups and 1,000 researchers investigating the urgent environmental problem of climate change, and will provide a wealth of information for the next Assessment Report (AR6) of the Intergovernmental Panel on Climate Change (IPCC). As part of Google Cloud Public Datasets and the AWS Open Data Sponsorship Program, this data is now available on Google Cloud and Amazon S3 storage.
First and foremost, a Zarr package is required to interact with the data stores. Listed below are languages with actively developed Zarr packages; bolded languages do not yet have Zarr packages that support the reading of remote data stores:
Additionally, a filesystem package for Google Cloud and/or S3 storage is required for some languages to access the files containing the data stores:
Though optional, a CSV-loading package allows for searching and filtering of the Zarr data stores, which are enumerated in CSV files located at the root of each cloud storage bucket. Python users are encouraged to use xarray, intake, and intake-esm, which facilitate exploration and interaction with the data through the use of Earth System Model (ESM) collection specifications which are also provided at the root of each bucket.
- Cloud data overview
- Accessing data in the cloud
- Pangeo Catalog