Statistical Analysis of Compressed Climate Data
[Note: this Technical Note was updated on 2020-08-17 per the authors' request to correct an error. See the description of the change on page two of the document's front matter.] The data storage burden resulting from large climate model experiments only continues to grow. Lossy data compression methods are required to alleviate this burden, but lossy methods introduce the possibility that key climate variable fields could be altered to the point of affecting scientific conclusions. It is therefore important to develop a detailed understanding of how compressed climate model output differs from the original for different compression algorithms and compression rates. In this work, we evaluate the effects of two leading compression algorithms, sz and zfp, on daily average and monthly maximum temperature data, and daily average precipitation rate data, from a historical run of CESM1 CAM5.2. While both algorithms show promising fidelity with the original model output, detectable artifacts are introduced even at relatively low error tolerances. Examples for temperature data include biases in temperature gradient fields, temporal autocorrelation, and seasonal cycles; precipitation data show, for example, biases in the number of rainy days. We highlight the need for evaluation methods that are sensitive to errors at different spatiotemporal scales and specific to the particular climate variable of interest.
document
http://n2t.net/ark:/85065/d7p84fqd
eng
geoscientificInformation
Text
publication
2016-01-01T00:00:00Z
EARTH SCIENCE > ATMOSPHERE > PRECIPITATION > PRECIPITATION AMOUNT
EARTH SCIENCE > ATMOSPHERE > ATMOSPHERIC TEMPERATURE > SURFACE TEMPERATURE > AIR TEMPERATURE
EARTH SCIENCE SERVICES > MODELS > COUPLED CLIMATE MODELS
EARTH SCIENCE SERVICES > DATA MANAGEMENT/DATA HANDLING > DATA COMPRESSION
EARTH SCIENCE SERVICES > DATA ANALYSIS AND VISUALIZATION > STATISTICAL APPLICATIONS
revision
2021-09-17
publication
2018-08-23T00:00:00Z
Copyright Author(s). This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.
None
OpenSky Support
UCAR/NCAR - Library
PO Box 3000
Boulder
80307-3000
name: homepage
pointOfContact
OpenSky Support
UCAR/NCAR - Library
PO Box 3000
Boulder
80307-3000
name: homepage
pointOfContact
2023-08-18T18:06:41.092579