Identification

Title

A Statistical Analysis of Lossily Compressed CESM-LENS Data

Abstract

The data storage burden resulting from CESM simulations continues to grow, and lossy data compression methods can alleviate this burden, provided that key climate variables are not altered to the point of affecting scientific conclusions. This dataset was generated to evaluate the effects of two leading lossy compression algorithms, sz and zfp, on daily output data from the CESM-LENS dataset. In particular, it contains daily data for variables TS (surface temperature) and PRECT (precipitation rate) from the historical forcing period (1920-2005) for CESM-LENS ensemble member 30. The provided data has been compressed and reconstructed via two popular compressors: sz 1.4.13 and zfp 0.5.3 with a number of different absolute error tolerances. Errors due to compression can be determined by comparing these reconstructed files to the original CESM-LENS timeseries data, and statistical methods can evaluate the errors at different spatiotemporal scales. While both compression algorithms show promising fidelity with the original output, detectable artifacts are introduced even at relatively tight error tolerances.

Resource type

dataset

Resource locator

https://gdex.ucar.edu/dataset/id/1e1399d0-737b-4e93-b5a1-91f8b00817b1.html

protocol: https

applicationProfile: browser

name: A Statistical Analysis of Lossily Compressed CESM-LENS Data

description: Metadata Link

function: download

Unique resource identifier

code

codeSpace

Dataset language

Spatial reference system

code identifying the spatial reference system

Classification of spatial data and services

Topic category

Keywords

Keyword set

keyword value

Dataset

originating controlled vocabulary

title

DataCite Resource Type

reference date

date type

revision

effective date

2014-10-16

Keyword set

keyword value

EARTH SCIENCE SERVICES > DATA MANAGEMENT/DATA HANDLING > DATA COMPRESSION

EARTH SCIENCE SERVICES > MODELS > ATMOSPHERIC GENERAL CIRCULATION MODELS

originating controlled vocabulary

title

NASA/GCMD Earth Science Keywords

reference date

date type

revision

effective date

2018-03-15

Geographic location

West bounding longitude

-180.0

East bounding longitude

180.0

North bounding latitude

90.0

South bounding latitude

-90.0

Temporal reference

Temporal extent

Begin position

1920-01-01

End position

2005-12-31

Dataset reference date

date type

publication

effective date

2020-03-13

date type

modified

effective date

2020-03-10

Frequency of update

Quality and validity

Lineage

Conformity

Data format

name of format

Hierarchical Data Format File (application/x-hdf)

version of format

Constraints related to access and use

Constraint set

Use constraints

Creative Commons Attribution 4.0 International License.

Limitations on public access

None

Responsible organisations

Responsible party

organisation name

UCAR/NCAR - Computational and Information Systems Laboratory

email address

abaker@ucar.edu

responsible party role

pointOfContact

Metadata on metadata

Metadata point of contact

organisation name

UCAR/NCAR - GDEX

email address

datahelp@ucar.edu

responsible party role

pointOfContact

Metadata date

2022-07-21T11:53:35-06:00

Metadata language

eng; USA