Overview
Providentia’s download mode retrieves modelled and observational data from BSC systems and external sources (CAMS, Zenodo, ACTRIS) for local use.
Getting started
To start downloading data, simply add --download or --dl to your command:
./bin/providentia --config='/path/to/file/example.conf' --download
./bin/providentia --config='/path/to/file/example.conf' --dl
This will get the data that needs to be downloaded from your configuration file and save it into the directories specified in settings/data_paths.yaml for local.
The download mode fetches all the content specified in your configuration file across all sections.
Types of downloads
Providentia supports four types of downloads. For detailed instructions, please visit the respective pages:
Download from BSC HPC Machines
Downloads GHOST and non-GHOST data and model outputs from BSC HPC machines. You must have a BSC account to access this feature.
How to get this type of download:
For GHOST networks, answer
yto the prompt:
Do you want to download observational data from the BSC remote machine? (Otherwise, GHOST observational data will be retrieved from Zenodo)
or setdl_ghost_sourcetobsc.For non-GHOST networks and interpolated/non-interpolated model data, no special action is required.
To see more information, check the BSC download page.
Download of network from Zenodo
Downloads GHOST networks from the GHOST Zenodo webpage.
How to get this type of download: answer
nto the HPC prompt: Do you want to download observational data from the BSC remote machine? (Otherwise, GHOST observational data will be retrieved from Zenodo) or setdl_ghost_sourcetozenodo.To see more information, check the Zenodo download page.
Download of network from ACTRIS (Thredds)
Downloads observational networks from ACTRIS Thredds.
How to get this type of download: put
actris/actrisin thenetworkfield in your configuration.To see more information, check the ACTRIS download page.
Download of non-interpolated model data from the Atmosphere Data Store (ADS)
Downloads model outputs from the Atmosphere Data Store.
How to get this type of download: specify the model as
cams_analysis,cams_forecastorcams_reanalysisin your configuration, and setdl_interpolatedtoFalse.To see more information, check the CAMS download page.
Download configuration fields
Only the following configuration fields are used during download. All required fields must be provided.
Variable |
Description |
Required |
Default |
|---|---|---|---|
|
GHOST version used when a GHOST network is selected |
No |
1.5 |
|
Observation network to use |
Yes |
— |
|
Model ID(s) to be interpolated |
No |
— |
|
Domain of the model, can be indicated in the model field (e.g. |
No |
— |
|
Ensemble of the model, can be indicated in the model field (e.g. |
No |
— |
|
Species to load (e.g. |
Yes |
— |
|
Observation data resolution (e.g. |
Yes |
— |
|
Model resolution if different from observations |
No |
Same as |
|
Start date of download ( |
Yes |
— |
|
End date of download ( |
Yes |
— |
|
Optional filter to select specific species |
No |
— |
Automation of the download
In order to add the download to your scripts or if you just want to make it without the user input, here are all the variables you need to have
Variable |
Description |
Original Question |
Expected Values |
|---|---|---|---|
|
Indicates whether previously downloaded files should be overwritten. |
There are some files that were already downloaded in a previous download, do you want to overwrite them ([y]/n)? |
|
|
Determines where GHOST observations are downloaded from. |
Do you want to download observational data from the BSC remote machine? (Otherwise, GHOST observational data will be retrieved from Zenodo) ([y]/n) |
|
|
Specifies whether the interpolated versions of the model output should be downloaded. |
Model data was detected in the configuration file. Do you want to download the interpolated version? (Otherwise, the non-interpolated model data will be downloaded) ([y]/n) |
|
|
Selects what to download when both observations and model output are present in the configuration file. |
Which type of data do you want to download? Observational, modelled or both? ([both]/obs/mod) |
|
|
Determines whether to use all GHOST or all non-GHOST networks when the observation field uses the |
Do you want to download all the GHOST networks? (Otherwise all the non-GHOST networks will be downloaded) ([y]/n) |
|
Using wildcards
You can use the * wildcard in the following fields to automatically select all available values:
network,observation,frameworkmodel,models,experiments,experimentspeciesresolutionstart_dateend_date
Note: Using wildcards may result in large downloads, so use with caution.