New data-near processing capabilities at the European centers DKRZ, CNRS-IPSL, STFC (at CEDA), and CMCC for model data hosted in the Earth System Grid Federation will be made accessible to a broader user community via the new Transnational Access (TNA) service. These processing capabilities support multi-model server-side data analysis through direct access to large data pools including replicated data from the European as well as non-European ESGF data nodes. CMCC joins the ENES TNA initiative through the CMCC Analytics-Hub facility, providing a data science environment, based on ECAS, with:
This environment is hosted as part of the CMCC data infrastructure and aims to support user groups with respect to climate data collection access, processing and analysis. The hosted datasets concentrate on model data generated as part of the CMIP climate model intercomparison project. Applying to the TNA call allows you to have direct access to CMCC compute facilities. An evaluation committee will supervise the selection of applications for access to these virtual workspaces. More information about the application procedure here.
Below more info about the site-specific deployment and facility.
CMCC will provide access to a set of specific CMIP variable-centric collections. Data will be downloaded and kept in sync with the ESGF federated data archive using the Synda replication tool. About 50 TB disk space have been allocated to this purpose.
The data pool is efficiently accessible from cluster resources as well as JupyterLab.
The JupyterLab environment is already equipped with a set of Python libraries to support end-users data analysis.
Users can request the installation of additional libraries by contacting the user support here.
Compute intensive parallel data analysis is supported by the submission of batch compute jobs via Ophidia to the CMCC Analytics-Hub cluster.
Besides a pre-defined set of variable-centric collections made available on the CMCC data pool, other ones will be set up to specifically address requests coming from the TNA applications.
The CMCC Analytics-Hub offers two different types of access:
1) Direct access via user registration: upon registration, the end user can implement his/her own data analysis use cases by exploiting the different services available (Jupyter Hub, Ophidia, CDO). The user can either upload some input data or analyse data from the collections available in the Analytics-Hub data pool. Access can be requested also for training, evaluation and testing purposes. For any request, please contact the support here.
2) Access via Trans-National (TNA) calls: after a selection process, the winning candidates will be granted access to the Analytics-Hub resources via a dedicated "workspace" where they will perform the data analysis. Besides resource allocation, the CMCC team will offer dedicated support to:
Also in this case, for any request, please contact the support here.
To access the CMCC Analytics-Hub, users needs to register at CMCC. You can register here.
Registered users can contact analytics-hub-support[at]cmcc[dot]it for any information request.
Compute resources, storage and installed software: