|
 The purpose of this service is to make data produced by ultimate numerical simulations available to the scientific community. The common characteristics of such data are high complexity, very large sizes, and lack of homogeneity and standardisation. Our vision is an advanced data search and exploration environment for effectively performing scientific discovery cultivating new research practices in the scientific community. 
Nowadays the scientific community is witnessing an unprecedented growth in the quality and quantity of data coming from simulations and real-world experiments. This is due to dramatically improved computational power of modern computer systems and resolution of new imaging modalities; often datasets are measured in hundreds (or even millions) of gigabytes. To be useful this output must be organized appropriately and then made easily accessible to scientists worldwide through advanced access tools. Our vision is to make this process as easy and as intuitive as navigating through a collection of information search results coming from a modern internet search engine. The purpose is to avoid the current situation of extensive data replication from multiple and time-expensive downloads due to not knowing in advance the relevant dataset's region of interest. The ultimate vision is to support personalisation through user profiles offering the potential for cultivating novel research practices, by making scientists aware of alternative scientific discovery methods through a new platform acting as a social network for sciences
Our final goal is to develop a robust, trustworthy methodology for setting up a world-wide reference infrastructure for effective sharing and access of multi-disciplinary scientific datasets coming from recently-performed, state-of-the-art numerical simulations of a total size in the magnitude of hundreds of terabytes up to petabytes. The core values underpinning the proposed infrastructure are standardisation, interoperability, extensibility and intuitive homogeneous and user-driven data access. The immediate result will be a prototype first realisation of a pioneering numerical simulation library. The legacy will be the development (in continuous consultation with the scientific community) of a foundation for standards, tools and services applicable across a range of disciplines and suitable for effectively handling the challenges imposed by the forthcoming, massive-scale datasets measured in hundreds of petabytes. |
|