The Data Intensive Research Initiative of South Africa (DIRISA) promotes, enables, and coordinates a data intensive research ecosystem in support of national science and strategic priorities. This is accomplished through several key objectives, including the provision of a robust and advanced national data infrastructure and services, the promotion of sound data stewardship practices and the development of expertise in data management and data intensive research. DIRISA operates as an overarching national data organization and as such, advocates research data sharing, coordinates publicly funded initiatives and advises on strategic agendas related to data intensive research.
DIRISA is expanding the national data infrastructure to accommodate increasing demands for research data storage, coordinates the national e-Science Postgraduate Training and Teaching Platform (NEPTTP), and a regional research data centre managed by a consortium of academic institutions. The data services developed by DIRISA include a research data repository providing 100GB of storage for registered users, a long-term data storage facility of 20 PB, as well as data discovery and analytical services that support more rapid data-based research innovation across all academic disciplines.
In supporting researchers to manage their research data, DIRISA has released an operational version of a research data management planning tool, called DMP-SA Online. It has become standard practice for research funders to require such a plan as part of the research proposal as it supports improved data-driven research across all disciplines. As a collaborative effort by DIRISA and Universities South African (USAf), DMP-SA Online has been demonstrated to, and adopted by several local universities and research institutions.
DIRISA is implementing a Digital Object Identifier (DOI) service that allows users to assign a unique persistent identifier to valuable research data collections. The use of these DOIs (also known as Handles) greatly improves the visibility and management of digital objects, including data and valuable national assets.
DIRISA hosts a national research data workshop annually, where researchers share their experiences and give input on their research data needs. DIRISA also conducts the Student Datathon Challenge that gives students an opportunity to use research data to produce creative and innovative solutions that help to solve some of South Africa’s challenges. This activity encourages the use of open data to promote open science and the development of data science skills early in the careers of students.
Provide a robust and reliable national data infrastructure for data intensive research with services such as a petascale 8 PB research data repository; and data sharing and archiving services through the deployment of a 20 PB long term storage facility. DIRISA itself, is conducting research and development on software defined data storage technologies specifically for local conditions.
Support R&D that yields improved and scaled up technologies that underpin manufacturing and logistics, and the Fourth Industrial Revolution. Localised services for research data management based on market needs, are being developed.
Federate existing research data repositories, such as Ilifu, into the DIRISA Tier 1 data node.
Together with, NRF and NIPMO, develop standards and policy recommendations to regulate the open and ethical use and management of research data. For DSI, develop strategies and frameworks for Open Data, Big Data and Open Science.
Continue to run and expand the National eScience Postgraduate Teaching and Training Platform (NEPTTP) programme to other universities; coordinate workshops and training events in the data sciences and data management to re-skill and upskill researchers.
Pursue partnerships with the IT industry and academia for co-funded collaborative development of IT services.
Provide recommended national strategies and frameworks for research big data and the establishment of regional tiered data nodes.
Develop Open Data policy recommendations supporting the national Open Science framework, and guidelines to preserve valuable and important research data collections hosted by institutions and projects such as SARIR and the NRS.
The Data Intensive Research Initiative of South Africa (DIRISA) promotes, enables, and coordinates a data intensive research ecosystem in support of national science and strategic priorities.