Big Data Innovation Hubs Mark

A Shared Mission

Data science is happening everywhere—so let’s work together. The Big Data Innovation Hubs bring together a wide-ranging and inclusive community to collaborate with data on shared societal and scientific priorities.

National initiatives like the COVID Information Commons, the Keeping Data Science Broad program, and the Data Sharing & Cyberinfrastructure Working Group are how we convene this grassroots community and amplify impact.

COVID Information Commons

The COVID Information Commons (CIC) is an open resource to explore NSF-funded research addressing the COVID-19 pandemic. Launched in July 2020, the purpose of the CIC is facilitate knowledge sharing and collaboration across various COVID research efforts addressing the many facets of the pandemic, through an open and accessible platform. It has since evolved from there into a growing and thriving community.

Data Sharing & Cybersecurity Working Group

The Data Sharing and Cyberinfrastructure Working Group is a collaboration across all four of the Big Data Innovation Hubs.  Each month, the working group brings together data scientists and cyberinfrastructure professionals to learn about topics of common interest.  Areas covered include data management and analytics tools and frameworks, and use of data science to solve real world problems in areas such as transportation, water quality, public health and disaster response.

The Trustworthy Data Working Group

Open science relies on data integrity, collaboration, high performance computing, and scalable tools to achieve results, but currently lacks effective cybersecurity programs that address the trustworthiness of scientific data. The Trustworthy Data Working Group (TDWG) is a collaboration between the four Big Data Hubs and Trusted CI, the NSF Cybersecurity Center of Excellence, along with other partners. The goal of the working group is to understand scientific data security concerns and provide guidance on ensuring data trustworthiness.

The Data Science Education and Workforce Working Group

Data literacy is essential to navigate the current deluge of data in almost every scientific discipline and business sector. The United States suffers from a lack of a trained workforce to meet both current and emerging demands, making data science a critical element of educating students for the 21st century. The Data Science Education and Workforce working group has zeroed in on a plan to collect and provide practical resources and models of engagement for broadening and deepening the available pool of data literate talent.

Keeping Data Science Broad Webinar Series

Having the skills to understand and make sense of data can provide a sense of power – or conversely, a sense of powerlessness to communities without these skills. The economic and social consequences of the Data Divide severely limit the opportunities of those who are unable to take advantage of the data revolution. The goal of this series is to garner community input into pathways for keeping data science as a discipline broadly inclusive, with input from data science programs in any region across the nation, either traditional or alternative, and from a range of institution types including minority-serving institutions, community colleges, liberal arts colleges, tribal colleges, universities, and industry partners.

Data Sharing & Cyberinfrastructure Working Group

The quantity of data available for research continues to grow exponentially. Investments are being made in academia, industry, government and not-for-profits to efficiently connect computers, laboratories, data, and people for research and application. Too many of these efforts occur in isolation, creating missed opportunities for collaboration and potential gaps in tools and capabilities to accelerate innovation. The Data Sharing and Cyberinfrastructure Working Group is a collaboration across all four of the Big Data Innovation Hubs, created to share research, provide linkages between research silos, and welcome new entrants in the national cyberinfrastructure (CI) space.

Harnessing the Data Revolution (HDR) PI Meeting and Coordination Activity

This is the inaugural PI meeting for the National Science Foundation’s Harnessing the Data Revolution (HDR) Big Idea. HDR is a visionary, national-scale activity to enable new modes of data-driven discovery, allowing fundamentally new questions to be asked and answered in science and engineering frontiers, generating new knowledge and understanding, and accelerating discovery and innovation.


Launched by the National Science Foundation (NSF) in 2015, we engage communities, share resources, and build partnerships that harness the data revolution to address societal and scientific challenges.

Metro/Urban Data Science

Precision Medicine

Natural Resources & Hazards 

Big Data Technology


Advanced Materials and Manufacturing

Digital Agriculture

Smart, Connected, and Resilient Communities

Water Quality

Big Data in Health

Health and Disparities

Smart Cities and Communities

Advanced Materials and Manufacturing

Environment and Coastal Hazards

Social Cybersecurity


Education + Data Literacy

Urban to Rural Communities

Responsible Data Science: Security + Privacy Ethics

Translate »
Skip to content