NREL: Sharing Data in Virtual Hubs Streamlines Energy Materials Research, Spurs Collaboration
High-quality research is an exercise in collaboration. And with remote work increasingly prevalent at leading research organizations like U.S. Department of Energy (DOE) national laboratories, virtual data-sharing and collaboration has become a crucial pillar for conducting science in the 21st century.
Nowhere is that truer than in energy materials research and development (R&D), according to a paper published in the International Journal of Advanced Computer Science and Applications. A modular and robust platform known as a “Data Hub”—developed to support DOE’s Energy Materials Network (EMN)—streamlines access to data, tools, and expertise across a range of materials science fields.
“The EMN’s cloud-based Data Hubs support open, reproducible, and on-demand data science, machine learning, and visualization,” said Robert R. White, National Renewable Energy Laboratory (NREL) scientist and corresponding author of the article. “We have seen several important benefits to this model, chiefly that research teams can more easily collaborate, regardless of where people and resources are housed.”
Authored by researchers at NREL and Pacific Northwest National Laboratory, the paper outlines the specifics on implementing, operating, and collaborating using EMN’s Data Hub platform.
A Supporting Framework for Diverse Materials Research
Explore the EMN Data Hubs
ChemCatBio
DuraMAT
ElectroCat
H-Mat
HydroGEN
HyMARC
LightMat
Initiated in 2016 and now representing seven EMN consortia, the EMN Data Hubs function as customizable virtual laboratories to accelerate research breakthroughs on even the toughest materials challenges in the energy sector. They accomplish this by providing a common infrastructure that teams can use to share and leverage resources—saving time and money in the process.
As a result, Data Hubs advance the core mission of the EMN. By improving data-sharing, integrating analytical tools, and securing access to resources even amid lab closures, Data Hubs better integrate all phases of materials R&D, from discovery to scale-up.
“The Data Hub platform is a tremendous cross cutting effort from researchers, developers, and operations teams at NREL in collaboration with other labs throughout these consortia,” explained NREL software engineer Nicholas Wunder. “I am excited to see what we create as we continue to build out these capabilities.”
As a lead and co-lead for multiple EMN consortia, including HydroGEN, DuraMAT, ChemCatBio, HyMARC, and ElectroCat, NREL has orchestrated much of the development of the EMN Data Hubs project. NREL researchers directly developed six of the seven Data Hubs, and they continue to add custom features to meet specific needs, such as visualization tools, sample tracking capabilities, security authorization methods, and more.
Such work is possible because of support from the DOE Office of Energy Efficiency and Renewable Energy’s Bioenergy Technologies Office, Hydrogen and Fuel Cell Technologies Office, and Solar Energy Technologies Office.
To date, seven DOE consortia have adopted the Data Hub platform to support a spectrum of materials research—from advanced water splitting materials to durable photovoltaic modules and more. All said, Data Hubs support 792 researchers, coordinate 200 projects, and archive and distribute over 49,000 data resources.
Improving the Virtual Laboratory Model
During its five-year tenure, the EMN has grown substantially, enabling materials researchers to share data and resources using this “virtual laboratory” model.
For example, the DuraMAT consortium is using the platform to advance knowledge on improving the design of photovoltaic modules and model accelerated testing processes. HydroGEN consortium modeling supported by the Data Hub could allow researchers to accelerate the selection of durable materials for use in hydrogen electrolyzers. Not least, ChemCatBio researchers are building tools and databases specific to catalysis research, such as the Catalyst Property Database.
“We built Data Hubs using a module framework, so the platform has been successfully customized to the needs of individual consortia in the EMN,” White said. “We think others can find similar success.”
The paper describes planned future updates that will improve Data Hubs across the EMN. These include:
Improved Data Hub operability and user experience
Improved metadata service that will reduce workload for developers
Expanded capabilities, such as integrated machine learning to analyze data in the cloud.
“The success we’ve seen so far is just the tip of the iceberg in terms of the potential of EMN Data Hubs,” White said. “We look forward to seeing how groups and users can wield this framework for different contexts and research.”