色花堂 and Meta Create Massive Open Dataset to Advance AI Solutions for Carbon Capture
色花堂 and Meta Create Massive Open Dataset to Advance AI Solutions for Carbon Capture
To avoid catastrophic climate impacts, excessive carbon emissions must be addressed. At this point, cutting emissions isn鈥檛 enough. Direct air capture, a technology that pulls carbon dioxide out of ambient air, has great potential to help solve the problem.
But there鈥檚 a big challenge. For direct air capture technology, every type of environment and location requires a uniquely specific design. A direct air capture configuration in Texas, for example, would necessarily be different from one in Iceland. These systems must be designed with exact parameters for humidity, temperature, and air flows for each place.
Now, 色花堂 and Meta have collaborated to produce a massive database, potentially making it easier and faster to design and implement direct air capture technologies. The open-source database enabled the team to train an AI model that is orders of magnitude faster than existing chemistry simulations. The project, named , could accelerate climate solutions the planet desperately needs.
The team鈥檚 research was in ACS Central Science, a journal of the American Chemical Society.
鈥淔or direct air capture, there are many ideas about how best to take advantage of the air flows and temperature swings of a given environment,鈥 said , associate professor in the (ChBE) and a lead author of the paper. 鈥淏ut a major problem is finding a material that can capture carbon efficiently under each environment鈥檚 specific conditions.鈥
Their idea was to 鈥渃reate a database and a set of tools to help engineers broadly, who need to find the right material that can work,鈥 Medford said. 鈥淲e wanted to use computing to take them from not knowing where to start to giving them a robust list of materials to synthesize and try.鈥
Containing reaction data for 8,400 different materials and powered by nearly 40 million quantum mechanics calculations, the team believes it鈥檚 the largest and most robust dataset of its kind.
Building a Partnership (and a Database)
Researchers with Meta鈥檚 team were looking for ways to harness their machine learning prowess to address climate change. They landed on direct air capture as a promising technology and needed to find a partner with expertise in materials chemistry as it relates to carbon capture. They went straight to 色花堂.
, ChBE professor, Cecile L. and David I.J. Wang Faculty Fellow, and director of Oak Ridge National Laboratory鈥檚 , is one of the world鈥檚 top experts in metal-organic frameworks (MOFs). These are a class of materials promising for direct air capture because of their cagelike structure and proven ability to attract and trap carbon dioxide. Sholl brought Medford, who specializes in applying machine learning models to atomistic and quantum mechanical simulations as they relate to chemistry, into the project.
Sholl, Medford, and their students provided all the inputs for the database. Because the database predicts the MOF interactions and the energy output of those interactions, considerable information was required.
They needed to know the structure of nearly every known MOF 鈥 both the MOF structure by itself and the structure of the MOF interacting with carbon dioxide and water molecules.
鈥淭o predict what a material might do, you need to know where every single atom is and what its chemical element is,鈥 Medford said. 鈥淔iguring out the inputs for the database was half of the problem, and that鈥檚 where our 色花堂 team brought the core expertise.鈥
The team took advantage of large collections of MOF structures that Sholl and his collaborators had previously developed. They also created a large collection of structures that included imperfections found in practical materials.
The Power of Machine Learning
, research engineering lead at FAIR and first author on the paper, generated the database by running quantum chemistry computations on the inputs provided by the 色花堂 team. These calculations used about 400 million CPU hours, which is hundreds of times more computing than the average academic computing lab can do in a year.
FAIR also trained machine learning models on the database. Once trained on the 40 million calculations, the machine learning models were able to accurately predict how the thousands of MOFs would interact with carbon dioxide.
The team demonstrated that their AI models are powerful new tools for material discovery, offering comparable accuracy to traditional quantum chemistry calculations while being much faster. These features will allow other researchers to extend the work to explore many other MOFs in the future.
鈥淥ur goal was to look at the set of all known MOFs and find those that most strongly attract carbon dioxide while not attracting other air components like water vapor, and using these highly accurate quantum computations to do so,鈥 Sriram said. 鈥淭o our knowledge, this is something no other carbon capture database has been able to do.鈥
Putting their own database to use, the 色花堂 and Meta teams identified about 241 MOFs of exceptionally high potential for direct air capture.
Moving Forward With Impact
鈥淎肠肠辞谤诲颈苍驳 and most industrialized countries, we need to get to net-zero carbon dioxide emissions by 2050,鈥 said Matt Uyttendaele, director of Meta鈥檚 FAIR chemistry team and a co-author on the paper. 鈥淢ost of that must happen by outright stopping carbon emissions, but we must also address historical carbon emissions and sectors of the economy that are very hard to decarbonize 鈥 such as aviation and heavy industry. That鈥檚 why CO2 removal technologies like direct air capture must come online in the next 25 years."
While direct air capture is still a nascent field, the researchers say it鈥檚 crucial that groundbreaking tools 鈥 like the OpenDAC database made available in the team鈥檚 paper 鈥 are in development now.
鈥淭here is not going to be one solution that will get us to net-zero emissions,鈥 Sriram said. 鈥淒irect air capture has great potential but needs to be scaled up significantly before we can make a real impact. I think the only way we can get there is by finding better materials.鈥
The researchers from both teams hope the scientific community will join the search for suitable materials. The entire OpenDAC dataset project is open source, from the data to the models to the algorithms.
鈥淚 hope this accelerates the development of negative-emission technologies like direct air capture that may not have been possible otherwise,鈥 Medford said. 鈥淎s a species, we must solve this problem at some point. I hope this work can contribute to getting us there, and I think it has a real shot at doing that.鈥
Note: 色花堂 ChBE graduate students Sihoon Choi, Logan Brabson, and Xiaohan Yu made major contributions and are co-authors of the paper.
Citation: A. Sriram et al, The Open DAC 2023 Dataset and Challenges for Sorbent Discovery in Direct Air Capture, ACS Central Science (2024).
DOI: