The Open Catalyst 2020 (OC20) Dataset and Community Challenges
Zachary Ulissi a
a Carnegie Mellon University, Department of Materials Science and Engineering, Pittsburgh, 0, United States
Proceedings of International Conference on Electrocatalysis for Energy Applications and Sustainable Chemicals (EcoCat)
Online, Spain, 2020 November 23rd - 25th
Organizers: Ward van der Stam, Marta Costa Figueiredo, Sixto Gimenez Julia, Núria López and Bastian Mei
Invited Speaker, Zachary Ulissi, presentation 018
Publication date: 6th November 2020

The Open Catalyst Project aims to develop new ML methods and models to accelerate the catalyst simulation process for renewable energy technologies and improve our ability to predict activity/selectivity across catalyst composition. To achieve that in the short term we need participation from the ML community in solving key challenges in catalysis. One path to interaction is the development of grand challenge datasets that are representative of common challenges in catalysis, large enough to excite the ML community, and large enough to take advantage of and encourage advances in deep learning models. Similar datasets have had a large impact in small molecule drug discovery, organic photovoltaics, and inorganic crystal structure prediction. We present the first open dataset from this effort on thermochemical intermediates across stable multi-metallic and p-block doped surfaces. This dataset includes full-accuracy DFT calculations across 53 elements and their binary/ternary materials, various low-index facets. Adsorbates span 56 common reaction intermediates with relevance to carbon, oxygen, and nitrogen thermal and electrochemical reactions. Off-equilibrium structures are also generated and included to aid in machine learning force field design and fitting. Collectively, this dataset represents the largest systematic dataset that bridges organic and inorganic chemistry and will enable a new generation of catalyst structure/property relationships. Fixed train/test splits that represent common chemical challenges and an open challenge website will be discussed to encourage competition and buy-in from the ML community.

The Open Catalyst Project is a joint collaboration between Facebook AI Research (FAIR) and Carnegie Mellon University.

© FUNDACIO DE LA COMUNITAT VALENCIANA SCITO
We use our own and third party cookies for analysing and measuring usage of our website to improve our services. If you continue browsing, we consider accepting its use. You can check our Cookies Policy in which you will also find how to configure your web browser for the use of cookies. More info