Active learning streamlines development of high performance catalysts for higher alcohol synthesis

Overview and scope of the active learning framework

We devised an active learning approach by integrating data-driven algorithms with experimental workflows, which continuously learns from existing and newly-generated data from iterative experimental cycles, to explore and identify FeCoCuZr compositions and reaction conditions optimizing catalyst performance metric(s) of interest (Fig. 1)^36,37,46. The core of the data-driven model combines Gaussian process (GP) and Bayesian optimization (BO) algorithms, along with human decision-making in order to accomplish single or multi-objective tasks^46,47.

**Fig. 1: Scheme of active learning workflow to develop FeCoCuZr catalysts.**

To showcase the feasibility of this approach to HAS, the study was systematically conducted in three distinct phases by progressively increasing the model complexity. In Phase 1, the catalyst composition was varied with the objective of maximizing STY_HA at fixed reaction conditions. In Phase 2, the dimensionality of the problem was increased by concurrently exploring the catalyst compositions and reaction conditions to maximize STY_HA. This approach was subsequently extended towards multi-objective capabilities by simultaneously maximizing STY_HA while minimizing combined selectivity to carbon dioxide and methane (\({S}_{{{{{{{\rm{CO}}}}}}}_{2}+{{{{{\rm{CH}}}}}}_{4}}\)) in Phase 3. Iterative cycles comprising of six experiments were conducted during each phase, until the target performance metric(s) were achieved or reached saturation.

Phase 1: Optimal catalyst formulations for productivity

The first phase aimed to explore the suitability of the active learning framework in identifying optimal FeCoCuZr formulations for maximizing STY_HA under fixed reaction conditions, specifically the H₂:CO ratio (H₂:CO), reaction temperature (T), pressure (P), and the gas hourly space velocity (GHSV). This strategy enabled the exploration of a space containing >175,000 unique compositional possibilities, known as the chemical space^34,36 (Supplementary Note 1), and helped understand the sensitivity of higher alcohols productivity to catalyst composition in this family of materials. Without a priori composition and performance data for FeCoCuZr formulations, the initial model training was performed using 31 data points on the FeCoZr, FeCuZr, and CuCoZr catalysts recently reported by our group⁴⁵, denoted as seed experiments for Phase 1 (Supplementary Note 2, Supplementary Table 1). Reaction conditions were fixed at H₂:CO = 2.0, T = 533 K, P = 50 bar, and GHSV = 24,000 cm³ h⁻¹ g_cat⁻¹ across cycles to match those in the seed dataset (Supplementary Note 2).

In each cycle, the GP-BO model was trained using the molar content values of the four elements (Fe, Co, Cu, Zr) and the corresponding STY_HA of all catalysts in the dataset. Subsequently, we evaluated the expected improvement (EI) and predictive variance (PV) acquisition functions separately under specific constraints to generate candidate compositions (see Methods section, “Gaussian process and Bayesian optimization”, Supplementary Table 2). Six suitable catalysts were manually selected for experimentation by balancing the number of recommendations from EI, which searches for compositions maximizing the STY_HA objective (i.e., exploitation), and from PV, which seeks potential candidates in the unexplored chemical space (i.e., exploration)^23,48. Here, it is important to acknowledge the key role of human decision-making in providing a judgement and selection from the suggested compositions, which allowed us to supervise and fine-tune the implementation of active learning at this early stage. The experimentally evaluated performance together with measured compositions of these six catalysts were added to the dataset to re-train the model for the next cycle.

Five iterative cycles were performed (30 catalysts, see Supplementary Tables 3-7), which mapped the chemical space and identified regions with high STY_HA (Fig. 2a). Progressive improvements in STY_HA across cycles were achieved (Fig. 2b, Supplementary Tables 3-7), with the Fe₆₉Co₁₂Cu₁₀Zr₉ catalyst in Cycle 3 attaining the highest STY_HA = 0.39 g_HA h⁻¹ g_cat⁻¹, a 1.2-fold improvement over the Fe₇₉Co₁₀Zr₁₁ seed benchmark (STY_HA = 0.32 g_HA h⁻¹ g_cat⁻¹). This performance was retained for at least 100 h on stream with no visible sign of deactivation (Supplementary Fig. 1). Similar compositions and performances obtained by the best catalysts in Cycles 4 and 5 confirmed the convergence of results.

**Fig. 2: Exploration of catalyst compositions to maximize *STY*_HA at fixed reaction conditions.**

To reveal compositional trends driving performance, we used the k-means clustering algorithm^21,34. This allowed us to identify catalysts that had high STY_HA and S_HA – both being key performance metrics, and enabled informed decision making regarding experiment selection in subsequent phases. Four distinct clusters were observed (Fig. 2c, d). Catalysts in the Zr-rich and equimolar clusters exhibit low STY_HA, likely due to low contents and suboptimal ratios of the active metals, respectively. Fe stands out as a key active metal, as maximum STY_HA up to 0.39 g_HA h⁻¹ g_cat⁻¹ was attained by Fe-rich catalysts, while those in the Fe-Co rich cluster exhibit the highest S_HA up to 17%. Notably, Zr content converged towards 10% in the highest performing Fe-rich catalysts, mimicking that obtained over the seed catalysts⁴⁵. Irrespective of compositional differences, product distributions of best performing catalysts in each cycle largely resembled each other, with S_HA = 14 ± 2%, \({S}_{{{{{{\rm{CO}}}}}}_{2}}\) = 13 ± 5%, \({S}_{{{{{{\rm{CH}}}}}}_{4}}\) = 25 ± 5%, and \(({S}_{{{{{{\rm{CH-}}}}}}}+{S}_{{{{{{\rm{CH=}}}}}}})\) = 48 ± 4% (Supplementary Fig. 2, Supplementary Table 8).

The superior STY_HA of the Fe₆₉Co₁₂Cu₁₀Zr₉ optimal catalyst compared to the Fe₇₉Co₁₀Zr₁₁ seed benchmark stemmed from an increase in X_CO (45% vs. 36%), since they showed similar S_HA ∼ 13%. We thus conducted characterization studies to identify key features behind the activity gain. STEM-EDX maps of both catalysts (Fig. 3a, b, Supplementary Figs. 3, 4) identified small domains of ZrO₂ in intimate contact with larger active metal nanoparticles. These defect-rich ZrO₂ domains, as indicated by the presence of Zr^δ+ contributions in the Zr 3d region of XPS spectra (Fig. 3c), were previously identified to enhance surface iron and cobalt carbide formation in line with the C 1s signal of XPS spectra (Fig. 3d)⁴⁹. Carbides are known active phases in CO hydrogenation, and their interface with partially reduced iron oxide species is thought to enhance non-dissociative CO activation and higher alcohol formation in iron-rich catalysts^45,49,50. Notably, elemental distributions of Fe and Co suggest that these two metals are inter-dispersed and not segregated in both catalysts whether as calcined or after use (Fig. 3a, b, Supplementary Figs. 3, 4), in line with their tendency to form intermetallic alloys or mixed oxides. Initially dispersed Cu in the fresh Fe₆₉Co₁₂Cu₁₀Zr₉ catalyst agglomerated into distinct 10–20 nm-sized agglomerates in contact with Fe-Co nanoparticles following reduction and reaction (Fig. 3b, Supplementary Figs. 3, 4), accompanied by a decrease in surface area (Supplementary Table 9) and confirmed by XRD patterns (Supplementary Fig. 5). Such architectural features have also been observed for similar FeCoCu materials prepared by a sol-gel method without Zr present in the composition⁵¹.

**Fig. 3: Characterization of the best performing catalyst in Phase 1 and the reference seed catalyst.**

The presence of copper enhances the surface reducibility of Fe₆₉Co₁₂Cu₁₀Zr₉ as demonstrated by H₂-TPR (Fig. 3e), where oxidic Cu is readily reduced in the presence of H₂ to the metallic state, which in turn enhances hydrogen splitting and spillover to neighboring Fe-Co oxide domains⁵¹. Temperature-programmed H₂-D₂ exchange experiments also showed a 40 K decrease in the exchange temperature for Fe₆₉Co₁₂Cu₁₀Zr₉ (Fig. 3f, g) compared to Fe₇₉Co₁₀Zr₁₁, suggesting that Cu nanoparticles improve hydrogen activation and thus play a similar role as in other Cu-catalyzed reactions such as methanol or olefin synthesis from CO_x^51,52. As such, the increase in X_CO, formation rates of each alcohol (Supplementary Table 10) and therefore STY_HA attainable over iron-rich FeCoCuZr catalysts originates from its enhanced H₂ activation ability from copper, while retaining the characteristics imparted by well-mixed, carbide-rich Fe-Co phases promoted by dispersed and defective ZrO₂. Under reaction conditions, *CH_x, CH₃O*, and CH₃CH₂O* intermediates were detected by in situ DRIFTS (Supplementary Fig. 6), in line with the expected mechanisms of *CH_x coupling and non-dissociative *CO insertion for HAS over m-FTS catalysts.

It is worth noting that the model formulation did not include any explicit chemical information guiding the iterative cycle. Nevertheless, it was able to provide performance predictions with high accuracy. The overall performance was influenced by the balance between EI and PV functions used in the GP-BO algorithm^23,24,48, as the latter exhibits higher uncertainty leading to lower accuracy, and vice versa for the former. A total of 13 and 17 catalysts were evaluated based on recommendations from the EI and PV acquisition functions, respectively (Supplementary Fig. 7, Supplementary Table 11), progressively improving model performance from an initial coefficient of determination R² = 0.36 in Cycle 1 to R² = 0.84 by the final cycle. This improvement resulted from the expansion of available data generated during the active learning cycles and an increased number of experimental evaluations guided by the exploitation function probing regions of high performance. Accurate predictions on catalyst performance could be made owing to the standardized synthesis method that ensured consistency in structural properties.

Phase 2: Optimal catalyst formulations and reaction conditions for productivity

While basic reactivity patterns and the relevance of operating conditions are well known for HAS, there are no universally applicable set of optimal conditions as the influence of each parameter is catalyst-specific. The second phase of this study tackles this by expanding the exploration space to include reaction conditions, including H₂:CO, T, and GHSV, defined as the parametric space (Supplementary Note 1). As Phase 1 did not include variation of reaction conditions, 20 additional experiments were performed to broaden the range of reaction conditions initially covered by the model, denoted as seed experiments for Phase 2 (Supplementary Note 3, Supplementary Table 12). Phase 2 was initiated by training the GP-BO algorithm using these 50 data points under compositional and parametric constraints based on knowledge from previous experiments and the literature (Supplementary Note 4, Supplementary Table 13).

Reaction conditions were found to exert a significant impact on STY_HA, as Fe₆₁Co₂₀Cu₉Zr₁₀ identified as the best performer in Cycle 1 was already able to exceed 0.5 g_HA h⁻¹ g_cat⁻¹ at H₂:CO = 1.8, T = 552 K, P = 50 bar, and GHSV = 32,550 cm³ h⁻¹ g_cat⁻¹, almost 1.5-fold higher compared to the seed data used in Phase 1. Over the next two active learning cycles, STY_HA reached ~0.7 g_HA h⁻¹ g_cat⁻¹, nearly doubling productivity compared to the maximum achieved in Phase 1 (Fig. 4a, Supplementary Tables 14–16). We noticed that by the end of Cycle 3, the optimizer was locally constrained at the GHSV upper bound of 50,000 cm³ h⁻¹ g_cat⁻¹. As it is typically observed in literature that an increase in GHSV concurrently increases STY_HA owing to higher reactant flows despite a slight reduction in X_CO^5,7,9, the upper bound was set to 100,000 cm³ h⁻¹ g_cat⁻¹ in Cycle 4 to observe the behavior of the model. This adjustment led to the GP-BO extrapolating to previously unexplored GHSV values, suggesting catalytic systems that provided higher STY_HA of up to 0.9 g_HA h⁻¹ g_cat⁻¹. By Cycle 5, our framework recommended the highly active Fe₆₅Co₁₉Cu₅Zr₁₁ catalyst that attained STY_HA = 1.1 g_HA h⁻¹ g_cat⁻¹ at operating conditions of H₂:CO = 2.2, T = 551 K, P = 50 bar, and GHSV = 90,000 cm³ h⁻¹ g_cat⁻¹, marking a significant 3.5-fold increase from the original Phase 1 seed benchmark (Fig. 4a, Supplementary Table 18). The stability of this catalyst was evaluated in a 150-h catalytic run (Fig. 4b), where X_CO ≥ 40% and STY_HA ≥ 1 g_HA h⁻¹ g_cat⁻¹ were maintained throughout. Phase 2 concluded upon the completion of Cycle 6, during which catalytic systems yielding approximately 1 g_HA h⁻¹ g_cat⁻¹ were achieved once more, suggesting repeatability of results as well as model saturation (Fig. 4a, Supplementary Table 19).

**Fig. 4: Identification of catalyst compositions and reaction conditions maximizing *STY*_HA.**

Benchmarking FeCoCuZr catalytic systems from Phases 1 and 2 with literature-reported catalysts across various families such as Rh-based, Mo-based, and m-FTS-based revealed notable differences. The literature-reported catalysts (a total of 125 catalysts were examined) exhibited an average STY_HA ≈ 0.1 g_HA h⁻¹ g_cat⁻¹ with top performers in the 90^th percentile reaching 0.18 g_HA h⁻¹ g_cat⁻¹. Conversely, the best performing FeCoCuZr catalysts in the different cycles of Phases 1 and 2 had an average STY_HA of approximately 0.6 g_HA h⁻¹ g_cat⁻¹ highlighting significantly enhanced productivity compared to literature-reported counterparts for direct hydrocarbon synthesis from syngas for direct HAS from syngas^6,8 (Fig. 4c). For comparative analysis, selectivity to higher alcohols exhibited a less responsive nature than productivity in this family. Regardless of the diverse set of compositions and reaction conditions investigated, S_HA = 11 ± 2% was the mean of the most active systems in Phase 2 (Supplementary Fig. 8, Supplementary Table 20), similar to the observation made in Phase 1 and pointing to an intrinsic feature of the FeCoCuZr family.

The impact of varying reaction conditions on performance in Phase 2 could be visualized by mapping catalyst compositions and their respective STY_HA attained in four clusters dictated by GHSV and T ranges (Supplementary Fig. 9). The highest-performing catalytic systems share similar compositions to the iron-rich catalysts discovered to be optimal in Phase 1 (Fig. 2c, d), hinting that they retain the same catalytic features previously determined to boost activity. The model recommendations of maximizing GHSV, T = 550–570 K, and moderate H₂:CO ∼ 2 are also in line with established heuristics for syngas-based HAS (Supplementary Note 4). The accuracy remained high with R² = 0.78 in Phase 2, comparable to that in Phase 1. Progressive improvements in accuracy were evident in Phase 2, with the mean absolute percentage error (MAPE) between predicted and measured STY_HA for each cycle in Phase 2 decreasing from 33% in Cycle 1 to 7.6% in Cycle 6 (Supplementary Table 21). Considering predicted and measured STY_HA across both phases resulted in an overall performance accuracy of R² = 0.91 (Fig. 4d) with a low root mean squared error (RMSE) of 0.09 g h⁻¹ g_cat⁻¹.

Phase 3: Maximized productivity and minimized selectivity to by-products

The third phase of this study aimed to apply active learning to search for catalytic systems that could meet multiple performance criteria, better reflecting the real-world demands on catalysts. Given the modest S_HA across all catalysts developed in Phase 1 and 2, we focused on selectivities towards carbon dioxide \(({S}_{{{{{{{\rm{CO}}}}}}}_{2}})\) and methane \(({S}_{{{{{{{\rm{CH}}}}}}}_{4}})\), considered as the least valuable products in HAS^5,9. The 86 data points in Phase 1 and 2 exhibited \({S}_{{{{{{{\rm{CO}}}}}}}_{2}}\)= 16 ± 6% and \({S}_{{{{{{{\rm{CH}}}}}}}_{4}}\) = 25 ± 4%, highlighting the significance of the water-gas shift (WGS) and CO methanation reactions, especially at conditions favoring high X_CO and therefore STY_HA. (Supplementary Note 4, Supplementary Fig. 10). A plot of STY_HA vs. \({S}_{{{{{{{\rm{CO}}}}}}}_{2}+{{{{{\rm{CH}}}}}}_{4}}\) suggested an intrinsic trade-off in the form of a Pareto front (Fig. 5a), in which the improvement of one metric would likely be at the expense of the other^53,54.

**Fig. 5: Uncovering Pareto-optimal catalysts and performance drivers.**

This scenario was explored in Phase 3 by varying the catalyst compositions and reaction conditions simultaneously to maximize STY_HA and minimize \({S}_{{{{{{{\rm{CO}}}}}}}_{2}+{{{{{\rm{CH}}}}}}_{4}}\). For this purpose, the GP-BO algorithm was trained with data from Phases 1 and 2 with STY_HA and \({S}_{{{{{{{\rm{CO}}}}}}}_{2}+{{{{{\rm{CH}}}}}}_{4}}\) as target metrics, using the Expected Hypervolume Improvement (EHVI) acquisition function, which guides the optimization process to recommend catalyst composition and reaction conditions that are likely to lead to better trade-offs among conflicting objectives^23,35.

During Cycle 1, a significant discrepancy between predicted and measured values of STY_HA and \({S}_{{{{{{{\rm{CO}}}}}}}_{2}+{{{{{\rm{CH}}}}}}_{4}}\) was observed (Supplementary Fig. 11, Supplementary Table 26) with none of the catalysts evaluated near the Pareto barrier (Fig. 5a, Supplementary Table 22). Upon entering Cycle 2, two of the six catalysts evaluated were situated on the Pareto front (Fig. 5a, Supplementary Table 23), while the model exhibited enhanced prediction accuracy. The highest performing system notably attained STY_HA = 1.04 g_HA h⁻¹ g_cat⁻¹, only 5% lower than the maximum attained in Phase 2 but with a drastically reduced \({S}_{{{{{{{\rm{CO}}}}}}}_{2}+{{{{{\rm{CH}}}}}}_{4}}\) = 40% vs. 46%. By Cycle 3, model recommendations improved significantly as five of the six catalysts evaluated lie directly on the Pareto frontier without crossing it (Fig. 5a, Supplementary Table 24) attaining \({S}_{{{{{{{\rm{CO}}}}}}}_{2}+{{{{{\rm{CH}}}}}}_{4}}\) = 34 ± 2% and STY_HA = 0.65 ± 0.05 g_HA h⁻¹ g_cat⁻¹. Herein, while the productivity remained ca. two times higher than average literature values, we highlight the selectivity of undesired CO₂ and CH₄ was minimized by around 10% (Fig. 5b), in comparison to some of the catalysts developed in Phase 2, suggesting an optimal trade-off between STY_HA and \({S}_{{{{{{{\rm{CO}}}}}}}_{2}+{{{{{\rm{CH}}}}}}_{4}}\). Notably, we identified optimal systems along the Pareto frontier, suggesting an intrinsic limitation of this family of HAS catalysts to achieve \({S}_{{{{{{{\rm{CO}}}}}}}_{2}+{{{{{\rm{CH}}}}}}_{4}}\) < 30% without compromising STY_HA. However, within this constraint, our strategy eventually uncovered five Pareto-optimal catalytic systems which are otherwise non-intuitive and not easily accessed by human experts⁵⁵, thereby underscoring its versatility and significance.

Performance drivers and data-informed guidelines

We sought to elucidate the main performance drivers among the set of input features impacting STY_HA and \({S}_{{{{{{{\rm{CO}}}}}}}_{2}+{{{{{\rm{CH}}}}}}_{4}}\). However, an inherent challenge with most ML algorithms, including the GP regressor used in this study, lies in the complexity of deciphering the internal rationale behind predictions—rendering them black-box in nature. To address this challenge and make the model interpretable, we utilized the agnostic ML explainer, SHapley Additive exPlanations (SHAP)⁵⁶. This methodology facilitates the extraction of interpretable insights from the GP algorithm through the computation of feature-importance scores^57,58. Akin to sensitivity analysis, SHAP determines the individual or combined contributions of features to the model’s prediction, enabling catalysis practitioners to quantify the relative importance of different features affecting performance, that can be corroborated with existing knowledge or lead to testable hypotheses.

The overall influence of each feature was expressed by normalized SHAP values, revealing that reaction conditions and catalyst compositions contributed to ca. 60% and 40%, respectively, to the model predictions for both targeted metrics in Phases 2 and 3 (Fig. 5c). In the case of STY_HA, GHSV and T emerged as the two most important parameters, in alignment with earlier intuition in Phase 2 as well as findings in the literature for similar C₁ transformations^36,57. Fe content ranked as the most prominent compositional input, in line with the high productivities only attained by iron-rich catalysts, as highlighted in the discussion of Phase 1 and given the claimed role of Fe phases in C-C coupling⁶. For \({S}_{{{{{{{\rm{CO}}}}}}}_{2}+{{{{{\rm{CH}}}}}}_{4}}\)_, T was identified as the most significant variable, followed by Fe and Co contents. The role of T in dictating selectivity patterns could be ascribed to the coexistence of competing reaction networks, each with different temperature-dependent kinetic and thermodynamic barriers including HAS, WGS, or methanation. This highlights the importance of optimizing T to fine-tune the selectivity towards higher alcohols or by-products. The Fe-Co-rich cluster determined in Phase 1 (Fig. 2c) catered most favorably to HA selectivity, with Fe-Co surface carbides previously identified as a key feature for selective higher alcohol production from catalyst characterization⁴⁵.

Despite demonstrating the efficacy of active learning in uncovering catalytic systems that enhance multiple performance metrics for HAS, it is essential to acknowledge its scope and limitations in its current form. The lack of electronic or structural descriptors as inputs to the model and its inability to optimize performance metrics which are intrinsically unresponsive to screened variables, such as S_HA in this work, can be mentioned (see Supplementary Note 5 for extended discussion). Nonetheless, in the course of this study, three categories of catalytic systems within the FeCoCuZr family emerged exhibiting distinct performance characteristics, namely high STY_HA and \({S}_{{{{{{{\rm{CO}}}}}}}_{2}+{{{{{\rm{CH}}}}}}_{4}}\) (STY_HA = 0.97 ± 0.08 g_HA h⁻¹ g_cat⁻¹, \({S}_{{{{{{{\rm{CO}}}}}}}_{2}+{{{{{\rm{CH}}}}}}_{4}}\) = 44 ± 2%, S_HA = 10 ± 1%), low STY_HA and \({S}_{{{{{{{\rm{CO}}}}}}}_{2}+{{{{{\rm{CH}}}}}}_{4}}\) (STY_HA = 0.25 ± 0.07 g_HA h⁻¹ g_cat⁻¹, \({S}_{{{{{{{\rm{CO}}}}}}}_{2}+{{{{{\rm{CH}}}}}}_{4}}\) = 31 ± 3%, S_HA = 14 ± 2%), and Pareto-optimal catalysts (STY_HA = 0.63 ± 0.06 g_HA h⁻¹ g_cat⁻¹, \({S}_{{{{{{{\rm{CO}}}}}}}_{2}+{{{{{\rm{CH}}}}}}_{4}}\) = 34 ± 2%, S_HA = 14.6 ± 0.3%) (Fig. 6). Each category favors unique catalyst compositions and reaction conditions; for instance, systems displaying high STY_HA and \({S}_{{{{{{{\rm{CO}}}}}}}_{2}+{{{{{\rm{CH}}}}}}_{4}}\) are characterized by high molar Fe content, H₂:CO, and GHSV values, whereas the low STY_HA and \({S}_{{{{{{{\rm{CO}}}}}}}_{2}+{{{{{\rm{CH}}}}}}_{4}}\) counterparts are favored at equimolar Fe-Co contents, low H₂:CO, and milder T. The Pareto-optimal catalysts feature a combination of the aforementioned traits, recommending high Fe contents and operation at high GHSV and mild T. These quantitative guidelines, especially those relating to operating conditions, align with literature findings and are likely not dependent on specific catalyst formulations and could be relevant to HAS catalysts in general^6,9. However, the exact compositional guidelines provided herein apply to the FeCoCuZr catalysts investigated in this study and would arguably not be directly relevant in designing HAS systems with different active metals, promoters, and architectures prepared by different synthesis methods. Importantly, this methodology based on data analysis can be extended to other potential HAS systems or even other multi-product chemical transformations, provided sufficient experimental data is available. Other users are thus recommended to formulate specific guidelines for their application during the active learning process. Overall, in the absence of quantifiable techno-economic data and community consensus on practically relevant productivities or selectivities for HAS, this approach provides guidelines for optimizing key metrics, serving as valuable assets to catalysis practitioners and industry stakeholders to accelerate research efforts by assisting in the selection of appropriate catalytic systems and experiments, ultimately saving time and resources.

**Fig. 6: Establishment of guidelines for developing performance-specific catalysts.**

Active learning and sustainable laboratories

While the possible chemical and parametric space of the FeCoCuZr systems is in the order of billion combinations, practical and real-world studies on multicomponent catalysts range between hundreds to thousands screening experiments^35,36,38. By employing active learning we mapped the vast space of FeCoCuZr catalysts to a cumulative 104 experiments, across Phases 1–3 to meet the desired performance objectives, confirming the growing body of literature that claim active learning accelerates experimental efforts^34,35,36. This has a profound impact on the environmental and economic sustainability of catalyst development programs that has not been explored.

To this context, assuming this study as representative of a catalyst development endeavor, we assessed the degree to which active learning could impact both sustainability pillars in laboratories (see scope in Supplementary Note 6). Our analysis suggests average reductions exceeding 90% in carbon footprint and costs on benchmarking with traditional campaigns (Fig. 7, Supplementary Tables 27–29). We also observe a very mild dependency of this result with regional variations across the globe affecting, for example, composition of the energy mix or laboratory operational expenditure (Supplementary Fig. 12). Thus, by reducing consumption of chemicals and energy, and optimizing resource utilization, active learning remarkably fosters sustainable catalysis laboratories.