Ex) Article Title, Author, Keywords
Current Optics
and Photonics
Ex) Article Title, Author, Keywords
Curr. Opt. Photon. 2024; 8(6): 545-561
Published online December 25, 2024 https://doi.org/10.3807/COPP.2024.8.6.545
Copyright © Optical Society of Korea.
Geon-Tae Park1, Rira Kang2, Byunghong Lee2, Sun-Kyung Kim1
Corresponding author: *sunkim@khu.ac.kr, ORCID 0000-0002-0715-0066
This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
Multilayers composed of two or more materials enable the regulation of transmission, reflection, and absorption spectra across one or multiple bands. While analytic formulas based on well-established interference conditions, such as those employed in single-, double-, and triple-layer antireflective coatings and distributed Bragg reflectors, have provided suitable solutions for traditional optical coatings, they are limited in achieving the intricate spectral characteristics required by multifunctional optical coatings. To overcome this limitation, a variety of machine learning-based design algorithms have been rigorously studied. Among these, binary optimization has proven particularly effective for designing multilayer optical coatings. This approach transforms a given multilayer into a binary vector with multiple bits, where each bit represents one of the constituent materials, and quickly identifies an optimal figure-of-merit by analyzing the interactions among the elements of the binary vector. In this review article, we elucidate the principles of binary optimization and explore its applications in the design of multilayers for antireflective coatings for high-numerical-aperture lenses, transparent radiative coolers for energy-saving windows, and bandpass filters for thermophotovoltaics. Furthermore, we address the limitations, challenges, and perspectives of machine learning-based optical design to guide directions for future research in this field.
Keywords: Binary optimization, Machine learning, Multilayer, Optical coating, Optical design
OCIS codes: (200.0200) Optics in computing; (220.0220) Optical design and fabrication; (310.0310) Thin films; (310.1210) Antireflection coatings; (310.6845) Thin film devices and applications
Multilayer optical coatings are constructed by stacking dielectric and/or metallic layers with subwavelength thicknesses [1]. These multilayers are prevalent in everyday applications due to their capacity to precisely regulate light transmission, reflection, and absorption over specific or broad wavelengths [2]. This ability facilitates the creation of antireflective and antiglare coatings [3–6], mirrors, and spectral filters [7–10], which are integral to the functionality of solar cells [11–15], windows [16–20], lasers [21–23], and display technologies [24–27]. Analytical formulas such as single-, double-, and triple-layer configurations are often employed to design antireflective coatings (ARCs) [28, 29]. These formulas are used by selecting the appropriate configuration based on the availability of materials, specifically their refractive indices, along with the desired bandwidth and economic considerations. Similarly, distributed Bragg reflectors (DBRs) are designed using an analytical approach [30]; They consist of multiple layers of alternating materials with high and low refractive indices, where each repeated pair has a quarter-wave thickness. These examples illustrate the application of well-defined destructive or constructive interference for managing partial reflective waves.
Although analytical formulas provide exact solutions for straightforward optical designs, they are constrained to perform multispectral engineering or to generate stepwise spectral profiles, both of which necessitate a greater complexity of variables. Multispectral engineering entails the independent regulation of light transmission, reflection, and absorption across multiple spectral bands or within the wavelength-incident angle domain, thus presenting a formidable challenge from the perspective of classical approaches. For example, transparent radiative coolers (TRCs) designed for energy-saving windows must concurrently achieve high transmittance in the visible (400–700 nm), high reflectance in the near-infrared (700–2,500 nm), and high emissivity (characterized by absorption according to Kirchhoff’s law of thermal radiation) in the mid-infrared (8–14 µm) [31]. Likewise, stepwise spectral profiles are crucial for applications such as cooling windows [32], photon recycling in incandescent lighting [33], and near-infrared cutoff filters for complementary metal-oxide-semiconductor image sensors [34]. Although DBRs provide complete reflection within a photonic bandgap, they exhibit residual reflectance outside the bandgap, which undermines the creation of a stepwise spectrum. Moreover, challenges such as the lack of materials with desired refractive indices, in addition to considerations of fabrication tolerance and cost-effectiveness (e.g., the number of interfaces and total thickness), complicate the application of existing analytical formulas.
To address these limitations, various inverse design methods based on machine learning have been developed and employed, including needle optimization [35–38], memetic algorithm [39–41], genetic algorithm [42–47], neural networks [48–50], and reinforcement learning [51, 52]. It is well known that binary optimization is particularly effective for designing multilayer optical coatings [53]. This approach converts a given multilayer into a binary vector composed of multiple bits, where each bit corresponds to one of the constituent materials with a required thickness, typically one-tenth of an operational wavelength. It examines the interactions among the elements of the binary vector and swiftly identifies an optimal (or minimal) figure-of-merit (FoM) using sampling methods such as brute force (BF), simulated annealing (SA), and quantum annealing (QA). BF evaluates every possible solution within the parameter space, ensuring the identification of the global optimum in a given machine learning model. However, it is computationally expensive and impractical for large-scale problems with extensive parameter spaces. SA offers a probabilistic technique that explores the solution space by allowing occasional uphill moves, thus circumventing local minima. While more computationally efficient than BF, it may still require significant computational resources and careful tuning of its parameters to balance exploration and exploitation. QA promises faster convergence for certain problem classes, particularly those with intricate energy landscapes. However, its practical application is limited by current technological constraints (i.e., the availability of quantum bits). The selection of a sampling method depends on the specific problem characteristics, including the size and complexity of the parameter space, computational resources, and the required tolerance of the solution. In principle, binary optimization can reveal multiple multilayer configurations that achieve similar levels of FoM within an acceptable tolerance, thereby facilitating the selection of a more economically viable solution. For instance, minimizing the number of interfaces, the variety of materials employed, and the overall thickness can serve as guidelines for determining the optimal endpoint among multiple multilayer solutions.
In this review article, we elucidate the principles of binary optimization and investigate its applications in designing multilayers for ARCs [54, 55], TRCs for energy-saving windows [56], and bandpass filters for thermophotovoltaics (TPVs) [57, 58]. In addition, we discuss the limitations, challenges, and perspectives of machine learning-based optical design, thereby indicating potential directions for future research in this field.
Recently, the use of machine learning to design optical structures has gained immense popularity. As the demand for complex optical specifications such as broad spectrum, wide incident angles, and multispectral objectives, continues to rise, traditional analytical solutions became inadequate. Addressing these complex problems often requires non-intuitive designs, which can be successfully discovered using machine learning due to its ability to explore extensive parametric spaces within a limited computational timeframe [35–58]. For a machine learning-based approach to be effective, it is crucial to select an optimization method that strikes a balance between accuracy and computational cost. In this regard, binary-optimization-based multilayer design offers a robust optimization framework by compressing the dimensionality of the system into a simplified binary representation [53].
The binary-optimization-based multilayer design involves four main steps [Fig. 1(a)].
The first step is to encode the multilayer into a binary vector. This process transforms the continuous system into finite N dimensions representing both refractive indices and thicknesses as binary digits. This discretization is essential for mapping the physical problem into a quadratic unconstrained binary optimization (QUBO) problem, a common and versatile approach for addressing various optical systems and combinatorial optimizations.
The second step involves generating the initial training dataset, represented as a binary vector. An exact solver such as the transfer matrix method (TMM) yields the FoM, providing an accurate initial dataset for training the machine learning model. The FoM can represent various optical properties, including transmittance, reflectance, or emissivity, and can account for complex conditions such as wide-angle performance or multispectral requirements. This versatility highlights a key advantage of the binary-optimization-based method, allowing users to define any objective function by specifying the FoM equation.
The third step involves creating a surrogate function, a predictive model that approximates the FoM without requiring detailed internal analysis of the interactions of elements in a multilayer, which significantly reduces computational demands. There are various models capable of optimizing the QUBO problem, such as factorization machines (FM) and Bayesian optimization. The choice of an appropriate machine learning model depends on the complexity of the optimization problem. For example, FM is advantageous for sparse initial data, making it suitable for smaller parametric spaces.
The final step is searching for a quasi-optimal structure among the configurations evaluated by the surrogate function. The selection of a suitable sampling method is critical to efficiently identify the optimal point. BF evaluates all possible FoM values but requires extensive computational resources, while methods such as SA and QA use probabilistic approaches to approximate the quasi-optimal solution. Probabilistic methods are especially effective for large datasets and limited computational resources, as they enable significant advances during the search process and accelerate optimization. After identifying the quasi-optimal point, the corresponding binary vector and FoM value are fed back into the training dataset, and the optimization process is iterated until convergence on the desired point.
This methodology finds application in ARCs for high-numerical-aperture lenses, TRCs for energy-saving windows, and bandpass filters for TPVs [Fig. 1(b)] [54–58]. In the semiconductor fabrication and inspection industries, high-numerical-aperture lenses are in demand to ensure superior resolution [59, 60]. Binary-optimization-based multilayer design identifies ARC structures with minimal reflectance over a wide range of incident angles, which is essential for high-numerical-aperture lenses operating at a target wavelength (λ0) [54, 55]. Another application is TRCs for energy-saving windows, driven by the need to effectively manage the thermal properties of buildings and vehicles. TRCs aim to provide high transmittance in the visible spectrum, high reflectance in the ultraviolet (UV) and near-infrared (NIR) regions, and high emissivity in the mid-infrared range (8–14 µm). Given the multispectral and multi-objective requirements, machine learning-based optimization is advantageous for designing TRCs [56]. The third application is bandpass filters for TPVs. Bandpass filters act as selective emitters in TPV systems and significantly enhance photon absorption efficiency in photovoltaic (PV) cells. To reject non-usable photons, a bandpass filter must achieve unit emissivity within the bandwidth corresponding to maximum external quantum efficiency (EQE) of the PV cell. Hence, a machine learning approach allows for customization of bandpass filters to meet the target emissivity spectrum and enhance photon absorption for various PV cell types [57, 58].
In the semiconductor inspection and lithography sector, high-numerical-aperture lenses for short-wavelength lasers such as ArF (193 nm) and KrF (248 nm) are essential for achieving superior resolution [61–66]. It remains challenging to design ARCs with angle-independent antireflective performance for deep-ultraviolet (DUV) wavelengths (190–350 nm). A major issue is the limited selection of transparent DUV materials, which are primarily fluoride and oxide-based dielectrics such as CaF2, MgF2, LaF3, AlF3, and Al2O3 [67–71]. These dielectrics offer a narrow range of refractive indices, which restricts design flexibility. In addition, the requirement for ultrathin films with minimal interfaces to mitigate fabrication-induced stress adds another layer of complexity to the design of DUV ARCs. Furthermore, high-numerical-aperture lenses require ARCs that perform effectively over a broad range of incident angles, but it is difficult to achieve omnidirectional antireflection in the absence of well-established analytical formulas.
To address these challenges, Kim et al. [54] proposed a binary-optimization-based multilayer design approach for DUV ARCs with FM as a machine learning model [72] and BF as a sampling method. Figure 2(a) illustrates a schematic with the FoM equation and the building blocks for producing ARCs. The multilayers are designed to minimize the average reflectance of both s-polarized and p-polarized light across wide incident angles (0° to 45°) at a wavelength of 193 nm. For transparent DUV materials, MgF2 and LaF3 are encoded as 0 and 1, respectively. Each bit represents a thickness of 10 nm, and CaF2 is chosen as a substrate material. The structure depicted in Fig. 2(a) can be represented as a binary vector {0 1 1 0 0 0 1 1} from the top layer, with 8 bits (Nbit) and four physical layers (Nlayer). The design process involves two stages: Discrete optimization (DO) using FM and continuous optimization (CO) using the interior-point (IP) method, as shown in Fig. 2(b). Given the constant bit thickness of 10 nm, FM explores the parametric space in discrete, larger steps. After this initial optimization, the thicknesses of the continuous layers are further refined using the IP method with a resolution of 1 nm. This finer adjustment moves the solution closer to the quasi-optimum point, reducing the risk of being trapped in local minima. Figure 2(c) presents the simulation results for both one-step (DO only) and two-step (DO followed by CO) optimizations as Nbit increases. The top panel in Fig. 2(c) displays the trend of the log-scaled FoM values. The red and blue lines correspond to the one-step and two-step optimizations, respectively. As Nbit increases, the FoM, which is an average angular reflectance, decreases in both cases. Specifically, at Nbit values of 3 and 10, the one-step optimization results in FoMs of 0.025 and 0.003, while the two-step optimization achieves slightly lower values of 0.024 and 0.002. The FoM values remain marginally changed for Nbit from 10 to 19, decrease slightly to 0.0009 at Nbit = 20, and reach a minimum value of 0.0008 at Nbit = 21. The two-step optimization consistently exhibits a slight improvement over the one-step optimization for all Nbit values. These findings indicate that DO with FM effectively explores the vast parametric space and provides a strong starting point close to the optimal solution.
The bottom panel of Fig. 2(c) illustrates the structure of the optimized ARCs. The topmost layer is consistently identified as MgF2, a low refractive index material chosen to minimize reflectance at the air interface. As Nbit increases, the configurations of the multilayers become increasingly intricate, even though the number of physical layers does not directly scale with Nbit. For example, a multilayer optimized through two-step optimization at Nbit(Nlayer) = 15(4) yields a simple four-layer configuration. This emphasizes the need to balance achieving a low FoM with maintaining fabrication fidelity, aiming for designs that provide similar FoM within an acceptable tolerance without unnecessarily increasing the number of layers. Figure 2(d) shows the reflectance of multilayers optimized at Nbit values of 4, 7, 10, 15, and 19 as a function of the incident angle (0°–45°) and the wavelength (180–300 nm). Notably, structures with Nbit values of 10, 15, and 19 demonstrate superior antireflective performance across a wide range of incident angles and wavelengths (180–200 nm), indicating robustness in fabrication tolerance regarding thickness variations.
After the optimization process, experimental validation is performed. Figures 2(e) and 2(f) present the measured reflectance values at incident angles of 0° and 45° for fabricated samples with Nbit(Nlayer) values of 4(1), 7(2), and 10(3), along with the simulated results, which cover the entire range from Nbit = 3 to 21. For both incident angles, reflectance significantly decreases as Nbit increased, which is consistent with the simulation trends shown in Fig. 2(c). At normal incidence, the sample with Nbit(Nlayer) = 10(3) demonstrates the lowest reflectance, approximately 0.35%. For the 45° incident angle, the same sample shows a reflectance of approximately 0.8%. The insets in Figs. 2(e) and 2(f) highlight the good agreement between the experimental and simulated spectra.
Another study by Park et al. [55] focused on designing DUV ARCs for a KrF laser wavelength (248 nm) using FM, and additionally provided analysis for comparisons to analytically designed ARCs. The initial simulation setup involves a binary combination of materials consisting of a low-refractive-index (low-n) material, AlF3 (n = 1.36), paired with one of high-refractive-index (high-n) materials: MgF2 (n = 1.40), LaF3 (n = 1.55), or Al2O3 (n = 1.71) [Fig. 3(a)]. The low-n material is encoded as “0” and the high-n material as “1.” Each bit represents a thickness of 10 nm, and CaF2 is used as a substrate material. A pair with the highest refractive index contrast, AlF3/Al2O3, shows the most effective antireflective performance. To highlight the performance of optimized AlF3/Al2O3 multilayers, they are assessed against ARCs designed using analytical formulas based on quarter-wave thicknesses [Fig. 3(b)]. The reflectance of the optimized multilayer with Nbit = 18 remains below 0.005 for incident angles between 0° and 45°. The observation of minimum reflectance around an incident angle of 23° for all configurations indicates that the machine learning aims to minimize the average reflectance across the considered angular range, rather than focusing solely on specific incident angles. Simulation results indicate that each optimized AlF3/Al2O3 multilayer outperforms the analytical double-layer coating across all Nbit values. Notably, the AlF3/Al2O3 multilayer at Nbit = 18 achieves a FoM of 0.00094, which is almost identical to the FoM of 0.00096 from the analytical single-layer coating; the latter requires an extremely low refractive index (n = 1.21), which is challenging to achieve in practice.
Phasor analysis is conducted to gain a deeper understanding of the underlying physical mechanisms of the designed ARCs [73–75]. Figure 3(c) provides a schematic representation of how partially reflected waves can be interpreted as phasors. The complex reflection and transmission amplitudes are obtained using Fresnel coefficients, where rlm and tlm represent reflection and transmission coefficients between layers l and m, respectively. The phase shifts are calculated from the optical path differences, which relies on the transmitted angle θj, layer thickness dj, and wave vector k0. Ej represents the phasor from a first-order reflection wave at the jth interface, ignoring higher-order reflections involving multiple reflections. The squared magnitude of the sum of all partial phasors (Etot) is then used to estimate the reflectance of the multilayer, and antireflective performance is evaluated by assessing how close Etot converges to the origin. Phasor diagrams for the AlF3/MgF2, AlF3/LaF3, and AlF3/Al2O3 multilayers at Nbit = 16 under normal incidence are obtained [Fig. 3(d)]. The final phasor for the AlF3/MgF2 multilayer diverges significantly from the origin with a relatively high Etot value of 0.013, indicating its suboptimal antireflective performance. In contrast, the AlF3/Al2O3 multilayer exhibits a much lower Etot value of 0.0013, which is an order of magnitude smaller, demonstrating its superior antireflective performance. The improvement can be attributed to the larger refractive index contrast between the binary materials, which increases the magnitudes of individual phasors and facilitates the convergence of Etot toward the origin. Conversely, when the magnitudes of individual phasors are too small, the overall phasor sum remains distant from the origin, with E1 being the dominant phasor.
TRCs can be used as window coatings to reduce cooling energy requirements for buildings and vehicles, thus contributing significantly to mitigating climate change [31]. Currently, the energy used for cooling accounts for approximately 15% of total global energy consumption [76, 77]. Passive radiative coolers, which operate without external energy by dissipating heat into the cold outer space through the atmospheric window, have attracted much interest [78−86]. TRCs are designed to minimize optical heating by reflecting UV and NIR light (comprising ~50% of solar irradiance) while allowing high transmittance in the visible spectrum. In the mid-infrared range, which corresponds to the atmospheric window, TRCs require near-unity emissivity to maximize radiative cooling efficiency. Additionally, another recent study has focused on tuning visible transmittance to create customizable windows that adapt to user needs and different lighting conditions [87]. The main challenges in TRC design include the complexity of multispectral requirements and the limited availability of materials with negligible absorption across the solar spectrum. These types of multispectral, multi-objective design problems are beyond the capability of analytical methods and instead require machine learning-based optimization approaches.
Kim et al. [56] proposed a TRC design using FM as the machine learning model, with QA selected as the sampling method. QA, a quantum computing technique, is well-suited for optimizing all potential input combinations since qubits can evaluate the objective function instantaneously to find the minimum point and bypassing local minima [88−90]. Compared to exhaustive enumeration, which guarantees finding the global minimum but can be computationally expensive, the QA-assisted optimization is significantly faster, particularly for structures with high-level complexity. The TRC layer is positioned between the top polydimethylsiloxane (PDMS) layer and a glass substrate, which do not participate in the optimization process [Fig. 4(a)]. The TRC design employs quaternary bases (SiO2 = 00, Si3N₄ = 01, Al2O3 = 10, TiO2 = 11) defined by combinations of binary digits, enabling the use of materials more than two within the binary-optimization framework. The structure is divided into Nbit layers with a total thickness of ttot = 1,200 nm, and each layer has a uniform thickness of ttot/Nbit. The optimization is conducted for Nbit values of 8, 10, 12, 16, 20, and 24 to identify the optimal structure. As observed in the ARC case, TRC multilayers with larger Nbit values exhibit a tailored spectrum closer to the ideal spectrum due to the increased degrees of freedom. Figure 4(b) compares the best-performing structure at Nbit = 24 with the ideal spectrum. The ideal transmittance spectrum is unity in the visible region and zero elsewhere, and is represented by the red line. For the transmittance spectrum of the optimal multilayer, the deviation from the ideal spectrum is notable for wavelengths greater than 1,400 nm. This discrepancy arises because the intensity of sunlight in the spectral region is much lower than in others, leading the machine learning model to prioritize matching the ideal spectrum in the visible. For experimental validation, the Nbit = 24 multilayer (10 physical layers) is fabricated using thin-film vacuum deposition and tested in a thermally insulated chamber with the TRC on the top side next to a reference glass panel [Fig. 4(c)]. The sample is tested in Phoenix, CA, USA for 37 hours, and shows that the chamber with the TRC maintains a temperature up to 6.1 ℃ lower than the chamber with the glass panel [Fig. 4(d)]. In both chambers, the measured temperature is higher than the ambient temperature due to the visible transparency of the TRC, which allows solar heating. The annual energy savings for a standard office with the two windows in the USA is estimated using EnergyPlusTM software (National renewable energy laboratory, CO, USA). The average annual energy savings in the surveyed USA cities is 50 MJ·m−2 when the TRC is used as a window coating [Fig. 4(e)]. In cities with hot and dry climates, such as Phoenix, the TRC can potentially save 86.3 MJ·m−2 per year. Similar calculations are performed for major cities worldwide [Fig. 4(f)]. All locations can benefit from the use of TRC as window coatings to reduce cooling energy.
Light-matter interactions in nanophotonic structures at infrared wavelengths have attracted increasing interest due to their wide-ranging potential applications [91−94]. In conventional solar PV systems, only photons with energy exceeding the bandgap of the PV cell are absorbed to generate photocurrent, whereas other energy losses occur due to thermalization and non-absorbed photons. Unlike traditional solar PV systems, TPV systems use engineered selective emitters to focus on narrowband thermal emission [95]. This approach potentially elevates efficiencies up to the Shockley-Queisser threshold by optimizing photon absorption matched with the spectral response of PV cells [96]. Among the various structures proposed for selective emitters, one-dimensional multilayer Tamm plasmon polaritons (TPPs) have demonstrated advantages in terms of scalable manufacturing and adjustable performance [97]. TPPs are surface waves formed at the boundary between a metallic mirror and a DBR with a zero in-plane wavevector, enabling excitation by incident light of which the in-plane wavevector is less than that of free space [98]. This simple configuration negates the necessity for complex nanostructures to compensate for wavevector mismatch.
Hu et al. [57] conducted research on this design approach for TPV systems focusing on optimizing selective emitters to achieve targeted performance. In a typical TPV system [Fig. 5(a)], a high-efficiency absorber captures heat, which is then transferred efficiently to the emitter, maximizing its operating temperature. The emitter is mechanically stabilized by an optically thick refractory material such as tungsten (W), which also acts as a heat spreader for a uniformly high temperature over the emitter. A 10 nm Al2O3 layer is positioned between the tungsten and the Tamm structure and acts as a spacer. The schematic primarily shows a metal-side emitter, where the metal layer directly faces the PV cell. Alternatively, a DBR-side emitter configuration can be used, where the DBR is oriented toward the PV cell. The DBR structure, which is the focus of the optimization process, consists of alternating layers of TiO2 and SiO2, denoted as 0 and 1, with respective bit thicknesses of t0 = 30 nm for TiO2 and t1 = 51 nm for SiO2. The Tamm emitter directs tailored thermal emission onto the PV cell, where photons with energy greater than the bandgap are absorbed to produce photocurrent, thus generating electricity. The PV cell is kept at a uniform temperature of 300 K using a heat sink.
To design an effective narrow-band selective emitter, both metal-side and DBR-side emitters are optimized using a multilayer design approach based on binary optimization. The Monte Carlo tree search (MCTS) algorithm, combined with Bayesian optimization, is employed to identify the optimal DBR configuration by using a four-step process of selection, expansion, simulation, and backpropagation [99−101]. Each binary sequence of 0s and 1s, from the root to the leaf node, represents a specific configuration of TiO2 and SiO2 layers in the DBR. Once a DBR configuration is determined, it is integrated into either the metal-side or DBR-side Tamm emitter, after which the corresponding emissivity spectrum, power density, and system efficiency are calculated. The system efficiency (η) is defined as the ratio of useful power, calculated as absorbed power minus non-radiative losses, to the total incident power. The output from the PV cell model is fed back into the MCTS algorithm for the evaluation and improvement of the current DBR configuration. The objective function of the machine learning-based MCTS algorithm is to maximize both the power density and the system efficiency for metal-side and DBR-side Tamm emitters. The ideal bandwidth for unit emissivity is determined by analyzing the trends of power density and system efficiency as a function of bandwidth (∆λ) [Fig. 5(b)]. As ∆λ increases, power density (P) rises steadily until it reaches a saturation point, whereas system efficiency η peaks at ∆λ = 0.696 µm. This is because not all absorbed photons generate photocurrent; Excess photon energy is instead converted to heat, which lowers the PV cell’s power conversion efficiency. Therefore, the MCTS algorithm targets an emissivity spectrum with unit emissivity over a bandwidth of 0.696 µm, which yields the peak system efficiency.
To compare the performance of the optimized metal-side and DBR-side emitters, the incident and absorbed power spectra are presented in Fig. 5(c), along with the blackbody irradiance at the same temperature. It is observed that the metal-side emitter exhibits higher intensity near the bandgap wavelength (λbg) compared to the DBR-side emitter. The corresponding absorbed power spectra shows absorption exclusively below λbg, as only photons with energy above this threshold can be effectively absorbed by the PV cell. Additionally, the metal-side emitter demonstrates more efficient power absorption than the DBR-side emitter. Figure 5(d) provides a comparison between the optimized emitter with Nbit = 15 for both metal-side and DBR-side configurations and periodically designed emitters with the same Nbit, which serve as a competing model for assessing the machine learning-based designs. When the input voltage increases, both optimized metal-side and DBR-side emitters show superior system efficiency compared to the periodic emitters. Among the optimized emitters, the metal-side emitter exhibits a higher system efficiency, achieving system efficiency η of 0.41 at 0 V. In terms of the generated photocurrent, both MCTS-optimized emitters outperform the periodic emitters, and the metal-side emitter shows superior performance by achieving a photocurrent of 7.9 A·cm−2 at 0 V. These results indicate that the periodic DBR structures are limited in performance, whereas the aperiodic DBR structures optimized by MCTS demonstrate the effectiveness of this method in designing selective emitters.
Another study on designing narrow-bandpass filters was conducted by Sakurai et al. [58] using Bayesian optimization. Traditionally, thermal emission control has been demonstrated using Fabry–Perot resonators with DBRs [102]. However, these analytical, periodic structures are merely a small subset of the general multilayer configurations. As a result, aperiodic and non-intuitive structures can further enhance performance due to their ability to explore a much larger parametric space compared to conventional designs. Figure 6(a) illustrates the schematic of the design method, where candidate materials Ge, Si, and SiO2 are encoded as 1, 2, and 3, respectively, with a tungsten substrate. Each bit thickness is uniformly set, and the number of bits (Nbit) is fixed at 18. The total thickness ttot varies from 3.6 to 4.0 µm in increments of 0.02 µm. The objective function aims for unit emissivity at a target wavelength (λt) with a specific bandwidth (∆λ), while minimizing emissivity at other wavelengths within the range (λmin to λmax) to reduce heat loss, as shown in Fig. 6(b). Bayesian optimization is employed as the machine learning model to predict the surrogate function that searches for the design closest to the target spectrum. Initially, a filter design with λt = 6.0 µm, ∆λ = 4.0 nm, λmin = 4.0 µm, and λmax = 8.0 µm is optimized. The result is a multilayer containing only Ge and SiO2 with a total thickness of ttot = 3.80 µm. Two additional designs are optimized under similar conditions, targeting λt values of 5.0 µm and 7.0 µm. Based on the initial results, Ge and SiO2 are selected as the material bases for these subsequent optimizations. Figure 6(c) compares the simulated and measured emissivity spectra. The optimized emissivity spectra (dashed lines) for λt = 5.0 µm, 6.0 µm, and 7.0 µm exhibit sharp, high emissivity with Q-factors of 217, 273, and 233, respectively. The measured emissivity spectra of the fabricated samples are shown as solid lines, showing a peak wavelength shift of approximately 0.5 µm, with peak emissivity of 0.76, 0.83, and 0.61, and Q-factors of 132, 188, and 109, respectively. The discrepancy between the peak positions is attributed to fabrication errors, particularly deviations in deposited thickness compared to the simulated structures. Despite these discrepancies, the machine learning optimization approach demonstrates improved emissivity and Q-factor values compared to conventional narrow-bandpass filters, highlighting the advantages of aperiodic multilayers.
In summary, we review three practical applications where binary-optimization-based multilayer designs effectively achieve optical performance close to the target spectra. Binary optimization enables the design of complex multilayers that meet requirements that are unattainable with conventional analytical methods. Encoding the continuous system into a reduced binary space saves significant computational efforts without compromising accuracy. Additionally, unlike more resource-intensive platforms such as neural networks, the FM model used for QUBO problems reveals the underlying physics of aperiodic or non-intuitive multilayers. For instance, ARCs depend on the interaction between adjacent layers, where the reflectance is influenced by each layer’s refractive index and thickness. The FM model’s approach to predicting pairwise interactions successfully interprets these layer-to-layer dynamics to analyze overall antireflective performance. However, relying solely on pairwise interactions may limit the capacity of the model to represent complex systems. For more complex systems, the development of a higher-order machine learning model that considers interactions beyond simple pairwise relationships is necessary.
Figure 7(a) illustrates a schematic for classifying phasors by reflection order. The first-order reflection wave includes phasors resulting from a single reflection, while the third-order reflection wave includes those from three reflections. Note that only odd-numbered reflections are considered. Figures 7(b) and 7(c) display phasor diagrams for a DUV ARC and an IR camera filter. In the DUV ARC, which targets a single wavelength, a clear phasor convergence is observed as third-order reflection waves are added to first-order reflections. This indicates that the pairwise interaction of the FM model is well-suited to the design of the given system. In contrast, the IR camera filter, designed for a broader spectral range (8–13 µm), shows more dispersed phasors even with third-order reflections, suggesting the need for a higher-order machine learning model to interpret such complex interactions effectively.
Another perspective in binary-optimization-based multilayer design can involve enhancing fabrication fidelity. Defining the FoM equation solely based on optical properties does not account for practical fabrication constraints, such as achievable deposition thickness, potential deposition errors influenced by layer height, or refractive index shifts in ultrathin layers compared to bulk materials. A statistical approach to analyzing the tolerance of designed structures to thickness errors can help address potential fabrication discrepancies. In Fig. 8(a), we illustrate FoM distribution by randomly perturbing all layers within a specified range for the triple-layer DUV ARC design, with the original FoM indicated by a red line. By applying 1,000 random perturbations, we assess tolerance by measuring FoM deviation from its original value. This analysis can also be applied to individual layers, as shown in Fig. 8(b), where the black squares indicate mean FoM values, and error bars represent the maximum and minimum FoM values after perturbation. The incorporation of fabrication fidelity into the FoM allows for a multilayer design that remains robust under realistic fabrication conditions. In addition, cost optimization is feasible by applying penalties for layer increases, which encourages high-performing structures with fewer physical layers. In conclusion, binary-optimization-based multilayer design offers a versatile approach for creating high-performance optical structures, efficiently balancing computational time, performance, and fabrication requirements, all achievable with precise formulation of the FoM equation.
This work was supported by the Hyundai-NGV Program (Grant No. R232604.0003) and the National Research Foundation (NRF) of Korea through the Basic Science Research Program (Grant No. RS-2023-00207966).
The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
Data sharing is not applicable to this article since no new data were created or analyzed in this study.
Curr. Opt. Photon. 2024; 8(6): 545-561
Published online December 25, 2024 https://doi.org/10.3807/COPP.2024.8.6.545
Copyright © Optical Society of Korea.
Geon-Tae Park1, Rira Kang2, Byunghong Lee2, Sun-Kyung Kim1
1Department of Applied Physics, Kyung Hee University, Yongin 17104, Korea
2Energy Device Research Team, Advanced Vehicle Platform Division, Hyundai Motor Co., Uiwang 16082, Korea
Correspondence to:*sunkim@khu.ac.kr, ORCID 0000-0002-0715-0066
This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
Multilayers composed of two or more materials enable the regulation of transmission, reflection, and absorption spectra across one or multiple bands. While analytic formulas based on well-established interference conditions, such as those employed in single-, double-, and triple-layer antireflective coatings and distributed Bragg reflectors, have provided suitable solutions for traditional optical coatings, they are limited in achieving the intricate spectral characteristics required by multifunctional optical coatings. To overcome this limitation, a variety of machine learning-based design algorithms have been rigorously studied. Among these, binary optimization has proven particularly effective for designing multilayer optical coatings. This approach transforms a given multilayer into a binary vector with multiple bits, where each bit represents one of the constituent materials, and quickly identifies an optimal figure-of-merit by analyzing the interactions among the elements of the binary vector. In this review article, we elucidate the principles of binary optimization and explore its applications in the design of multilayers for antireflective coatings for high-numerical-aperture lenses, transparent radiative coolers for energy-saving windows, and bandpass filters for thermophotovoltaics. Furthermore, we address the limitations, challenges, and perspectives of machine learning-based optical design to guide directions for future research in this field.
Keywords: Binary optimization, Machine learning, Multilayer, Optical coating, Optical design
Multilayer optical coatings are constructed by stacking dielectric and/or metallic layers with subwavelength thicknesses [1]. These multilayers are prevalent in everyday applications due to their capacity to precisely regulate light transmission, reflection, and absorption over specific or broad wavelengths [2]. This ability facilitates the creation of antireflective and antiglare coatings [3–6], mirrors, and spectral filters [7–10], which are integral to the functionality of solar cells [11–15], windows [16–20], lasers [21–23], and display technologies [24–27]. Analytical formulas such as single-, double-, and triple-layer configurations are often employed to design antireflective coatings (ARCs) [28, 29]. These formulas are used by selecting the appropriate configuration based on the availability of materials, specifically their refractive indices, along with the desired bandwidth and economic considerations. Similarly, distributed Bragg reflectors (DBRs) are designed using an analytical approach [30]; They consist of multiple layers of alternating materials with high and low refractive indices, where each repeated pair has a quarter-wave thickness. These examples illustrate the application of well-defined destructive or constructive interference for managing partial reflective waves.
Although analytical formulas provide exact solutions for straightforward optical designs, they are constrained to perform multispectral engineering or to generate stepwise spectral profiles, both of which necessitate a greater complexity of variables. Multispectral engineering entails the independent regulation of light transmission, reflection, and absorption across multiple spectral bands or within the wavelength-incident angle domain, thus presenting a formidable challenge from the perspective of classical approaches. For example, transparent radiative coolers (TRCs) designed for energy-saving windows must concurrently achieve high transmittance in the visible (400–700 nm), high reflectance in the near-infrared (700–2,500 nm), and high emissivity (characterized by absorption according to Kirchhoff’s law of thermal radiation) in the mid-infrared (8–14 µm) [31]. Likewise, stepwise spectral profiles are crucial for applications such as cooling windows [32], photon recycling in incandescent lighting [33], and near-infrared cutoff filters for complementary metal-oxide-semiconductor image sensors [34]. Although DBRs provide complete reflection within a photonic bandgap, they exhibit residual reflectance outside the bandgap, which undermines the creation of a stepwise spectrum. Moreover, challenges such as the lack of materials with desired refractive indices, in addition to considerations of fabrication tolerance and cost-effectiveness (e.g., the number of interfaces and total thickness), complicate the application of existing analytical formulas.
To address these limitations, various inverse design methods based on machine learning have been developed and employed, including needle optimization [35–38], memetic algorithm [39–41], genetic algorithm [42–47], neural networks [48–50], and reinforcement learning [51, 52]. It is well known that binary optimization is particularly effective for designing multilayer optical coatings [53]. This approach converts a given multilayer into a binary vector composed of multiple bits, where each bit corresponds to one of the constituent materials with a required thickness, typically one-tenth of an operational wavelength. It examines the interactions among the elements of the binary vector and swiftly identifies an optimal (or minimal) figure-of-merit (FoM) using sampling methods such as brute force (BF), simulated annealing (SA), and quantum annealing (QA). BF evaluates every possible solution within the parameter space, ensuring the identification of the global optimum in a given machine learning model. However, it is computationally expensive and impractical for large-scale problems with extensive parameter spaces. SA offers a probabilistic technique that explores the solution space by allowing occasional uphill moves, thus circumventing local minima. While more computationally efficient than BF, it may still require significant computational resources and careful tuning of its parameters to balance exploration and exploitation. QA promises faster convergence for certain problem classes, particularly those with intricate energy landscapes. However, its practical application is limited by current technological constraints (i.e., the availability of quantum bits). The selection of a sampling method depends on the specific problem characteristics, including the size and complexity of the parameter space, computational resources, and the required tolerance of the solution. In principle, binary optimization can reveal multiple multilayer configurations that achieve similar levels of FoM within an acceptable tolerance, thereby facilitating the selection of a more economically viable solution. For instance, minimizing the number of interfaces, the variety of materials employed, and the overall thickness can serve as guidelines for determining the optimal endpoint among multiple multilayer solutions.
In this review article, we elucidate the principles of binary optimization and investigate its applications in designing multilayers for ARCs [54, 55], TRCs for energy-saving windows [56], and bandpass filters for thermophotovoltaics (TPVs) [57, 58]. In addition, we discuss the limitations, challenges, and perspectives of machine learning-based optical design, thereby indicating potential directions for future research in this field.
Recently, the use of machine learning to design optical structures has gained immense popularity. As the demand for complex optical specifications such as broad spectrum, wide incident angles, and multispectral objectives, continues to rise, traditional analytical solutions became inadequate. Addressing these complex problems often requires non-intuitive designs, which can be successfully discovered using machine learning due to its ability to explore extensive parametric spaces within a limited computational timeframe [35–58]. For a machine learning-based approach to be effective, it is crucial to select an optimization method that strikes a balance between accuracy and computational cost. In this regard, binary-optimization-based multilayer design offers a robust optimization framework by compressing the dimensionality of the system into a simplified binary representation [53].
The binary-optimization-based multilayer design involves four main steps [Fig. 1(a)].
The first step is to encode the multilayer into a binary vector. This process transforms the continuous system into finite N dimensions representing both refractive indices and thicknesses as binary digits. This discretization is essential for mapping the physical problem into a quadratic unconstrained binary optimization (QUBO) problem, a common and versatile approach for addressing various optical systems and combinatorial optimizations.
The second step involves generating the initial training dataset, represented as a binary vector. An exact solver such as the transfer matrix method (TMM) yields the FoM, providing an accurate initial dataset for training the machine learning model. The FoM can represent various optical properties, including transmittance, reflectance, or emissivity, and can account for complex conditions such as wide-angle performance or multispectral requirements. This versatility highlights a key advantage of the binary-optimization-based method, allowing users to define any objective function by specifying the FoM equation.
The third step involves creating a surrogate function, a predictive model that approximates the FoM without requiring detailed internal analysis of the interactions of elements in a multilayer, which significantly reduces computational demands. There are various models capable of optimizing the QUBO problem, such as factorization machines (FM) and Bayesian optimization. The choice of an appropriate machine learning model depends on the complexity of the optimization problem. For example, FM is advantageous for sparse initial data, making it suitable for smaller parametric spaces.
The final step is searching for a quasi-optimal structure among the configurations evaluated by the surrogate function. The selection of a suitable sampling method is critical to efficiently identify the optimal point. BF evaluates all possible FoM values but requires extensive computational resources, while methods such as SA and QA use probabilistic approaches to approximate the quasi-optimal solution. Probabilistic methods are especially effective for large datasets and limited computational resources, as they enable significant advances during the search process and accelerate optimization. After identifying the quasi-optimal point, the corresponding binary vector and FoM value are fed back into the training dataset, and the optimization process is iterated until convergence on the desired point.
This methodology finds application in ARCs for high-numerical-aperture lenses, TRCs for energy-saving windows, and bandpass filters for TPVs [Fig. 1(b)] [54–58]. In the semiconductor fabrication and inspection industries, high-numerical-aperture lenses are in demand to ensure superior resolution [59, 60]. Binary-optimization-based multilayer design identifies ARC structures with minimal reflectance over a wide range of incident angles, which is essential for high-numerical-aperture lenses operating at a target wavelength (λ0) [54, 55]. Another application is TRCs for energy-saving windows, driven by the need to effectively manage the thermal properties of buildings and vehicles. TRCs aim to provide high transmittance in the visible spectrum, high reflectance in the ultraviolet (UV) and near-infrared (NIR) regions, and high emissivity in the mid-infrared range (8–14 µm). Given the multispectral and multi-objective requirements, machine learning-based optimization is advantageous for designing TRCs [56]. The third application is bandpass filters for TPVs. Bandpass filters act as selective emitters in TPV systems and significantly enhance photon absorption efficiency in photovoltaic (PV) cells. To reject non-usable photons, a bandpass filter must achieve unit emissivity within the bandwidth corresponding to maximum external quantum efficiency (EQE) of the PV cell. Hence, a machine learning approach allows for customization of bandpass filters to meet the target emissivity spectrum and enhance photon absorption for various PV cell types [57, 58].
In the semiconductor inspection and lithography sector, high-numerical-aperture lenses for short-wavelength lasers such as ArF (193 nm) and KrF (248 nm) are essential for achieving superior resolution [61–66]. It remains challenging to design ARCs with angle-independent antireflective performance for deep-ultraviolet (DUV) wavelengths (190–350 nm). A major issue is the limited selection of transparent DUV materials, which are primarily fluoride and oxide-based dielectrics such as CaF2, MgF2, LaF3, AlF3, and Al2O3 [67–71]. These dielectrics offer a narrow range of refractive indices, which restricts design flexibility. In addition, the requirement for ultrathin films with minimal interfaces to mitigate fabrication-induced stress adds another layer of complexity to the design of DUV ARCs. Furthermore, high-numerical-aperture lenses require ARCs that perform effectively over a broad range of incident angles, but it is difficult to achieve omnidirectional antireflection in the absence of well-established analytical formulas.
To address these challenges, Kim et al. [54] proposed a binary-optimization-based multilayer design approach for DUV ARCs with FM as a machine learning model [72] and BF as a sampling method. Figure 2(a) illustrates a schematic with the FoM equation and the building blocks for producing ARCs. The multilayers are designed to minimize the average reflectance of both s-polarized and p-polarized light across wide incident angles (0° to 45°) at a wavelength of 193 nm. For transparent DUV materials, MgF2 and LaF3 are encoded as 0 and 1, respectively. Each bit represents a thickness of 10 nm, and CaF2 is chosen as a substrate material. The structure depicted in Fig. 2(a) can be represented as a binary vector {0 1 1 0 0 0 1 1} from the top layer, with 8 bits (Nbit) and four physical layers (Nlayer). The design process involves two stages: Discrete optimization (DO) using FM and continuous optimization (CO) using the interior-point (IP) method, as shown in Fig. 2(b). Given the constant bit thickness of 10 nm, FM explores the parametric space in discrete, larger steps. After this initial optimization, the thicknesses of the continuous layers are further refined using the IP method with a resolution of 1 nm. This finer adjustment moves the solution closer to the quasi-optimum point, reducing the risk of being trapped in local minima. Figure 2(c) presents the simulation results for both one-step (DO only) and two-step (DO followed by CO) optimizations as Nbit increases. The top panel in Fig. 2(c) displays the trend of the log-scaled FoM values. The red and blue lines correspond to the one-step and two-step optimizations, respectively. As Nbit increases, the FoM, which is an average angular reflectance, decreases in both cases. Specifically, at Nbit values of 3 and 10, the one-step optimization results in FoMs of 0.025 and 0.003, while the two-step optimization achieves slightly lower values of 0.024 and 0.002. The FoM values remain marginally changed for Nbit from 10 to 19, decrease slightly to 0.0009 at Nbit = 20, and reach a minimum value of 0.0008 at Nbit = 21. The two-step optimization consistently exhibits a slight improvement over the one-step optimization for all Nbit values. These findings indicate that DO with FM effectively explores the vast parametric space and provides a strong starting point close to the optimal solution.
The bottom panel of Fig. 2(c) illustrates the structure of the optimized ARCs. The topmost layer is consistently identified as MgF2, a low refractive index material chosen to minimize reflectance at the air interface. As Nbit increases, the configurations of the multilayers become increasingly intricate, even though the number of physical layers does not directly scale with Nbit. For example, a multilayer optimized through two-step optimization at Nbit(Nlayer) = 15(4) yields a simple four-layer configuration. This emphasizes the need to balance achieving a low FoM with maintaining fabrication fidelity, aiming for designs that provide similar FoM within an acceptable tolerance without unnecessarily increasing the number of layers. Figure 2(d) shows the reflectance of multilayers optimized at Nbit values of 4, 7, 10, 15, and 19 as a function of the incident angle (0°–45°) and the wavelength (180–300 nm). Notably, structures with Nbit values of 10, 15, and 19 demonstrate superior antireflective performance across a wide range of incident angles and wavelengths (180–200 nm), indicating robustness in fabrication tolerance regarding thickness variations.
After the optimization process, experimental validation is performed. Figures 2(e) and 2(f) present the measured reflectance values at incident angles of 0° and 45° for fabricated samples with Nbit(Nlayer) values of 4(1), 7(2), and 10(3), along with the simulated results, which cover the entire range from Nbit = 3 to 21. For both incident angles, reflectance significantly decreases as Nbit increased, which is consistent with the simulation trends shown in Fig. 2(c). At normal incidence, the sample with Nbit(Nlayer) = 10(3) demonstrates the lowest reflectance, approximately 0.35%. For the 45° incident angle, the same sample shows a reflectance of approximately 0.8%. The insets in Figs. 2(e) and 2(f) highlight the good agreement between the experimental and simulated spectra.
Another study by Park et al. [55] focused on designing DUV ARCs for a KrF laser wavelength (248 nm) using FM, and additionally provided analysis for comparisons to analytically designed ARCs. The initial simulation setup involves a binary combination of materials consisting of a low-refractive-index (low-n) material, AlF3 (n = 1.36), paired with one of high-refractive-index (high-n) materials: MgF2 (n = 1.40), LaF3 (n = 1.55), or Al2O3 (n = 1.71) [Fig. 3(a)]. The low-n material is encoded as “0” and the high-n material as “1.” Each bit represents a thickness of 10 nm, and CaF2 is used as a substrate material. A pair with the highest refractive index contrast, AlF3/Al2O3, shows the most effective antireflective performance. To highlight the performance of optimized AlF3/Al2O3 multilayers, they are assessed against ARCs designed using analytical formulas based on quarter-wave thicknesses [Fig. 3(b)]. The reflectance of the optimized multilayer with Nbit = 18 remains below 0.005 for incident angles between 0° and 45°. The observation of minimum reflectance around an incident angle of 23° for all configurations indicates that the machine learning aims to minimize the average reflectance across the considered angular range, rather than focusing solely on specific incident angles. Simulation results indicate that each optimized AlF3/Al2O3 multilayer outperforms the analytical double-layer coating across all Nbit values. Notably, the AlF3/Al2O3 multilayer at Nbit = 18 achieves a FoM of 0.00094, which is almost identical to the FoM of 0.00096 from the analytical single-layer coating; the latter requires an extremely low refractive index (n = 1.21), which is challenging to achieve in practice.
Phasor analysis is conducted to gain a deeper understanding of the underlying physical mechanisms of the designed ARCs [73–75]. Figure 3(c) provides a schematic representation of how partially reflected waves can be interpreted as phasors. The complex reflection and transmission amplitudes are obtained using Fresnel coefficients, where rlm and tlm represent reflection and transmission coefficients between layers l and m, respectively. The phase shifts are calculated from the optical path differences, which relies on the transmitted angle θj, layer thickness dj, and wave vector k0. Ej represents the phasor from a first-order reflection wave at the jth interface, ignoring higher-order reflections involving multiple reflections. The squared magnitude of the sum of all partial phasors (Etot) is then used to estimate the reflectance of the multilayer, and antireflective performance is evaluated by assessing how close Etot converges to the origin. Phasor diagrams for the AlF3/MgF2, AlF3/LaF3, and AlF3/Al2O3 multilayers at Nbit = 16 under normal incidence are obtained [Fig. 3(d)]. The final phasor for the AlF3/MgF2 multilayer diverges significantly from the origin with a relatively high Etot value of 0.013, indicating its suboptimal antireflective performance. In contrast, the AlF3/Al2O3 multilayer exhibits a much lower Etot value of 0.0013, which is an order of magnitude smaller, demonstrating its superior antireflective performance. The improvement can be attributed to the larger refractive index contrast between the binary materials, which increases the magnitudes of individual phasors and facilitates the convergence of Etot toward the origin. Conversely, when the magnitudes of individual phasors are too small, the overall phasor sum remains distant from the origin, with E1 being the dominant phasor.
TRCs can be used as window coatings to reduce cooling energy requirements for buildings and vehicles, thus contributing significantly to mitigating climate change [31]. Currently, the energy used for cooling accounts for approximately 15% of total global energy consumption [76, 77]. Passive radiative coolers, which operate without external energy by dissipating heat into the cold outer space through the atmospheric window, have attracted much interest [78−86]. TRCs are designed to minimize optical heating by reflecting UV and NIR light (comprising ~50% of solar irradiance) while allowing high transmittance in the visible spectrum. In the mid-infrared range, which corresponds to the atmospheric window, TRCs require near-unity emissivity to maximize radiative cooling efficiency. Additionally, another recent study has focused on tuning visible transmittance to create customizable windows that adapt to user needs and different lighting conditions [87]. The main challenges in TRC design include the complexity of multispectral requirements and the limited availability of materials with negligible absorption across the solar spectrum. These types of multispectral, multi-objective design problems are beyond the capability of analytical methods and instead require machine learning-based optimization approaches.
Kim et al. [56] proposed a TRC design using FM as the machine learning model, with QA selected as the sampling method. QA, a quantum computing technique, is well-suited for optimizing all potential input combinations since qubits can evaluate the objective function instantaneously to find the minimum point and bypassing local minima [88−90]. Compared to exhaustive enumeration, which guarantees finding the global minimum but can be computationally expensive, the QA-assisted optimization is significantly faster, particularly for structures with high-level complexity. The TRC layer is positioned between the top polydimethylsiloxane (PDMS) layer and a glass substrate, which do not participate in the optimization process [Fig. 4(a)]. The TRC design employs quaternary bases (SiO2 = 00, Si3N₄ = 01, Al2O3 = 10, TiO2 = 11) defined by combinations of binary digits, enabling the use of materials more than two within the binary-optimization framework. The structure is divided into Nbit layers with a total thickness of ttot = 1,200 nm, and each layer has a uniform thickness of ttot/Nbit. The optimization is conducted for Nbit values of 8, 10, 12, 16, 20, and 24 to identify the optimal structure. As observed in the ARC case, TRC multilayers with larger Nbit values exhibit a tailored spectrum closer to the ideal spectrum due to the increased degrees of freedom. Figure 4(b) compares the best-performing structure at Nbit = 24 with the ideal spectrum. The ideal transmittance spectrum is unity in the visible region and zero elsewhere, and is represented by the red line. For the transmittance spectrum of the optimal multilayer, the deviation from the ideal spectrum is notable for wavelengths greater than 1,400 nm. This discrepancy arises because the intensity of sunlight in the spectral region is much lower than in others, leading the machine learning model to prioritize matching the ideal spectrum in the visible. For experimental validation, the Nbit = 24 multilayer (10 physical layers) is fabricated using thin-film vacuum deposition and tested in a thermally insulated chamber with the TRC on the top side next to a reference glass panel [Fig. 4(c)]. The sample is tested in Phoenix, CA, USA for 37 hours, and shows that the chamber with the TRC maintains a temperature up to 6.1 ℃ lower than the chamber with the glass panel [Fig. 4(d)]. In both chambers, the measured temperature is higher than the ambient temperature due to the visible transparency of the TRC, which allows solar heating. The annual energy savings for a standard office with the two windows in the USA is estimated using EnergyPlusTM software (National renewable energy laboratory, CO, USA). The average annual energy savings in the surveyed USA cities is 50 MJ·m−2 when the TRC is used as a window coating [Fig. 4(e)]. In cities with hot and dry climates, such as Phoenix, the TRC can potentially save 86.3 MJ·m−2 per year. Similar calculations are performed for major cities worldwide [Fig. 4(f)]. All locations can benefit from the use of TRC as window coatings to reduce cooling energy.
Light-matter interactions in nanophotonic structures at infrared wavelengths have attracted increasing interest due to their wide-ranging potential applications [91−94]. In conventional solar PV systems, only photons with energy exceeding the bandgap of the PV cell are absorbed to generate photocurrent, whereas other energy losses occur due to thermalization and non-absorbed photons. Unlike traditional solar PV systems, TPV systems use engineered selective emitters to focus on narrowband thermal emission [95]. This approach potentially elevates efficiencies up to the Shockley-Queisser threshold by optimizing photon absorption matched with the spectral response of PV cells [96]. Among the various structures proposed for selective emitters, one-dimensional multilayer Tamm plasmon polaritons (TPPs) have demonstrated advantages in terms of scalable manufacturing and adjustable performance [97]. TPPs are surface waves formed at the boundary between a metallic mirror and a DBR with a zero in-plane wavevector, enabling excitation by incident light of which the in-plane wavevector is less than that of free space [98]. This simple configuration negates the necessity for complex nanostructures to compensate for wavevector mismatch.
Hu et al. [57] conducted research on this design approach for TPV systems focusing on optimizing selective emitters to achieve targeted performance. In a typical TPV system [Fig. 5(a)], a high-efficiency absorber captures heat, which is then transferred efficiently to the emitter, maximizing its operating temperature. The emitter is mechanically stabilized by an optically thick refractory material such as tungsten (W), which also acts as a heat spreader for a uniformly high temperature over the emitter. A 10 nm Al2O3 layer is positioned between the tungsten and the Tamm structure and acts as a spacer. The schematic primarily shows a metal-side emitter, where the metal layer directly faces the PV cell. Alternatively, a DBR-side emitter configuration can be used, where the DBR is oriented toward the PV cell. The DBR structure, which is the focus of the optimization process, consists of alternating layers of TiO2 and SiO2, denoted as 0 and 1, with respective bit thicknesses of t0 = 30 nm for TiO2 and t1 = 51 nm for SiO2. The Tamm emitter directs tailored thermal emission onto the PV cell, where photons with energy greater than the bandgap are absorbed to produce photocurrent, thus generating electricity. The PV cell is kept at a uniform temperature of 300 K using a heat sink.
To design an effective narrow-band selective emitter, both metal-side and DBR-side emitters are optimized using a multilayer design approach based on binary optimization. The Monte Carlo tree search (MCTS) algorithm, combined with Bayesian optimization, is employed to identify the optimal DBR configuration by using a four-step process of selection, expansion, simulation, and backpropagation [99−101]. Each binary sequence of 0s and 1s, from the root to the leaf node, represents a specific configuration of TiO2 and SiO2 layers in the DBR. Once a DBR configuration is determined, it is integrated into either the metal-side or DBR-side Tamm emitter, after which the corresponding emissivity spectrum, power density, and system efficiency are calculated. The system efficiency (η) is defined as the ratio of useful power, calculated as absorbed power minus non-radiative losses, to the total incident power. The output from the PV cell model is fed back into the MCTS algorithm for the evaluation and improvement of the current DBR configuration. The objective function of the machine learning-based MCTS algorithm is to maximize both the power density and the system efficiency for metal-side and DBR-side Tamm emitters. The ideal bandwidth for unit emissivity is determined by analyzing the trends of power density and system efficiency as a function of bandwidth (∆λ) [Fig. 5(b)]. As ∆λ increases, power density (P) rises steadily until it reaches a saturation point, whereas system efficiency η peaks at ∆λ = 0.696 µm. This is because not all absorbed photons generate photocurrent; Excess photon energy is instead converted to heat, which lowers the PV cell’s power conversion efficiency. Therefore, the MCTS algorithm targets an emissivity spectrum with unit emissivity over a bandwidth of 0.696 µm, which yields the peak system efficiency.
To compare the performance of the optimized metal-side and DBR-side emitters, the incident and absorbed power spectra are presented in Fig. 5(c), along with the blackbody irradiance at the same temperature. It is observed that the metal-side emitter exhibits higher intensity near the bandgap wavelength (λbg) compared to the DBR-side emitter. The corresponding absorbed power spectra shows absorption exclusively below λbg, as only photons with energy above this threshold can be effectively absorbed by the PV cell. Additionally, the metal-side emitter demonstrates more efficient power absorption than the DBR-side emitter. Figure 5(d) provides a comparison between the optimized emitter with Nbit = 15 for both metal-side and DBR-side configurations and periodically designed emitters with the same Nbit, which serve as a competing model for assessing the machine learning-based designs. When the input voltage increases, both optimized metal-side and DBR-side emitters show superior system efficiency compared to the periodic emitters. Among the optimized emitters, the metal-side emitter exhibits a higher system efficiency, achieving system efficiency η of 0.41 at 0 V. In terms of the generated photocurrent, both MCTS-optimized emitters outperform the periodic emitters, and the metal-side emitter shows superior performance by achieving a photocurrent of 7.9 A·cm−2 at 0 V. These results indicate that the periodic DBR structures are limited in performance, whereas the aperiodic DBR structures optimized by MCTS demonstrate the effectiveness of this method in designing selective emitters.
Another study on designing narrow-bandpass filters was conducted by Sakurai et al. [58] using Bayesian optimization. Traditionally, thermal emission control has been demonstrated using Fabry–Perot resonators with DBRs [102]. However, these analytical, periodic structures are merely a small subset of the general multilayer configurations. As a result, aperiodic and non-intuitive structures can further enhance performance due to their ability to explore a much larger parametric space compared to conventional designs. Figure 6(a) illustrates the schematic of the design method, where candidate materials Ge, Si, and SiO2 are encoded as 1, 2, and 3, respectively, with a tungsten substrate. Each bit thickness is uniformly set, and the number of bits (Nbit) is fixed at 18. The total thickness ttot varies from 3.6 to 4.0 µm in increments of 0.02 µm. The objective function aims for unit emissivity at a target wavelength (λt) with a specific bandwidth (∆λ), while minimizing emissivity at other wavelengths within the range (λmin to λmax) to reduce heat loss, as shown in Fig. 6(b). Bayesian optimization is employed as the machine learning model to predict the surrogate function that searches for the design closest to the target spectrum. Initially, a filter design with λt = 6.0 µm, ∆λ = 4.0 nm, λmin = 4.0 µm, and λmax = 8.0 µm is optimized. The result is a multilayer containing only Ge and SiO2 with a total thickness of ttot = 3.80 µm. Two additional designs are optimized under similar conditions, targeting λt values of 5.0 µm and 7.0 µm. Based on the initial results, Ge and SiO2 are selected as the material bases for these subsequent optimizations. Figure 6(c) compares the simulated and measured emissivity spectra. The optimized emissivity spectra (dashed lines) for λt = 5.0 µm, 6.0 µm, and 7.0 µm exhibit sharp, high emissivity with Q-factors of 217, 273, and 233, respectively. The measured emissivity spectra of the fabricated samples are shown as solid lines, showing a peak wavelength shift of approximately 0.5 µm, with peak emissivity of 0.76, 0.83, and 0.61, and Q-factors of 132, 188, and 109, respectively. The discrepancy between the peak positions is attributed to fabrication errors, particularly deviations in deposited thickness compared to the simulated structures. Despite these discrepancies, the machine learning optimization approach demonstrates improved emissivity and Q-factor values compared to conventional narrow-bandpass filters, highlighting the advantages of aperiodic multilayers.
In summary, we review three practical applications where binary-optimization-based multilayer designs effectively achieve optical performance close to the target spectra. Binary optimization enables the design of complex multilayers that meet requirements that are unattainable with conventional analytical methods. Encoding the continuous system into a reduced binary space saves significant computational efforts without compromising accuracy. Additionally, unlike more resource-intensive platforms such as neural networks, the FM model used for QUBO problems reveals the underlying physics of aperiodic or non-intuitive multilayers. For instance, ARCs depend on the interaction between adjacent layers, where the reflectance is influenced by each layer’s refractive index and thickness. The FM model’s approach to predicting pairwise interactions successfully interprets these layer-to-layer dynamics to analyze overall antireflective performance. However, relying solely on pairwise interactions may limit the capacity of the model to represent complex systems. For more complex systems, the development of a higher-order machine learning model that considers interactions beyond simple pairwise relationships is necessary.
Figure 7(a) illustrates a schematic for classifying phasors by reflection order. The first-order reflection wave includes phasors resulting from a single reflection, while the third-order reflection wave includes those from three reflections. Note that only odd-numbered reflections are considered. Figures 7(b) and 7(c) display phasor diagrams for a DUV ARC and an IR camera filter. In the DUV ARC, which targets a single wavelength, a clear phasor convergence is observed as third-order reflection waves are added to first-order reflections. This indicates that the pairwise interaction of the FM model is well-suited to the design of the given system. In contrast, the IR camera filter, designed for a broader spectral range (8–13 µm), shows more dispersed phasors even with third-order reflections, suggesting the need for a higher-order machine learning model to interpret such complex interactions effectively.
Another perspective in binary-optimization-based multilayer design can involve enhancing fabrication fidelity. Defining the FoM equation solely based on optical properties does not account for practical fabrication constraints, such as achievable deposition thickness, potential deposition errors influenced by layer height, or refractive index shifts in ultrathin layers compared to bulk materials. A statistical approach to analyzing the tolerance of designed structures to thickness errors can help address potential fabrication discrepancies. In Fig. 8(a), we illustrate FoM distribution by randomly perturbing all layers within a specified range for the triple-layer DUV ARC design, with the original FoM indicated by a red line. By applying 1,000 random perturbations, we assess tolerance by measuring FoM deviation from its original value. This analysis can also be applied to individual layers, as shown in Fig. 8(b), where the black squares indicate mean FoM values, and error bars represent the maximum and minimum FoM values after perturbation. The incorporation of fabrication fidelity into the FoM allows for a multilayer design that remains robust under realistic fabrication conditions. In addition, cost optimization is feasible by applying penalties for layer increases, which encourages high-performing structures with fewer physical layers. In conclusion, binary-optimization-based multilayer design offers a versatile approach for creating high-performance optical structures, efficiently balancing computational time, performance, and fabrication requirements, all achievable with precise formulation of the FoM equation.
This work was supported by the Hyundai-NGV Program (Grant No. R232604.0003) and the National Research Foundation (NRF) of Korea through the Basic Science Research Program (Grant No. RS-2023-00207966).
The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
Data sharing is not applicable to this article since no new data were created or analyzed in this study.