基于FASTER的近瞬时大气反演与模型比较

Research

arXiv

Near-instantaneous Atmospheric Retrievals and Model Comparison with FASTER

Anna Lueber ,

摘要 Abstract

随着詹姆斯·韦伯太空望远镜（JWST）时代的到来，系外行星大气光谱的质量得到了显著提升，这要求我们对这些光谱的分析能力也必须同步飞跃：需要对成千上万条光谱进行大气反演，并结合大量模型（探索大气化学、热剖面和云模型）来确定最佳模型。在这种情况下，传统的贝叶斯推理方法如嵌套抽样变得过于昂贵。我们引入了FASTER（快速摊销基于模拟的凌星系外行星反演），这是一种基于神经网络的方法，能够以传统技术一小部分的计算成本实现大气反演和贝叶斯模型比较。我们证明，利用模拟光谱以及真实NIRSpec PRISM光谱（WASP-39b），FASTER框架内模型的所有参数边缘后验分布以及所考虑模型的后验概率与嵌套抽样的计算结果一致。FASTER框架真正的强大之处在于其摊销特性，这使得训练后的网络能够在几乎不增加额外计算成本的情况下，对真实或模拟的光谱集合进行实用的贝叶斯推理和模型比较。这种方法为理解模型比较的结果（例如区分有云与无云、等温与非等温模型）及其对底层参数的依赖提供了宝贵的见解，而这些在嵌套抽样下是计算上不可行的。这一方法将在光谱分析领域带来类似于基于马尔可夫链蒙特卡洛的传统反演方法那样的巨大飞跃。

In the era of the James Webb Space Telescope (JWST), the dramatic improvement in the spectra of exoplanetary atmospheres demands a corresponding leap forward in our ability to analyze them: atmospheric retrievals need to be performed on thousands of spectra, applying to each large ensembles of models (that explore atmospheric chemistry, thermal profiles and cloud models) to identify the best one(s). In this limit, traditional Bayesian inference methods such as nested sampling become prohibitively expensive. We introduce FASTER (Fast Amortized Simulation-based Transiting Exoplanet Retrieval), a neural-network based method for performing atmospheric retrieval and Bayesian model comparison at a fraction of the computational cost of classical techniques. We demonstrate that the marginal posterior distributions of all parameters within a model as well as the posterior probabilities of the models we consider match those computed using nested sampling both on mock spectra, and for the real NIRSpec PRISM spectrum of WASP-39b. The true power of the FASTER framework comes from its amortized nature, which allows the trained networks to perform practically instantaneous Bayesian inference and model comparison over ensembles of spectra -- real or simulated -- at minimal additional computational cost. This offers valuable insight into the expected results of model comparison (e.g., distinguishing cloudy from cloud-free and isothermal from non-isothermal models), as well as their dependence on the underlying parameters, which is computationally unfeasible with nested sampling. This approach will constitute as large a leap in spectral analysis as the original retrieval methods based on Markov Chain Monte Carlo have proven to be.