利用机器学习算法估算星系组分质量

Estimating the mass of galactic components using machine learning algorithms

摘要 Abstract

星系的主要重子组成(如星系核球和盘的质量)可以通过多种方法进行估算,但这些方法的实施往往面临挑战,因为它们通常依赖于对重子动力学或暗物质模型的强假设。在本文中,我们提出了一种利用一组机器学习算法(KNN近邻算法、线性回归、随机森林和神经网络)预测星系组分质量(包括盘、核球、恒星质量和总质量)的替代方法。选择u griz 光学系统中的绝对星等作为输入特征,并使用来自Guo模拟目录的螺旋星系样本(包含核球)进行训练数据集构建,该目录来源于Millennium模拟。总体而言,所有算法对从$10^9 M_\odot$到$10^{11} M_\odot$范围内的星系质量组分提供了良好的预测结果,这对应于训练质量域的中心区域;然而,神经网络相较于其他方法提供了更精确的预测。此外,为了测试神经网络架构的性能,我们使用了来自SDSS巡天观测样本的数据,其质量组分已知。我们发现,对于合成样本中的盘优势星系,神经网络可以在相同星等范围内以高达99%的置信水平预测发光质量;而对于包含更大核球的星系,其质量组分可以被预测至95%的置信水平。神经网络算法还可以揭示不同组分质量与星等之间的比例关系。

The estimation of the bulge and disk massses, the main baryonic components of a galaxy, can be performed using various approaches, but their implementation tend to be challenging as they often rely on strong assumptions about either the baryon dynamics or the dark matter model. In this work, we present an alternative method for predicting the masses of galactic components, including the disk, bulge, stellar and total mass, using a set of machine learning algorithms: KNN-neighbours (KNN), Linear Regression (LR), Random Forest (RF) and Neural Network (NN). The rest-frame absolute magnitudes in the ugriz-photometric system were selected as input features, and the training was performed using a sample of spiral galaxies hosting a bulge from Guo's mock catalogue \citep{Guo-Catalog} derived from the Millennium simulation. In general, all the algorithms provide good predictions for the galaxy's mass components ranging from $10^9\,M_\odot$ to $10^{11}\,M_\odot$, corresponding to the central region of the training mass domain; however, the NN give rise to the most precise predictions in comparison to other methods. Additionally, to test the performance of the NN architecture, we used a sample of observed galaxies from the SDSS survey whose mass components are known. We found that the NN can predict the luminous masses of disk-dominant galaxies within the same range of magnitudes that for the synthetic sample up to a $99\%$ level of confidence, while mass components of galaxies hosting larger bulges are well predicted up to $95\%$ level of confidence. The NN algorithm can also bring up scaling relations between masses of different components and magnitudes.