论文标题
银河系分类:一种对斯隆数字天空调查图像进行分类的深度学习方法
Galaxy classification: a deep learning approach for classifying Sloan Digital Sky Survey images
论文作者
论文摘要
近几十年来,斯隆数字天空调查(SDSS)等大型天空调查导致了大量数据。天文学家对这种大量数据的分类很耗时。为了简化这一过程,引入了2007年的一个名为Galaxy Zoo的基于志愿者的公民科学项目,这使分类的时间大大减少了。但是,在现代的深度学习时代,自动化此分类任务是非常有益的,因为它减少了分类的时间。在过去的几年中,已经提出了许多算法,这些算法恰好在将星系分为多个类中做出了惊人的工作。但是所有这些算法倾向于将星系分类为少于六个类。但是,在考虑了我们了解星系的分钟信息之后,有必要将星系分为八个以上的类别。在这项研究中,提出了一个神经网络模型,以将SDSS数据分类为10个类别,从扩展的哈勃调整叉中分为10个类。非常小心,以圆盘边缘和光盘面向星系,从而区分与每个类别相关的各种子结构和微小特征。所提出的模型由卷积层组成,以提取使此方法完全自动的特征。实现的测试准确性为84.73%,在考虑课程中的细节之后,恰好是有希望的。除卷积层外,所提出的模型还有三层负责分类的层,这使算法消耗的时间更少。
In recent decades, large-scale sky surveys such as Sloan Digital Sky Survey (SDSS) have resulted in generation of tremendous amount of data. The classification of this enormous amount of data by astronomers is time consuming. To simplify this process, in 2007 a volunteer-based citizen science project called Galaxy Zoo was introduced, which has reduced the time for classification by a good extent. However, in this modern era of deep learning, automating this classification task is highly beneficial as it reduces the time for classification. For the last few years, many algorithms have been proposed which happen to do a phenomenal job in classifying galaxies into multiple classes. But all these algorithms tend to classify galaxies into less than six classes. However, after considering the minute information which we know about galaxies, it is necessary to classify galaxies into more than eight classes. In this study, a neural network model is proposed so as to classify SDSS data into 10 classes from an extended Hubble Tuning Fork. Great care is given to disc edge and disc face galaxies, distinguishing between a variety of substructures and minute features which are associated with each class. The proposed model consists of convolution layers to extract features making this method fully automatic. The achieved test accuracy is 84.73 per cent which happens to be promising after considering such minute details in classes. Along with convolution layers, the proposed model has three more layers responsible for classification, which makes the algorithm consume less time.