Exploring Machine Learning Methods for the Star/Galaxy Separation Problem

Os alunos Eduardo Machado, Marcello Serqueira, Riccardo Campisano tiveram o artigo intitulado “Exploring Machine Learning Methods for the Star/Galaxy Separation Problem” aceito no International Joint Conference On Neural Networks (IJCNN). O trabalho foi feito em colaboração com o Observatório Nacional (ON/LINEA) representados por Ricardo Ogando, Marcio A. G. Maia e Luiz Nicolaci da Costa. Pelo lado do CEFET/RJ participaram também os professores Gustavo Guedes, Eduardo Bezerra e Eduardo Ogasawara.

Abstract: For recent or planned deep astronomical surveys, it is important to tell stars and galaxies apart, a task known as Star/Galaxy Separation Problem (SGSP). At faint magnitudes, the separation between pointy and extended sources is fuzzy, which makes SGSP a hard task. This problem is even harder for large surveys like Dark Energy Survey (DES) and, in a near future, the Large Synoptic Survey Telescope (LSST) due to their large data volume. Hence, the search for classification methods that are both accurate and efficient is highly relevant. In this work, we present a comparative analysis of several machine learning methods targeted at solving the SGSP at faint magnitudes. In order to train the classification models, the COSMOS survey was used. We use machine learning methods as distinct as artificial neural networks, k nearest-neighbor, Support Vector Machines, Random Forests and Naive Bayes. The exploratory process was modeled as data centric workflow. The workflow was implemented on top of Hadoop framework and was used to find the best parameter values for each classification method we considered, of which neural networks and random forest present superior performance.

Comments are closed.