Improving expectation maximization algorithm over stellar data

Hasan Kurban, Can Kockan, Mark Jenne, Mehmet M. Dalkilic

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

2 Citations (Scopus)

Abstract

Stellar data, only a few years ago, measured in the.1M of objects. Now, sets are routinely 1M. With the launch of ESA's Gaia in 2013, we expect 1000M stellar objects measured more precisely and with more measurements. Without question, astronomy is about Big Data and clustering is a very common task over astronomy domain. The expectation-maximization algorithm is among the top 10 data mining algorithms used in scientific and industrial applications, however, we observe that astronomical community does not make use of it as a clustering algorithm. In this work, we cluster ∼ 1M stellar objects (simulated Galactic spectral data) via the traditional expectation-maximization algorithm for clustering (EM-T) and our extended EM-T algorithm that we call EM and present the experimental results.

Original languageEnglish
Title of host publicationProceedings - 2017 IEEE International Conference on Big Data, Big Data 2017
EditorsJian-Yun Nie, Zoran Obradovic, Toyotaro Suzumura, Rumi Ghosh, Raghunath Nambiar, Chonggang Wang, Hui Zang, Ricardo Baeza-Yates, Ricardo Baeza-Yates, Xiaohua Hu, Jeremy Kepner, Alfredo Cuzzocrea, Jian Tang, Masashi Toyoda
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages2559-2568
Number of pages10
ISBN (Electronic)9781538627143
DOIs
Publication statusPublished - 1 Jul 2017
Externally publishedYes
Event5th IEEE International Conference on Big Data, Big Data 2017 - Boston, United States
Duration: 11 Dec 201714 Dec 2017

Publication series

NameProceedings - 2017 IEEE International Conference on Big Data, Big Data 2017
Volume2018-January

Conference

Conference5th IEEE International Conference on Big Data, Big Data 2017
Country/TerritoryUnited States
CityBoston
Period11/12/1714/12/17

Keywords

  • astronomy
  • big data
  • clustering
  • data reduction
  • expectation maximization
  • heap

Fingerprint

Dive into the research topics of 'Improving expectation maximization algorithm over stellar data'. Together they form a unique fingerprint.

Cite this