On Machine-Learned Classification of Variable Stars with Sparse and Noisy Time-Series Data

Astronomy and Astrophysics – Astrophysics – Instrumentation and Methods for Astrophysics

Scientific paper

Rate now

[ 0.00 ] – not rated yet Voters 0 Comments 0

Details On Machine-Learned Classification of Variable Stars with Sparse and Noisy Time-Series Data On Machine-Learned Classification of Variable Stars with Sparse and Noisy Time-Series Data

: 2011-01-10
: arxiv.org/abs/1101.1959v1
: ApJ, 733, 10 (2011)
: Astronomy and Astrophysics
: Astrophysics
: Instrumentation and Methods for Astrophysics

: 23 pages, 9 figures
: Scientific paper
: 10.1088/0004-637X/733/1/10
: With the coming data deluge from synoptic surveys, there is a growing need for frameworks that can quickly and automatically produce calibrated classification probabilities for newly-observed variables based on a small number of time-series measurements. In this paper, we introduce a methodology for variable-star classification, drawing from modern machine-learning techniques. We describe how to homogenize the information gleaned from light curves by selection and computation of real-numbered metrics ("feature"), detail methods to robustly estimate periodic light-curve features, introduce tree-ensemble methods for accurate variable star classification, and show how to rigorously evaluate the classification results using cross validation. On a 25-class data set of 1542 well-studied variable stars, we achieve a 22.8% overall classification error using the random forest classifier; this represents a 24% improvement over the best previous classifier on these data. This methodology is effective for identifying samples of specific science classes: for pulsational variables used in Milky Way tomography we obtain a discovery efficiency of 98.2% and for eclipsing systems we find an efficiency of 99.1%, both at 95% purity. We show that the random forest (RF) classifier is superior to other machine-learned methods in terms of accuracy, speed, and relative immunity to features with no useful class information; the RF classifier can also be used to estimate the importance of each feature in classification. Additionally, we present the first astronomical use of hierarchical classification methods to incorporate a known class taxonomy in the classifier, which further reduces the catastrophic error rate to 7.8%. Excluding low-amplitude sources, our overall error rate improves to 14%, with a catastrophic error rate of 3.5%.

Affiliated with

Bloom Joshua S.

Astronomy and Astrophysics – Astrophysics

Scientist

[ 0.00 ] – not rated yet Voters 0 Comments 0

Brewer John M.

Astronomy and Astrophysics – Astrophysics – Instrumentation and Methods for Astrophysics

Scientist

[ 0.00 ] – not rated yet Voters 0 Comments 0

Butler Nathaniel R.

Astronomy and Astrophysics – Astrophysics

Scientist

[ 0.00 ] – not rated yet Voters 0 Comments 0

Crellin-Quick Arien

Astronomy and Astrophysics – Astrophysics – Instrumentation and Methods for Astrophysics

Scientist

[ 0.00 ] – not rated yet Voters 0 Comments 0

Higgins Justin

Astronomy and Astrophysics – Astrophysics – Instrumentation and Methods for Astrophysics

Scientist

[ 0.00 ] – not rated yet Voters 0 Comments 0

Also associated with

No associations

LandOfFree

Say what you really think

Search LandOfFree.com for scientists and scientific papers. Rate them and share your experience with other people.

Rating

On Machine-Learned Classification of Variable Stars with Sparse and Noisy Time-Series Data does not yet have a rating. At this time, there are no reviews or comments for this scientific paper.
If you have personal experience with On Machine-Learned Classification of Variable Stars with Sparse and Noisy Time-Series Data, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and On Machine-Learned Classification of Variable Stars with Sparse and Noisy Time-Series Data will most certainly appreciate the feedback.

Rate now

Comments { 0 }

Profile ID: LFWR-SCP-O-152432

All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.

Canada

Charities
Companies
MP Candidates
Patents
Employee Salary Disclosure

World

Places of the World
Scientific Papers

United States

Banks
Companies
Counties
Patents
Employee Salary Disclosure