Mathematics – Statistics Theory
Scientific paper
2009-03-17
Annals of Statistics 2010, Vol. 38, No. 5, 2781-2822
Mathematics
Statistics Theory
Published in at http://dx.doi.org/10.1214/10-AOS806 the Annals of Statistics (http://www.imstat.org/aos/) by the Institute of
Scientific paper
10.1214/10-AOS806
The aim of this paper is to provide a new method for the detection of either favored or avoided distances between genomic events along DNA sequences. These events are modeled by a Hawkes process. The biological problem is actually complex enough to need a nonasymptotic penalized model selection approach. We provide a theoretical penalty that satisfies an oracle inequality even for quite complex families of models. The consecutive theoretical estimator is shown to be adaptive minimax for H\"{o}lderian functions with regularity in $(1/2,1]$: those aspects have not yet been studied for the Hawkes' process. Moreover, we introduce an efficient strategy, named Islands, which is not classically used in model selection, but that happens to be particularly relevant to the biological question we want to answer. Since a multiplicative constant in the theoretical penalty is not computable in practice, we provide extensive simulations to find a data-driven calibration of this constant. The results obtained on real genomic data are coherent with biological knowledge and eventually refine them.
Reynaud-Bouret Patricia
Schbath Sophie
No associations
LandOfFree
Adaptive estimation for Hawkes processes; application to genome analysis does not yet have a rating. At this time, there are no reviews or comments for this scientific paper.
If you have personal experience with Adaptive estimation for Hawkes processes; application to genome analysis, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Adaptive estimation for Hawkes processes; application to genome analysis will most certainly appreciate the feedback.
Profile ID: LFWR-SCP-O-491198