Penalized Q-Learning for Dynamic Treatment Regimes

Statistics – Methodology

Scientific paper

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

Scientific paper

A dynamic treatment regime effectively incorporates both accrued information and long-term effects of treatment from specially designed clinical trials. As these become more and more popular in conjunction with longitudinal data from clinical studies, the development of statistical inference for optimal dynamic treatment regimes is a high priority. This is very challenging due to the difficulties arising form non-regularities in the treatment effect parameters. In this paper, we propose a new reinforcement learning framework called penalized Q-learning (PQ-learning), under which the non-regularities can be resolved and valid statistical inference established. We also propose a new statistical procedure---individual selection---and corresponding methods for incorporating individual selection within PQ-learning. Extensive numerical studies are presented which compare the proposed methods with existing methods, under a variety of non-regular scenarios, and demonstrate that the proposed approach is both inferentially and computationally superior. The proposed method is demonstrated with the data from a depression clinical trial study.

No associations

LandOfFree

Say what you really think

Search LandOfFree.com for scientists and scientific papers. Rate them and share your experience with other people.

Rating

Penalized Q-Learning for Dynamic Treatment Regimes does not yet have a rating. At this time, there are no reviews or comments for this scientific paper.

If you have personal experience with Penalized Q-Learning for Dynamic Treatment Regimes, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Penalized Q-Learning for Dynamic Treatment Regimes will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFWR-SCP-O-502589

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.