Computer Science – Learning
Scientist
Computer Science
Learning
Scientist
Experiments with Infinite-Horizon, Policy-Gradient Estimation
Infinite-Horizon Policy-Gradient Estimation
KnightCap: A chess program that learns by combining TD(lambda) with game-tree search
TDLeaf(lambda): Combining Temporal Difference Learning with Game-Tree Search
No associations
LandOfFree
Jonathan Baxter does not yet have a rating. At this time, there are no reviews or comments for this scientist.
If you have personal experience with Jonathan Baxter, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Jonathan Baxter will most certainly appreciate the feedback.
Profile ID: LFWR-SCP-P-232647