Full Text Available
Note: Clicking the button above will open the full text document at the original institutional repository in a new window.
Reinforcement learning methods have become more efficient in recent years. In particular, the A3C (asynchronous advantage actor critic) approach demonstrated in Mnih et al. (2016) was able to halve the training time of the existing state-of-the-art approaches. However, these methods still require re...
| Main Author: | |
|---|---|
| Other Authors: | |
| Format: | Thesis |
| Language: | English |
| Published: |
Department of Statistical Sciences
2020
|
| Subjects: | |
| Tags: |
No Tags, Be the first to tag this record!
|