The Library
Exponential lower bounds for policy iteration
Tools
Fearnley, J. (2010) Exponential lower bounds for policy iteration. In: 37th International Colloquium on Automata, Languages and Programming, Bordeaux, France, 06-10 Jul 2010 . Published in: Automata, Languages and Programming Pt II Proceedings, Vol.6199 pp. 551-562. doi:10.1007/978-3-642-14162-1_46 ISSN 0302-9743.
Research output not available from this repository.
Request-a-Copy directly from author or use local Library Get it For Me service.
Abstract
We study policy iteration for infinite-horizon Markov decision processes. It has recently been shown policy iteration style algorithms have exponential lower bounds in a. two player game setting. We extend these lower bounds to Markov decision processes with the total reward and average-reward optimality criteria.
Item Type: | Conference Item (Paper) | ||||
---|---|---|---|---|---|
Divisions: | Faculty of Science, Engineering and Medicine > Science > Computer Science | ||||
Journal or Publication Title: | Automata, Languages and Programming Pt II Proceedings | ||||
Publisher: | Springer | ||||
ISSN: | 0302-9743 | ||||
Official Date: | 2010 | ||||
Dates: |
|
||||
Volume: | Vol.6199 | ||||
Page Range: | pp. 551-562 | ||||
DOI: | 10.1007/978-3-642-14162-1_46 | ||||
Status: | Peer Reviewed | ||||
Publication Status: | Published | ||||
Conference Paper Type: | Paper | ||||
Title of Event: | 37th International Colloquium on Automata, Languages and Programming | ||||
Type of Event: | Conference | ||||
Location of Event: | Bordeaux, France | ||||
Date(s) of Event: | 06-10 Jul 2010 |
Data sourced from Thomson Reuters' Web of Knowledge
Request changes or add full text files to a record
Repository staff actions (login required)
View Item |