Real-Time Dynamic Programming (RTDP) Algorithm: Difference between revisions

From GM-RKB
Jump to navigation Jump to search
m (Text replacement - " [[" to " [[")
m (Text replacement - " In Proceedings of " to " In: Proceedings of ")
Line 22: Line 22:


=== 2009 ===
=== 2009 ===
* (Sanner et al., 2009) ⇒ [[Scott Sanner]], [[Robby Goetschalckx]], [[Kurt Driessens]], and [[Guy Shani]] (2009). [https://www.ijcai.org/Proceedings/09/Papers/297.pdf "Bayesian Real-Time Dynamic Programming"]. In: In Proceedings of the 21st International Joint Conference on Artificial Intelligence (IJCAI-09).
* (Sanner et al., 2009) ⇒ [[Scott Sanner]], [[Robby Goetschalckx]], [[Kurt Driessens]], and [[Guy Shani]] (2009). [https://www.ijcai.org/Proceedings/09/Papers/297.pdf "Bayesian Real-Time Dynamic Programming"]. In: In: Proceedings of the 21st International Joint Conference on Artificial Intelligence (IJCAI-09).


=== 2005 ===
=== 2005 ===
* (McMahan et al., 2005) ⇒ [[H. Brendan McMahan]], [[Maxim Likhachev]], and [[Geoffrey J. Gordon]] (2005). [http://www.cs.cmu.edu/~ggordon/mcmahan-likhachev-gordon.brtdp.pdf Bounded Real-Time Dynamic Programming: RTDP with monotone upper bounds and performance guarantees]. In: In Proceedings of the 22nd International Conference on Machine learning (pp. 569-576).
* (McMahan et al., 2005) ⇒ [[H. Brendan McMahan]], [[Maxim Likhachev]], and [[Geoffrey J. Gordon]] (2005). [http://www.cs.cmu.edu/~ggordon/mcmahan-likhachev-gordon.brtdp.pdf Bounded Real-Time Dynamic Programming: RTDP with monotone upper bounds and performance guarantees]. In: In: Proceedings of the 22nd International Conference on Machine learning (pp. 569-576).


=== 2003 ===
=== 2003 ===

Revision as of 05:21, 12 June 2024

A Real-Time Dynamic Programming (RTDP) Algorithm is a Dynamic Programming Algorithm that is enhanced by implementing a heuristic search algorithm.



References

2017

2009

2005

2003