Wyniki wyszukiwania - BazTech

Ograniczanie wyników

Znaleziono wyników: 1

Liczba wyników na stronie

Wyniki wyszukiwania

Wyszukiwano:
w słowach kluczowych: algorytm TD

Sortuj według:

Ogranicz wyniki do:

Poszukiwanie optymalnej strategii eksploracji z zastosowaniem uczenia ze wzmocnieniem

Pluciński M.

Metody Informatyki Stosowanej

2008

nr 1 (Tom 13)

127-137

The paper presents an application of the reinforcement learning for a searching of an optimal policy in an exploration problem (also known as a Jeep problem). The continuous problem, in unrealistic so the main work was concentrated on the discrete Jeep problem. There is examined and described an influence of main learning parameters on the learning speed and there are presented some found exemplary policies for different problem conditions.