Wyniki wyszukiwania - Biblioteka Nauki

Nowa wersja platformy, zawierająca wyłącznie zasoby pełnotekstowe, jest już dostępna.
Przejdź na https://bibliotekanauki.pl

Ograniczanie wyników

Znaleziono wyników: 2

Liczba wyników na stronie

Wyniki wyszukiwania

Wyszukiwano:
w słowach kluczowych: approximate dynamic programming

Sortuj według:

Ogranicz wyniki do:

Approximating the solution of a dynamic, stochastic multiple knapsack problem

100%

Hartman J. C. , Perry T. C.

2006

tom Vol. 35, no 3

535-550

We model an environment where orders arrive probabilistically over time, with their revenues and capacity requirements becoming known upon arrival. The decision is whether to accept an order, receiving a reward and reserving capacity, or reject an order, freeing capacity for possible future arrivals. We model the dynamic, stochastic multiple knapsack problem (DSMKP) with stochastic dynamic programming (SDP). Multiple knapsacks are used as orders may stay in the system for multiple periods. As the state space grows exponentially in the number of knapsacks and the number of possible orders per period, we utilize linear programming and duality to quickly approximate the end-of-horizon values for the SDP. This helps mitigate end-of-study effects when solving the SDP directly, allowing for the solution of larger problems and leading to increased quality in solutions.

Approximate dynamic programming in robust tracking control of wheeled mobile robot

67%

Hendzel Z. , Szuster M.

2009

tom Vol. LVI, nr 3

223-236

In this work, a novel approach to designing an on-line tracking controller for a nonholonomic wheeled mobile robot (WMR) is presented. The controller consists of nonlinear neural feedback compensator, PD control law and supervisory element, which assure stability of the system. Neural network for feedback compensation is learned through approximate dynamic programming (ADP). To obtain stability in the learning phase and robustness in face of disturbances, an additional control signal derived from Lyapunov stability theorem based on the variable structure systems theory is provided. Verification of the proposed control algorithm was realized on a wheeled mobile robot Pioneer-2DX, and confirmed the assumed behavior of the control system.

W pracy przedstawiono nowe ujęcie problematyki sterowania nadążnego mobilnym robotem dwukołowym. Algorytm bazuje na metodzie uczenia ze wzmocnieniem o strukturze aktor-krytyk i nie wymaga uczenia wstępnego, działa on-line bez znajomości modelu robota. Element generujący sterowania (aktor - ASE) oraz element generujący sygnał wewnętrznego wzmocnienia (krytyk - ACE) są zrealizowane w postaci sztucznej sieci neuronowej (SN). Prezentowany algorytm sterowania zweryfikowano na rzeczywistym obiekcie, dwukołowym robocie mobilnym Pioneer-2DX. Badania potwierdziły poprawność przyjętego rozwiązania.