Wyniki wyszukiwania - BazTech

1

Bottom-up learning of hierarchical models in a class of deterministic POMDP environments

Itoh H., Fukumoto H., Wakuya H., Furukawa T.

International Journal of Applied Mathematics and Computer Science

|

2015

|

Vol. 25, no. 3

597--615

EN

The theory of partially observable Markov decision processes (POMDPs) is a useful tool for developing various intelligent agents, and learning hierarchical POMDP models is one of the key approaches for building such agents when the environments of the agents are unknown and large. To learn hierarchical models, bottom-up learning methods in which learning takes place in a layer-by-layer manner from the lowest to the highest layer are already extensively used in some research fields such as hidden Markov models and neural networks. However, little attention has been paid to bottom-up approaches for learning POMDP models. In this paper, we present a novel bottom-up learning algorithm for hierarchical POMDP models and prove that, by using this algorithm, a perfect model (i.e., a model that can perfectly predict future observations) can be learned at least in a class of deterministic POMDP environments.

2

Application of Markov decision process as a mathematical model of operation and maintenance process

Landowski B.

Zeszyty Naukowe / Akademia Morska w Szczecinie

|

2010

|

nr 24 (96)

12-16

EN

This paper presents an example of applying Markov decision process to model and analyse the bus operation and maintenance process within an urban transport system and to forecast the influence of the operation and maintenance strategies realised for the technical objects on the transport system behaviour. Setting the values of the indices describing the process under the analysis is performed on the basis of a computerised simulation of the Markov decision process, being a mathematical model of the technical objects operation and maintenance process. In order to simulate the operation and maintenance process (Markov decision process) and to evaluate the influence of the decisions being made on the course and effectiveness of the process being realised within the study object a simulation algorithm has been elaborated and a computer calculation program has been written. The model presented herein has been created in such a way to assure that it shall be possible to use it in as extensive as possible class of the problems regarding the operation and maintenance of the technical objects.

PL

W artykule przedstawiono przykład zastosowania decyzyjnego procesu Markowa do modelowania i analizy procesu eksploatacji autobusów komunikacji miejskiej oraz prognozowania wpływu realizowanych strategii eksploatacji obiektów technicznych na zachowanie się systemu transportowego. Wyznaczanie wartości wskaźników charakteryzujących analizowany proces realizowane jest na podstawie komputerowej symulacji decyzyjnego procesu Markowa, będącego matematycznym modelem procesu eksploatacji obiektów technicznych. W celu symulacji procesu eksploatacji (decyzyjnego procesu Markowa) i oceny wpływu podejmowanych decyzji na przebieg i efektywność procesu realizowanego w obiekcie badań opracowano algorytm symulacji i napisano komputerowy program obliczeniowy. Przedstawiony w pracy model został skonstruowany w taki sposób, by zapewnić możliwość jego wykorzystania w jak najszerszej klasie problemów związanych z eksploatacją obiektów technicznych.

3

Wieloagentowe systemy decyzyjne

Kałuski J.

Zeszyty Naukowe. Organizacja i Zarządzanie / Politechnika Śląska

|

2009

|

z. 49

133-143

PL

W pracy przedstawiono definicje i podstawowe pojęcia Sieci Semantycznych Web oraz Wieloagentowych Systemów Informatycznych i Decyzyjnych. Omówiono ich genezę i rozwój. Pokazano dwa przykłady szczegółowe tych systemów.

EN

In the work some definitions and notions of the Semantic Web and Multi-Agent Systems are presented. The first part of the work contains some characteristic of concept and theory applied to semantic networks. The planning, grouping time and space, closing on semantic networks algorithm in two practical examples of One-and Three-Agent Systems are done.

4

Using of decision-making Semi-Markov processes for modelling and simulating of operation and maintenance process for buses

Woropay M., Landowski B., Neubauer A.

Archiwum Motoryzacji

|

2004

|

Vol. 7, nr 1

53-65

EN

The paper presents considerations of simulation method to optimise operation and maintenance process of the buses in an urban transportation system. The method is based on simulating the operation and maintenance process according to the elaborated decision-making operation and maintenance process model and on determining optimal, for the sake of the adopted criterion, decisions regarding the course of the analysed process based on the results of the simulation experiments. The semi-Markov process has been applied to describe the bus operation and maintenance process.Having identified real operation and maintenance system of the urban bus transportation system, significant bus operation and maintenance states have been selected and possible transitions between these states have been determined.In order to transform the discussed process to decision-making one it has been assumed that it is possible in each state to take some decisions that influence further course of the operation.

PL

W pracy rozważana jest symulacyjna metoda optymalizacji procesu eksploatacji autobusów komunikacji miejskiej. Metoda polega na symulacji procesu eksploatacji zgodnie z opracowanym decyzyjnym modelem procesu eksploatacji i wyznaczeniu na podstawie wyników eksperymentów symulacyjnych optymalnych w sensie przyjętego kryterium decyzji dotyczących przebiegu analizowanego procesu. Do opisu procesu eksploatacji autobusów zastosowano proces semi-Markowa. Na podstawie identyfikacji rzeczywistego systemu eksploatacji autobusów komunikacji miejskiej wyróżniono istotne stany eksploatacyjne autobusu oraz wyznaczono możliwe przejścia między tymi stanami. Dla przekształcenia omawianego procesu w proces decyzyjny przyjęto, że w każdym ze stanów możliwe są do podjęcia pewne decyzje mające wpływ na dalszy przebieg procesu eksploatacji. Decyzje podejmowane są zależnie od stanu, w którym znajduje się proces. Ciąg decyzji podejmowanych w kolejnych stanach eksploatacyjnych nazywany jest strategią. Strategia ma wpływ na dochody (koszty) uzyskiwane (ponoszone) w systemie transportowym w związku ze zmianami stanów obiektów technicznych. W celu ilustracji rozważań w pracy przedstawiono przykład obliczeniowy i wybrane wyniki eksperymentów symulacyjnych.

5

The machine learning approach: analysis of experimental results

Poliscuk J. E.

Journal of Applied Computer Science

|

2003

|

Vol. 11, nr 1

61-76

EN

The article analyses a reinforcement learning method in which the subject of learning is defined. The essence of this method is the selection of activities by a try and fail process and awarding deferred rewards. Theoretical analyses were supplemented by the practical studies, with reference to implementation of the Sarsa( Lambda) algorithm, with replacing eligibility traces and the Epsilon greedy policy.

6

Analityczna metoda wyznaczania niektórych macierzy zdyskontowanego decyzyjnego łańcucha Markowa

Kałuski J.

Zeszyty Naukowe. Organizacja i Zarządzanie / Politechnika Śląska

|

2000

|

z. 2

53-69

PL

W pracy przedstawiono analityczną metodę wyznaczania ergodycznej i różnicowych macierzy decyzyjnego łańcucha Markowa ze skończoną liczbą stanów z zadanym współczynnikiem dyskonta. Znany w literaturze wynik dla pełnych oczekiwanych dochodów takiego łańcucha zinterpretowano w aspekcie otrzymanych macierzy. Przedstawione wyniki dają możliwość wyróżnienia składowej stałej i zmiennej pełnego oczekiwanego dochodu. Metodę zilustrowano na dwóch prostych przykładach. W oparciu o zaprezentowaną metodę zaproponowano nowy wskaźnik jakości dla optymalizacji omawianego łańcucha Markowa.

EN

In the work the analytical method to calculate the ergodic and difference matrices of finite state discounted Markov decision processes is presented. On the basis well - known literature the result for overall discounted value, this one in interpretation of the calculated matrices is shown. The obtained results gives a possibility to distinguish the constant andvariable parts of the overall discounted value. The presented analitical method is illustrated by two simple examples. New performance index to discounted optimal Markov control problem is proposed.