Preferencje help
Widoczny [Schowaj] Abstrakt
Liczba wyników
Powiadomienia systemowe
  • Sesja wygasła!
  • Sesja wygasła!

Znaleziono wyników: 4

Liczba wyników na stronie
first rewind previous Strona / 1 next fast forward last
Wyniki wyszukiwania
Wyszukiwano:
w słowach kluczowych:  proces decyzyjny Markowa
help Sortuj według:

help Ogranicz wyniki do:
first rewind previous Strona / 1 next fast forward last
EN
In promoting the construction of prefabricated residential buildings in Yunnan villages and towns, the use of precast concrete elements is unstoppable. Due to the dense arrangement of steel bars at the joints of precast concrete elements, collisions are prone to occur, which can affect the stress of the components and even pose certain safety hazards for the entire construction project. Because the commonly used the steel bar obstacle avoidance method based on building information modeling has low adaptation rate and cannot change the trajectory of the steel bar to avoid collision, a multi-agent reinforcement learning-based model integrating building information modeling is proposed to solve the steel bar collision in reinforced concrete frame. The experimental results show that the probability of obstacle avoidance of the proposed model in three typical beam-column joints is 98.45%, 98.62% and 98.39% respectively, which is 5.16%, 12.81% and 17.50% higher than that of the building information modeling. In the collision-free path design of the same object, the research on the path design of different types of precast concrete elements takes about 3–4 minutes, which is far less than the time spent by experienced structural engineers on collision-free path modeling. The experimental results indicate that the model constructed by the research institute has good performance and has certain reference significance.
EN
A honeypot is used to attract and monitor attacker activities and capture valuable information that can be used to help practice good cybersecurity. Predictive modelling of a honeypot system based on a Markov decision process (MDP) and a partially observable Markov decision process (POMDP) is performed in this paper. Analyses over a finite planning horizon and an infinite planning horizon for a discounted MDP are respectively conducted. Four methods, including value iteration (VI), policy iteration (PI), linear programming (LP), and Q-learning, are used in the analyses over an infinite planning horizon for the discounted MDP. The results of the various methods are compared to evaluate the validity of the created MDP model and the parameters in the model. The optimal policy to maximise the total expected reward of the states of the honeypot system is achieved, based on the MDP model employed. In the modelling over an infinite planning horizon for the discounted POMDP of the honeypot system, the effects of the observation probability of receiving commands, the probability of attacking the honeypot, the probability of the honeypot being disclosed, and transition rewards on the total expected reward of the honeypot system are studied.
EN
Currently, air pollution and energy consumption are the main issues in the transportation area in large urban cities. In these cities, most people choose their transportation mode according to corresponding utility including traveller's and trip’s characteristics. Also, there is no effective solution in terms of population growth, urban space, and transportation demands, so it is essential to optimize systematically travel demands in the real network of roads in urban areas, especially in congested areas. Travel Demand Management (TDM) is one of the well-known ways to solve these problems. TDM defined as a strategy that aims to maximize the efficiency of the urban transport system by granting certain privileges for public transportation modes, Enforcement on the private car traffic prohibition in specific places or times, increase in the cost of using certain facilities like parking in congested areas. Network pricing is one of the most effective methods of managing transportation demands for reducing traffic and controlling air pollution especially in the crowded parts of downtown. A little paper may exist that optimize urban transportations in busy parts of cities with combined Markov decision making processes with reward and evolutionary-based algorithms and simultaneously considering customers’ and trip’s characteristics. Therefore, we present a new network traffic management for urban cities that optimizes a multi-objective function that related to the expected value of the Markov decision system’s reward using the Genetic Algorithm. The planned Shiraz city is taken as a benchmark for evaluating the performance of the proposed approach. At first, an analysis is also performed on the impact of the toll levels on the variation of the user and operator cost components, respectively. After choosing suitable values for the network parameters, simulation of the Markov decision process and GA is dynamically performed, then the optimal decision for the Markov decision process in terms of total reward is obtained. The results illustrate that the proposed cordon pricing has significant improvement in performance for all seasons including spring, autumn, and winter.
EN
Intermediate buffers often exist in practical production systems to reduce the influence of the breakdown and maintenance ef subsystems on system production. At the same time, the effects of intermediate buffers also make the degradation process of the system more difficult to model. Some existing papers investigate the performance evaluation and maintenance optimisation of a production system with intermediate buffers under a predetermined maintenance strategy structure. However, only few papers pay attention to the property of the optimal maintenance strategy structure. This paper develops a method based on the Markov decision process to identify the optimal maintenance strategy for a series-parallel system with two multi-component subsystems and an intermediate buffer. The structure of the obtained optimal maintenance strategy is analysed, which shows that the optimal strategy structure cannot be modelled by a limited number of parameters. However, some useful properties of the strategy structure are obtained, which can simplify the maintenance optimisation. Another interesting finding is that a large buffer capacity cannot always bring about high average revenue even through the cost of holding an item in the buffer is much smaller than the production revenue per item.
PL
W systemach produkcyjnych często stosuje się bufory pośrednie w celu zmniejszenia wpływu awarii i konserwacji podsystemów na system produkcji. Jednocześnie, oddziaływanie buforów pośrednich utrudnia modelowanie procesu degradacji systemu. Istnieją badania dotyczące oceny funkcjonowania i optymalizacji utrzymania systemów produkcyjnych wykorzystujących bufory pośrednie przy założeniu wcześniej określonej struktury strategii utrzymania ruchy. Jednak tylko nieliczne prace zwracają uwagę na własności optymalnej struktury strategii utrzymania ruchu. W przedstawionej pracy opracowano opartą na procesie decyzyjnym Markowa metodę określania optymalnej strategii utrzymania ruchu dla układu szeregowo-równoległego z dwoma podsystemami wieloskładnikowymi oraz buforem pośrednim. Przeanalizowano strukturę otrzymanej optymalnej strategii utrzymania i wykazano, że struktury takiej nie można zamodelować przy użyciu ograniczonej liczby parametrów. Jednak odkryto pewne przydatne właściwości struktury strategii, które mogą ułatwić optymalizację utrzymania ruchu. Innym interesującym odkryciem było to, że duża pojemność bufora nie zawsze daje wysoką średnią przychodów mimo iż koszty przechowywania obiektu w buforze są znacznie mniejsze niż przychody z produkcji w przeliczeniu na jeden obiekt.
first rewind previous Strona / 1 next fast forward last
JavaScript jest wyłączony w Twojej przeglądarce internetowej. Włącz go, a następnie odśwież stronę, aby móc w pełni z niej korzystać.