Wyniki wyszukiwania - BazTech

1

Testbed for thermal and performance analysis in MPSoC systems

Sojka Michal, Benedikt Ondřej, Hanzálek Zdeněk, Zaykov Pavel

Annals of Computer Science and Information Systems

|

2020

|

Vol. 21

683--692

EN

Many modern computing platforms in the safety-critical domains are based on heterogeneous Multiprocessor System-on-Chip (MPSoC). Such computing platforms are expected to guarantee high-performance within a strict thermal envelope. This paper introduces a testbed for thermal and performance analysis. The testbed allows the users to develop advanced scheduling and resource allocation techniques aiming at finding an optimal trade-off between the peak temperature and the achieved performance. This paper presents a new, open-source Thermobench tool for data collection and analysis of user-defined workloads. Furthermore, a methodology for shortening the time needed for the data collection is proposed. Experiments show that a significant amount of time can be saved. Specifically, time reduction from 60 minutes to 15 minutes is achieved with the i.MX8 MPSoC from NXP while running a set of user-defined benchmarks that stress CPU, GPU, and different levels of the memory hierarchy.

2

Parallel implementation of a PIC simulation algorithm using OpenMP

Suciu Alin, Hangan Anca, Marginean Anca, Joldos Marius, Voitcu Gabriel, Echim Marius

Annals of Computer Science and Information Systems

|

2020

|

Vol. 21

381--385

EN

Particle-in-cell (PIC) simulations are focusing on the individual trajectories of a very large number of particles in self-consistent and external electric and magnetic fields; they are widely used in the study of plasma jets, for example. The main disadvantage of PIC simulations is the large simulation runtime,which often requires a parallel implementation of the algorithm. The current paper focuses on a PIC1d3v simulation algorithm and describes the successful implementation of a parallel version of it on a multi-core architecture, using OpenMP, with very promising experimental and theoretical results.

3

Efficient Computation of RNA Partition Functions Using McCaskill’s Algorithm

Zhao Chunchun, Sahni Sartaj

Annals of Computer Science and Information Systems

|

2020

|

Vol. 21

449--452

EN

We develop efficient single- and multi-core algorithms to compute partition functions for RNA sequences. Our algorithms, which are based on McCaskill's algorithm, are benchmarked against state-of-the-art fast algorithms obtained using the parallelizing source-to-source compilers PLUTO and TRACO. On our Intel I9 computational platform, our best single core algorithm takes up to 81.2% less time than the single core algorithm resulting from PLUTO, which is faster than that obtained from TRACO. Our best multi-core algorithm takes up to 84.7% less time than the multi-core algorithm obtained using TRACO when run with 20 threads (our I9 has 10 cores and supports hyperthreading); the TRACO multi-core algorithm is faster than the PLUTO one.

4

A Reactive Search-Based Algorithm for Scheduling Multiprocessor Tasks on Two Dedicated Processors

Aïder Méziane, Baatout Fatma Zohra, Hifi Mhand

Annals of Computer Science and Information Systems

|

2020

|

Vol. 21

257--261

EN

In this paper, we propose a reactive search-based algorithm for solving the problem of scheduling multiprocessor tasks on two dedicated processors. An instance of the problem is characterized by a set of tasks divided into three subsets and two processors, where some tasks can be executed either on one processor or two processors. The goal of the problem is to determine the scheduling of all tasks minimizing the execution of the last assigned task. The proposed reactive search starts with a starting greedy solution. Next, a series of local operators combined with a tabu list are introduced in order to intensify the search process. The method is also reinforced with a drop and rebuild operator that is applied for diversifying the search process. Finally, the performance of the proposed method is evaluated on a set of benchmark instances, where its provided results are compared to those achieved by a recent method available in the literature. Encouraging results have been reached.

5

Adaptacja algorytmu rate monotonic scheduling dla architektur wieloprocesorowych

Gajer M., Handzel Z.

Informatyka Teoretyczna i Stosowana

|

2005

|

R. 5, nr 8

95-105

PL

Artykuł stanowi propozycję rozszerzenia obszaru stosowalności popularnego algorytmu szeregujące zadania Rate Monotonie Scheduling (RMS) dla architektur wieloprocesorowych. Dotychczas algorytm RMS wykorzystywany był do szeregowania zbioru niezależnych, wywłaszczalnych i periodycznych zadań przeznaczonych tylko dla jednego procesora. Rosnąca coraz bardziej popularność rozwiązań wieloprocesorowych wymusza dokonanie takiej adaptacji algorytmu RMS, aby algorytm ten nadawał się również do szeregowania zadań wieloprocesorowych. Artykuł poświęcony został architekturom wieloprocesorowych o topologii hipersześcianu. Topologia ta charakteryzuje się bardzo korzystnym stosunkiem liczby połączeń komunikacyjnych pomiędzy poszczególnymi jednostkami obliczeniowymi do maksymalnej długości drogi przesyłu komunikatu, co jest bezpośrednią przyczyną jej dużej i wciąż wzrastającej popularności. Zaproponowana przez autorów modyfikacja klasycznego algorytmu RMS, umożliwiająca jego implementację również dla przypadku szeregowania zbioru zadań realizowanych w systemie równoległym o topologii hipersześcianu, polega na dokonaniu binaryzacji okresów zadań. W wyniku binaryzacji wybrane zadania uzyskują identyczne wartości okresów, dzięki czemu mogą zostać połączone w jedno większe tzw. superzadanie, do realizacji którego wymagana jest jednoczesna dostępność wszystkich jednostek obliczeniowych występujących w systemie. Następnie zbiór superzadań może zostać potraktowany tak, jak zbiór zadań jednoprocesorowych, do realizacji których wymagana jest jednostka obliczeniowa specjalnego typu, tzn. taka, która stanowi klaster zbudowany z odpowiedniej liczby procesorów. Jednak z punktu widzenia programu szeregującego superzadania, wewnętrzna budowa jednostki obliczeniowej nie jest istotna, a szeregowane super zadania można potraktować w taki sam sposób, w jaki traktuje się zadania jednoprocesorowe, czyli można już bezpośrednio zastosować algorytm RMS.

EN

The real-time systems are getting more and more popular. In fact most of contemporary industrial and communi-cation systems could not do without them. The popularity of real-time systems with bard real-time constraints forced the extensive development of task scheduling theory. In the case of real-time systems with bard real-time constraints it does not suffice that the task produces logically correct results but these results must be delivered within their time constraints. In su ch systems even logically correct results that are delivered with the violation of their time constraints are totally useless. Moreover, the consequences of violation of time constraints tan very often be quite severe and tan cause the great economic losses and even losses of human lives, e.g. in the case of controi systems of nuclear reac-tors, space ships etc. The main goal of the łask scheduling theory is to prove at the stage of the system project that the time eonstraints for all tasks will always be met under any possible circumstances. In the case of the real-time systems with bard real-time constraints there is very often a necessity of seheduling a set of independent, pre-emptive and periodic tasks. The most popular algorithm for scheduling such set of inde-pendent, pre-emptive and periodic tasks is the Rate Monotonie Scheduling (RMS) algorithm. The paper is the proposition of applying RMS in multiprocessor architectures. The architecture of hyper-cube was chosen because of its many useful properties. The method proposed by these authors consists on the binarization of the periods of tasks and grouping the tasks into task-clusters called supertasks. Then the supertasks are scheduled as if they were normał uniprocessor tasks, only hole hypercube architecture most be available in order to perform their execution. In the paper the method of tasks seheduling proposed by these authors were illustrated on the exampłe of seheduling 20 tasks for four dimensional hypercube.

6

Porównanie efektywności różnych algorytmów szeregowania zadań wieloprocesowych

Gajer M.

Kwartalnik Elektroniki i Telekomunikacji

|

2000

|

Vol. 46, nr 1

21-34

PL

W artykule rozważono problematykę szeregowania zadań wieloprocesowych, związanych z dziedziną przetwarzania obrazów, dla wieloprocesowego układu TMS320C80. Rozważony został problem szeregowania zbioru niezależnych zadań wieloprocesowych dla trzech procesorów DSP, wchodzących w skład układu TMS320C80. Celem postawionego zadania było znalezienie takiego planu szeregowania jedno i dwuprocesowych zadań, przeznaczonych dla dedykowania procesorów DSP, aby łączny czas w którym procesory pozostają w stanie jałowym był minimalny. Dokonano przeglądu proponowanych w literaturze algorytmów oraz zaproponowano nowe oryginalne rozwiązanie rozważonego zagadnienia. Zamieszczono również wyniki eksperymentów, których celem było porównanie efektywności nowo zaproponowanego algorytmu z dwoma algorytmami zaprezentowanymi uprzednio w literaturze.

EN

In the paper the problem of multiprocessor tasks scheduling for the image processing multiprocessor device the TMS320C80 is discussed. The problem of scheduling of independent multiprocessor tasks set for three DSP processors is examined. The main purpose is to find the optimal schedule of uniprocessor and biprocessor tasks for three dedicated DSP processors so that the total passive time of all the processors should be as short as possible. The proposed in the literature algorithms are discussed and the new solution of the considered problem is presented. The experimental results, the purpose of which is to compare the effectiveness of the proposed tasks scheduling algorithm with two different algorithms known in the literature, are also presented.