Wyniki wyszukiwania - BazTech

1

Transformation of "user-object" matrix for the collaborative filtering

Savchuk T., Sakaliuk A., Wójcik W., Burlibay A.

Przegląd Elektrotechniczny

|

2014

|

R. 90, nr 1

55-59

EN

The paper is devoted to application of collaborative filtering that is one of the method of automatic data filtering in the Internet. The main disadvantage of the approach is the necessity of performing a large number of operations. The authors have presented a mean of overcoming this problem by reduction of the dimension of the input matrix. Experimental results show that it had led not only to reduction of computational time, but also increased the accuracy of recommendations obtained.

PL

Artykuł poświęcony jest filtrowaniu kolaboracyjnemu, które jest jedną z metod automatycznej filtracji danych w sieci Internet. Główną wadą wspomnianego podejścia jest konieczność wykonywania bardzo dużej liczby operacji. Autorzy przedstawili rozwiązanie tego problemu polegający na redukcji wymiarowości przetwarzanej macierzy. Rezultaty badań pokazują, że oprócz zmniejszenia czasu obliczeń, uzyskano poprawę dokładności uzyskiwanych rekomendacji.

2

Iterative methods for solving large-scale problems of structural mechanics using multi-core computers

Fialko S. Y.

Archives of Civil and Mechanical Engineering

|

2014

|

Vol. 14, no. 1

190--203

EN

The paper studies the conjugate gradient method for solving systems of linear algebraic equations with symmetric sparse matrices that arise when the finite-element method is applied to the problems of structural mechanics. The main focus is on designing effective preconditioning and parallelizing the method for multi-core desktop computers. Preconditioning is based on the incomplete Cholesky “by value” factorization method and implemented based on the technique of sparse matrices, which allows increasing convergence considerably without a significant increase of the computer's resources. Parallelization is implemented for the incomplete factorization as well as for iterative process stages. The method is integrated into the SCAD software package (www.scadsoft.com). The paper includes a discussion of the results of calculations done with direct and iterative methods for large-scale design models of tall buildings, originally from the SCAD Soft1 problem collection.

3

Computational aspects of GPU - accelerated sparse matrix - vector multiplication for solving Markov models

Bylina B., Bylina J., Karwacki M.

Theoretical and Applied Informatics

|

2011

|

Vol. 23, No. 2

127-145

EN

In this article we investigate some computational aspects of GPU-accelerated matrix-vector multiplication where matrix is sparse. Particularly, we deal with sparse matrices appearing in modelling with Markovian queuing models. The model we use for research is a Markovian queuing model of a wireless device. This model describes the device’s behavior during possible channel occupation by other devices. We study the efficiency of multiplication of a sparse matrix by a dense vector with the use of an appropriate, ready-to-use GPU-accelerated mathematical library, namely CUSP. For the CUSP library we discuss data structures and their impact on the CUDA platform for the fine-grained parallel architecture of the GPU. Our aim is to find the best format for storing a sparse matrix for GPU-computation (especially one associated with the Markovian model of a wireless device). We compare the time, the performance and the speed-up for the card NVIDIA Tesla C2050 (with ECC ON). For unstructured matrices (as our Markovian matrices), we observe speed-ups (in respect to CPU-only computations) of over 8 times.

PL

Łańcuchy Markowa są przydatnym narzędziem do modelowania systemów złożonych, takich jak systemy i sieci komputerowe. W ostatnich latach łańcuchy Markowa zostały z powodzeniem wykorzystane do oceny pracy sieci bezprzewodowych. Jednym z problemów jaki się pojawia przy wykorzystywaniu łańcuchów Markowa w modelowaniu sieci są problemy natury obliczeniowej. W artykule zajmiemy się badaniem mnożenia macierzy rzadkiej przez wektor, które jest jedną z głównych operacji podczas numerycznego rozwiązywania modeli Markowowskich. Aby, przyspieszyć czas obliczeń mnożenia macierz rzadkiej przez wektor wykorzystano funkcje z biblioteki CUSP. Biblioteka jest zbiorem funkcji wykonywanych na GPU (ang.Graphics Processing Unit) celem skrócenia czasu obliczeń. Do testowania operacji mnożenia macierzy rzadkiej przez wektor badano macierze z Markowowskiego modelu pracy sieci bezprzewodowej. Model ten opisuje zachowanie urządzenia, gdy kanał transmisyjnych może być zajęty przez inne urządzenia. Macierz przejść wspomnianego modelu jest macierzą rzadką i potrzeba specialnej struktury danych do jej przechowywania, dlatego w artykule dyskutowane są różne struktury danych dla macierzy rzadkich i ich przydatność do obliczen na kartach graficznych. W pracy porównano czas, wydajność i przyspieszenie jakie otrzymano podczas testowania biblioteki CUSP na karcie NVIDIA Tesla C2050 dla niestrukturalnych macierzy rzadkich opisujących model zajętości węzła w sieciach bezprzewodowych przy różnych formatach przechowywania macierzy rzadkich. Dla testowanych macierzy zauważono ośmiokrotne przyspieszenie obliczeń przy wykorzystaniu karty graficznej.

4

Incomplete column-row factorization method for general sparse matrices

Saukh S.

Przegląd Elektrotechniczny

|

2010

|

R. 86, nr 1

127-129

EN

Incomplete column-row factorization method is proposed. This method does not require permutations of rows and columns in submatrices. Calculable stability of the method is provided by search of pivots in submatrices to minimize divergence on the Frobenius norm between the transformed and regenerate submatrices. Significance of entries of the factor matrices is specified by comparison of norms of rows and columns of transformed and subtracted submatrices. The new method requires less memory than AINV, ILUT and RIF-Ns methods.

PL

Artykuł proponuje niekompletną faktoryzację kolumnowo-wierszową, która nie wymaga permutacji wierszy ani kolumn podmacierzy. Stabilność obliczeń jest zapewniana przez wybór elementów dominujących aby zminimalizować normę Frobeniusa dla odpowiednich podmacierzy. Znaczenie współczynników jest określane przez porównanie norm wierszy i kolumn. Nowa metoda ma mniejsze zapotrzebowanie na pamięć, niż AINV, ILUT i RIF-N.

5

Metody reprezentacji danych w aplikacjach inżynierskich wykorzystujacych architekturę SOA

Mikoda J., Sczygiol N., Wawszczak A.

Metody Informatyki Stosowanej

|

2010

|

nr 2 (23)

117-125

PL

Macierze rzadkie stanowią znaczną częśćc danych wykorzystywanych w symulacjach inżynierskich. W niniejszej pracy przedstawione zostały formaty zapisu takich macierzy w dokumentach SOAP. Dokumenty te stanowią podstawę komunikacji w aplikacjach zbudowanych w oparciu o architekturę SOA, a w szczególności w aplikacjach wykorzystujących technologie WebServices.Sposób zapisu dużych zbiorów danych, jakimi są między innymi rozważane w pracy macierze, ma znaczny wpływ na wydajność takich aplikacji. Przeprowadzone badania pokazały, że zastosowanie zaproponowanego w pracy sposobu serializacji macierzy znacznie zmniejsza ilość przesyłanych danych, a tym samym korzystnie wpływa na poprawę wydajności aplikacji. W sekcji 5 zaprezentowane zostały wyniki symulacji ustalonego przepływu ciepła, przeprowadzonej za pomocą metody elementów skończonych. Rezultaty te pokazuja, że narzut czasowy wynikający z wykorzystania technologii WebServices jest niewielki. Prezentowana w pracy architektura oprogramowania może być z powodzeniem stosowana w połączeniu z serwerami obliczeniowymi (GPU, FPGA, klastry) umożliwiającymi zrównoleglenie pewnych fragmentów symulacji, na przykład procesu rozwiązywania układu równań. Takie rozwiązanie pozwoli na znaczne skrócenie procesu obliczeń i w zupełności zniweluje opóźnienia spowodowane wykorzystaniem technologii WebServices.

EN

Over recent years the development of the Internet has resulted in significant increase in the popularity of applications based on Service Oriented Architecture. This kind of architecture can be also successfully used while designing engineering packages. This article contains analysis of problem that can occur in such solutions, that is sending large amount of data, in particular sparse matrices. The modified method of sparse matrices transcription in SOAP documents was described. This method enables to decrease the amount of data that is to be sent. The exemplary simulation of heat transfer was also presented. This simulation utilizes Finite Element Method and WebServices technology. The results of conducted research show that SOA architecture can be successfully used while designing engineering ackages.

6

A block sparse shared-memory multifrontal finite element solver for problems of structural mechanics

Fialko S. Y.

Computer Assisted Mechanics and Engineering Sciences

|

2009

|

Vol. 16, No. 2

117-131

EN

The presented method is used in finite-element analysis software developed for multicore and multiprocessor shared-memory computers, or it can be used on single-processor personal computers under the operating systems Windows 2000, Windows XP, or Windows Vista, widely popular in small or medium-sized design offices. The method has the following peculiar features: it works with any ordering; it uses an object-oriented approach on which a dynamic, highly memory-efficient algorithm is based; it performs a block factoring in the frontal matrix that entails a high-performance arithmetic on each processor and ensures a good scalability in shared-memory systems. Many years of experience with this solver in the SCAD software system have shown the method's high efficiency and reliability with various large-scale problems of structural mechanics (hundreds of thousands to millions of equations).

7

Classic and object-oriented general sparse matrices storage formats

Baron B., Pasierbek A., Połomski M.

Prace Naukowe Politechniki Śląskiej. Elektryka

|

2008

|

z. 4

7-22

EN

Computational algorithms used in power systems engineering and simulation are often based on large matrices, mainly of sparse character. There are many various sparse matrix notation formats. However, they are designed for reading simplicity and efficiency, while complicating matrix data generation and update code. This paper reviews most common sparse matrix notation formats, highlighting their common features and shortcomings. It also presents new sparse matrix storage format and its C++ interface, designed as universal, independent module. Its most notable features are: ease of use, efficiency, math friendliness and simplicity of usage (both in data consumer and producer code). The work has been funded from the science budget for years 2007-2010 as a research project No N511 001 32/0852 (PBU-10/RE3/2007).

PL

Algorytmy obliczeniowe stosowane w obliczeniach dotyczących optymalizacji systemu energetycznego bazują najczęściej na wielkich macierzach, często o rzadkim zapisie. Znanych jest wiele różnorodnych sposobów zapisu macierzy rzadkiej w pamięci komputera. Większość z nich została jednak stworzona z myślą o szybkości odczytywania elementów, co komplikuje jednak kod wypełniający macierz i zmieniający jej zawartość. Publikacja zawiera przegląd najpopularniejszych formatów zapisu macierzy rzadkich w pamięci, podkreślając ich zalety oraz wady. Prezentuje także nowy sposób zapisu wraz z interfejsem w języku C++, zaprojektowanym jako uniwersalny, niezależny moduł. Jego najważniejsze cechy to: łatwość wykorzystania, efektywność, dostosowanie do zapisu matematycznego oraz prostota struktur danych. Interfejs został stworzony w ramach grantu N511 001 32/0852 (PBU-10/RE3/2007) na lata 2007-2010.

8

A parallel block Lanczos algorithm and its implementation for the evaluation of some eigenvalues of large sparse symmetric matrices on multicomputers

Guarracino M. R., Perla F., Zanetti P.

International Journal of Applied Mathematics and Computer Science

|

2006

|

Vol. 16, no 2

241-249

EN

In the present work we describe HPEC (High Performance Eigenvalues Computation), a parallel software package for the evaluation of some eigenvalues of a large sparse symmetric matrix. It implements an efficient and portable Block Lanczos algorithm for distributed memory multicomputers. HPEC is based on basic linear algebra operations for sparse and dense matrices, some of which have been derived by ScaLAPACK library modules. Numerical experiments have been carried out to evaluate HPEC performance on a cluster of workstations with test matrices from Matrix Market and Higham’s collections. A comparison with a PARPACKroutine is also detailed. Finally, parallel performance is evaluated on random matrices, using standard parameters.

9

New high performance algorithms for sparse matrix envelope compression

Kamińska A., Pańczak B., Stabrowski M. M.

Kwartalnik Elektroniki i Telekomunikacji

|

2000

|

Vol. 46, nr 2

211-223

EN

The first of new methods defines the row weights and performs quick sorting of the rows and columns using weights criterion. The second method relies on swapping with diminishing threshold in order to find true global optimum. The third methods is a completely redeveloped and modified simulated annealing method. All three methods are compared with two classic algorithms. Matrices used in the tests are more exacting as in other tests up to date.

PL

Pierwsza z nowych metod definiuje wagi wierszy macierzy oraz wykonuje szybkie sortowanie wierszy i kolumn używając kryterium wagi. Druga metoda polega na przestawianiu wierszy ze zmniejszającą się wartością progową aż do znalezienia minimum globalnego. Trzecia metoda jest całkowicie zmodyfikowaną metodą symulowanego wyżarzania. Wszystkie trzy metody porównano z dwoma algorytmami klasycznymi. Użyte w testach macierze są trudniejsze do skompresowania niż w innych dotychczasowych testach.

10

Efficient bandwidth compression in sparse matrices

Kamińska A., Pańczak B., Stabrowski M. M.

Kwartalnik Elektroniki i Telekomunikacji

|

2000

|

Vol. 46, nr 2

225-235

EN

The first of new methods performs quick sorting of the rows and columns without explicit usage of bandwidth criterion. The second swapping method and the third - simulated annealing - rely on direct application of bandwidth optimisation criterion. These methods are compared with classic reverse Cuthill - McKee method and Akyuz and Utku method.

PL

Pierwsza z nowych metod dokonuje szybkiego sortowania wierszy i kolumn bez jawnego użycia kryterium szerokosci pasma. Druga - metoda progowa oraz trzecia - symulowanego wyżarzania - polegają na bezpośrednim zastosowaniu kryterium optymalizacji wartosci szerokości pasma. Metody te zostały porównane z klasyczną odwrotna metodą Cuthill-Mc-Kee oraz z metodą Akyuza-Utku.