Wyniki wyszukiwania - BazTech

Ograniczanie wyników

Znaleziono wyników: 2

Liczba wyników na stronie

Wyniki wyszukiwania

Wyszukiwano:
w słowach kluczowych: komputery wielordzeniowe

Sortuj według:

Ogranicz wyniki do:

Iterative methods for solving large-scale problems of structural mechanics using multi-core computers

Fialko S. Y.

Archives of Civil and Mechanical Engineering

2014

Vol. 14, no. 1

190--203

The paper studies the conjugate gradient method for solving systems of linear algebraic equations with symmetric sparse matrices that arise when the finite-element method is applied to the problems of structural mechanics. The main focus is on designing effective preconditioning and parallelizing the method for multi-core desktop computers. Preconditioning is based on the incomplete Cholesky “by value” factorization method and implemented based on the technique of sparse matrices, which allows increasing convergence considerably without a significant increase of the computer's resources. Parallelization is implemented for the incomplete factorization as well as for iterative process stages. The method is integrated into the SCAD software package (www.scadsoft.com). The paper includes a discussion of the results of calculations done with direct and iterative methods for large-scale design models of tall buildings, originally from the SCAD Soft1 problem collection.

Programming NVIDIA cards by means of transitive closure based parallelization algorithms

Palkowski M., Bielecki W.

Przegląd Elektrotechniczny

2012

R. 88, nr 10b

217-222

Massively parallel processing is a type of computing that uses many separate CPUs or GPUs running in parallel to execute a single program. Because most computations are contained in program loops, automatic extraction of parallelism available in loops is extremely important for many-core systems. In this paper, we study speed-up and scalability of parallel code scanning synchronization-free slices and time partitions by means of a 960 CUDA Cores machine, Tesla S1070.

Przetwarzanie równoległe na wielką skalę wykonywane jest za pomocą wielu procesorów (również graficznych) wykonujących jednocześnie instrukcje pojedynczego programu. Ponieważ większość obliczeń zlokalizowana jest w pętlach programowych, automatyczne zrównoleglanie kodu jest ważne dla maszyn wielordzeniowych. W artykule zbadano przyspieszenie i skalowalność równoległego kodu złożonego z niezależnych fragmentów lub harmonogramowania swobodnego za pomocą maszyny Tesla S1070 zbudowanej z 960 rdzeni CUDA.