Preferencje help
Widoczny [Schowaj] Abstrakt
Liczba wyników
Powiadomienia systemowe
  • Sesja wygasła!

Znaleziono wyników: 2

Liczba wyników na stronie
first rewind previous Strona / 1 next fast forward last
Wyniki wyszukiwania
Wyszukiwano:
w słowach kluczowych:  prawo Amdahla
help Sortuj według:

help Ogranicz wyniki do:
first rewind previous Strona / 1 next fast forward last
EN
The aim of this paper is to investigate dense linear algebra algorithms on shared memory multicore architectures. The design and implementation of a parallel tiled WZ factorization algorithm which can fully exploit such architectures are presented. Three parallel implementations of the algorithm are studied. The first one relies only on exploiting multithreaded BLAS (basic linear algebra subprograms) operations. The second implementation, except for BLAS operations, employs the OpenMP standard to use the loop-level parallelism. The third implementation, except for BLAS operations, employs the OpenMP task directive with the depend clause. We report the computational performance and the speedup of the parallel tiled WZ factorization algorithm on shared memory multicore architectures for dense square diagonally dominant matrices. Then we compare our parallel implementations with the respective LU factorization from a vendor implemented LAPACK library. We also analyze the numerical accuracy. Two of our implementations can be achieved with near maximal theoretical speedup implied by Amdahl’s law.
2
Content available remote Parallel FDTD simulation using Task Parallel Library (TPL)
EN
The finite-difference time-domain (FDTD) is a numerical analysis technique used for solving computational electrodynamic problems. The nature of the FDTD method is that simulation of big and complicated electromagnetic field problems requires a vast amount of computer operational memory and runtime. Parallel-processing techniques have been broadly applied to FDTD to accelerate the simulations. The parallelism of the FDTD algorithm is based on a fact that the computational domain can be divided into parts (sub-domains), and each processor in a parallel system deals with one or several sub-domains. The FDTD algorithm belongs to data parallelism model and can be effectively implemented on shared memory system architecture. The parallel FDTD method was implemented using TPL library. The Task Parallel Library (TPL) is a library for .NET that makes easy to parallelize the program using the advantages of .NET Framework. The speedup metrics of parallel FDTD algorithm were calculated and compared with Amdahl’s estimated speedup.
first rewind previous Strona / 1 next fast forward last
JavaScript jest wyłączony w Twojej przeglądarce internetowej. Włącz go, a następnie odśwież stronę, aby móc w pełni z niej korzystać.