Akceleracja metody elementów skończonych przy użyciu procesora graficznego

Dziekoński, A.; Lamęcki, A.; Mrozowski, M.

doi:10.15199/48.2016.09.03

Artykuł - szczegóły

Tytuł artykułu

Akceleracja metody elementów skończonych przy użyciu procesora graficznego

Autorzy

Dziekoński A. , Lamęcki A. , Mrozowski M.

Wybrane pełne teksty z tego czasopisma

http://pe.org.pl/

Identyfikatory

DOI

10.15199/48.2016.09.03

Warianty tytułu

Acceleration of the Finite Element Method with GPU

Języki publikacji

Abstrakty

Artykuł przedstawia rezultaty akceleracji obliczeń metody elementów skończonych z użyciem procesora graficznego. Dzięki zastosowaniu masowo zrównoleglonych obliczeń na procesorze graficznym dwóch najbardziej kosztownych obliczeniowo etapów generacji macierzy współczynników i rozwiązywania układu równań przy użyciu metody gradientów sprzężonych z wielopoziomowym prekondycjonerem o schemacie V udało się pięciokrotnie skrócić czas symulacji metody elementów skończonych.

This paper presents the results of the acceleration of computations involved in the finite element method obtained with graphics processors. A 5-fold acceleration was achieved thanks to the massive parallelization of two most time-consuming steps of the finite element method, namely matrix generation and the solution of sparse system of linear equations with the conjugate gradient method and a V-cycle multilevel preconditioner.

Słowa kluczowe

procesor graficzny metoda elementów skończonych wielopoziomowy operator ściskający

GPU FEM multilevel preconditioner

Wydawca

Wydawnictwo SIGMA-NOT

Czasopismo

Przegląd Elektrotechniczny

Rocznik

2016

Tom

R. 92, nr 9

Strony

12--15

Opis fizyczny

Bibliogr. 24 poz.

Twórcy

autor

Dziekoński A.

adam.dziekonski@gmail.com

Politechnika Gdańska, Wydział Elektroniki, Telekomunikacji i Informatyki, ul. Gabriela Narutowicza 11/12, 80-233 Gdańsk

autor

Lamęcki A.

adlam@eti.pg.gda.pl

Politechnika Gdańska, Wydział Elektroniki, Telekomunikacji i Informatyki, ul. Gabriela Narutowicza 11/12, 80-233 Gdańsk

autor

Mrozowski M.

michal.mrozowski@eti.pg.gda.pl

Politechnika Gdańska, Wydział Elektroniki, Telekomunikacji i Informatyki, ul. Gabriela Narutowicza 11/12, 80-233 Gdańsk

Bibliografia

[1] Jin J., The Finite Element Method in Electromagnetics, John Wiley and Sons Inc., 2002
[2] Volakis J.L., Chatterjee A., Kempel L.C., Finite Element Method for Electromagnetics. Antennas, Microwave Circuits and Scattering Applications, IEEE Series on Electromagnetic Wave Theory, 1998
[3] Dziekonski A., Optymalizacja wydajności obliczeniowej metody elementów skończonych w architekturze CUDA, rozprawa doktorska, Politechnika Gdańska, 2015
[4] Meng H.-T., Nie B.-L., Wong S., Macon C., Jin J.-M., GPU Accelerated Finite Element Computation for Electromagnetic Analysis, IEEE Antennas Propag. Mag, vol. 56, (2014) no. 2,39-62
[5] Kreutzer M., Hager G., Wellein G., Fehske H., Bishop A.R., AUnified Sparse Matrix Data Format for Efficient General Sparse Matrix-Vector Multiplication on Modern Processors with WideSIMD Units, SIAM Journal on Scientific Computing, vol. 36,(2014) no. 5, C401-C423
[6] Anzt H., Tomov S., Dongarra J., Implementing a Sparse Matrix Vector Product for the SELL-C\SELL-C-σ formats on NVIDIA GPUs. Raport instytutowy, University of Tennessee, Department of Electrical Engeneering & Computer Science, 2014
[7] Langr D., Tvrdik P., Evaluation Criteria for Sparse Matrix Storage Formats, IEEE Transactions on Parallel and Distributed Systems, vol.27, no.2, 428-440, Feb. 1 2016
[8] Li R., Saad Y., GPU-Accelerated Preconditioned IterativeLinear Solvers, Journal of Supercomputing, vol. 63, (2013) no. 2, 443-466
[9] Naumov M., Incomplete-LU and Cholesky PreconditionedIterative Methods Using CUSPARSE and CUBLAS, http://docs.nvidia.com/cuda/incomplete-lu-cholesky/
[10] Goeddeke D., Strzodka R., Mohd-Yusof J., McCormick P., Wobker H., Becker C., Turek S., Using GPUs to Improve Multigrid Solver Performance on a Cluster, International Journal on Computer Science and Engineering, vol. 4, (2008) no. 1, 36-55
[11] Matrix Algebra on GPU and Multicore Architectures, http://icl.cs.utk.edu/ magma/index.html
[12] AmgX, https://developer.nvidia.com/amgx
[13] Banas K., Plaszewski P., Maciol P., Numerical Integration on GPUs for Higher Order Finite Elements. Computers and Mathematics with Applications, vol. 67, (2014) 1319-1344
[14] Markall G., Slemmer A., Ham D., Kelly P., Cantwell C., Sherwin S., Finite Element Assembly Strategies on Multi-Core and Many-Core Architectures, Int. J. Numer. Meth. Fluids, vol. 71, (2013) no. 1, 80-97
[15] Cecka C., Lew A.J., Darve E., Assembly of Finite Element Methods on Graphics Processors, International Journal for Numerical Methods in Engineering, vol. 85, (2011) no. 3, 640- 669
[16] Georgescu S., Chow P., Okuda H., GPU Acceleration for FEMBased Structural Analysis, Archives of Computational Methods in Engineering, vol. 20, (2013) no. 2, 111-121
[17] Reguly I.Z., Giles M.B., Finite Element Algorithms and Data Structures on Graphical Processing Units, International Journal of Parallel Programming, vol. 43, (2015) no. 2, 203-239
[18] Fu Z., Lewis T.J., Kirby R.M., Whitaker R.T., Architecting the Finite Element Method Pipeline for the GPU, Journal of Computational and Applied Mathematics, vol. 257, (2014) 195- 211
[19] Rubio J., Arroyo J., Zapata J., Analysis of Passive MicrowaveCircuits by Using a Hybrid 2-D and 3-D Finite-Element Mode- Matching Method, IEEE Trans. Microw. Theory Techn., vol. 47, (1999) no. 9, 1746-1749
[20] Sanders J., Kandrot E., CUDA by Example: An Introduction to General-Purpose GPU Programming, NVIDIA Corporation, 2011
[21] Nvidia. CUDA Programming Guide Version 4.0. Nvidia, czerwiec 2011. http://www.nvidia.com/object/cuda_develop.html.
[22] Dziekonski A., Sypek P., Lamecki A., Mrozowski M., Generation of Large Finite Element Matrices on Multiple Graphics Processors, Int. J. Numer. Meth. Eng., vol. 94, (2012 )no. 2, 204-220
[23] A. Dziekonski, A. Lamecki, M. Mrozowski, A Memory Efficient and Fast Sparse Matrix Vector Product on a GPU, Progress In Electromagnetics Research, vol. 116, (2011) 49-63
[24] A. Dziekonski, A. Lamecki, M. Mrozowski, GPU Acceleration ofMultilevel Solversfor Analysis of Microwave Components With Finite Element Method, IEEE Microw. Compon. Lett., vol. 1, (2011) 1-3

Uwagi

Opracowanie ze środków MNiSW w ramach umowy 812/P-DUN/2016 na działalność upowszechniającą naukę.

Typ dokumentu

Bibliografia

Identyfikator YADDA

bwmeta1.element.baztech-0c387dcb-6ddd-4c2a-a8ad-c2365f8fbe76