Wyniki wyszukiwania - BazTech

1

The Use of the CUDA Architecture to Increase the Computing Effectiveness of the Simulation Module of a Ceramic Mould Quality Forecasting System

Hojny M., Żaba K., Dębiński T., Porada J.

Archives of Foundry Engineering

|

2020

|

Vol. 20, iss. 4

5--12

EN

This paper presents practical capabilities of a system for ceramic mould quality forecasting implemented in an industrial plant (foundry). The main assumption of the developed solution is the possibility of eliminating a faulty mould from a production line just before the casting operation. It allows relative savings to be achieved, and faulty moulds, and thus faulty castings occurrence in the production cycle to be minimized. The numerical computing module (the DEFFEM 3D package), based on the smoothed particle hydrodynamics (SPH) is one of key solutions of the system implemented. Due to very long computing times, the developed numerical module cannot be effectively used to carry out multi-variant simulations of mould filling and solidification of castings. To utilize the benefits from application of the CUDA architecture to improve the computing effectiveness, the most time consuming procedure of looking for neighbours was parallelized (cell-linked list method). The study is complemented by examples of results of performance tests and their analysis.

2

Lattice structure for parallel calculation of orthogonal wavelet transform on GPUs with CUDA architecture

Puchala D., Szczepaniak B., Yatsymirskyy M.

Przegląd Elektrotechniczny

|

2015

|

R. 91, nr 7

52-54

EN

In this paper, we present a novel approach to calculation of discrete wavelet transform (DWT) on modern Graphics Processing Units (GPUs) with CUDA architecture which takes advantage of highly parallel lattice structure. The experimental results obtained for model signals show that the proposed approach allows to obtain several times acceleration compared with sequential calculations carried out on the CPU while taking into account not only time of calculations but also time required for data transfers.

XX

W artykule zaproponowano nowe podejście do obliczania dyskretnego przekształcenia falkowego (DWT) na nowoczesnych procesorach graficznych (GPUs) o architekturze CUDA oparte o wysoce równoległe struktury kratowe. Wyniki badań eksperymentalnych przeprowadzonych na sygnałach modelowych pokazują, że zaproponowane podejście daje możliwość uzyskania kilkukrotnego przyśpieszenia obliczeń w porównaniu do obliczeń sekwencyjnych na CPU, biorąc pod uwagę nie tylko czas obliczeń , ale również czasy przesyłu danych.

3

Fetal state evaluation with fuzzy analysis of newborn attributes using CUDA architecture

Czabański R., Wróbel J., Jeżewski J.

Journal of Medical Informatics & Technologies

|

2013

|

Vol. 22

125--133

EN

Cardiotocography is a biophysical method of fetal state evaluation involving the recording and analysis of the fetal heart rate (FHR). Since a proper interpretation of the signal is relatively difficult, an automatic classification is often based on computational intelligence methods. The quality of classifiers based on supervised learning algorithms depends on a proper selection of learning data. In case of the fetal state evaluation, the learning is usually based on a set of quantitative parameters of FHR signal and the corresponding reference information determined on the basis of the retrospective analysis of newborn attributes. Values of the single attribute have been used so far as a reference. As a result, a part of information on the actual neonatal outcome has always been lost. The following paper presents a method of the fuzzy reasoning leading to an evaluation of neonatal outcome as a function of three newborn attributes. The fuzzy system was used in the process of a qualitative evaluation of the fetal state based on quantitative analysis of FHR signal using a support vector machine (SVM). In order to improve computational effectiveness, the learning algorithm was implemented in Compute Unified Device Architecture (CUDA). The results of these studies confirm the effectiveness of the proposed method and indicate the possibility of practical usage of the fuzzy system in supervised learning algorithms for the qualitative evaluation of the fetal state.

4

Modelowanie dynamiki pierścienia wirowego metodą cząstek wirowych z wykorzystaniem obliczeń równoległych na kartach graficznych

Kosior A., Kudela H.

Modelowanie Inżynierskie

|

2012

|

T. 13, nr 44

145-150

PL

W artykule przedstawiono metodę wiru w komórce (VIC -Vortex in Cell) służącą do rozwiązywania równań ruchu płynu. Ze względu na długi czas obliczeń przy użyciu jednego procesora w pracy została przedstawiona implementacja metody VIC z wykorzystaniem obliczeń równoległych w środowisku kart graficznych. Uzyskano dzięki temu 46-krotne skrócenie czasu obliczeń. Przedstawione zostały dwa przypadki testowe. Pierwszy prezentuje poprawność obliczeń prowadzonych z wykorzystaniem kart graficznych. Drugi pokazuje wpływ lepkości na prędkość przemieszczania się pierścienia wirowego.

EN

The paper presented the Vortex in Cell (VIC) method for solving the fluid motion equations in 3D. Due to the long time of computation on single processor the parallel implementation of the VIC method was presented. The speed-up for the entire VIC method implementation on the GPU was 46 times. Two test cases were presented. First one, shows correctness of the parallel implementation on GPU. Second example shows influence of viscosity on the translation velocity of the vortex ring.

5

Przyśpieszenie algorytmu Smitha-Watermana z użyciem procesora graficznego

Pawłowski R., Mrozek D.

Studia Informatica

|

2011

|

Vol. 32, nr 2A

181-198

PL

Wraz z wprowadzeniem przez firmę NVIDIA technologii CUDA wykorzystanie potencjału kart graficznych stało się łatwiejsze. W artykule przedstawiono metodę istotnego przyśpieszenia wykonania algorytmu Smitha-Watermana, znajdującego optymalne, lokalne dopasowanie dwóch sekwencji, takich jak sekwencje aminokwasów lub nukleotydów. Uzyskane wyniki sugerują, że możliwe jest dokładne przeszukiwanie bioinformatycznych baz danych w rozsądnym czasie.

EN

CUDA is a technology introduced by NVIDIA Corporation, which allows software developers to take advantage of GPU resources relatively easily. This paper presents an approach leading to significant acceleration of the execution of the Smith-Waterman algorithm. The algorithm finds the best local alignment of two sequences, such as amino acid or nucleotide sequences. The results show that it is possible to search bio-informatics databases accurately within a reasonable time.

6

Architektura CUDA w bezopóźnieniowych hurtowniach danych

Gorawski M., Lis D.

Studia Informatica

|

2011

|

Vol. 32, nr 2A

157-167

PL

Rośnie znaczenie i potrzeba zapewnienia aktualności danych oraz efektywności ich przetwarzania w bezopóźnieniowych hurtowniach danych. Standardowe podejście, oparte na tradycyjnym procesie ekstrakcji danych (ETL), okazało się niewystarczające ze względu na potrzebę podziału czasu dostępu, na czas odświeżania danych oraz czas ich analizy. W artykule przedstawiono system ETL dla bezopóźnieniowych hurtowniach danych. System ten realizuje algorytm WINE-HYBRIS, bazujący na architekturze CUDA oraz CPU. Przedstawiono testy wydajnościowe tego systemu opartego na dwóch całkowicie różnych architekturach, umożliwiając zobaczenie możliwości, jakie niesie za sobą wykorzystanie architektury CUDA w systemach hurtowni danych.

EN

There is a growing importance and the need to ensure data actualisation and efficiency of their processing in zero-latency data warehouses. The standard approach, based on the traditional process of data extraction (ETL) was not sufficient because of the need for time-sharing access during the process of refreshing the data and the time of analysis. The paper presents an ETL system for the zero-latency data warehouse. This system implements the WINE-HYBRIS algorithm based on CUDA and CPU architectures. Presented performance testing of the system, is based on two completely different architectures, allowing the possibility of observing upcoming opportunities that arises during usage of the CUDA architecture in data warehousing systems.

7

Zarządzanie pamięcią i blokami wątków w obliczeniach równoległych z użyciem architektury CUDA

Widuch J.

Studia Informatica

|

2010

|

Vol. 31, nr 4A

75-96

PL

Dzięki upowszechnieniu się procesorów wielordzeniowych przetwarzanie danych za pomocą obliczeń równoległych staje się coraz bardziej dostępne dla szerokiego grona użytkowników. Przykładem jest opracowana przez firmę NVIDIA architektura CUDA, będąca architekturą wielordzeniowych procesorów graficznych. Procesor graficzny może być traktowany jako procesor SIMD z pamięcią wspólną. Na przykładzie operacji mnożenia macierzy zbadano wpływ zarządzania pamięcią i blokami wątków na czas obliczeń z użyciem architektury CUDA.

EN

With the propagation of a multi-core processors a parallel data processing becomes more accessible to a wide range of users. An example is CUDA architecture developed by NVIDIA, which is a multi-core GPU architecture. The GPU can be treated as a SIMD processor with shared memory. The influence of memory management and blocks of threads management on time of computation using CUDA architecture was researched on the basis of matrix multiplication.