Znaleziono wyników: 2

Liczba wyników na stronie
first rewind previous Strona / 1 next fast forward last
Wyniki wyszukiwania
help Sortuj według:

help Ogranicz wyniki do:
first rewind previous Strona / 1 next fast forward last
EN
This paper explores cost-effective alternatives for resource-constrained environments in the context of language models by investigating methods such as quantization and CPUbased model implementations. The study addresses the computational efficiency of language models during inference and
the development of infrastructure for text document processing. The paper discusses related technologies, the CLARIN-PL infrastructure architecture, and implementations of small and large language models. The emphasis is on model formats, data precision, and runtime environments (GPU and CPU). It identifies optimal solutions through extensive experimentation. In addition, the paper advocates for a more comprehensive performance evaluation approach. Instead of reporting only average token throughput, it suggests considering the curve’s shape, which can vary from constant to monotonically increasing or decreasing functions. Evaluating token throughput at various curve points, especially for different output token counts, provides a more informative perspective.
Mniej
Więcej
EN
The chapter discusses the performance aspects of intelligent agents in Complex Event Processing (CEP) systems. The contemporary solution for implementing CEP systems is based on available software components (Siddhi) and modern implementation techniques (Kubernetes). However, Siddhi lacks the
implementation of modern deep learning algorithms. Hence, the concept of intelligent agent is introduced. A case study with a set of intelligent agents designed to handle real-world events related to environmental data monitoring is presented. The results of the case study discussion indicate a reasonable scale for tuning the Event Processing Element (EPA) topology with correct responses and the required output performance level. These results have important implications for the practical implementation of the EPA structure, i.e., the use of GPUs in CEP systems. Finally, the results of performance analysis of different implementations of intelligent agents are presented and discussed.
Mniej
Więcej
Preferencje help
Widoczny [Schowaj] Abstrakt
Liczba wyników
first rewind previous Strona / 1 next fast forward last