Wyniki wyszukiwania - BazTech

1

Methods of automatic topic mining in publications in agriculture domain

Karwowski W., Wrzeciono P.

Information Systems in Management

|

2017

|

Vol. 6, No. 3

192--202

EN

Today the vast majority of resources are available in digital form. Publications frequently are related to topics not set out in the title or even summary. In this paper we presented and discussed examples of methods of finding the common topic of a publication in the field of agriculture with the use of AGROVOC dictionary. The focus is on publications in the Polish language, and the possibilities of the use of the semantics defined in the multi-language thesaurus AGROVOC. First indexing tools, especially Agrotagger, which is useful for documents in the field of agriculture, are presented, and also the test results of Agrotagger are discussed. Next the semantic technologies implemented in the AGROVOC thesaurus are discussed. In the final part, we described the design and implementation of a system, based on Polish language dictionary and AGROVOC. Additionally some tests of implemented system are discussed.

2

Wpływ mechanizmu indeksowania danych na szybkość realizacji zapytań SQL

Gryglewicz-Kacerka W., Kacerka J.

Zeszyty Naukowe Wydziału Elektrotechniki i Automatyki Politechniki Gdańskiej

|

2015

|

Nr 47

59--62

PL

Przedmiotem pracy jest analiza doświadczalna wpływu wybranych metod indeksowania na czas wykonania polecenia SQL. Badaniom poddano bazę testową wydzieloną z rzeczywistego systemu bankowego. Pomiary czasu wykonania zapytania SQL wykonano dla tabel zawierających do 1000000 rekordów. Badania przeprowadzono dla tabel nie zawierających indeksów oraz tabel zawierających indeksy. Do badań wykorzystano relacyjny system zarządzania bazą danych oparty na Oracle 9. Wyniki badań doświadczalnych pozwoliły na modernizację pracy systemu bankowego.

EN

The subject of the study is to analyse the impact of selected experimental methods of crawling on the execution time of SQL commands. The study involved test database separate from the actual banking system. Measurements runtime SQL queries made for tables containing up to 1000000 records. The study was conducted for tables that do not contain indexes and tables with indexes. The study used relational database management system based on Oracle. The experimental results have enabled the modernization work of the banking system. Analysis of the results showed that the use of indices in most of the cases can significantly reduce the waiting time for the results of SQL queries, especially for tables containing a large amount of records. In the cases studied to reduce the time it was even more than 6few hundred times.

3

Indeksowanie tabel dla różnych gęstości grup zapytań SQL (na przykładzie ORACLE 11G)

Boroński R., Bocewicz G.

Studia Informatica

|

2014

|

Vol. 35, nr 2

127--138

PL

Powszechnie stosowane komercyjne narzędzia doboru indeksów działają na podstawie metod umożliwiających indeksowanie tabel będących częścią niezależnych zapytań SQL. W artykule przedstawiono ideę indeksowania tabel uwzględniającą gęstość grupy zapytań. Przedstawiono wyniki uzyskane autorską Metodą Doboru Indeksów (MDI), opierającą się na algorytmie genetycznym. Przeprowadzone badania pokazują, że dla różnych gęstości zastosowanie indeksu grupowego pozwala skrócić czas wykonania zapytań (o 15%), a także zmniejszyć rozmiar indeksów (o 68-90%).

EN

Commonly used commercial tools are based on a methodology that enables tables indexing for individual SQL queries. The article presents an original method, based on a genetic algorithm, for indexing tables for groups of queries in a relational database. Conducted experiments have shown that the use of indices for a group of queries can reduce the group execution time by 15% as well as can reduce the memory needs by 68-90%.

4

Automatic indexer for Polish agricultural texts

Karwowski W., Wrzeciono P.

Information Systems in Management

|

2014

|

Vol. 3, No. 4

229--238

EN

Today, the majority of resources are available in digital forms to acquire information. We have to search through collections of documents. In this paper text indexing which can improve searching is described. Next, indexing tool, the Agrotagger, which is useful for documents in the field of agriculture, is presented. Two available versions of the Agrotagger are tested and discussed. The Agrotagger is useful only for the English language despite the fact that it uses multilingual thesaurus Agrovoc. Because of the Agrotagger is not useful for texts in Polish, it is important to create similar tool appropriate for the Polish language. The problems connected with extensive inflection in languages such as Polish language in the process of indexing were discussed. In the final part of the paper, it is presented design and implementation of a system, based on the Polish language dictionary and the Agrovoc. Additionally some tests of implemented system are discussed.

5

Semantic text indexing

Kaleta Z.

Computer Science

|

2014

|

Vol. 15 (1)

19--34

EN

The following article presents a specific issue of semantic analysis of texts in natural language – text indexing and describes one field of its application (web browsing). The main part of this article describes a computer system assigning a set of semantic indexes (similar to keywords) to a particular text. The indexing algorithm employs a semantic dictionary to find specific words in a text that represent a text content. Furthermore, it compares two given sets of semantic indexes to determine similarities between texts (assigning a numerical value). The article describes the semantic dictionary – a tool essential to accomplish this task and its usefulness, the main concepts of the algorithm, and the test results.

6

Automatic indexing of information resources concerning agriculture in Polish

Karwowski W., Wrzeciono P.

Agricultural Engineering

|

2014

|

Vol. 18, No. 4

103--110

EN

Contemporary research and production activity require searching and collecting a variety of information, this also applies to issues in the field of agriculture. Today, the vast majority of resources are available in a digital form. FAO on the portal of the Agricultural Information Management Standards presents an AgroTagger, tool for indexing documents in the field of agriculture, which is designed for the English language. Extraction of knowledge is not very convenient in languages such as Polish language with a very extensive inflection. In Polish, the following parts of speech inflect: verbs, nouns, numerals, adjectives, and pronouns. Proper indexing requires an initial reduction of grammatical forms, to which the authors have used the dictionary of the Polish language and have developed a programme of reducing. Moreover the algorithms for determining weights corresponding to the validity of the appointments taking into account the prevalence of terms and their position in the document were developed and implemented.

PL

Współcześnie działalność badawcza i produkcyjna wymaga wyszukiwania i gromadzenia różnorodnych informacji, dotyczy to także zagadnień z dziedziny rolnictwa. Obecnie większość zasobów dostępna jest w formie cyfrowej. FAO w ramach portalu Agricultural Information Management Standards prezentuje AgroTagger narzędzie do indeksowania dokumentów z dziedziny rolnictwa, które przeznaczone jest dla języka angielskiego. Ekstrakcja wiedzy jest utrudniona w językach takich jak język polski, posiadających bardzo rozbudowaną fleksję. W języku polskim odmienia się rzeczowniki, czasowniki, przymiotniki oraz zaimki osobowe. Właściwa indeksacja wymaga wstępnej redukcji form fleksyjnych, wobec czego wykorzystano słownik odmian języka polskiego i opracowano program redukujący. Ponadto opracowano i zaimplementowano algorytmy wyznaczania wag odpowiadających ważności terminów uwzględniające częstość występowania terminów i ich pozycję w dokumencie.

7

Indeksowanie tabel dla grupowych zapytań SQL z uwzględnieniem kryterium rozmiaru

Boroński R.

Studia Informatica

|

2013

|

Vol. 34, nr 2A

85--96

PL

Indeksowanie jest kluczowym elementem optymalizacyjnym systemów relacyjnych baz danych. Komercyjne narzędzia doboru indeksów (np. Toad, SQL Server Database Tuning Advisor) działają na podstawie metod przeznaczonych dla pojedynczych zapytań. W artykule przedstawiono podejście indeksowania tabel w ramach grupowych zapytań SQL uwzględniające kryterium rozmiaru indeksów. Przedstawione przykłady ilustrują, że zastosowanie podejścia grupowego pozwala zmniejszyć czas wykonania zapytań nawet o 30% w stosunku do rozwiązań uzyskanych klasycznymi metodami.

EN

This paper discusses the problem of minimizing the response time for a given database workload by a proper choice of indexes. The main objective of our contribution is to illustrate the database queries as a group and search for good indexes for the group instead of an individual query, including the size criterion. Examples illustrate that the use of a group approach can reduce queries block execution time of 30% compared to classical methods.

8

Interactive 3D architectural visualization with semantics in web browsers

Książek M., Pietruszka M.

Journal of Applied Computer Science

|

2012

|

Vol. 20, nr 2

59-70

EN

This paper focuses on rendering, and access to visual and descriptive information about the digital architectural models on the Web. It was proposed to reach these contents with a help of deep linking, which allows to access to different views and descriptions from both the internal navigation system or from the browser, or search engine. Along with the HTML5 and WebGL it allows updating the link during the exploration of a virtual model, and remembers to re-use. Although all the methods were tested on architecture's models, it can be used in other interactive 3D applications.

9

Sprzętowo-programowa analiza obrazu otrzymanego z detektora obiektów ruchomych

Pankiewicz B., Wójcikowski M., Żaglewski R.

Elektronika : konstrukcje, technologie, zastosowania

|

2010

|

Vol. 51, nr 2

119-122

PL

W artykule przedstawiono budowę wewnętrzną oraz zasadę działania sprzętowo-programowego bloku realizującego analizę danych z obrazowego detektora ruchu. System zrealizowano za pomocą 2 identycznych procesorów 8-bitowych pracujących synchronicznie, jednego 32-bitowego procesora typu BA12 [4] oraz zestawu tablic pamięci. Algorytm analizy obrazu jest dwuetapowy. W pierwszym etapie następuje transformacja geometryczna umożliwiająca w przyszłości analizę odległości, wielkości i prędkości wykrytych obiektów. Drugi etap jest typowym indeksowaniem wykrytych obiektów. System został wykonany praktycznie z wykorzystaniem układu FPGA i potwierdza prawidłowość działania zaproponowanego rozwiązania.

EN

In the paper the structure and operation of hardware-software block for data analysis FROM image detector is presented. The system consists of 2 identical 8-bit custom processors working synchronously, single 32-bit processor BA12 [4] and the set of memory tables. The algorithm is composed of two phases. In the first phase, the geometrical transformation needed for distance and size measuring of the detected objects is calculated. During the second phase, the detected objects are indexed. The system has been practically realized using FPGA and works correctly, which confirms the correctness of the proposed idea.

10

Time complexity of page filling algorithms in Materialized Aggregate List (MAL) and MAL/TRIGG materialization cost

Gorawski M.

Control and Cybernetics

|

2009

|

Vol. 38, no 1

153-172

EN

The Materialized Aggregate List (MAL) enables effective storing and processing of long aggregates lists. The MAL structure contains an iterator table divided into pages that stores adequate number of aggregates. Time complexity of three algorithms was calculated and, in comparison with experimental results, the best configuration of MAL parameters (number of pages, single page size and number of database connections) was estimated. MAL can be also applied to every aggregation level in different indexing structures, like for instance the aR-tree.

11

How to improve efficiency of analysis of sequential data?

Andrzejewski W., Królikowski Z., Morzy T.

Control and Cybernetics

|

2009

|

Vol. 38, no 1

107-126

EN

Many of todays database applications, including market basket analysis, web log analysis, DNA and protein sequence analysis utilize databases to store and retrieve sequential data. Commercial database management systems allow to store sequential data, but they do not support efficient querying of such data. To increase the efficiency of analysis of sequential data new index structures need to be developed. In this paper we propose an indexing scheme for non-timestamped sequences of sets, which supports set subsequence queries. Our contribution is threefold. First, we describe the index logical and physical structure, second, we provide algorithms for set subsequence queries utilizing this structure, and finally we perform experimental evaluation of the index, which proves its feasibility and advantages in set subsequence query processing.

12

Uogólnione podejście do aktualizacji indeksów w obiektowej bazie danych

Kowalski T. M., Kuliberda K., Draus C., Adamus R., Wiślicki J.

Automatyka / Akademia Górniczo-Hutnicza im. Stanisława Staszica w Krakowie

|

2009

|

T. 13, z. 3/2

1545--1561

PL

W artykule opisujemy uogólnione podejście do problemu automatycznej aktualizacji indeksów w reakcji na zmiany odpowiadających im danych. W celu umożliwienia tworzenia i przezroczystej konserwacji indeksów wspierających klucze oparte na dowolnych, deterministycznych i wolnych od efektów ubocznych wyrażeniach autorzy zaproponowali zastosowanie specjalnego rodzaju procedur wyzwalanych. Języki zapytań dla obiektowych modeli (klasy, dziedziczenie, polimorfizm, metody, itp.) pozwalają na łatwe definiowanie bardziej złożonych warunków selekcji. W celu zapewnienia pełnej przezroczystości indeksowania, mechanizmy aktualizacji indeksów wymagają znaczącej rewizji. Niewystarczająca kontrola poprawności indeksów może prowadzić do poważnych błędów podczas przetwarzania zapytań. Praca autorów jest oparta na architekturze stosowej SBA (Stack-Based Architecture) i została zaimplementowana w prototypowej obiektowej bazie danych ODRA (Object Database for Rapid Developmeni).

EN

We describe a generalized approach to the problem of the automatic index updating in response to modification of corresponding data. To enable creation and transparent maintenance of indices supporting keys defined using arbitrary, deterministic and side effects free expressions the authors propose applying a special kind of database triggers. Query language for object-oriented model (classes, inheritance, polymorphism, class methods, etc.) allows easy defining of more complex selection predicates; nevertheless, in order to provide full indexing transparency, index updating requires substantial revising. Inadequate index maintenance can lead to serious errors in query processing. The authors work is based on the Stack-Based Architecture (SBA) and has been implemented in the ODRA (Object Database for Rapid Applications Development) OODBMS prototype.

13

Metody optymalizacji przez indeksowanie dla obiektowego języka zapytań

Kowalski T. M., Cebula P., Kuliberda K., Wiślicki J., Adamus R.

Automatyka / Akademia Górniczo-Hutnicza im. Stanisława Staszica w Krakowie

|

2009

|

T. 13, z. 3/2

1529-1543

PL

W artykule zostały opisany ogólne zasady optymalizacji zapytań przez indeksowanie dla obiektowego języka zapytań SBQL (Stack-Based Query Language). Opracowane metody zostały zaimplementowane i przetestowane w prototypie systemu ODRA. Implementacja indeksowania na potrzeby systemu ODRA opiera się na liniowym haszowaniu i działa lokalnie w zakresie samodzielnej bazy danych. Składa się ona z przezroczystej optymalizacji zapytań, automatycznej aktualizacji indeksów oraz modułu zarządzającego. Na kilku przykładach zostały omówione kwestie semantycznej równoważności zaproponowanych metod optymalizacji w kontekście obiektowego modelu danych i języka zapytań.

EN

In paper we present an overview of query optimization by indexing for SBQL (Stack-Based Query Language). Developed methods have been implemented and tested in ODRA prototype system. The ODRA index implementation is based on linear hashing and works in a scope of a standalone database. It consists of transparent optimization, automatic index updating and management facilities. The semantic equivalence of proposed query optimization methods in the context of object data model and query language is discussed on several examples.

14

Integracja oraz indeksowanie rozproszonych zasobów danych w technologii "data grid"

Kuliberda K., Kowalski T. M., Wiślicki J., Adamus R., Meina M.

Automatyka / Akademia Górniczo-Hutnicza im. Stanisława Staszica w Krakowie

|

2009

|

T. 13, z. 3/2

1563-1570

PL

Problemy integracji rozproszonych zasobów są obecnie jednym z podstawowych zagadnień w dziedzinie gromadzenia danych i uzyskiwania spójnej i wiarygodnej informacji - odpowiedź została zawarta w artykule. Autorzy opisują aspekty przezroczystej integracji rozproszonych danych do obiektowego gridu bazodanowego poprzez technologię p2p z uwzględnieniem niezwykle istotnej kwestii ich indeksowania. Przedstawione rozwiązanie zostało zaimplementowane i zweryfikowane poprzez w pełni funkcjonalny prototyp. Tekst prezentuje podstawy wykorzystania architektury p2p oraz procedury indeksowania danych pochodzących z odległych źródeł, dzięki którym dostęp do nich staje się szybszy o rzędy wielkości, a transport przez sieć ograniczony do niezbędnego minimum.

EN

The problems of integration of distributed resources are currently one of the most substantial issues in the domain of collecting data and retrieving consistent and reliable information - the answer has been included in the following paper. Authors describe aspects of transparent integration of distributed data into an object-oriented data grid with application of the p2p technology and introducing extremely crucial issues of indexing. The presented solution has been implemented and verified in the completely functional prototype. The paper presents basics of application of the p2p architecture and procedures of indexing data originating from remote sources. These procedures accelerate data access by orders of magnitude and data transportation becomes limited to the necessary minimum.

15

CBA-drzewa-kompaktowa struktura dla indeksowania przestrzennych obiektów niepunktowych

Gorawski M., Tlatlik D.

Studia Informatica

|

2008

|

Vol. 29, nr 1

99-118

PL

Artykuł ten prezentuje nowe struktury danych: CBA-drzewa i QCBA-drzewa zaprojektowane jako alternatywa dla BA-drzew eliminująca niektóre spośród ich wad. Omówiona została ogólna charakterystyka wprowadzonych struktur, a także przeprowadzone dla nich testy porównawcze. Opisany został również problem agregacji przestrzennej, dla którego w głównej mierze adresowane są przedstawione rozwiązania.

EN

This article presents new data structures for both: CBA and QCBA-trees designed as an options for BA-tree and as an elimination of several flows. As part of this article, general description of introduced structures was not only discussed but also compared by tests. In addition to this, the problem of spatial aggregation, for which the solutions are mainly addressed, was described.

16

Indexing biomedical streams in data management system

Widera M., Wróbel J., Matonia A., Jeżewski M., Horoba K., Kupka T.

Journal of Medical Informatics & Technologies

|

2005

|

Vol. 9

107--112

EN

We are developing Data Stream Management System that can be applied in medical monitoring system. One of the problems is how to support efficient data access methods. This paper considers indexing issues in data management system. The indexing is a commonly used method of data access acceleration. Classical and new streaming methods of indexing have been presented. The method was developed in data stream management system and first applied in a centralized neonatal monitoring system.

17

Rodzaje indeksowania w hurtowniach danych

Stopczyk M.

Mikroelektronika i Informatyka : prace naukowe

|

2004

|

Z. nr 4

189-195

18

Media cyfrowe

Skarbek W.

Przegląd Telekomunikacyjny + Wiadomości Telekomunikacyjne

|

2003

|

nr 8-9

404-408

PL

Dokonano przeglądu osiągnięć grupy normalizacyjnej MPEG na tle podstawowych technik multimedialnych, tj. kompresji, selekcji, transportu i integracji. Scharakteryzowano aktualne prace i omówiono planowane aktywności grupy MPEG. Przeprowadzono dyskusję tematów badawczych podejmowanych w obszarze hybrydowych mediów cyfrowych integrujących multimedialne sceny rzeczywiste z multimedialnymi scenami wirtualnymi.

EN

A survey of achievements within multimedia normalization activities by MPEG group is included with basic multimedia techniques as the background: compression, selection, transportation and integration. It describes current MPEG works and presents nearest plans of the group. Finally a discussion of research topics proposed within hybrid digital media that integrate multimedia real scenes with multimedia virtual scenes.

19

Center-Based Indexing in Vector and Metric Spaces

Wojna A.

Fundamenta Informaticae

|

2003

|

Vol. 56, nr 3

285-310

EN

The paper addresses the problem of indexing data for k nearest neighbors (k-nn) search. Given a collection of data objects and a similarity measure the searching goal is to find quickly the k most similar objects to a given query object. We present a top-down indexing method that employs a widely used scheme of indexing algorithms. It starts with the whole set of objects at the root of an indexing tree and iteratively splits data at each level of indexing hierarchy. In the paper two different data models are considered. In the first, objects are represented by vectors from a multi-dimensional vector space. The second, more general, is based on an assumption that objects satisfy only the axioms of a metric space. We propose an iterative k-means algorithm for tree node splitting in case of a vector space and an iterative k-approximate-centers algorithm in case when only a metric space is provided. The experiments show that the iterative k-means splitting procedure accelerates significantly k-nn searching over the one-step procedure used in other indexing structures such as GNAT, SS-tree and M-tree and that the relevant representation of a tree node is an important issue for the performance of the search process. We also combine different search pruning criteria used in BST, GHT nad GNAT structures into one and show that such a combination outperforms significantly each single pruning criterion. The experiments are performed for benchmark data sets of the size up to several hundreds of thousands of objects. The indexing tree with the k-means splitting procedure and the combined search criteria is particularly effective for the largest tested data sets for which this tree accelerates searching up to several thousands times

20

A new approach to color person image indexing and retrieval

Muselet D., Macaire L., Postaire J. G.

Machine Graphics and Vision

|

2002

|

Vol. 11, No. 2/3

257-283

EN

In the context if image indexing for the purpose of retrieval, colored object recognition methods tend to fail when the illumination of the objects varies from an image to another. A new approach to indexing image of persons is proposed, which copes with the variations of the lighting conditions. We assume that illumination changes can be described using a simple linear transform. For comparing two images, we transform the color of the target one according to the colors of the query one by means of an original color histogram specification based on color invariant evaluation. For the retrieval purpose, we evaluate invariant color signatures of the query image and the transformed target image through the use of color co-occurrence matrices. Tests on real images are very encouraging, with substantially better results than those obtained with other well--established indexing and retrieval schmes.