The last few years have seen a very dynamic development of the Internet worldwide. This is related to the rapid growth of the amount of information stored in its resources. The vast amount of data, impossible to be analyzed by man, is the reason why finding and selecting valuable information from a large number of results returned by search engines has recently become the task very difficult. Another problem is the low quality of the data contained in a large part of the results returned by search engines. This situation poses serious problems if one searches for detailed information related to the specific area of industry or science. In addition, the lack of effective solutions, allowing for continuous monitoring of WEB in terms of the search for emerging information while maintaining the high quality of the returned results, only aggravates this situation. Due to this state of affairs, a solution highly welcome would be a system allowing for continuous monitoring of the WEB and searching for valuable information from the selected Internet resources. This paper describes a concept of such a system along with its initial implementation and application to search for information in the foundry industry. The results of a prototype implementation of this system were presented, and plans for its further development and adaptation to other sectors of the industry were outlined.
2
Dostęp do pełnego tekstu na zewnętrznej witrynie WWW
The paper summarizes the system for WEB resources monitoring based on defined query. Experiment compares results returned by the proposed system to those provided by Google Search and Google Alert services. Results indicate that the system could be solid base for development and tests of pattern detection and information retrieval mechanism, while providing more data than Google solutions. Drawback of system and further development plans are also presented.
PL
W artykule przedstawiono architekturę systemu monitorującego zasoby sieci WEB pod kątem zdefiniowanego zapytania. Wyniki działania systemu porównano z prowadzonym w tym samym czasie monitoringiem za pomocą mechanizmów oferowanych przez Google. Rezultaty wskazują, że system może być przydatną bazą do badania mechanizmów wykrywania wzorców i wyszukiwania informacji, udostępniając więcej danych w porównaniu do mechanizmów Googla. Wykazano też niedoskonałości aktualnej wersji systemu wynikające ze specyfiki źródeł danych i zaproponowano kierunki jego rozwoju.
JavaScript jest wyłączony w Twojej przeglądarce internetowej. Włącz go, a następnie odśwież stronę, aby móc w pełni z niej korzystać.