Domain WEB Monitoring

Kluska-Nawarecka, S.; Opaliński, A.; Wilk-Kołodziejczyk, D.

Artykuł - szczegóły

Tytuł artykułu

Domain WEB Monitoring

Autorzy

Kluska-Nawarecka S. , Opaliński A. , Wilk-Kołodziejczyk D.

Treść / Zawartość

Pełne teksty:

10_kluska-nawarecka_domain_web_2s_2015.pdf

Pobierz

Identyfikatory

Warianty tytułu

Języki publikacji

Abstrakty

The last few years have seen a very dynamic development of the Internet worldwide. This is related to the rapid growth of the amount of information stored in its resources. The vast amount of data, impossible to be analyzed by man, is the reason why finding and selecting valuable information from a large number of results returned by search engines has recently become the task very difficult. Another problem is the low quality of the data contained in a large part of the results returned by search engines. This situation poses serious problems if one searches for detailed information related to the specific area of industry or science. In addition, the lack of effective solutions, allowing for continuous monitoring of WEB in terms of the search for emerging information while maintaining the high quality of the returned results, only aggravates this situation. Due to this state of affairs, a solution highly welcome would be a system allowing for continuous monitoring of the WEB and searching for valuable information from the selected Internet resources. This paper describes a concept of such a system along with its initial implementation and application to search for information in the foundry industry. The results of a prototype implementation of this system were presented, and plans for its further development and adaptation to other sectors of the industry were outlined.

Słowa kluczowe

web monitoring foundry industry data integration

monitoring Internetu przemysł odlewniczy integracja danych

Wydawca

Komisja Odlewnictwa Polskiej Akademii Nauk Oddział w Katowicach

Czasopismo

Archives of Foundry Engineering

Rocznik

2015

Tom

Vol. 15, iss. 2 spec.

Strony

43--46

Opis fizyczny

Bibliogr. 14 poz., rys.

Twórcy

autor

Kluska-Nawarecka S.

Foundry Research Institute, Cracow, Poland

autor

Opaliński A.

AGH University of Science and Technology, Cracow, Poland

autor

Wilk-Kołodziejczyk D.

dorota.wilk@iod.krakow.pl

Foundry Research Institute, Cracow, Poland

Bibliografia

[1] International Telecommunication Union: Measuring the Information Society 2012, Place des Nations, CH-1211 Geneva Switzerland, ISBN 978-92-61-14071-7.
[2] Miniwatts Marketing Group: World internet usage and population statistics (June 30, 2012), http://www.internetworldstats .com.
[3] Bell, S. (2004). The infodiet: how libraries can offer an appetizing alternative to Google, The Chronicle of Higher Education. 50(24), B15.
[4] Regulski, K., Kluska-Nawarecka, S. & Wilk-Kołodziejczyk, D. (2015). Codification as a part of knowledge management in the research projects in the field of metallurgy. Applied Mechanics and Materials. 708, 288-293. DOI:10.4028/www.scientific.net/AMM.708.288.
[5] The Global Search & Social Report, Q1 2014, http://internationaldigitalhub.com/en/publications/thewebcertain-global-search-and-social-report-2014.
[6] Opalinski, A., Turek, W., Cetnarowicz, K. (2013). Scalable web monitoring system. In Computer Science and Information Systems (FedCSIS), 2013 Federated Conference on, pp. 1273-1279.
[7] Chang, K.C.C., He, B., Li, C., Patel, M., & Zhang, Z. (2004). Structured databases on the web: Observations and implications. ACM SIGMOD Record. 33(3), 61-70.
[8] Burner, M. (1997) Crawling towards eternity: Building an archive of the world wide web. Web Techniques Magazine. 2(5).
[9] Olston, Ch. & Najork, M. (2010). Web Crawling. Foundations and Trends in Information Retrieval. 4(3), 175-246.
[10] D. Gruhl et al. (2004) How to build a WebFountain: An Architecture for very large-scale text analytics. IBM System Journal. 43(1), 64-77.
[11] Khare, R., Cutting, D., Sitaker, K., & Rifkin, A. (2004). Nutch: A flexible and scalable open-source web search engine. Oregon State University. 1, 32-32.
[12] Vesna, H. (2005) Open source libraries for information retrieval. IEEE Software. 22(5), 78-82.
[13] Mohr, G., Stack, M., Ranitovic, I., Avery, D., & Kimpton, M. (2004) An Introduction to heritrix. An open source archival quality web crawler. 4th International Web Archiving Workshop.
[14] Turek, W., Opalinski, A., & Kisiel-Dorohinicki, M. (2011). Extensible web crawler–towards multimedia material analysis. In Multimedia Communications, Services and Security, CCIS, vol. 149, pp. 183-190. Berlin: Springer Heidelberg.

Typ dokumentu

Bibliografia

Identyfikator YADDA

bwmeta1.element.baztech-a694eb0a-f64d-46bf-8f02-bb68d181f049