PL EN


Preferencje help
Widoczny [Schowaj] Abstrakt
Liczba wyników
Tytuł artykułu

Domain WEB Monitoring

Treść / Zawartość
Identyfikatory
Warianty tytułu
Języki publikacji
EN
Abstrakty
EN
The last few years have seen a very dynamic development of the Internet worldwide. This is related to the rapid growth of the amount of information stored in its resources. The vast amount of data, impossible to be analyzed by man, is the reason why finding and selecting valuable information from a large number of results returned by search engines has recently become the task very difficult. Another problem is the low quality of the data contained in a large part of the results returned by search engines. This situation poses serious problems if one searches for detailed information related to the specific area of industry or science. In addition, the lack of effective solutions, allowing for continuous monitoring of WEB in terms of the search for emerging information while maintaining the high quality of the returned results, only aggravates this situation. Due to this state of affairs, a solution highly welcome would be a system allowing for continuous monitoring of the WEB and searching for valuable information from the selected Internet resources. This paper describes a concept of such a system along with its initial implementation and application to search for information in the foundry industry. The results of a prototype implementation of this system were presented, and plans for its further development and adaptation to other sectors of the industry were outlined.
Rocznik
Strony
43--46
Opis fizyczny
Bibliogr. 14 poz., rys.
Twórcy
  • Foundry Research Institute, Cracow, Poland
  • AGH University of Science and Technology, Cracow, Poland
  • Foundry Research Institute, Cracow, Poland
Bibliografia
  • [1] International Telecommunication Union: Measuring the Information Society 2012, Place des Nations, CH-1211 Geneva Switzerland, ISBN 978-92-61-14071-7.
  • [2] Miniwatts Marketing Group: World internet usage and population statistics (June 30, 2012), http://www.internetworldstats .com.
  • [3] Bell, S. (2004). The infodiet: how libraries can offer an appetizing alternative to Google, The Chronicle of Higher Education. 50(24), B15.
  • [4] Regulski, K., Kluska-Nawarecka, S. & Wilk-Kołodziejczyk, D. (2015). Codification as a part of knowledge management in the research projects in the field of metallurgy. Applied Mechanics and Materials. 708, 288-293. DOI:10.4028/www.scientific.net/AMM.708.288.
  • [5] The Global Search & Social Report, Q1 2014, http://internationaldigitalhub.com/en/publications/thewebcertain-global-search-and-social-report-2014.
  • [6] Opalinski, A., Turek, W., Cetnarowicz, K. (2013). Scalable web monitoring system. In Computer Science and Information Systems (FedCSIS), 2013 Federated Conference on, pp. 1273-1279.
  • [7] Chang, K.C.C., He, B., Li, C., Patel, M., & Zhang, Z. (2004). Structured databases on the web: Observations and implications. ACM SIGMOD Record. 33(3), 61-70.
  • [8] Burner, M. (1997) Crawling towards eternity: Building an archive of the world wide web. Web Techniques Magazine. 2(5).
  • [9] Olston, Ch. & Najork, M. (2010). Web Crawling. Foundations and Trends in Information Retrieval. 4(3), 175-246.
  • [10] D. Gruhl et al. (2004) How to build a WebFountain: An Architecture for very large-scale text analytics. IBM System Journal. 43(1), 64-77.
  • [11] Khare, R., Cutting, D., Sitaker, K., & Rifkin, A. (2004). Nutch: A flexible and scalable open-source web search engine. Oregon State University. 1, 32-32.
  • [12] Vesna, H. (2005) Open source libraries for information retrieval. IEEE Software. 22(5), 78-82.
  • [13] Mohr, G., Stack, M., Ranitovic, I., Avery, D., & Kimpton, M. (2004) An Introduction to heritrix. An open source archival quality web crawler. 4th International Web Archiving Workshop.
  • [14] Turek, W., Opalinski, A., & Kisiel-Dorohinicki, M. (2011). Extensible web crawler–towards multimedia material analysis. In Multimedia Communications, Services and Security, CCIS, vol. 149, pp. 183-190. Berlin: Springer Heidelberg.
Typ dokumentu
Bibliografia
Identyfikator YADDA
bwmeta1.element.baztech-a694eb0a-f64d-46bf-8f02-bb68d181f049
JavaScript jest wyłączony w Twojej przeglądarce internetowej. Włącz go, a następnie odśwież stronę, aby móc w pełni z niej korzystać.