PL EN


Preferencje help
Widoczny [Schowaj] Abstrakt
Liczba wyników
Tytuł artykułu

Desktop and Web services for Internet documents retrieval

Autorzy
Identyfikatory
Warianty tytułu
Konferencja
Information Technology and Knowledge Management ITKM'08 / sympozjum [1; 17-18.04.2008; Jastrzębia Góra]
Języki publikacji
EN
Abstrakty
EN
Various reasons motivate the development of desktop and Web services for Internet documents retrieval. Search engines are used for retrieval document on the World Wide Web whereas the desktop search tools are employed when Internet documents are stored locally. In this paper we present an intermediary service that would help the Web client to gain information from the World Wide Web. We present a Web searching system with text mining functionality that has been developed based on the IBM's Intelligent Miner for Text system. This system can be employed as the search engine in our intermediary service. Another presented system is a new desktop search engine called Needle Desktop Search. It has some unique features such as on-line index update for supervised folders, document structuralization to improve information retrieval and API for preparation own applications using system functions and structures.
Rocznik
Tom
Strony
21--34
Opis fizyczny
Bibliogr. 21 poz.
Twórcy
autor
  • Institute of Information Science and Engineering, Wrocław University of Technology, 50-370 Wrocław, Wybrzeże Wyspiańskiego 27, Poland
Bibliografia
  • [1]Barker J., Best search tools chart. Infopeople Project, California State Library (http://infopeople.org/search/), 2004.
  • [2]Berry M.W., Survey of text mining: Clustering, classification, and retrieval, Springer Verlag, New York, 2004.
  • [3]Borzemski L., Data mining in evaluation of Internet path performance, Lecture Notes in Artificial Intelligence, 2004, Vol. 3029, Springer Verlag, Berlin, 643-652.
  • [4]Borzemski L., The use of data mining to predict Web performance. Cybernetics &Systems: An International Journal, 2006, 37(6), 587-608.
  • [5]Borzemski L., Lopatka P., Complementing search engines with text mining, Lecture Notes in Artificial Intelligence, 2005, Vol. 3533, Springer Verlag, Berlin, 743-745.
  • [6]Borzemski L., Miduch P., Needle Desktop Search: local documents search tool, in: Knowledge management and information technologies, PWNT, Gdansk, 2008 (in Polish).
  • [7]Borzemski L., Nowak Z., Using the geographic distance for selecting the nearest agent in intermediary-based access to Internet resources, Lecture Notes in Artificial Intelligence, 2005, Vol. 3683, Springer Verlag, Berlin, 261-267.
  • [8]Broder A., A taxonomy of Web search, ACM SIGIR Forum Archive, 2002, Vol. 36, Iss. 2, 3-10.
  • [9]Brin S., Page L., The anatomy of a large-scale hypertextual Web search engine, Computer Networks, 1998, 30 (1-7), 107-117.
  • [10]Chakrabarti S., Mining the Web: Analysis of hypertext and semi structured data, Morgan Kaufmann Publishers, Elsevier, San Francisco, 2003.
  • [11]Gospodnetic O., Hatcher E., Lucene in action, Manning, Greenwich, 2005.
  • [12]IBM Intelligent Miner for Text ver. 2.3, 1998.
  • [13]Levene M., An Introduction to search engines and Web navigation, Addison-Wesley, Harlow, England, 2006.
  • [14]Nasukawa T., Nagano T., Text analysis and knowledge mining system, IBM Systems J., 2001, Vol. 40, No. 4, 967-984.
  • [15]Spek S., Dorp K., Mathijsen E., Hass T., Herik J., Advanced information search within a research-based multinational, in: Proc. of the Belgium-Netherlands Artificial Conference BNAIC'03, Nijmegen, The Netherlands, 2003.
  • [16]The BEST Search Engines. UC Berkeley - Teaching Library Internet Workshops (http://www.lib.berkeley.edu/TeachingLib/Guides/Internet/SearchEngines.html), 2004.
  • [17]Using Intelligent Miner for Data. V8.1, IBM Redbooks, SH12-6394-00, 2002.
  • [18]http://www.dotlucene.net (2006).
  • [19]http://www.egothor.org (2006).
  • [20]http://mg4j.dsi.unimi.it (2006).
  • [21]http://www. xapian.org (2006).
Typ dokumentu
Bibliografia
Identyfikator YADDA
bwmeta1.element.baztech-article-BPP1-0092-0055
JavaScript jest wyłączony w Twojej przeglądarce internetowej. Włącz go, a następnie odśwież stronę, aby móc w pełni z niej korzystać.