The number of papers published every year in scientific journals is growing tremendously, especially in biological sciences. Keeping the track of a given branch of science is therefore a difficult task. This was one of the reasons for developing the classification tool we called Cerberus. The classification categories may correspond to some areas of research defined by the user. We have used the tool to classify papers as containing marine metagenomic, terrestrial metagenomic or non-metagenomic information. Cerberus is based on special filters using weighted domain vocabularies. Depending on the number of occurrences of the keywords from the vocabularies in the paper, the program classifies the paper to a predefined category. This classification can precede the information extraction since it can reduce the number of papers to be analyzed. Classification of papers using the method we propose results in an accurate and precise result set of articles that are relevant to the scientist. This can reduce the resources needed to find the data required in ones field of studies.
JavaScript jest wyłączony w Twojej przeglądarce internetowej. Włącz go, a następnie odśwież stronę, aby móc w pełni z niej korzystać.