Integration of structural pattern recognition methods into knowledge-based framework. Application to geographic map image analysis

Stąpor, K.

Artykuł - szczegóły

Tytuł artykułu

Integration of structural pattern recognition methods into knowledge-based framework. Application to geographic map image analysis

Autorzy

Stąpor K.

Identyfikatory

Warianty tytułu

Integracja metod strukturalnych rozpoznawania obrazów z opartą na wiedzy strukturą wnioskującą. Zastosowanie do automatycznej analizy map geograficznych

Języki publikacji

Abstrakty

In this thesis the generally applicable reasoning framework with the developed and properly integrated within it structural classifiers for automatic conversion of paper documents into a digital form is proposed. As the application of the proposed methods, the Polish Fundamental Land Map has been chosen. The proposed framework is based on the knowledge-based approach, and is composed of the three schemas: 1) map model, represented in the form of the developed hybrid semantic network which is a combination of the different knowledge representation formalisms: semantic network, rules, frame system and the object-oriented computation paradigm, 2) object detection methods (structural classifiers), 3) image analysis flow scheme composed of the two structures: mixed control mechanism and the non-monotonic reasoning method called complementary reasoning. The image analysis flow scheme, which is based on the proposed intelligent analysis strategy, is accomplished by having the initial recognition of seed objects followed by the progressive extraction of the remaining layers of geographic objects through the iteration of the 4-step interpretation cycle: hypotheses generation, compatibility examination, scenario selection, and scenario verification. Each of the developed structural classifiers of the different graphics/text components of map drawing is based on the appropriate object model. The simplest is the polygon detector which is based on the defined polygon models. The second detector relies on the relational model of a general 2D curve, and the recognition method consists of matching relational structures (i.e. searching for relational homomorphism), followed by distance calculation in a parameter space. The third detector is based on the error-tolerant graph matching procedure between objects represented by the attributed graphs which is a search for an optimal many-to-one graphs homomorphism. The most general detector is the fourth one, in which objects are represented by the programmed, higher-dimensional extension of a string grammar, and the recognition is performed by error-correcting, mixed (top-down/bottom-up) parsing. All detectors are translational and rotational invariant and are tolerant to many complex structural deformations, including topological ones. The proposed framework is composed of the following modules: 1) reasoning engine which is is a global coordinator of the reasoning performed by the independent object instances, 2) detectors manager, 3) detectors which are based on the different pattern recognition techniques and use the intrinsic properties of objects for model-based recognition, 4) vector graph construction module, 5) resegmentation module which realizes the refining matches by resegmentation technique to recover the missing object features, and 5) evidence pool which is a database where accumulation of object instances and hypotheses is performed during the analysis. The following 4-level structure of geographic map image analysis process is proposed in the thesis: 1) pre-processing level composed of binarization, filled region extraction and thinning, 2) vector graph construction level, i.e. line structure tracking, vector graph segmentation, small and big graph vectorizations, 3) graphics and text recognition level, 4) understanding level. The proposed, general reasoning framework, together with the developed structural classifiers respond to the all posed problems of geographic map image analysis (reasoning with incomplete information, control strategy, contextual layer separation, reliable analysis) , as well as of the structural pattern recognition (limited descriptive power of string grammars, errorneous patterns, high number of prototypes, interaction of segmentation with recognition, selection of region of interest). The proposed approach to document image analysis enables for increased flexibilty: the image analysis flow scheme is independent of the application, the map model, dependent on the application, can be used for another ones only after small adjustments. The proposed framework has been implemented as the map conversion module MA PIN, which replaced the manual input of paper maps into the Geographic Information System, converting correctly about 70% of the information in the maps and leaving the remaining 30% to the operator.

W rozprawie przedstawiono zaproponowaną, ogólną strukturę wnioskującą wraz z wbudowanymi strukturalnymi klasyfikatorami dla celu automatycznej analizy (konwersji) dokumentów z postaci papierowej do odpowiedniego formatu cyfrowego. Jako zastosowanie opracowanych metod wybrano Polską Zasadniczą Mapę Kraju. Zaproponowana struktura wnioskująca bazująca na opartym na wiedzy podejściu złożona jest z trzech schematów: 1) modelu mapy w postaci hybrydowej sieci semantycznej opartej na różnych formalizmach reprezentacji wiedzy: sieci semantycznej, reguł, ram i podejścia obiektowo-zorientowanego, 2) metod detekcji obiektów (strukturalnych klasyfikatorów), 3) schematu przebiegu analizy obrazu złożonego z dwóch podschematów: mieszanego mechanizmu kontroli oraz metody wnioskowania niemonotonicznego, zwanego wnioskowaniem uzupełniającym. Schemat przebiegu analizy obrazu, oparty na zaproponowanej, inteligentnej strategii analizy składa się z wstępnego rozpoznawania obiektów zarodkowych, po którym następuje ekstrakcja pozostałych warstw obiektów geograficznych poprzez iterację cyklu interpretacji złożonego z czterech kroków: generacji hipotez, testu kompatybilności, wyboru scenariusza oraz jego weryfikacji. Każdy z zaprojektowanych strukturalnych klasyfikatorów graficznych/tekstowych komponentów mapy wykorzystuje odpowiedni model obiektu. Najprostszym jest detektor wielokątów, który oparty jest na zdefiniowanych modelach wielokątów. Drugi z detektorów oparty jest na relacyjnym modelu krzywej, zaś algorytm rozpoznawania polega na uzgadnianiu struktur relacyjnych (poszukiwaniu relacyjnego homomorfizmu), po którym następuje obliczanie odległości w przestrzeni cech. Metoda rozpoznawania w trzecim detektorze jest przybliżonym, odpornym na strukturalne zniekształcenia porównywaniem grafów atrybutywnych reprezentujących obiekty wzorca oraz nieznanego obiektu. Najbardziej uniwersalnym jest detektor czwarty, w którym obiekty są reprezentowane za pomocą rozszerzonej o specjalne operatory, programowej gramatyki łańcuchowej, zaś rozpoznawanie polega na niedokładnej, mieszanej analizie syntaktycznej, z wbudowaną korekcją błędów. Wszystkie opracowane klasyfikatory są odporne na translację oraz obroty, a także wykazują dużą tolerancję na różnego typu deformacje strukturalne obiektów. Zaproponowana struktura wnioskująca zrealizowana jest za pomocą następujących modułów: 1) mechanizmu wnioskującego, który jest globalnym koordynatorem procesów wnioskowania poszczególnych obiektów, 2) modułu zarządzania detektorami obiektów, 3) detektorów, opartych na różnych metodach strukturalnego rozpoznawania obrazów, 4) modułu konstrukcji grafu wektorowego, 5) modułu resegmentacji, w którym wykonywane jest wielokrotne powtarzanie procedury segmentacji ze zmienionymi wartościami parametrów w przypadku niemożliwości detekcji poszukiwanych obiektów przez detektory, 5) bazy faktów, w której akumulowane są wszystkie pośrednie wyniki rozpoznawania (obiektów i hipotez). W pracy zaproponowano i zrealizowano następującą, 4-poziomową strukturę analizy map geograficznych: 1) przetwarzanie wstępne, obejmujące binaryzację, ekstrakcję regionów wypełnionych oraz szkieletyzację, 2) konstrukcję grafu wektorowego (śledzenie linii, segmentacja, wektoryzacja małego i dużego grafu, 3) rozpoznawanie komponentów graficznych oraz tekstowych, 4) interpretacja. Zaproponowana struktura wnioskująca wraz z wbudowanymi strukturalnymi klasyfikatorami pozwala na rozwiązanie postawionych w rozprawie problemów dotyczących zarówno samej automatycznej analizy map geograficznych (wnioskowanie z niepełną informacją, mechanizm kontroli, kontekstowa segmentacja, rzetelna analiza), jak również problemów strukturalnego rozpoznawania obrazów (ograniczona moc opisowa gramatyk łańcuchowych, generacja błędnych struktur, duża liczba strukturalnych prototypów, interakcja rozpoznawania i segmentacji, wybór właściwego obszaru do analizy). Zaproponowane podejście do automatycznej analizy dokumentów jest w dużym stopniu uniwersalne: schemat przebiegu analizy obrazu jest niezależny od rodzaju aplikacji, natomiast model mapy, zależny od typu dokumentu (syntaktyka), może być w prosty sposób zaadaptowany dla celów innej aplikacji. Przedstawiona struktura wnioskująca wraz z klasyfikatorami została zaimplementowana w postaci modułu półautomatycznej konwersji map, co pozwoliło na jej 70% automatyzację.

Słowa kluczowe

pattern recognition geographic map analysis structura pattern recognition knowledge representation syntax semantics syntactic classifier

Wydawca

Wydawnictwo Politechniki Śląskiej

Czasopismo

Studia Informatica

Rocznik

2000

Tom

Vol. 21, nr 4

Strony

1--212

Opis fizyczny

bibliogr. 381 poz.

Twórcy

autor

Stąpor K.

Instytut Informatyki Politechniki Śląskiej Gliwice, ul.Akademicka 16 tel.: (0-32) 2371143, katarzyna.stapor@polsl.pl

Bibliografia

Typ dokumentu

Bibliografia

Identyfikator YADDA

bwmeta1.element.baztech-article-BSL7-0025-0003