PL EN


Preferencje help
Widoczny [Schowaj] Abstrakt
Liczba wyników
Tytuł artykułu

Jakość danych w kontekście danych zintegrowanych

Autorzy
Identyfikatory
Warianty tytułu
EN
Data in the context of integrated data
Języki publikacji
PL
Abstrakty
PL
Integracja danych to zadanie jakie stawiane jest przed aplikacjami, dla których konieczne jest pc skiwanie informacji z wielu autonomicznych i heterogenicznych źródeł. Potrzeba integracji szczegó widoczna jest w dynamicznie rozwijających się instytucjach, jakimi są banki, ale także np. w duż projektach związanych z informatyzacją instytucji rządowych i wielu innych miejscach. Jakość danych jest w literaturze definiowana w różny sposób. Potrzeba pomiaru jakości dan pojawia się wraz z napływem coraz większej ich ilości. W niniejszej pracy przedstawimy podsta' we zagadnienia związane z jakością danych, w szczególności zaś skoncentrujemy się na jakości nych zintegrowanych.
EN
The data integration is a problem rises in the areas where applications require information from multiple autonomous and heterogeneous sources. The need of integration is seen especially in dynamically growing organizations such as banks, but also in large projects done for governmental institutions. The data quality is in the literature defined in many different ways. The need on measurement of the quality rises with the increasing number of incoming data. In this chapter we have shown basic concepts of data integration and data quality emphasising the problem of the quality of integrated data.
Rocznik
Tom
Strony
75--98
Opis fizyczny
Bibliogr. 68 poz.
Twórcy
autor
  • Department of Computer Science and Engineering, York University, 4700 Keele Street Toronto Ontario M3J 1P3, Canada, pawluk@ese.yorku.ca
Bibliografia
  • [AlHaSh 2005] Alasound A., Haarslev V., Shiri N., A hybrid approach for ontology integration. In: proceedings of the 31 st VLDB Conference 2005
  • [ArBeCho 1999] Arenas M., Bertossi L., Chomicki J., Consistent queiy answers in inconsistent databasi In: PODS '99: Proceedings of the eighteenth ACM SIGMODSIGACT-SIGART symp sium on Principles of database systems, ACM, New York, NY, USA, 195 68-79.
  • [ArBeCHR 2003] Arenas M., Bertossi L., Chomicki J., He X., Raghavan V, Spinrad J., Scalar aggr gation in inconsistent databases, Theor. Comput. Sci., 296(3), 2003, 405-434.
  • [BaPa 1985] Ballou D., Pazer H., Modeling data and process quality in multi-input, multioutp information systems, Management Science, 31(2), 1985, 150-162.
  • [BaWaPaTa 1998] Ballou D., Wang R., Pazer H., Tayi G.K., Modeling information manufacture systems to determine information product quality. Manage. Sci., 44(4), 1998, 462-484.
  • [BaSc 2006] Batini C, Scannapieco M., Data Quality: Concepts, Methodologies and Tea niques. Data-Centric Systems and Applications, Springer-Verlag, New York, Ine Secaucus, NJ, USA, 2006.
  • [BeMe 2007] Bernstein P.A., Melnik S., Model management 2.0: manipulating richer mapping. In: Chee Yong Chan, Beng Chin Ooi, and Aoying Zhou, editors, SIGMOD Confa enee, ACM, 2007, 1-12.
  • [Bertossi 2006] Bertossi L., Consistent query answering in databases, SIGMOD Ree, 35(2), 200( 68-76.
  • [BINa 2008] Bleiholder J., Naumann F., Data fusion, ACM Comput. Surv., 41(1), 2008.
  • [BoMaYa 1998] Bobrowski M., Marr M., Yankelevich D., A software engineering view of dat quality. In: European Quality Week Conference, 1998.
  • [BoKe 2002] Bouzeghoub M., Kedad Z., Quality in Data Warehousing, Kluwer Academic Pub lisher, 2002.
  • [BraBer 2004] Bravo L., Bertossi L., Consistent query answering under inclusion dependencia In: CASCON '04. Proceedings of the 2004 conference of the Centre for Advancei Studies on Collaborative research, IBM Press, 2004, 202-216.
  • [BraJo 2007] Brazhnik O., Jones J.F., Anatomy of data integration, J. of Biomedical Informatics 40(3), 2007, 252-269
  • [CaCaGiLe 2002] Cali A., Calvanese D., Giacomo G, Lenzerini M., On the role of integrity con straints in data integration, IEEE Data Eng. Bull., 25(3), 2002, 39-45.
  • [CaCaGiLe 2004] Cali A., Calvanese D., Giacomo G, Lenzerini M., Data integration under integrity constraints, Inf. Syst., 29(2), 2004, 147-163.
  • [Crosby 1979] Crosby P.B., Quality is free: the art of making quality certain, McGraw-Hill, New York 1979.
  • [CuWiWi 2000] Cui Y., Widom J., Wiener J.L., Tracing the lineage of view data in a warehousing environment, ACM Trans. Database Syst., 25(2), 2000, 179-227.
  • [DaJo 2003] Dasu T., Johnson T., Exploratory Data Mining and Data Cleaning, Wiley-Interscience. May 2003.
  • [DViJaPaSa 2007] De Capitani di Vimercati S., Jajodia S., Paraboschi S., Samarati P., Trust management services in relational databases. In: ASIACCS '07. Proceedings of the 2nd ACM symposium on Information, computer and communications security, ACM, New York, NY, USA, 2007, 149-160.
  • [DiBe 2004] Dimmock N., Belokosztolszki A., Eyers D., Bacon J., Moody K., Using trust and risk in role-based access control policies. In: SACMAT '04. Proceedings of the ninth ACM symposium on Access control models and technologies, ACM, New York, NY, USA, 2004, 156-162.
  • [DrHeMaObS 2007] Dreibelbis A., Hechler E., Mathews B., Oberhofer M., Sauter G., Master data management architecture patterns, 2007.
  • [EllpVe 2007] Elmagarmid A.K., Ipeirotis P.G., Verykios V.S., Duplicate record detection: A survey, Knowledge and Data Engineering, IEEE Transactions on, 19(1), 2007, 1-16.
  • [English 1996] English L., Information quality improvement: Principles, methods, and management, Seminar, 1996.
  • [Fagin 2006] Fagin R., Inverting schema mappings. In: PODS '06. Proceedings of the twenty-fifth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems, ACM, New York, NY, USA, 2006, 50-59.
  • [FaKoTaP 2004] Fagin R., Kolaitis P.G., Tan W.C, Popa L., Composing schema mappings: second-order dependencies to the rescue. In: PODS '04. Proceedings of the twenty-third ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems, ACM, New York, NY, USA, 2004, 83-94.
  • [Fan 2008] Fan W., Dependencies revisited for improving data quality. In: PODS '08. Proceedings of the twenty-seventh ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems, ACM, New York, NY, USA, 2008, 159-170.
  • [FaGeJiK 2008] Fan W., Geerts F., Jia X., Kementsietsidis A., Conditional functional dependencies for capturing data inconsistencies, ACM Trans. Database Syst., 33(2), 2008, 1 48.
  • [FoHe 2007] Foley O., Helfert M., The development of an objective metric for the accessibility dimension of data quality. In: Proceedings of International Conference on Innovations in Information Technology, IEEE, Dublin 2007, 11-15.
  • [GeSch 2007] Gertz M., Schmitt I., Data Integration Techniques based on Data Quality Aspects. In I. Schmitt, C. Türker, E. Hildebrandt, M. Höding, editors, Proceedings 3. Workshop „Föderierte Datenbanken", Magdeburg, 10-11 Dezember 1998, Shaker Verlag, Aachen, 1-19.
  • [Gertz 1996] Gertz M., Managing data quality and integrity in federated databases. In: 2nd Annual IFIP TC-11 WG 11.5 Working Conf. on Integrity and Internal Control in Information Systems, 1996, 211-230.
  • [GrGLZ 2004] Gryz J., Guo J., Liu L., Zuzarte C, Query sampling in dbl universal database. In: SIGMOD '04. Proceedings of the 2004 ACM SIGMOD international conference on Management of data, ACM, New York, NY, USA, 2004, 839-843.
  • [GuWi 1993] Gupta A., Widom J., Local verification of global integrity constraints in distributed databases. In: P. Buneman, S. Jajodia (Eds.), Proceedings of the 1993 ACM SIGMOD International Conference on Management of Data, Washington, D.C., May 26-28, 1993, ACM Press, 1993, 49-58.
  • [Hass 2007] Haas L.M., Beauty and the beast: The theory and practice of information integra-tion. In: Th. Schwentick, D. Suciu (Eds.), ICDT, Vol. 4353 of Lecture Notes in Computer Science, Springer, 2007, 28—43.
  • [HaRaOr 2006] Halevy A., Rajaraman A., Ordille J., Data integration: the teenage years. In: VLDB '06. Proceedings of the 32nd international conference on Very large data bases, VLDB Endowment, 2006, 9-16.
  • [Halevy 2001] Halevy A.Y., Answering queries using views: A survey, The VLDB Journal, 10(4), 2001,270-294.
  • [ISO 1994] ISO8402. Quality management and quality assurance: Vocabulary. Published standard, 1994.
  • [JaVa 1997] Jarke M., Vassiliou Y., Data warehouse quality design: A review of the DWQ project. In: Proc. 2nd Conference on Information Quality, 1997.
  • [JaLeVa 2001] Jarke M., Lenzerini M., Vassiliou Y., Vassiliadis P., Fundamentals of Data Warehouses, Springer-Verlag, New York, Inc., Secaucus, NJ, USA, 2001.
  • [JoIsBo 2007] Josang A., Ismail R., Boyd C, A survey of trust and reputation systems for onlin service provision, Decision Support Systems, 43(2), March 2007, 618-644.
  • [Kolaitis 2005] Kolaitis P.G., Schema mappings, data exchange, and metadata management. Ir PODS '05. Proceedings of the twenty-fourth ACM SIGMOD-SIGACT-SIGAR symposium on Principles of database systems, ACM, New York, NY, USA, 2005 61-75.
  • [KriMo 1982] Kriebel CH., Moore J.H., Economics and management information svstems SIGMIS Database, 14(1), 1982, 30-40.
  • [LePiStWa 2004] Lee Y.W., Pipino L., Strong D.M., Wang R.Y., Process embedded data integrity J. Database Manag., 15(1), 2004, 87-103.
  • [Lenz 2002] Lenzerini M., Data integration: a theoretical perspective. In: PODS '02. Proceedings of the twenty-first ACM SIGMOD-SIGACT-SIGART symposium on Principies of database systems, ACM, New York, NY, USA, 2002, 233-246.
  • [Naumann 2002] Naumann F., Quality-driven query answering for integrated information systems Springer-Verlag, New York, Inc., New York, NY, USA, 2002.
  • [Olken 1993] Olken F., Random Sampling from Databases, PhD thesis, University of California at Berkeley, 1993.
  • [Olson 2002] Olson J., Data Quality: The Accuracy Dimension. Morgan Kaufmann Publishers Inc., San Francisco, CA, USA, 2002.
  • [Pankowski 2007] Pankowski T., Integracja danych w teorii i praktyce-przegląd problemów i rozwiązań. W: Bazy Danych: Nowe Technologie, WK, 2007, 45-55.
  • [PaSaJa 2002] Parssian A., Sarkar S., Jacob V.S., Assessing information quality for the composite relational operation join. In: IQ, 2002, 225-237.
  • [PaSaJa 2004] Parssian A., Sarkar S., Jacob V.S., Assessing data quality for information products: Impact of selection, projection, and cartesian product, Manage. Sei., 50(7), 2004, 967-982.
  • [Peralta 2006] Peralta V., Data Quality Evaluation in Data Integration Systems, PhD thesis, Université de Versailles Saint-Quentin-en-Yvelines (France) and Univeridad de la República (Uruguay), 2006.
  • [PiLeWa 2002] Pipino L.L., Lee Y.W, Wang R.Y., Data quality' assessment. Communications of the ACM, 45, 2002,211-218.
  • [Ra Be 2001] Rahm E., Bernstein P.A., A survey of approaches to automatic schema matching, VLDB Journal, Very Large Data Bases, 10(4), 2001, 334-350.
  • [ReWa 1995] Reddy M.P., Wang R.Y., Estimating data accuracy in a federated database environment. In: CISMOD, 1995, 115-134.
  • [Redman 1997] Redman T.C., Data Quality for the Information Age, Foreword by Godfrey, A. Blanton, Artech House, Inc., Norwood, MA, USA, 1997.
  • [Redman2001] Redman T.C., Data quality: the field guide, Digital Pr. [u.a.], Boston 2001.
  • [SeFa 1990] Segev A., Fang W., Currency-based updates to distributed materialized views. In: Proceedings of the Sixth International Conference on Data Engineeringwashington, IEEE Computer Society, DC, USA, 1990, 512-520.
  • [ShSh 2006] Shahri H.H., Shahri S.H., Eliminating duplicates in information integration: An adaptive, extensible framework, IEEE Intelligent Systems, 21(5), 2006, 63-71.
  • [Shin 2003] Shin B., An exploratory investigation of system success factors in data warehousing, J. AIS, 4, 2003.
  • [SiWe 2009] Simpson J., Weiner E., Oxford english dictionary, 2009.
  • [StLeWa 1997] Strong D.M., Lee Y.W., Wang R.Y., Data quality in context, Commun. ACM, 40(5), 1997, 103-110.
  • [TaBa 1998] Tayi G.K., Ballou D.P., Examining data quality, Commun. ACM, 41(2), 1998, 54-57.
  • [Tupek 2006] Tupek A.R., Definition of data quality, 2006.
  • [Ullman 1997] Ullman J.D., Information integration using logical views. In: ICDT '97. Proceedings of the 6th International Conference on Database Theory, Springer-Verlag, London, UK, 1997, 19-40.
  • [WaWa 1996] Wand Y., Wang R.Y., Anchoring data quality dimensions in ontological founda-tions, Commun. ACM, 39(11), 1996, 86-95.
  • [WaChe 2005] Wang, R.Y., Chettayar K., Dravis F., Funk J., Katz-Haas R, Lee C, Lee Y., Xian X., Bhansali S., Exemplifying business oppurtunities for improving data quality from corporate household research. In: Advances in Management Information Systems - Information Quality (AMIS-IQ) Monograph, April 2005.
  • [WaSt 1996] Wang R.Y., Strong D.M., Beyond accuracy: what data quality means to data con-sumers, J. Manage. Inf. Syst., 12(4), 1996, 5-33.
  • [XuEm 2003] Xu L., Embley D.W., Combining the best of global-as-view and local-as-view for data integration. In: Information Systems Technologies and its Applications - 1ST A, 2003, 123-136.
  • [ZuPa 2005] Zuo Y., Panda B., Component based trust management in the context of a virtualorganization. In: SAC '05. Proceedings of the 2005 ACM symposium on Applied computing, ACM, New York, NY, USA, 2005, 1582-1588.
Typ dokumentu
Bibliografia
Identyfikator YADDA
bwmeta1.element.baztech-article-BPW9-0009-0089
JavaScript jest wyłączony w Twojej przeglądarce internetowej. Włącz go, a następnie odśwież stronę, aby móc w pełni z niej korzystać.