Identyfikatory
Warianty tytułu
Języki publikacji
Abstrakty
The article investigates the possibility of measuring the strength of a linear corre lation relationship between nominal data and numerical data. Correlation coeffi cients for variables coded with real numbers as well as for variables coded with complex numbers were studied. For variables coded with real numbers, unam biguous measures of real linear correlation were obtained. In the case of complex coding, it has been observed that the obtained complex correlation coefficients change with the permutation of the phases in the complex numbers used to code classes of elements with equal cardinalities. It was found that a necessary condi tion for linear correlation is the possibility of linear ordering of a set with data. Since linear order is not possible in the set of complex numbers, complex correla tion coefficients cannot be used as a measure of linear correlation. In the event of such a situation, a substitute action was suggested that would prevent equal cardi nality of classes of identical elements contained in the set with nominal data. This action would consist in the correction of data, analogous to the correction during preprocessing or cleaning of data containing missing or outlier values.
Rocznik
Tom
Strony
57--82
Opis fizyczny
Bibliogr. 16 poz., rys., tab.
Twórcy
Bibliografia
- [1] H. M. Blalock, Social Statistics. McGraw-Hill, 1960.
- [2] P. Francuz and R. Mackiewicz, Liczby nie wiedzą skąd pochodzą. Przewodnik po metodologii i statystyce nie tylko dla psychologów. Lublin: Wydawnictwo KUL, 2007.
- [3] Z. Gniazdowski and M. Grabowski, “Numerical Coding of Nominal Data,” Zeszyty Naukowe WWSI, vol. 9, no. 12, pp. 53-61, 2015. [Online]. Available: http://doi.org/10.26348/znwwsi.12.53
- [4] Z. Gniazdowski, “Geometric interpretation of a correlation,” Zeszyty Naukowe WWSI, vol. 7, no. 9, pp. 27-35, 2013. [Online]. Available: http://doi.org/10.26348/znwwsi.9.27
- [5] Z. Gniazdowski, “O relacjach i algorytmach,” in Zbiór wykładów wszechnicy popołudniowej: Algorytmika i programowanie. Zastosowania informatyki. Warszawska Wyższa Szkoła Informatyki, 2011, pp. 265-286. [Online]. Available: http://akademickaseriawwsi.wwsi.edu.pl/ksiazki/5/O_relacjachi_algorytmach.pdf
- [6] S. S. Stevens, “On the theory of scales of measurement,” Science, vol. 103, no. 2684, pp. 677-680, 1946. [Online]. Available: https://psychology.okstate.edu/faculty/jgrice/psyc3120/Stevens_FourScales_1946.pdf
- [7] StatSoft, “Elektroniczny podręcznik statystyki,” 2011. [Online]. Available: https://www.statsoft.pl/textbook/stbasic.html
- [8] C. R. Mehta and N. R. Patel, “A network algorithm for performing fisher’s exact test in r × c contingency tables,” Journal of the American Statistical Association, vol. 78, no. 382, pp. 427-434, 1983. [Online]. Available: https: //doi.org/10.1080/01621459.1983.10477989
- [9] F. Wilcoxon, “Individual comparisons by ranking methods,” Biometrics Bulletin, vol. 1, no. 6, pp. 80-83, 1945.
- [10] T. Mok and H. B. Iz, “Vector regression introduced,” Journal of geodetic science, vol. 4, no. 1, pp. 57-64, 2014. [Online]. Available: https://core.ac.uk/download/pdf/61032201.pdf
- [11] whuber, “Analysis with complex data, anything different?” 2013. [Online]. Available: https://stats.stackexchange.com/q/66268
- [12] M. Dryja, J. Jankowska, and M. Jankowski, Przegląd metod i algorytmów numerycznych, część 2. Warszawa: Wydawnictwa Naukowo-Techniczne, 1982.
- [13] A. Kiełbasiński and H. Schwetlick, Numeryczna algebra liniowa: wprowadzenie do obliczeń zautomatyzowanych. Wydawnictwa Naukowo-Techniczne, 1992.
- [14] O. Maimon and L. Rokach, Eds., Data mining and knowledge discovery handbook. Springer, 2010. [Online]. Available: https://link.springer.com/content/pdf/10.1007/b107408.pdf
- [15] D. J. Hand, H. Mannila, and P. Smyth, Principles of data mining. MIT press, 2001.
- [16] M. Berthold and D. J. Hand, Intelligent data analysis. An introduction. Springer, 2007.
Typ dokumentu
Bibliografia
Identyfikator YADDA
bwmeta1.element.baztech-61882bbe-ea67-4069-804e-d96c7de43418