Identyfikatory
Warianty tytułu
Języki publikacji
Abstrakty
Symbolic sequence decomposition into a set of consecutive, distinct subsequences (mers) is presented. Several statistical distributions of nucleotide subsequences are defined and analysed. Sequence entropy and similarity between sequences in terms of mer lengths distribution are defined. An alignment-free method of phylogenetic tree construction is proposed.
Słowa kluczowe
Rocznik
Tom
Strony
5--10
Opis fizyczny
Bibliogr. 11 poz., rys.
Twórcy
autor
- University of Information Technology and Management ul. H. Sucharskiego 2, 35-225 Rzeszów, Poland, bkozarzewski@wsiz.rzeszow.pl
Bibliografia
- [1] A. Lempel, J. Ziv, On the complexity of finite sequences. IEEE Trans. Inform. Theory 22, 75-81 (1976).
- [2] H.H. Out, K. Sayood, A new sequence distance measure for phylogenetic tree construction. Bioinformatics 19, 2122-2130 (2003).
- [3] D.-G. Ke, Q.-Y. Tong, Easily adaptable complexity measure for finite time series. Phys. Rev. E77, 066215 (2008).
- [4] Z. Kása, On the d-complexity of strings. http://arxiv.org/abs/1002.2721v1.
- [5] C. Adami, N.J. Ceref, 1999. Physical complexity of symbolic sequences. arxiv: adap-org/9605002v3
- [6] J. Wen, C. Li, Similarity analysis of DNA sequences based on the LZ complexity. Internet Electron. J. Mol. Des. 6, 1-12 (2007).
- [7] B. Kozarzewski, Multilevel time series complexity. Journal of Applied Computer Science 19, 2, 61-71 (2011).
- [8] J.-B. Brissaud, The meaning of entropy. Entropy 7, 68-96 (2005).
- [9] Y.-H. Chen, S.-L. Nyeo, C.-Y. Yeh, Model for distribution of k-mers in DNA sequences. Physical Review E72, 011908 (2005).
- [10] W.K. Brown, K.H. Wohletz, Derivation of the Weibull distribution based on physical principles and its connection to the Rossin-Rammler and lognormal distributions. Journal of Applied Physics 78, 2758-2763 (1995).
- [11] M. van Oven, http://www.phylotree.org (2009)
Typ dokumentu
Bibliografia
Identyfikator YADDA
bwmeta1.element.baztech-article-BUJ8-0016-0025