We consider finite-alphabet and real-valued time series and the following four problems: i) estimation of the (limiting) probability P(x0 1/4xs) for every s and each sequence x0 1/4xs of letters from the process alphabet (or estimation of the density p(x0, 1/4, xs) for real-valued time series), ii) the so-called on-line prediction, where the conditional probability P(xt+1| x1x2 1/4xt) (or the conditional density p(xt+1| x1x2 1/4xt)) should be estimated, where x1x21/4xt are given, iii) regression and iv) classification (or so-called problems with side information). We show that Kolmogorov complexity (KC) and universal codes (or universal data compressors), whose codeword length can be considered as an estimation of KC, can be used as a basis for constructing asymptotically optimal methods for the above problems. (By definition, a universal code can "compress" any sequence generated by a stationary and ergodic source asymptotically to the Shannon entropy of the source.)