Variable-order_Markov_model

Variable-order Markov model

Markov-based processes with variable "memory"

In the mathematical theory of stochastic processes, variable-order Markov (VOM) models are an important class of models that extend the well known Markov chain models. In contrast to the Markov chain models, where each random variable in a sequence with a Markov property depends on a fixed number of random variables, in VOM models this number of conditioning random variables may vary based on the specific observed realization.

This realization sequence is often called the context; therefore the VOM models are also called context trees.^[1] VOM models are nicely rendered by colorized probabilistic suffix trees (PST).^[2] The flexibility in the number of conditioning random variables turns out to be of real advantage for many applications, such as statistical analysis, classification and prediction.^[3]^[4]^[5]

Definition

Let A be a state space (finite alphabet) of size $|A|$ .

Consider a sequence with the Markov property $x_{1}^{n}=x_{1}x_{2}\dots x_{n}$ of n realizations of random variables, where $x_{i}\in A$ is the state (symbol) at position i $\scriptstyle (1\leq i\leq n)$ , and the concatenation of states $x_{i}$ and $x_{i+1}$ is denoted by $x_{i}x_{i+1}$ .

Given a training set of observed states, $x_{1}^{n}$ , the construction algorithm of the VOM models^[3]^[4]^[5] learns a model P that provides a probability assignment for each state in the sequence given its past (previously observed symbols) or future states.

Specifically, the learner generates a conditional probability distribution $P(x_{i}\mid s)$ for a symbol $x_{i}\in A$ given a context $s\in A^{*}$ , where the * sign represents a sequence of states of any length, including the empty context.

VOM models attempt to estimate conditional distributions of the form $P(x_{i}\mid s)$ where the context length $|s|\leq D$ varies depending on the available statistics. In contrast, conventional Markov models attempt to estimate these conditional distributions by assuming a fixed contexts' length $|s|=D$ and, hence, can be considered as special cases of the VOM models.

Effectively, for a given training sequence, the VOM models are found to obtain better model parameterization than the fixed-order Markov models that leads to a better variance-bias tradeoff of the learned models.^[3]^[4]^[5]

Share this article:

This article uses material from the Wikipedia article Variable-order_Markov_model, and is written by contributors. Text is available under a CC BY-SA 4.0 International License; additional terms may apply. Images, videos and audio are available under their respective licenses.

[Rissanen-1] [1]
Rissanen, J. (Sep 1983). "A Universal Data Compression System". IEEE Transactions on Information Theory. 29 (5): 656–664. doi:10.1109/TIT.1983.1056741.

[:0-2] [2]
Gabadinho, Alexis; Ritschard, Gilbert (2016). "Analyzing State Sequences with Probabilistic Suffix Trees: The PST R Package". Journal of Statistical Software. 72 (3). doi:10.18637/jss.v072.i03. ISSN 1548-7660. S2CID 63681202.

[Shmilovici-3] [3]
Shmilovici, A.; Ben-Gal, I. (2007). "Using a VOM Model for Reconstructing Potential Coding Regions in EST Sequences". Computational Statistics. 22 (1): 49–69. doi:10.1007/s00180-007-0021-8. S2CID 2737235.

[Begleiter-4] [4]
Begleiter, R.; El-Yaniv, R.; Yona, G. (2004). "On Prediction Using Variable Order Markov models". Journal of Artificial Intelligence Research. 22: 385–421. arXiv:1107.0051. doi:10.1613/jair.1491.

[Ben-Gal-5] [5]
Ben-Gal, I.; Morag, G.; Shmilovici, A. (2003). "Context-Based Statistical Process Control: A Monitoring Procedure for State-Dependent Processes" (PDF). Technometrics. 45 (4): 293–311. doi:10.1198/004017003000000122. ISSN 0040-1706. S2CID 5227793.

[6] [6]
Grau J.; Ben-Gal I.; Posch S.; Grosse I. (2006). "VOMBAT: Prediction of Transcription Factor Binding Sites using Variable Order Bayesian Trees" (PDF). Nucleic Acids Research. 34 (Web Server issue). Nucleic Acids Research, vol. 34, issue W529–W533.: W529-33. doi:10.1093/nar/gkl212. PMC 1538886. PMID 16845064.

[Bratko-7] [7]
Bratko, A.; Cormack, G. V.; Filipic, B.; Lynam, T.; Zupan, B. (2006). "Spam Filtering Using Statistical Data Compression Models" (PDF). Journal of Machine Learning Research. 7: 2673–2698.

[8] [8]
Browning, Sharon R. "Multilocus association mapping using variable-length Markov chains." The American Journal of Human Genetics 78.6 (2006): 903–913.

[9] [9]
Smith, A.; Denenberg, J.; Slack, T.; Tan, C.; Wohlford, R. (1985). "Application of a sequential pattern learning system to connected speech recognition". ICASSP '85. IEEE International Conference on Acoustics, Speech, and Signal Processing. Vol. 10. Tampa, FL, USA: Institute of Electrical and Electronics Engineers. pp. 1201–1204. doi:10.1109/ICASSP.1985.1168282. S2CID 60991068.

[1]

[2]

[3]

[4]

[5]

[6]

[7]

[8]

[9]

Variable-order_Markov_model

Variable-order Markov model

Example

Definition

Application areas

See also

References

Share this article: