Home | Business News | Browse by Publication | J | Journal of the American Statistical Association

Estimation in the mixture of Markov chains moving with different speeds.

Publication: Journal of the American Statistical Association
Publication Date: 01-SEP-05
Format: Online - approximately 6866 words
Delivery: Immediate Online Access

Article Excerpt
1. INTRODUCTION

Markov chains in both discrete-time and continuous-time have been very popular modeling tools in applications, and the estimation methods for these models have been well established. Modeling the evolution of a population of individual entities with a Markov chain assumes that the population is homogeneous with respect to the movement behavior among the states. However, in many applications the studied population is heterogeneous with respect to the dynamics of transitions among states, and a Markov chain provides an inaccurate description of the evolution of the population.

The purpose of this article is to define and estimate the mixture of time-homogeneous Markov chains that captures the population heterogeneity in the rate of movement among states. Identification of different regimes with respect to the rate of movement, or with respect to some other aspect of movement behavior, is of obvious interest for describing the dynamics of the observed process. It also has an exploratory value for identifying and characterizing possible nonhomogeneous subpopulations. In addition, the non-Markovian property of a mixture process offers a possibility of taking into account not only a present state, but also the past history of the process when making forecasts of the future states.

Both continuous-time and discrete-time versions of the mixture process are considered herein. To define a continuous-time version, let X = {X(t), t > 0} be a stochastic process with state space D = {1, 2,...,w}, which, conditionally on the initial state, is a mixture of homogeneous Markov chains, [X.sub.m] = {[X.sub.m](t), t > 0}, with generators [A.sub.m], 1 [less than or equal to] m [less than or equal to] N. It is assumed that the generators are related by

[A.sub.m] = [[GAMMA].sub.m]Q, 1 [less than or equal to] m [less than or equal to] N, (1)

where [[GAMMA].sub.m] = diag([[gamma].sub.1,m], [[gamma].sub.2,m],...,[[gamma].sub.w,m]), 1 [less than or equal to] m [less than or equal to] N - 1, [[GAMMA].sub.N] = I. The mth generator, [A.sub.m], is a matrix with entries [a.sub.ij] satisfying

[a.sub.ii,m] [less than or equal to] 0, [a.sub.ij,m] [greater than or equal to] 0,

[summation over (j[not equal to]i)] [a.sub.ij,m] = -[a.sub.ii,m] [equivalent to] [a.sub.i,m], i [member of] D,

where [a.sub.ij,m] = [[gamma].sub.i,m][q.sub.i,j], i [not equal to] j, and [a.sub.i,m] = [[gamma].sub.i,m][q.sub.i]. In particular, 1/(-[q.sub.ii]) is the expected length of time that [X.sub.N](t) remains in state i, i [member of] D. The discrete mixing distribution on these Markov chains is defined conditionally on X(0). The mixture probabilities, [s.sub.i,m], depend on the initial state i,

[s.sub.i,m] = P(X = [X.sub.m]|X(0) = i),

and for each i satisfy [s.sub.i,m] [greater than or equal to] 0, m = 1, 2,..., N, and [[summation].sub.m=1.sup.N] [s.sub.i,m] = 1. The transition probability matrix of the mixture process is

P(t) = [N.summation over (m=1)][S.sub.m][P.sub.m](t), t [greater than or equal to] 0, (2)

where [P.sub.m](t) = exp(t[A.sub.m]), and [S.sub.m] = diag([s.sub.1,m], [s.sub.2,m],..., [s.sub.w,m]). The Markov chains that compose the mixture may differ in the rates ([a.sub.i,m]) at which they move among the states, but all chains when leaving a state i have the same probability of entering state j. Depending on whether [[gamma].sub.i,m] = 0, < [[gamma].sub.i,m] < 1, or [[gamma].sub.i,m] > 1, the realizations generated by [A.sub.m] do not move out of state i or move out of state i at a lower or a higher rate than those generated by Q.

The particular case of the proposed mixture process, known as a mover-stayer model, arises when N = 2 and [[gamma].sub.1,1] = [[gamma].sub.2,1] = ... = [[gamma].sub.w,1] = 0. It postulates that population consists of "movers," who evolve according to a Markov chain and of "stayers," who never leave their initial states. The motivation for the new mixture was to relax the "mover-stayer" dichotomy and allow for subpopulations to have different rates of movement from states. Despite its rather restrictive assumptions, the mover-stayer mixture, particularly its discrete-time version, has gained considerable popularity since its introduction by Blumen, Kogan, and McCarthy (1955). It has been applied to modeling of occupational mobility (Sampson 1990), income dynamics (Dutta, Sefton, and Weale 2001), consumer brand preferences (Chatterjee and Ramaswamy 1996; Colombo and Morrison 1989), bond ratings migrations (Altman and Kao 1991), credit behavior (Frydman, Kallberg, and Kao 1985), and tumor progression (Tabar et al. 1996; Chen, Duffy, and Tabar 1997).

A discrete-time analog of the mixture process in (2) is defined in Section 4. For both continuous-time and discrete-time mixtures, this article develops the maximum likelihood estimation of their parameters from a sample of independent, continuously observed realizations of the mixture process. For each mixture, the maximum likelihood estimators (MLEs) of their parameters are first obtained under the complete information, that is, when it is known which Markov chain generated each realization. These are then used in the expectation step of the EM algorithm for estimating the mixtures under incomplete information.

This development shows that in both the continuous-time and discrete-time cases, all mixture parameters can be identified. In the continuous-time case, this is a consequence of restriction (1) on the form of generators, and in the discrete-time case, it is a consequence of restriction (10). As suggested by a referee, it is of interest to consider more general identifiable mixtures than the one defined in (2). One possible generalization of (2) is to assume that [A.sub.m] is obtained from the basic generator Q by introducing two gamma parameters for each row (in the present model there is one gamma parameter, [[gamma].sub.i,m], for the ith row), a gamma parameter for the upward transitions (above diagonal) and one for the downward transitions (below diagonal). The estimation in this extension of (2) and in more general ones is left to the future investigation.

Maximum likelihood estimation in the discrete-time mover-stayer model was developed by Frydman (1984) and Fuchs and Greenhouse (1988). Some additional aspects of this estimation were discussed by Swensen (1996). Estimation in the continuous-time mover-stayer model from continuous data was considered by Frydman and Kadam (2004) and from panel data by Fougere and Kamionka (2003). Cook, Kalbfleisch, and Yi (2002) considered estimation of a generalized mover-stayer model from panel data. Because the mover-stayer model is a very special case of the mixture of Markov chains proposed...

View this article FREE - Now for a Limited Time, try Goliath Business News
Free for 3 Days!



More articles from Journal of the American Statistical Association
Ranked Set Sampling: Theory and Applications.(book by Zehua Chen, Zhid..., September 01, 2005
Association Schemes: Designed Experiments, Algebra and Combinatorics.(..., September 01, 2005
Diagnostic Checks in Time Series.(book by Wai Keung Li )(Book Review), September 01, 2005
Introduction to Rare Event Simulation.(book by James Antonio Bucklew)(..., September 01, 2005
Transdimensional Markov chains: a decade of progress and future perspe..., September 01, 2005

Looking for additional articles?
Search our database of over 3 million articles.

Looking for more in-depth information on this industry?
Search our complete database of Industry & Market reports by text, subject, publication name or publication date.

About Goliath
Whether you're looking for sales prospects, competitive information, company analysis or best practices in managing your organization, Goliath can help you meet your business needs.

Our extensive business information databases empower business professionals with both the breadth and depth of credible, authoritative information they need to support their business goals. Whether it be strategic planning, sales prospecting, company research or defining management best practices - Goliath is your leading source for accurate information.