I've been trying to think of a way of modifying the standard transition matrix method to describe a process where you have many "particles," but they may only switch states in some fixed number higher than 1. For example, assuming the particles may only hop between states in
pairs, but are otherwise independent, is there a simple way to predict the system's development over many time steps?
Here is the standard approach. Removing the "must hop in pairs" constraint, the transition matrix is a simple way of analyzing many time steps for a single particle. Suppose there are two states a particle may be in, state 1 and state 2 (denoted by
s1 and
s2). During each time step, a particle in state 1 has probability
a11 of remaining in state 1, and probability
a21 of switching to state 2. A particle in state 2 has probability
a22 of remaining in state 2, and probability
a12 of switching to state 1. The transition matrix,
A, is
The probability of finding the particle in each state after
t time steps is found from
At. If we're given its initial location, then
The components of the final state vector,
s(
t), are mutually exclusive: either the particle has found its way to state 1 or state 2. So the total probability of the particle being
somewhere is just their sum (which is, of course, 1). To calculate the statistics for
N independent particles, this sum is raised to the
N power. For 1 time step, if the particle starts in state 1, this is a simple binomial:
Q = (a11 + a21)N
Q, the sum over all available system states, is the partition function. For multiple time steps,
Q is calculated as
Q = ([1 1]Ats(0))N
where [1 1] just "flattens" the final state vector into a sum, so the multiparticle expansion is all-to-all, as required. This is mathematically convenient in this case, but it's worth noting that the final state may be kept in vector form: we can still do an all-to-all multiplication by convoluting the vector with itself (which is, in an unfortunate convention, denoted by *),
Ats(0) * Ats(0) * · · · * Ats(0)
Writing the multiparticle expansion as a convolution seems like a pointless complication, since we're going to ultimately flatten the vector to calculate
Q. However, keeping the vector's structure intact has the advantage of being able to keep track of
pairs of particles switching states, because that information has not been flattened out. This is most clearly seen by writing the convolution in matrix form. The convolution matrix,
C, will have the state vector as its first row, and a "shifted" state vector as its second row. (A matrix created using this rule is called a "circulant matrix.") After 1 time step,
This circulant matrix is handy because it lets us calculate convolutions by doing ordinary matrix multiplication. The total expansion for a two-particle system (
N = 2) is:
C2 = | | a112 + a212 | 2a11a21 |
2a11a21 | a112 + a212 |
| |
|
The internal structure of the circulant stores
even powers of the hop probability
a21, corresponding to pairs of particles switching states, on its diagonal, and
odd powers on its anti-diagonal. To see this more clearly, look at the circulant for a four-particle system,
C4 = | | a114 + 6a112a212 + a214 | 4a113a21 + 4a11a213 |
4a113a21 + 4a11a213 | a114 + 6a112a212 + a214 |
| |
|
So the partition function for 1 time step for pairs of particles can be calculated as 1/2 of the trace of the circulant raised to the number of particles! Alternately, to store every third power of the hop probability on its diagonal (a "three particles must jump simultaneously" rule), a 3x3 circulant matrix may be used:
C = | | a11 | a21 | 0 |
0 | a11 | a21 |
a21 | 0 | a11 |
| |
|
and so on, for every fourth (4x4), fifth (5x5), etc. power.
This is not that useful as-is, because it's only for a single time-step, where it is not difficult to brute-force calculate every possible trajectory by hand. It would be very nice to be able to do this for multiple time steps: in effect, combining the transition matrix and circulant matrix to allow you to easily develop a multiparticle system for many time steps, while keeping the allowed trajectories separated from the forbidden trajectories inside the circulant's structure. This is a bit frustrating, because I think there should be a simple way to do this, but for some reason I just can't see what it is...
This explanation makes the problem much clearer to me... maybe because I now know what A^t is? I see what you mean: there is obviously a pattern there, and probably a method for tracking which particles stay or jump... as for what that method is, I am not sure, either. Now that I understand this a little better, I will keep musing on it... have you thought more about the explanation with calculus that you told me about earlier (I am not sure I really understood it, but something with a ring or circle of numbers, I think)? Maybe there is a way to turn that explanation into something on a matrix basis and see a pattern? I'm sorry I'm not much help yet, but that would probably be one avenue of thought I would take if I knew more math.... *sigh* must learn more math!!
ReplyDeleteThe calculus method, as I remember it, uses a delta function to select out the allowed powers in the binomial expansion. (I think Steve wrote the delta function as a loop integral, which may be what you're thinking of?) This method, however, also only works for 1 time step. The underlying difficulty is that, normally, the time evolution is calculated prior to doing the multiparticle expansion. However, if you impose a stoichiometric constraint (e.g., "only pairs of particles may jump"), you have to incorporate the multiparticle expansion after every time step, since you have to get rid of the forbidden trajectories. If you do the full time evolution, and then just zero out for example all odd powers of the hop probabilities, you end up including a lot of forbidden trajectories, because this permits any net even number of particles to jump over all time steps, rather than restricting it to pairs at each time step!
ReplyDeleteIt seems like there should be a straightforward way of incorporating this constraint, which is encoded by the circulant matrix, into the transition matrix. So far I haven't been able to figure out what that should be, though. It's possible I could write a third-order transition tensor that would include the circulant's structure as the third mode of the tensor. However, I am not sure this would work, and higher-order tensors are a giant pain in the butt to work with, so I am trying to avoid that!