Exploiting fluctuations in gene expression to detect causal interactions between genes

Definition 1. A co-transition event is a transition event in the system where two or more components change simultaneously. That is, a reaction where the step size d has more than one vector component.

An example would be a conversion event, where a chemical species $z_{m}$ converts to another chemical species $z_{n}$

$(z_{m}, z_{n}) \overset{W (z_{m})}{\to} (z_{m} - 1, z_{n} + 1)$

According to our framework, an arrow would be drawn from $z_{m}$ to $z_{n}$ in the network if the above reaction was part of the system. Another example would be two molecules $z_{m}$ , $z_{n}$ that bind to form a complex $z_{l}$

$(z_{m}, z_{n}, z_{l}) \overset{W (z_{m}, z_{n})}{\to} (z_{m} - 1, z_{n} - 1, z_{l} + 1) .$

If the above reaction was part of the system, an arrow would be drawn from $z_{m}$ to $z_{n}$ , from $z_{n}$ to $z_{m}$ , and from both $z_{m}$ , $z_{n}$ to $z_{l}$ .

In order to derive Equation 2, we need to assume that no components in $z_{aff}^{c}$ are part of a co-transition event with any variable in $z_{aff}$ . We thus begin by proving the following lemma.

Lemma 1. Let Z_k be a component that is not affected by X or Y. For any network of the class in Appendix 1—figure 21B (Equations 4.3; 4.4), there exists another network with the exact same dynamics in X, Y, and Z_k, but where no components in $z_{aff}^{c}$ are part of a co-transition event with any component in $z_{aff}$ .

Proof of Lemma 1. Let the following co-transition reaction be part of the system, where ${a_{k}}$ are variables in $z_{aff}$ and ${b_{k}}$ are variables in $z_{aff}^{c}$

(5.1)

$(a, b) \overset{W (b)}{\to} (a + d_{a}, b + d_{b}) .$

By definition of $z_{aff}$ and $z_{aff}^{c}$ , the reaction rate $W$ in the above reaction cannot depend on the variables ${a_{k}}$ . However, because a is part of $z_{aff}$ , the ${a_{k}}$ variables must be affected by X or Y. Therefore, there must exist one or more reactions, labeled with $i \in {1, 2, \dots}$ , in the system, that change a with reaction rates that depend on the variables affected by X or Y

(5.2)

$a \overset{W_{i} (z_{aff}, z_{aff}^{c})}{\to} a + d_{i} for i = 1, 2, \dots$

Note that the above reactions cannot make changes to b; otherwise, the ${b_{k}}$ variables would not be in the variables not affected by X or Y. We now consider an exact copy of the whole system z and the system reactions, with the change that the a variables are decomposed into two sets of mock variables. Specifically, we define the variables ${a_{k}^{i n t}}$ and ${a_{k}^{b}}$ that undergo the following reactions

$\begin{aligned} (a^{b}, b) & \overset{W (b)}{\to} (a^{b} + d_{a}, b + d_{b}) \\ a^{i n t} & \overset{W_{i} (z_{aff}, z_{aff}^{c})}{\to} a^{i n t} + d_{i} for i = 1, 2, \dots, \end{aligned}$

which correspond to the reactions in Equations 5.1; 5.2. We replace all the explicit a dependencies in the reaction rates of the system with $a^{i n t} + a^{b}$ . As a result, all the reactions that depended on a are unchanged, but now we can put the $a^{b}$ variables in $z_{aff}^{c}$ , and we can put the $a^{i n t}$ variables in $z_{aff}$ . We are left with a system where the $a^{i n t}$ are not part of a co-transition event with variables in $z_{aff}^{c}$ , and the dynamics of X and Y remain unchanged. Moreover, none of the reaction rates that govern the dynamics of any component in the original $z_{aff}^{c}$ have been altered, meaning the dynamics of Z_k remain unchanged. Such a decomposition can be done for any co-transition reaction that involves components from $z_{aff}$ and $z_{aff}^{c}$ .

We now let $Z_{k}$ correspond to another component of interest in the system, as a continuous-time Markov process. It can be any stochastic process that can be measured, like the abundance of a molecular species in the network, the size of the cell, or any parameter that can influence the reaction rates of the system.

Theorem 1. Let $Z_{k}$ be a component in $z_{aff}^{c}$ . If the averages $⟨ x_{t} ⟩$ , $⟨ y_{t} ⟩$ and the covariances $Cov (x_{t}, z_{k, t})$ , $Cov (x_{t}, z_{k, t})$ over the ensemble have reached a stationary state (i.e. they have become constant over time), then $η_{x z_{k}} = η_{y z_{k}}$ .

Proof of Theorem 1. We condition on the components not affected by X or Y, $z_{aff}^{c} [- \infty, t]$ . This corresponds to a hypothetical system where all the variables in $z_{aff}^{c}$ become deterministic time-varying signals ${z_{k} (t)}$ . The reactions governing X and Y in the conditional system become

(5.3)

$\begin{array}{cc} x \overset{R (z_{aff}, t)}{\to} x + 1 & y \overset{α R (z_{aff}, t)}{\to} y + 1 \\ x \overset{x β (t)}{\to} x - 1 & y \overset{y β (t)}{\to} y - 1 \end{array}$

where $R (z_{aff}, t) = R (z_{aff}, z_{aff}^{c} (t))$ and $β (t) = β (z_{aff}^{c} (t))$ now have an explicit time dependence from the conditioned history $z_{aff}^{c} [- \infty, t]$ . We let $A$ be the set of all integers k such that the k-th reaction in Equation 4.1 leads to a change in at least one of the components in $z_{aff}$ . In the conditional probability space, the components affected in $z_{aff}$ follow the following reactions

(5.4)

$z_{aff} \overset{W_{k} (z_{aff}, t)}{\to} z_{aff} + d_{k} \forall k \in A,$

where $W_{k} (z_{aff}, t) = W_{k} (z_{aff}, z_{aff}^{c} (t))$ now has an explicit time dependence from the conditioned history of the variables not affected by X or Y. Note that if some components in $z_{aff}^{c}$ were part of a co-transition event with components in $z_{aff}$ , then Equations 5.3; 5.4 would not hold. This is because conditioning on the history of those extrinsic variables effectively conditions on those birth events in $z_{aff}$ that are caused by those co-transitions. However, from Lemma 1, we can always work with another network in which there are no such co-transition events, and where the dynamics of X and Y remain unchanged.

This conditional system follows the following master equation

$\begin{array}{ll} \frac{d}{d t} & P (x, y, z_{a f f}, t ∣ z_{a f f}^{c} [- \infty, t]) = \\ \sum_{k \in A} [W_{k} (z_{a f f} - d_{k}, t) P (x, y, z_{a f f} - d_{k}, t ∣ z_{a f f}^{c} [- \infty, t]) - W_{k} (z_{a f f}, t) P (x, y, z_{a f f}, t ∣ z_{a f f}^{c} [- \infty, t])] \\ + R (x - 1, y, z_{a f f}, t) P (x - 1, y, z_{a f f}, t ∣ z_{a f f}^{c} [- \infty, t]) - R (x, y, z_{a f f}, t) P (x, y, z_{a f f}, t ∣ z_{a f f}^{c} [- \infty, t]) \\ + α R (x, y - 1, z_{a f f}, t) P (x, y - 1, z_{a f f}, t ∣ z_{a f f}^{c} [- \infty, t]) - α R (x, y, z_{a f f}, t) P (x, y, z_{a f f}; t ∣ z_{a f f}^{c} [- \infty, t]) \\ + (x + 1) β (t) P (x + 1, y, z_{a f f}, t ∣ z_{a f f}^{c} [- \infty, t]) - x β (t) P (x, y, z_{a f f}, t ∣ z_{a f f}^{c} [- \infty, t]) \\ + (y + 1) β (t) P (x, y + 1, z_{a f f}, t ∣ z_{a f f}^{c} [- \infty, t]) - y β (t) P (x, y, z_{a f f}, t ∣ z_{a f f}^{c} [- \infty, t]) . \end{array}$

We consider the averages of X and Y conditioned on the upstream history, $\bar{x} (t) = E [x_{t} | z_{aff}^{c} [- \infty, t]]$ and $\bar{y} (t) = E [y_{t} | z_{aff}^{c} [- \infty, t]]$ , where $x_{t}$ and $y_{t}$ are the X and Y abundances at time t. From the above master equation, the time-evolution for these first moments can be derived (Joly-Smith et al., 2021; Hilfinger and Paulsson, 2011)

(5.5)

$\frac{d \bar{x}}{d t} = \bar{R} (t) - \bar{x} \bar{β} (t) & \frac{d \bar{y}}{d t} = α \bar{R} (t) - \bar{y} \bar{β} (t),$

where $\bar{R} (t) = E [R (x_{t}, y_{t}, z_{aff}, t) | z_{aff}^{c} [- \infty, t]]$ and $\bar{β} (t) = E [β (z_{aff}^{c}) | z_{aff}^{c} [- \infty, t]]$ are the average production and degradation rates conditioned on the history of the variables not affected by X or Y. Note that $\bar{β} (t) = E [β (z_{aff}^{c}) | z_{aff}^{c} [- \infty, t]] = β (z_{aff}^{c} (t)) = β (t)$ , because the time trajectory of $z_{aff}^{c}$ is set through the conditioning on the upstream history. We can then take the expectation of $\bar{x}$ over all possible histories of $z_{aff}^{c}$ to get

(5.6)

$E [\bar{x} (t)]_{histories} = E {[E [x_{t} ∣ z_{aff}^{c} [- \infty, t]]]}_{histories} = ⟨ x_{t} ⟩,$

which follows from the law of total expectation. We now let $Z_{k}$ correspond to any component in the network that is not affected by X or Y. It can be a molecular abundance, concentration, or another stochastic cellular variable like the growth rate of the cell. It follows that

(5.7)

$\begin{aligned} E {[\bar{x} (t) z_{k} (t)]}_{histories} & = E {[E [x_{t} ∣ z_{aff}^{c} [- \infty, t]] \cdot E [z_{k, t} ∣ z_{aff}^{c} [- \infty, t]]]}_{histories} \\ = E {[E [x_{t} z_{k, t} ∣ z_{aff}^{c} [- \infty, t]]]}_{histories} = ⟨ x_{t} z_{k, t} ⟩ \end{aligned},$

where the second step comes from the fact that conditioning on the history of $z_{aff}^{c}$ effectively also conditions on the history of $Z_{k}$ with $z_{k} (t) = E [z_{k, t} | z_{aff}^{c} [- \infty, t]]$ (so x and $z_{k}$ are independent when conditioning on the $z_{aff}^{c}$ history), the last step follows from the law of total expectation, and $z_{k, t}$ is the measured amount of $Z_{k}$ at time t. From Equations 5.6; 5.7, it follows that

(5.8)

$\begin{matrix} Cov (x_{t}, z_{k, t}) = Cov (\bar{x} (t), {\bar{z}}_{k} (t)), \end{matrix}$

where ${\bar{z}}_{k} (t) = E [z_{k, t} | z_{aff}^{c} [- \infty, t]] = z_{k} (t)$ , where the last step comes from the fact that the time trajectory of $z_{k}$ is set through the conditioning of the upstream histories. Intuitively, Equation 5.8 says that when X does not affect $Z_{k}$ , the stochastic fluctuations of X average out when taking the covariance between X and $Z_{k}$ . Strikingly, this is independent of any type of feedback that X may impose through interactions in the cloud of components $z (t)$ . As a result, the same should hold for Y, and since $\bar{y} (t)$ is governed by the same differential equation as $\bar{x}$ , the intrinsic fluctuations that differentiate X and Y will average out when taking the covariances with $Z_{k}$ .

That is, dividing the right equation in Equation 5.5 with α, we write the general solution for $\bar{x} (t)$ and $\bar{y} (t) / α$ , and find

(5.9)

$\bar{x} (t) = \bar{x} (0) e^{- \int_{0}^{t} β (u) d u} + \int_{0}^{t} e^{- (\int_{0}^{t} β (u) d u - \int_{0}^{t^{'}} β (v) d v)} R (t^{'}) d t^{'}$

(5.10)

$\bar{y} (t) / α = \bar{y} (0) e^{- \int_{0}^{t} β (u) d u} / α + \int_{0}^{t} e^{- (\int_{0}^{t} β (u) d u - \int_{0}^{t^{'}} β (v) d v)} R (t^{'}) d t^{'}$

Taking the average over all histories and subtracting the equations we have

(5.11)

$\begin{aligned} E [\bar{x} (t)]_{histories} - E [\bar{y} (t)]_{histories} & = E {[(\bar{x} (0) - \bar{y} (0) / α) e^{- \int_{0}^{t} β (u) d u}]}_{histories} \\ \Rightarrow ⟨ x_{t} ⟩ - ⟨ y_{t} ⟩ / α & = E {[(\bar{x} (0) - \bar{y} (0) / α) e^{- \int_{0}^{t} β (u) d u}]}_{histories}, \end{aligned}$

where in the second step we used Equation 5.6 which also holds for $y$ by symmetry. We now invoke the requirement that the averages $⟨ x_{t} ⟩$ and $⟨ y_{t} ⟩$ are stationary, meaning they are constant over time. In that case, $⟨ x_{t} ⟩ - ⟨ y_{t} ⟩ / α$ is constant over time, and so

(5.12)

$⟨ x_{t} ⟩ - ⟨ y_{t} ⟩ / α = lim_{t \to \infty} (⟨ x_{t} ⟩ - ⟨ y_{t} ⟩ / α) .$

Substituting Equation 5.11, we thus have

(5.13)

$⟨ x_{t} ⟩ - ⟨ y_{t} ⟩ / α = E {[(\bar{x} (0) - \bar{y} (0) / α) e^{- \int_{0}^{\infty} β (u) d u}]}_{histories} .$

Now, we must have $\int_{0}^{\infty} β (u) d u \to \infty$ when $⟨ β ⟩_{t} = lim_{T \to \infty} \frac{1}{T} \int_{0}^{T} β (u) d u > 0$ . Therefore, the right-hand side of Equation 5.13 becomes 0:

(5.14)

$⟨ x_{t} ⟩ - ⟨ y_{t} ⟩ / α = E {[(\bar{x} (0) - \bar{y} (0) / α) e^{- \int_{0}^{\infty} β (u) d u}]}_{histories} = 0,$

Similarly, we now multiply Equations 5.10; 5.9 with $z_{k} (t)$ , average over all histories, and subtract to obtain

(5.15)

$\begin{aligned} l l E [\bar{x} (t) z_{k} (t)]_{histories} - E [\bar{y} (t) z_{k} (t)]_{histories} & = E {[(\bar{x} (0) - \bar{y} (0) / α) z_{k} (t) e^{- \int_{0}^{t} β (u) d u}]}_{histories} \\ \Rightarrow ⟨ x_{t} z_{k, t} ⟩ - ⟨ y_{t} z_{k, t} ⟩ / α & = E {[(\bar{x} (0) - \bar{y} (0) / α) z_{k} (t) e^{- \int_{0}^{t} β (u) d u}]}_{histories} = 0, \end{aligned}$

where in the second step we used Equation 5.7 which also holds for y by symmetry. The last step follows when $⟨ x_{t} z_{k, t} ⟩$ and $⟨ y_{t} z_{k, t} ⟩$ have reached stationarity and are constant over time, along with $⟨ β ⟩_{t} = lim_{T \to \infty} \frac{1}{T} \int_{0}^{T} β (u) d u > 0$ .

It then follows from Equations 5.14; 5.15 that

(5.16)

$⟨ x ⟩ = ⟨ y ⟩ / α & Cov (x, z_{k}) = Cov (y, z_{k}) / α .$

Dividing $Cov (x, z_{k})$ by $⟨ x ⟩ ⟨ z ⟩$ , and $Cov (y, z_{k}) / α$ by $⟨ y ⟩ ⟨ z ⟩ / α$ , we find $η_{x z_{k}} = η_{y z_{k}}$ .

If the reporter Y is engineered to be passive (i.e., it does not affect components in the network), then a violation of Equation 2 would imply that X affects $Z_{k}$ . Otherwise, such a violation would imply that X or Y affect $Z_{k}$ .

Thus far, we assumed that all the components not affected by X or Y are part of a continuous-time Markov chain. This was in order to make a rigorous definition of causal interaction in our framework as a path in the topology of the transition rates. Alternatively, if we relax the requirement that the components not affected by X or Y be Markov chains (i.e. they can be a set of arbitrary stochastic processes), we can operationally define ‘no causal interaction from X or Y’ to mean that we can condition on the history of those stochastic processes and write down Equations 5.3; 5.4. We can then operationally define any violation of Equation 2 as a ‘causal interaction from X or Y’ to a stochastic process $Z_{k}$ .

Exploiting fluctuations in gene expression to detect causal interactions between genes

Continue Reading

More posts

ISS and Glass Lewis Issue Proxy Voting Policy Updates for 2026

Marquette Theatre alum takes lighting design to Broadway

Cyclones in the NFL: Week 15 Recap

UH study reveals how ocean’s most abundant bacteria diversify