Linking complex microbial interactions and dysbiosis through a disordered Lotka–Volterra model

We thus collect all the parameters estimated from the data in a vector π=(h,qd,q0,K). As we will better detail in the Methods, we develop a moment matching inference algorithm to infer the model parameters θ, as depicted in Figure 1. The idea of the method is to introduce a cost function C(θ|π), representing a total relative error for some self-consistent equations. If the parameters θ are such that the right part of the self-consistent equation equals the left part, the problem is considered solved. Because the landscape associated with this cost function presents several minima, we perform multiple optimization procedures to collect an ensemble of possible solutions, from which we retain the top 30. First, we find that different solutions θ of the optimization problem provide ecological insights into the underlying microbiome populations.

As originally predicted in Altieri et al., 2021, among all the parameters that define Equation 1, the only ones relevant for reproducing the theoretical phase diagram are the amplitude of demographic noise and the heterogeneity of interactions. The mean interaction strength, provided it is sufficiently positive, does not play a significant role. This prediction is fully confirmed by the inference procedure applied to the two microbiome datasets, allowing us to identify a universal signature that distinguishes healthy from unhealthy states. Figure 3a shows, indeed, that inferred noise (T) and interaction heterogeneity strength (σ) for healthy and diseased microbiomes are clustered in the two-dimensional plane.

Distinct ecological organization in healthy vs diseased microbiomes.

(a) Inferred T (demographic noise strength) and σ (interactions heterogeneity) for healthy (blue) and diseased (red) microbiomes are clustered. Darker dots correspond to better solutions (i.e., solutions with a lower value of the cost function C), while the two points with hexagonal markers correspond to the best two (healthy and diseased, respectively) solutions. In the first panel inset, we also show (in log–log scale) the species abundance distributions (SADs) corresponding to each solution. To have a more concise representation, we present each SAD fixing the disorder to its average ζ¯=Kμh. (b) The probability density function of the inferred interactions αi,j for healthy (blue) and diseased (red) microbiomes. Dysbiosis reduces the heterogeneity of the interaction strengths. The quantities reported in the legend are the average and standard deviation of αi,j. They are calculated as μXα=μX/SX and σXα=σX/SX, where SX is the species pool size, estimated as the set of all observed species in a dataset, X can denote healthy (H) or diseased (D) individuals.

In particular, the SAD for the healthy cohort is robust among the different solutions of the inference procedure, as depicted by the superposition of the different curves in the inset of Figure 3a. On the other hand, SADs inferred from unhealthy patients have high sensitivity to different solutions. In particular, some of them display a mode for high-abundance species (light red lines in Figure 3a), a signature of dominant strain in the gut. Consistently, the distribution of the interactions P(αi,j) generated through the inferred parameters μ and σ is different between healthy and diseased cohorts, giving a distinct pattern of interactions (see Figure 3b), a result that is compatible with that found by Bashan et al., 2016. Remarkably, we find that dysbiosis reduces the heterogeneity of interaction strengths, a result also observed when taking correlations as a proxy for interactions (Seppi et al., 2023).

We then assess how close the inferred σ and β=1/T (a.k.a. inverse temperature in a statistical physics approach) are to the critical RSB line of the dgLV (R=0), evaluated by keeping all the other parameters constant (see Methods). We find again that the replicon values R corresponding to each solution of our optimization protocol are significantly different for the two investigated microbiome phenotypes (see Figure 4a). In particular, diseased microbiomes are closer to marginal stability within the RS ansatz (Altieri et al., 2021; Mézard et al., 1987; de Almeida and Thouless, 1978). Furthermore, by investigating the shape of the SAD given by Equation 2, we can estimate the ratio between niche (represented by species interaction) and neutral (represented by birth/death and immigration) ecological forces, which can be captured by the quantity ψ (Wu et al., 2021). It detects the emergence of peaks in the SAD as a hallmark of niche processes (see Appendix 2).


Stability of healthy vs diseased microbiomes.

(a) The replicon eigenvalue corresponding to each solution of our optimization procedure (shaded dots). The solid hexagon represents the replicon corresponding to the best solutions that minimize the error in predicting the order parameters of the theory (minimum C). The two investigated microbiome phenotypes (healthy in blue, diseased in red) are significantly different. In particular, diseased microbiomes are closer to the marginal stability of replica-symmetric ansatz (gray horizontal line). (b) Solutions of the moment-matching objective function are shown as a function of ψ and m, which in turn depend on the species abundance distribution (SAD) parameters (see main text). Healthy (blue) and diseased (red) microbiomes appear to be clustered. Therefore, distinct ecological organization scenarios (strong neutrality/emergent neutrality) take place. Darker dots correspond to solutions with lower values of the cost function, while hexagonal markers correspond to the two best solutions.

Inspired by field-theory arguments (see Methods and Appendix, Section S2), we can call the mass of the theory the m parameter, as defined above in Equation 2. In classical and quantum field theory, the particle–particle interaction embedded in the quadratic term is typically referred to as a mass source. In our context, m=1βσ2(qdq0) captures quadratic fluctuations of species abundances, as also appearing in the expression of the leading eigenvalue of the stability matrix. When m0, the analytical order parameters diverge and the system enters the unphysical regime of unbounded growth. As such, the mass term can be considered a complementary stability measure, capable of capturing the transition to the unbounded growth regime.

In the model, two kinds of effects compete to shape the community structure. On the one hand, we have niche effects, encoded in disordered interactions and thus tracked by the parameters μ, σ, and K. Their overall effect is selective and tends to concentrate the SAD around the typical abundance value. On the other hand, we have neutral effects encoded in the stochastic dynamics and immigration, governing the low-abundance regime of the SAD. When the demographic noise amplitude is stronger than immigration (ν<1, as in our case), the SAD exhibits a low-abundance integrable divergence. In the opposite scenario, for ν>1, there is no divergence, and the SAD is modal. Since interactions are random, the probability of observing an internal mode can be estimated as the fraction of SADs realizations having non-trivial solutions to the stationary point equation. Such a quantity, dubbed as the niche–neutral ratio, can be analytically evaluated:

(5)

ψ=12Erfc(ζ+ζ¯2σζ)+12Erfc(ζζ¯2σζ) ,

where ζ=4(1ν)mβ and ζ¯=Kμh. When ψ1, niche and neutral forces give comparable contributions to the dynamics, as both low-abundance divergence and a finite abundance mode coexist in the SAD. Finally, if the typical abundance diverges, we enter the unbounded growth phase, which means that the mass m and the niche–neutral ratio ψ are not independent, as suggested by the analytical expression for ψ. For an exhaustive derivation of this result, see Appendix 2. With the obtained model parameters, we are able to evaluate m and ψ for healthy and diseased microbiomes. Also, in this case, healthy and diseased microbiomes are visibly clustered, as shown in Figure 4. Unhealthy microbiomes turn out to be closer to the unbounded growth phase, and the niche–neutral ratio is larger by five orders of magnitude than the healthy case ψD105ψH. This leads us to argue that selective pressure is way larger in diseased states, while in the healthy one, birth and death effects are the key drivers of the dynamics. These results are also confirmed by the SAD shapes in the inset of Figure 3 (panel a).

In summary, in the Results section, we show that (i) the inference pipeline robustly recovers demographic noise and interaction heterogeneity by calculating h, q0, and qd from the data; and (ii) these parameters cluster according to health status, with diseased microbiomes lying closer to the replica-symmetry-breaking threshold, indicating reduced ecological resilience.

Continue Reading