4  Application—Bacterial Chemotaxis

The contents of this chapter have been published in Phys. Rev. X 13, 041017 (2023)  [1].

The chemotaxis signaling network of the bacterium Escherichia coli is a sophisticated information processing system, enabling the bacterium to sense nutrient gradients and dynamically adjust its movement. The bacterium’s ability to climb chemical gradients is constrained by the mutual information rate between the sensed nutrient concentration and the phosphorylated messenger protein CheYp. A recent study by Mattingly et al.  [2] used a Gaussian approximation to estimate this rate, based on the assumption that in shallow gradients the chemotactic response is approximately linear. However, the nonlinear nature of chemotaxis suggests that Gaussian methods may only approximate the true information rate. In this chapter, we apply an exact technique, Path Weight Sampling (PWS), to precisely quantify information transmission in E. coli chemotaxis and compare the results against the Gaussian approximation. We build a stochastic model based on literature data, which we use to simulate nonlinear chemotactic responses to time-dependent stimuli. Our PWS results for this model yield information rates 4–5 times higher than those obtained experimentally. While this finding can be viewed as surprisingly accurate for an ab initio prediction, the question remains whether the discrepancy is due to the limitations of the Gaussian framework used by Mattingly et al.  [2] or due to the assumptions of our stochastic model. Examining the latter question reveals that our initial model underestimates both the magnitude of the response and the biochemical noise. We refined the model by changing two key parameters that describe the receptor array, namely the number of clusters and their size. This leads to information rate estimates that closely align with experimental data, indicating that the number of receptor clusters is much smaller than hitherto believed, while their size is much larger. Finally, our analysis confirms the accuracy of the Gaussian framework for studying chemotaxis in shallow gradients, validating its use by Mattingly et al.  [2] a posteriori.

4.1 Introduction

The chemotaxis system of the bacterium Escherichia coli is a complex information processing system. It is responsible for detecting nutrient gradients in the cell’s environment and using that information to guide the bacterium’s movement. E. coli navigates through its environment by performing a biased random walk, successively alternating between so-called runs, during which it swims with a nearly constant speed, and tumbles, during which it randomly chooses a new direction  [3]. By adaptively varying the tumbling probability and thus adjusting the relative duration of runs and tumbles, the bacterium is able to climb a chemical gradient.

Recently, Mattingly et al.  [2] found that the ability of the bacterium to climb chemical gradients is fundamentally limited by the information it can acquire via its receptors. However, to calculate the information rate from their experimental data, they relied on a Gaussian approximation which assumes linear Gaussian statistics for the trajectories. The use of the approximation is justified by the argument that in shallow concentration gradients the behavior of the chemotaxis network is approximately linear. Nevertheless, it is well known that chemotaxis generally exhibits a highly nonlinear response, which suggests limitations to the Gaussian framework’s accuracy in capturing the full dynamics of the system. Moreover, we have seen in the previous chapter that the Gaussian approximation may fail in surprising ways.

In this chapter, we use Path Weight Sampling (PWS) to exactly quantify information transmission in E. coli chemotaxis, and to assess the accuracy of the Gaussian approximation. To study information transmission in E. coli chemotaxis using PWS, we first develop a stochastic model of the biochemical chemotaxis network based on the MWC model for receptor cooperativity  [46]. This model accurately describes the nonlinear behavior of the chemotaxis network and allows us to simulate the chemotactic response to an arbitrary time-dependent stimulus. Furthermore, we can use PWS to compute the mutual information rate between arbitrary input signals and the response signal generated by the model. To obtain realistic time-dependent input signals, we assume a cell performing a random walk in a static exponential gradient, following Mattingly et al.  [2]. By using the same input dynamics as those in Ref.  [2], we can rigorously compare our results against the experiments.

One of our principal findings is that the information transmission rate computed by PWS for the model based on the literature data—which we refer to as the “literature-based model”—is approximately 4–5 times higher than the rate measured experimentally by Mattingly et al.  [2]. Given that this model relies entirely on available literature data without any fitting to the results of Mattingly et al.  [2], an agreement within a factor of 4–5 is perhaps unexpectedly good. Still, the source of the discrepancy remains unclear: does it stem from the inaccuracy of the Gaussian framework used by Mattingly et al.  [2] or from the inaccuracy of our literature-based model?

To address this question, we examined the data of Mattingly et al.  [2] on the response and the noise amplitude. We found that the literature-based model underestimates not only the response strength but also the noise. By fitting the response kernel and the noise correlation function, we developed a “fitted model” in which receptor clusters are larger, enhancing the response amplitude, but significantly fewer in number, leading to much greater noise and thus a lower information transmission rate. Recomputing the information rate with PWS for this model, fitted to the linear response and the noise, yielded close agreement with the information rate measurements of Mattingly et al.  [2], suggesting that the receptor array composition differs substantially from previous assumptions. In particular, while the predicted cluster size is about a factor of 2 larger than previous estimates, the number of clusters is about tenfold lower. Lastly, we found that the Gaussian framework is highly accurate in the regime of shallow gradients as studied by Mattingly et al.  [2], thus verifying their Gaussian approach a posteriori.

In Section 4.2 we describe the dynamics of the input signal. Section 4.3 introduces the stochastic model of the chemotaxis system that we developed based on available literature. In Section 4.4 we describe the Gaussian approximation used by Mattingly et al.  [2] to compute the information transmission rate and in Section 4.5 we present the results. We conclude with a discussion of our findings, particularly regarding the number of clusters and their size.

4.2 Stochastic Dynamics of the Input Signal for Chemotaxis

The information transmission rate depends not only on the biochemical chemotaxis network, but also on the dynamics of the input signal. It is therefore important that the dynamics of this signal in our model agree with those in the experiments of Mattingly et al.  [2]. For these experiments the input signal is the time-dependent ligand concentration c(t)c(t) that is experienced by the swimming bacterium.

In a shallow exponential gradient c(x)c(x), the bacterium diffuses nearly freely in the xx-direction. The variance of the position increases with time, the hallmark of a random walk. The input signal is the concentration c(t)=c(x(t))c(t)=c(x(t)) as experienced by the bacterium at time tt.

We consider an Escherichia coli bacterium that swims in a static nutrient concentration gradient c(x)c(x). Following Mattingly et al.  [2], the gradient is exponential: c(x)=c0egxc(x)=c_0 e^{gx} with steepness gg. In a shallow gradient, the speed vx(t)v_x(t) of E. coli along the gradient axis can be considered as a stochastic process that fluctuates around the net chemotactic drift velocity. Following Mattingly et al.  [2], we assume that in a shallow gradient the bacterial swimming dynamics are, to a good approximation, the same as they are in the absence of a gradient. Their experimental evidence shows that the velocity fluctuations in absence of a gradient are described by an exponentially decaying auto-correlation function:

V(t)=vx(0)vx(t)=aveλ|t|.(4.1)V(t) = \langle v_x(0) v_x(t) \rangle = a_v e^{-\lambda |t|} \,. \qquad(4.1)

Therefore, in a shallow gradient, the gradient-climbing speed can be modeled as a zero-mean Ornstein-Uhlenbeck process

dvxdt=λvx+σξ(t)(4.2)\frac{dv_x}{dt} = -\lambda v_x + \sigma \xi(t) \qquad(4.2)

where σ=2avλ\sigma = \sqrt{2 a_v \lambda}, and ξ(t)\xi(t) is white noise with ξ(t)ξ(t)=δ(tt)\langle \xi(t) \xi(t^\prime) \rangle = \delta (t - t^\prime). The xx-position of the bacterium is given by the integral of the velocity, i.e., x(t)=0tdτvx(τ)x(t)=\int^t_0d\tau\, v_x(\tau). Thus, when projected onto the gradient axis, the bacterium performs a 1D random walk described by the Langevin equation

d2xdt2=λdxdt+σξ(t).(4.3)\frac{d^2 x}{dt^2} = -\lambda \frac{dx}{dt} + \sigma \xi(t) \,. \label{eq-random_walk} \qquad(4.3)

Since the bacterium moves in a static concentration gradient described by c(x)c(x), the concentration dynamics that the cell observes are generated directly from its own movement dynamics, see Fig. . At time tt the cell is at position x(t)x(t) and thus measures the concentration c(t)=c(x(t))c(t)=c(x(t)). We find the stochastic dynamics of cc by differentiating using the chain rule

dcdt=cxxt=gc(t)vx(t).(4.4)\frac{dc}{dt} = \frac{\partial c}{\partial x} \frac{\partial x}{\partial t} = g c(t)\,v_x(t)\,. \label{eq-concentration_dynamics} \qquad(4.4)

The concentration dynamics are thus fully determined by the stochastic dynamics of the cell’s swimming velocity vx(t)v_x(t) in the absence of a gradient and by the shape of the concentration gradient c(x)c(x). The resulting stochastic dynamics are visualized in Figure 4.1.

Figure 4.1: Input dynamics. Left column: two example time traces of the up-gradient velocity vx(t)v_x(t) and the observed concentration c(t)c(t) obtained by integrating Equation 4.4. Right column: averages of velocity and concentration traces obtained from 1000 simulated trajectories. The solid lines show the mean as a function of time and boundaries of the shaded regions indicate the 5% and 95% quantiles.

In the PWS simulations we use c(t)c(t) directly as the input to our system. Yet, for the Gaussian approximation, to which we will compare the PWS result, we need to use a different input signal because the chemotaxis system does not respond linearly to c(t)c(t). Instead, Mattingly et al.  [2] show that the chemotaxis system responds approximately linear to an input s(t)s(t) defined by

s(t)=ddtlnc(t)=gvx(t).(4.5)s(t) = \frac{d}{dt} \ln c(t) = gv_x(t) \,. \qquad(4.5)

The correlation function of s(t)s(t) is given by

Css(t)=s(τ)s(t+τ)=g2V(t).(4.6)C_{ss}(t) = \langle s(\tau) s(t + \tau) \rangle = g^2 V(t)\,. \qquad(4.6)

The power spectral density of this signal is given by the Fourier transform of its correlation function:

Pss(ω)=g2V(ω)=g22avλω2+λ2.(4.7)P_{ss}(\omega) = g^2 V(\omega) = g^2 \frac{2a_v \lambda}{\omega^2 + \lambda^2}\,. \qquad(4.7)

We use this same input below in Section 4.4 to compute the Gaussian approximation of the mutual information rate. As discussed in more detail in the main text, we note that the mutual information between the output and the input trajectory c(t)c(t), as measured in the PWS simulations, is identical to that between the output and the input trajectory s(t)s(t), as computed in the Gaussian model because of the deterministic and monotonic mapping between c(t)c(t) and s(t)s(t).

4.3 Stochastic Chemotaxis Model

Figure 4.2: Cartoon of the chemotaxis network of E. coli. Receptors form clusters with an associated CheA kinase. A cluster can either be active or inactive, depending on the number of bound ligands (green dots) and methylated sites (orange dots). Active CheA can phosphorylate CheY; phosphorylated CheY controls the rotation direction of the flagellar motors and thereby the movement of the bacterium.

We apply PWS to a stochastic model of the chemotaxis network that describes individual reactions via a master equation. In this model, the receptors are grouped in clusters. Each receptor can switch between an active and an inactive conformational state, but, in the spirit of the Monod-Wyman-Changeux model  [4], the energetic cost of having two different conformations in the same cluster is prohibitively large. We can then speak of each cluster as either being active or inactive. Each receptor in a cluster can bind ligand and be (de)methylated, which, together, control the probability that the cluster is active. In the simulations, receptor (de)methylation is modeled explicitly, because the (de)methylation reactions are slow. In contrast, the timescale of receptor-ligand (un)binding is much faster than the other timescales in the system, i.e., those of the input dynamics, CheY (de)phosphorylation, and receptor (de)methylation. The receptor-ligand binding dynamics can therefore be integrated out without affecting information transmission, in order to avoid wasting CPU time. In addition, the receptor clusters can phosphorylate CheY, while phosphorylated CheY is dephosphorylated at a constant rate. The dynamics of the kinase CheA and the phosphatase CheZ which drive (de)phosphorylation are not modeled explicitly. Figure 4.2 shows a depiction of the bacterial chemotaxis network.

Table Table 4.1 shows the parameter values of our chemotaxis model, which are all based on values reported in the literature. For what follows below, the key parameters are the number of receptors per cluster, which is taken to be N=6N=6 based on Refs.  [7,8], while the number of clusters is Nc=Nr/N=400N_{\mathrm c} = N_{\mathrm r} / N = 400, where 103<Nr<10410^3 < N_{\mathrm r} < 10^4 is an estimate for the total number of receptors based on Ref.  [9]. These are the numbers of the “literature-based model”. The key parameters, the number of clusters and their size, change in the “fitted model”, which is based on fitting the response kernel and noise correlation function to the data of Mattingly et al.  [2].

Table 4.1: The parameters required for the chemotaxis model, based on literature values. These are the parameters used in the so-called literature-based model. In the fitted model (see Section 4.5) the same parameter values are chosen, except for N=15N=15 and Nc=9N_c=9, which were obtained by fitting to the data of Mattingly et al.  [2]; we note that changing NN and NcN_c also requires updating kAk_A to keep the fraction ϕY\phi_Y of phosphorylated CheY constant.
parameter value description
ava_v 157.1 μm²/s² variance of up-gradient velocity  [2]
λ\lambda 0.86 s⁻¹ velocity correlation decay constant  [2]
c0c_0 100 μM mean ligand concentration  [2]
NN 6 number of receptor units per cluster  [7]
NcN_c 400 number of receptor clusters  [9]
MM 4 number of methylation sites per receptor  [7]
NYN_Y 10 000 total copy number of CheY proteins (phosphorylated and unphosphorylated)  [9]
KaK_a 2900 μM ligand dissociation constant of active receptors  [8]
KiK_i 18 μM ligand dissociation constant of inactive receptors  [8]
kRk_R 0.1 s⁻¹ methylation rate  [2,7]
kBk_B 0.2 s⁻¹ demethylation rate  [2,7]
kAk_A 0.015 s⁻¹ phosphorylation rate  [10,11]
kZk_Z 10.0 s⁻¹ dephosphorylation rate  [10,11]
ϕY\phi_Y 0.17 steady-state fraction of phosphorylated CheY  [10]
m0/Nm_0 / N 0.5 receptor methylation level without ligands  [7]
δfm\delta\!f_m -2.0 kBTk_\mathrm{B}T free energy change of active conformation from attachment of one methyl group  [7]

4.3.1 MWC Model

In our model, each cluster consists of NN receptors. report a typical value for the cluster size of N=6N=6. Detailed balance requires that the ligand binding affinity depends on whether a cluster is in the active or inactive state. Consequently, we have a dissociation constant KaK_a for a ligand bound to an active receptor and another dissociation constant KiK_i for a ligand bound to an inactive receptor. For chemotaxis, KaKiK_a\gg K_i, i.e. the ligand binding affinity is higher for the inactive state.

Additionally, each receptor monomer has MM methylation sites that can affect its conformation and therefore the kinase activity. The aspartate receptor Tar has M=4M=4 methylation sites  [7]. We model the receptors’ methylation dynamics such that CheB only demethylates active receptors while CheR only methylates inactive receptors, following previous models for chemotaxis  [5,12]. This approach represents arguably the simplest way to model methylation with exact receptor adaptation.

In an environment with ligand concentration cc, the probability of a receptor cluster with mm methylated sites to be active, pa(c,m)p_a(c, m), is determined by the free-energy difference between the active and inactive receptor states

pa(c,m)=11+ef(c,m)(4.8)p_a(c, m) = \frac{1}{1 + e^{-f(c, m)}} \label{eq-prob_active} \qquad(4.8)

where

f(c,m)=Nln(1+c/Ki1+c/Ka)+δfm(mm0).(4.9)f(c, m) = N \ln\left( \frac{1 + c/K_i}{1 + c/K_a} \right) + \delta f_m (m - m_0) \,. \label{eq-free_energy_active} \qquad(4.9)

Here, the number of methylated sites of a cluster (not receptor) is denoted by mm, ranging from 00 to NMNM. The parameters are again taken from Shimizu et al.  [7]. Their experimental results indicate that δfm=2kBT,m0=N/2\delta\!f_m = -2 k_\mathrm{B}T,\ m_0=-N/2. Kamino et al.  [8] report ligand dissociation constants of Ka=2900μMK_a = 2900\,\mu\text{M} for active receptors and Ki=18μMK_i = 18\,\mu\text{M} for inactive Tar receptors (for MeASP). Note that in the equations we assume units such that kBT=1k_\mathrm{B} T=1.

The dynamics of methylation in our model are described by the following mean-field equation

dmdt=(1pa(c,m))kRpa(c,m)kB.(4.10)\frac{dm}{dt} = (1-p_a(c, m)) k_R - p_a(c, m) k_B \,. \label{eq-methylation_dynamics} \qquad(4.10)

The system reaches a steady state for the adapted activity pa(c,m)=a0p_a(c, m)=a_0 where

a0=kRkR+kB.(4.11)a_0 = \frac{k_R}{k_R + k_B} \,. \label{eq-adapted_activity} \qquad(4.11)

The steady-state methylation mm^\star can be obtained from Equation 4.8, Equation 4.9 by solving pa(c,m)=a0p_a(c, m^\star)=a_0:

m=m0+Nln(1+c/Ki1+c/Ka)+ln(1a0a0)δfm.(4.12)m^\star = m_0 + \frac{N \ln\left( \frac{1 + c/K_i}{1 + c/K_a} \right)+ \ln\left( \frac{1-a_0}{a_0} \right)}{-\delta f_m}\,. \qquad(4.12)

To characterize the methylation timescale, we linearize the dynamics of m(t)m(t) around the steady state (at constant ligand concentration c(t)=c0c(t)=c_0). To first order, we can write

dmdt=m(t)mτm.(4.13)\frac{d m}{d t} = -\frac{m(t) - m^\star}{\tau_m}\,. \label{eq-dmdt} \qquad(4.13)

where τm\tau_m is the characteristic timescale of the methylation dynamics. We find τm\tau_m by expanding pap_a [Equation 4.8] around m=mm=m^\star:

pa(c,m)=pa(c,m)+pam|m(mm)+𝒪(m2)=a0[1δfm(1a0)(mm)]+𝒪(m2),(4.14)\begin{aligned} p_a(c, m) &= p_a(c, m^\star) + \frac{\partial p_a}{\partial m} \Bigg|_{m^\star} (m - m^\star) + \mathcal{O}(m^2) \\ &= a_0 [1 - \delta f_m (1-a_0) (m-m^\star)] + \mathcal{O}(m^2) \,, \end{aligned} \qquad(4.14)

and then plugging this first-order expansion into Equation 4.10 to get

dmdt=δfm(mm)kR1+kB1.(4.15)\frac{d m}{d t} = \frac{\delta f_m (m - m^\star)}{k_R^{-1} + k_B^{-1}} \,. \qquad(4.15)

By comparing with Equation 4.13 we find that for small perturbations, the timescale for methylation to approach steady state is given by

τm=kR1+kB1δfm.(4.16)\tau_m = \frac{k^{-1}_R + k^{-1}_B}{-\delta f_m} \,. \qquad(4.16)

Thus, the parameters kRk_R and kRk_R determine two important characteristics of the methylation system: the adapted activity a0a_0 and the methylation time scale τm\tau_m. Shimizu et al.  [7] report an adapted activity of a0=1/3a_0 = 1/3 and based on experimental data  [2,7] we assume a methylation time scale of τm=10s\tau_m = 10\,\text{s}. Our parameter choice, which is consistent with both of these observations, is kR=0.075s1k_R=0.075\,\text{s}^{-1} and kB=0.15s1k_B=0.15\,\text{s}^{-1}.

CheY is phosphorylated by CheA, the receptor-associated kinase. The kinase activity is directly linked to the activity of a receptor cluster. Therefore, we assume that CheY is phosphorylated by active receptor clusters. Dephosphorylation of CheY-p is catalyzed by the phosphatase CheZ, which we assume to be present at a constant concentration. The CheZ-catalyzed dephosphorylation rate was reported to be 2.2s12.2\,\text{s}^{-1} for attractant response and 22s122\,\text{s}^{-1} for repellent response  [11]. Based on this data, we use the approximate dephosphorylation rate kZ=10s1k_Z = 10\,\text{s}^{-1} in our model. In the fully adapted state the fraction of active receptors is a0a_0 and therefore the mean fraction of phosphorylated CheY, ϕY=[CheYp]/([CheY]+[CheYp])\phi_Y = [\text{CheYp}]/([\text{CheY}] + [\text{CheYp}]), is given by

ϕY=a0NckAkZ+a0NckA.(4.17)\phi_Y = \frac{a_0 N_c k_A}{k_Z + a_0 N_c k_A} \,. \qquad(4.17)

In the fully adapted state the phosphorylated fraction was found to be ϕY0.16\phi_Y \approx 0.16  [10]. Hence, we infer a phosphorylation rate of kA=kZϕY/(a0Nc(1ϕY))=0.015s1k_A = k_Z \phi_Y / (a_0 N_c (1 - \phi_Y)) = 0.015\,\text{s}^{-1} for the literature-based model. Accordingly, for the “fitted model”, based on fitting K(t)K(t) and N(t)N(t) to those measured by Mattingly et al.  [2], we use a larger phosphorylation rate due to the smaller number of clusters NcN_c.

4.3.2 Reaction Kinetics

Since the timescale of conformational switching of active and inactive receptors and ligand binding is much faster  [13] than the timescale of phosphorylation or methylation, we don’t explicitly model ligand (un)binding and conformational switching. Each cluster is characterized by its methylation state mm. This ranges from 0 to the total number of methylation sites, which equals the number of sites per receptor MM times the number of receptors per cluster NN. In our Gillespie simulation, each possible state of a cluster is its own species, i.e., we have species Cm\mathrm{C}_m for m=0,,NMm=0,\ldots,NM. Overall, our chemotaxis model consists of four types of reactions that describe (a) the methylation of a receptor CmCm+1\mathrm{C}_m\to\mathrm{C}_{m+1}, (b) the demethylation of a receptor CmCm1\mathrm{C}_m\to\mathrm{C}_{m-1}, (c) the phosphorylation of CheY Cm+YCm+Yp\mathrm{C}_m + \mathrm{Y} \to \mathrm{C}_m + \mathrm{Y_p}, and (d) the single dephosphorylation reaction YpY\mathrm{Y_p} \to \mathrm{Y}. Thus, due to the combinatorial explosion of receptor states, the system has a total number of 3NM+23NM+2 elementary reactions (which amounts to 75 reactions in the literature-based model and 182 reactions in the fitted model).

The ligand-concentration dependent methylation rate for CmCm+1\mathrm{C}_m\rightarrow \mathrm{C}_{m+1} is given by

km+(c,m)=(1pa(c,m))kR.(4.18)k_{m+}(c,m) = (1-p_a(c,m)) k_R \,. \qquad(4.18)

The term 1pa(c,m)1-p_a(c,m) is needed because only inactive receptors can be methylated. The demethylation rate for CmCm1\mathrm{C}_{m}\rightarrow \mathrm{C}_{m-1} is given by

km(c,m)=pa(c,m)kB(4.19)k_{m-}(c, m) = p_a(c,m) k_B \qquad(4.19)

where only active receptors can be demethylated. These zero-order dynamics of (de)methylation of receptors lead to the adaptive behavior of the chemotaxis system as described above.

Only active receptors can phosphorylate the CheY protein using the receptor-associated kinase CheA. We therefore model phosphorylation as a reaction Cm+YCm+Yp\mathrm{C}_{m} + \mathrm{Y} \rightarrow \mathrm{C}_{m} + \mathrm{Y_p} with rate

kYYp(c,m)=pa(c,m)kA(4.20)k_{\mathrm{Y}\rightarrow\mathrm{Y_p}}(c,m) = p_a(c, m) k_A \qquad(4.20)

where kAk_A is a constant that represents the phosphorylation rate of an active cluster. The dephosphorylation YpY\mathrm{Y_p}\rightarrow \mathrm{Y} is carried out by the phosphatase CheZ at a constant rate kZ=10s1k_Z=10\,\text{s}^{-1}.

4.4 Mutual Information Rate for the Chemotaxis System in the Gaussian Approximation

To test the validity of the Gaussian approach used by Mattingly et al.  [2], we also compared the exact PWS results for our discrete, stochastic model to the prediction of the Gaussian approximation for this same model. In continuous time, the information transmission rate R(𝒮,𝒳)R(\mathcal{S}, \mathcal{X}) of a Gaussian system in steady state can be computed exactly from the power spectral density functions of the system  [14]:

R(𝒮,𝒳)=14πdωln[1|Psx(ω)|2Pss(ω)Pxx(ω)].(4.21)R(\mathcal{S}, \mathcal{X}) = -\frac{1}{4\pi} \int^\infty_{-\infty} d\omega\ \ln\left[1 - \frac{|P_{sx}(\omega)|^2}{P_{ss}(\omega)P_{xx}(\omega)}\right] \,. \label{eq-info_rate_gaussian} \qquad(4.21)

Here, the power spectral density Pαβ(ω)P_{\alpha\beta}(\omega) is defined as

Pαβ(ω)=dteiωtCαβ(t)(4.22)P_{\alpha\beta}(\omega) = \int^\infty_{-\infty} dt\ e^{-i\omega t} C_{\alpha\beta}(t) \qquad(4.22)

where Cαβ(titj)=α(ti)β(tj)C_{\alpha\beta}(t_i - t_j) = \langle \alpha(t_i) \beta(t_j) \rangle denote the stationary (cross-)correlation functions of the system.

Thus, we need to obtain the (cross-)correlation functions to compute the information rate in the Gaussian framework. In their experiments with E. coli bacteria, Mattingly et al.  [2] don’t obtain these correlation functions directly, however. Instead, they obtain three kernels, V(t)V(t), K(t)K(t) and N(t)N(t), from which the correlation functions can be inferred. We follow this approach for calculating the Gaussian information transmission rate.

4.4.1 Computing the Gaussian information rate using linear response kernels V(t), K(t), and N(t)

V(t)V(t) denotes the autocorrelation function of the swimming velocity of bacteria, i.e., V(t)=vx(τ)vx(τ+t)V(t)=\langle v_x(\tau) v_x(\tau + t)\rangle. As explained in Section 4.2, the swimming dynamics of the bacteria determine the statistics of the input signal s(t)=ddtlnc(t)s(t) = \frac{d}{dt}\ln c(t), where c(t)c(t) is the ligand concentration as experienced by the bacterium and gg is the gradient steepness. The input signal correlation function, denoted by Css(t)C_{ss}(t), can then be expressed as

Css(t)=g2V(t).(4.23)C_{ss}(t) = g^2 V(t) \,. \qquad(4.23)

The response kernel, denoted by K(t)K(t), represents the time evolution of the average activity of the receptors in response to an instantaneous step change in the input concentration. More precisely, K(t)K(t) is defined as

K(t)=θ(t)a(t)a0lncsc0(4.24)K(t) = \theta(t) \langle a(t) - a_0 \rangle \ln\frac{c_s}{c_0} \qquad(4.24)

where we assume the input concentration jumps instantaneously from c0c_0 to csc_s at time t=0t=0. θ(t)\theta(t) is the Heaviside step function. Note that because the signal s(t)s(t) is defined as the time-derivative of the concentration c(t)c(t), a step-change in c(t)c(t) corresponds to a delta impulse in s(t)s(t). Thus, K(t)K(t) describes the deterministic dynamics of the system after being subjected to a delta stimulus s(t)=δ(t)s(t) = \delta(t), making K(t)K(t) the Green’s function of the system. The activity a(t)a(t) resulting from arbitrary time-dependent signal s(t)s(t) can be written as a convolution of K(t)K(t) with s(t)s(t)

a(t)=a0+tdtK(tt)s(t)+ηa(t)(4.25)a(t) = a_0 + \int^t_{-\infty} dt^\prime\ K(t-t^\prime) s(t^\prime) + \eta_a(t) \qquad(4.25)

where ηa(t)\eta_a(t) is the receptor activity noise. We define the response x(t)=a(t)a0x(t)=a(t) - a_0. Assuming the input statistics are stationary and described by the correlation function Css(t)C_{ss}(t), it is easy to show that the cross-correlation between s(t)s(t) and x(t)x(t) is given by

Csx(t)=s(τ)x(τ+t)=tdtK(tt)Css(t).(4.26)C_{sx}(t) = \langle s(\tau) x(\tau + t) \rangle = \int^t_{-\infty} dt^\prime\ K(t-t^\prime) C_{ss}(t^\prime) \,. \label{eq-conv_correlation} \qquad(4.26)

In other words, the cross-correlation between s(t)s(t) and x(t)x(t) is given by the convolution of the response kernel with the input correlation function.

The noise kernel N(t)N(t) describes the autocorrelation of the activity fluctuations in the absence of an input stimulus. In particular, N(t)=ηa(τ)ηa(τ+t)N(t) = \langle \eta_a(\tau) \eta_a(\tau + t) \rangle.

We now rewrite Equation 4.21 for the mutual information rate in terms of the three kernels described above. We express the power spectra Pαβ(ω)P_{\alpha\beta}(\omega) in terms of the Fourier-transformed kernels V(ω)V(\omega), K(ω)K(\omega), and N(ω)N(\omega). In Section 4.2 we already showed that Pss(ω)=g2V(ω)P_{ss}(\omega) = g^2 V(\omega). The cross power spectrum is given by Psx(ω)=K(ω)Pss(ω)P_{sx}(\omega) = K(\omega) P_{ss}(\omega) which follows from Equation 4.26. Finally, from Ref.  [14] we use the identity Pxx(ω)=Pss(ω)|K(ω)|2+N(ω)P_{xx}(\omega) = P_{ss}(\omega) |K(\omega)|^2 + N(\omega) to express the output power spectrum. We insert these expressions into Equation 4.21 which yields

R(𝒮,𝒳)=14πdωln(1+g2V(ω)|K(ω)|2N(ω)).(4.27)R(\mathcal{S}, \mathcal{X}) = \frac{1}{4\pi} \int^\infty_{-\infty} d\omega\ \ln\left(1+\frac{g^2 V(\omega) |K(\omega)|^2}{N(\omega)}\right). \qquad(4.27)

Then, for shallow gradients, we can make a Taylor approximation in gg to obtain

R(𝒮,𝒳)=g24πdωV(ω)|K(ω)|2N(ω)+𝒪(g4).(4.28)R(\mathcal{S}, \mathcal{X}) = \frac{g^2}{4\pi} \int^\infty_{-\infty} d\omega\ \frac{V(\omega)|K(\omega)|^2}{N(\omega)} + \mathcal{O}(g^4) \,. \label{eq-gauss_mi_approx} \qquad(4.28)

This result shows that the information rate in shallow gradients is proportional to g2g^2 and the proportionality constant is determined by the measured kernels. Mattingly et al.  [2] obtain the relevant kernels V(ω)V(\omega), K(ω)K(\omega), and N(ω)N(\omega) from experiments by fitting phenomenological models to their single-cell data. We obtain the kernels from the simulation outputs of our stochastic chemotaxis model.

4.4.2 Estimating the kernels from simulations

4.4.2.1 Response Kernel K(t)

To compute the response kernel from simulations of our model, we study how the system responds to a sudden increase in ligand concentration. First, we allow the system to reach steady state by adapting it to an initial ligand concentration of c0=100μMc_0=100\,\mu\text{M} for t0=50st_0 = 50\,\text{s}. At time t=t0t=t_0, we instantaneously increase the concentration by 10% to cs=c0+0.1c0c_s=c_0 + 0.1 c_0. We then record the system’s response over the next 200 s, sampling at intervals of 0.01 s.

Rather than directly obtaining the receptor activity from the simulations, we follow the experimental approach of inferring the receptor activity from the phosphorylated CheY levels. Specifically, we record the fraction f(t)=[Yp]/[Y]f(t)=[\mathrm{Y_p}]/[\mathrm{Y}] between phosphorylated and unphosphorylated CheY. Since the copy number of CheY is relatively large, this fraction serves as a good proxy for the activity a(t)a(t). We relate the f(t)f(t) to the activity a(t)a(t) via the expression

a(t)=kZkANcf(t)(4.29)a(t) = \frac{k_Z}{k_A N_c} f(t) \label{eq-frac_to_act} \qquad(4.29)

where kAk_A and kZk_Z are the phosphorylation and dephosphorylation rate, respectively, and NcN_c is the number of receptor clusters.

Finally, we estimate the response Kernel K(t)K(t) by averaging the changes in measured activity over 10510^5 simulated trajectories

K(t)=ln(csc0)a(tt0)a(t0).(4.30)K(t) = \ln\left( \frac{c_s}{c_0} \right) \langle a(t-t_0)-a(t_0) \rangle\,. \qquad(4.30)

4.4.2.2 Output Noise Kernel N(t)

We can similarly obtain the noise statistics of the output from simulations of our chemotaxis model. In this case, we stochastically evolve the chemotaxis model at constant ligand concentration c0=100μMc_0=100\,\mu\text{M} for a very long time of 1 × 104 s. The result is a time trace of the activity a(t)a(t), which we again obtain from the fraction f(t)f(t) using Equation 4.29. We discretize this time trace at a resolution of 0.01 s. This results in a time series 𝒂=(a1,,aN)T\mathbfit{a}=(a_1,\ldots,a_N)^T where ai=a(ti)a_i=a(t_i). To estimate the correlations in the time series we subtract the overall average activity from each data point and thus obtain the data vector 𝒙\mathbfit{x} where xi=aij=1Naj/Nx_i=a_i - \sum^N_{j=1} a_j/N. From 𝒙\mathbfit{x} we estimate the auto-correlation function Cxx(t)=x(τ)x(τ+t)C_{xx}(t) = \langle x(\tau) x(\tau + t) \rangle of the activity. To obtain precise results we average the correlation function for 10510^5 trajectories.

4.4.2.3 Obtaining the Fourier Kernels using the FFT

To compute the Gaussian information rate, we need the frequency-space representations of the kernels V(t)V(t), K(t)K(t), and N(t)N(t). We already derived the analytical form of V(ω)V(\omega) in Section 4.2. We obtain K(ω)K(\omega) and N(ω)N(\omega) numerically via a discrete Fourier transform of the corresponding time-domain kernel.

As explained above, we compute time-discretized kernels Ki=K(ti)K_i = K(t_i) and Ni=N(ti)N_i = N(t_i) from time traces obtained via stochastic simulations of our model. We sample these functions at the instants t0,,tN1t_0,\ldots,t_{N-1}, the sampling frequency being fs=100Hzf_s = 100\,\text{Hz}. Then, we use the discrete Fourier transform (DFT) to obtain approximations for K(ω)K(\omega) and N(ω)N(\omega) as follows. The DFT coefficients K̃k\tilde{K}_k are given by

K̃k=n=0N1Knei2πnk/N(4.31)\tilde{K}_k = \sum_{n=0}^{N-1} K_n e^{-i2\pi nk/N} \qquad(4.31)

where k=0,1,,N1k=0,1,\ldots,N-1. These DFT coefficients can be computed efficiently using the Fast Fourier Transform (FFT) algorithm. The DFT provides point estimates for the Fourier-domain kernel K(ω)K(\omega) at discrete frequencies

ωk=2πfskN,k=0,1,...,N1,(4.32)\omega_k = \frac{2\pi f_s k}{N},\quad k=0,1,...,N-1 \,, \qquad(4.32)

i.e., K(ωk)K̃kK(\omega_k) \approx \tilde{K}_k. This approximation introduces some level of error, known as spectral leakage, due to the finite duration and sampling of the signal. This error can be reduced by multiplying the time-domain kernel with a window function. Thus, before computing the DFT, we multiply the kernel with a Hanning window, which is a smooth function that tapers at the edges of the kernel, reducing the effect of discontinuities at the beginning and end of the time series. The Hanning window is defined as:

hn=12[1cos(2πnN1)],n=0,1,...,N1.(4.33)h_n = \frac{1}{2}\left[1 - \cos\left(\frac{2\pi n}{N-1}\right)\right],\quad n=0,1,...,N-1 \,. \qquad(4.33)

The windowed kernel knk_n is obtained by multiplying the time-domain kernel KnK_n with the Hanning window hnh_n:

kn=Knhn,n=0,1,...,N1(4.34)k_n = K_n h_n,\quad n=0,1,...,N-1 \qquad(4.34)

Using the FFT algorithm we then compute the DFT coefficients k̃k\tilde{k}_k of the windowed kernel.

The procedure described above to obtain the DFT coefficients k̃k\tilde{k}_k from K(t)K(t) is also applied to N(t)N(t) to obtain the coefficients ñk\tilde{n}_k.

We can then evaluate the information rate using Equation 4.28 by discretizing the integral dωF(ω)kΔωF(ωk)\int d\omega\,F(\omega) \to \sum_k \Delta\omega\,F(\omega_k) with Δω=2πfs/N\Delta\omega = 2\pi f_s / N. More precisely, we compute the Gaussian information rate as

R(𝒮,𝒳)=g24πk=0N1ΔωV(ωk)|k̃k|2ñk.(4.35)R(\mathcal{S}, \mathcal{X}) = \frac{g^2}{4\pi} \sum^{N-1}_{k=0} \Delta\omega\ \frac{V(\omega_k)|\tilde{k}_k|^2}{\tilde{n}_k} \,. \qquad(4.35)

4.5 Results

We first asked whether our chemotaxis model based on the current literature can reproduce the information transmission rate as recently measured by Mattingly et al.  [2]. In what follows, we call this model the “literature-based” model.

4.5.1 Discrepancy between experimental results and literature-based model

In our model, the output is the concentration of phosphorylated CheY, while in the experiments of Mattingly et al.  [2] it is the average activity of the receptor clusters as obtained via FRET measurements. We argue that this difference does not significantly affect the obtained information rates, and thus, that it is valid to compare our results to the experiments. In particular, since the copy number of CheY is much larger than the number of receptor clusters, the fluctuations in CheY are dominated by the extrinsic fluctuations coming from the receptor activity noise rather than from the intrinsic fluctuations associated with CheY (de)phosphorylation. To a good approximation, the copy number of phosphorylated CheY, Yp(t){\mathrm Yp}(t), is thus a deterministic function of the average receptor activity a(t)a(t). Mathematically, the mutual information I(X;Y)I(X;Y) between two stochastic variables XX and YY is the same as the mutual information I(f(X);g(Y))I(f(X);g(Y)) for deterministic and monotonic functions ff and gg. It follows that the mutual information between c(t)c(t) and Yp(t){\mathrm Yp}(t), is nearly the same as that between c(t)c(t) and the receptor activity a(t)a(t). It is therefore meaningful to compare the information transmission rates as predicted by our PWS simulations to those measured by Mattingly et al.  [2].

We use RR-PWS to compute the mutual information for the literature-based model. Specifically, we measure the mutual information I(𝐂,𝐘𝐩;T)I(\mathbf{C}, \mathbf{Y_p}; T) between the input trajectory of the ligand concentration c(t)c(t) and the output trajectory of phosphorylated CheY, yp(t)y_p(t), and where each trajectory is of duration TT. With RR-PWS it is possible to compute I(𝐂,𝐘𝐩;τ)I(\mathbf{C}, \mathbf{Y_p}; \tau) for all τT\tau \leq T within a single PWS simulation of duration TT by saving intermediate results after each sampled segment, see Section 3.3. The receptor states are considered hidden internal states, and we use the technique described in Section 2.3 to integrate them out.

Figure 4.3: PWS simulations for the trajectory mutual information of chemotaxis in the shallow gradient regime. a Mutual information I(𝐂T,𝐘T)I(\mathbf{C}_T, \mathbf{Y}_T) between input trajectories c(t)c(t) and output trajectories yp(t)y_p(t) as a function of trajectory duration TT. In each RR-PWS simulation, N=7200N = 7200 Monte Carlo samples were used (M=256M = 256 for the particle filter). b The information transmission rate is defined as I(𝐂T,𝐘T)/TI(\mathbf{C}_T, \mathbf{Y}_T)/T in the limit TT\to\infty.

Figure 4.3a shows the PWS estimate of the information transmission rate for cells swimming in gradients of different steepnesses gg. The information transmission rate is obtained from the PWS estimate of the trajectory mutual information I(𝐂,𝐘𝐩;T)I(\mathbf{C}, \mathbf{Y_p}; T), different trajectory durations TT. As seen in Figure 4.3b, for short trajectories the mutual information increases non-linearly with trajectory duration TT, but in the long-duration limit the slope becomes constant. This asymptotic rate of increase of the mutual information with TT is the desired information transmission rate R(𝐂,𝐘𝐩)R(\mathbf{C}, \mathbf{Y_p}). The precise definition is given by

R(𝐂,𝐘𝐩)=limTI(𝐂,𝐘𝐩;T)T.(4.36)R(\mathbf{C}, \mathbf{Y_p}) = \lim_{T\to\infty} \frac{I(\mathbf{C}, \mathbf{Y_p}; T)}{T} \,. \qquad(4.36)

Figure 4.4: Comparison of theoretical models with experimental data for bacterial chemotaxis system. Panels a and b show the response and noise kernels, respectively, for the model based on literature parameters (green), parameters fitted to experiments (blue), and experiments from  [2] (orange). In panel c, the information transmission rate is shown for each model as a function of gradient steepness, with results from the Gaussian approximation shown alongside exact PWS calculations. The fitted model closely matches the experiments, while the literature-based model over-estimates information transmission rate by a factor of 4\approx 4 despite having a lower response amplitude (panel a). This is because the literature-based model has a large number of independents receptor clusters NcN_c, resulting in much lower noise in the output (panel b). In all cases, the Gaussian approximation matches the exact PWS results, providing support for the accuracy of the measurements of  [2].

We then compared our results for the information transmission rate of the literature-based model to those of Mattingly et al.  [2]. Figure 4.4c shows that the model predictions differ from the experiments by a factor of 4\approx 4. Despite this discrepancy, we believe that the agreement between experiment and theory is, in fact, remarkable, because these predictions were made ab initio: the model was developed based on the existing literature and we did not fit our model to the data of Mattingly et al.  [2]

Yet, the question about the origin of the discrepancy remains. The difference between their measurements and our predictions could be attributed either to the inaccuracy of our model or to the approximation that Mattingly et al.  [2] had to employ to compute the information transmission rate from experimental data. Concerning the latter hypothesis, due to the curse of dimensionality and experimental constraints, Mattingly et al.  [2] could not directly obtain the information transmission rate from measured time traces of the input and output of the system. Instead, they measured three different kernels that describe the system in the linear regime. Specifically, they obtained the response K(t)K(t) of the kinase activity to a step-change in input signal, the autocorrelation function of the input signal V(t)V(t), and the autocorrelation N(t)N(t) of the kinase activity in a constant background concentration. Then they used a Gaussian model to compute the information transmission rate from these measured functions K(t)K(t), V(t)V(t), and N(t)N(t)  [2,14] (see also Section 4.4). This Gaussian model is based on a linear noise assumption and cannot perfectly capture the true non-linear dynamics of the biochemical network. This could be the cause for the observed discrepancies in the information rate. We have indeed already seen in Chapter 3 that there can be substantial differences between exact computations and the Gaussian approximation for the trajectory mutual information.

4.5.2 Revising the literature-based model

To uncover the reason for the discrepancy we first tested whether our literature-based model reproduces the experimentally measured kernels. If the kernels do not match, then, clearly, the discrepancy in the information rate may be caused by the difference between our model and the experimental system, as opposed to the inaccuracy of the Gaussian framework. Our input correlation function, V(t)V(t), is, by construction, the same as that of Mattingly et al.  [2]. The simulation protocol we used for measuring the other kernels was directly modeled after the experimental protocol  [2].

We find that the response kernel K(t)K(t) and the autocorrelation function of the noise N(t)N(t) of our system are different. Figure 4.4a, b shows that our model reproduces the timescales of N(t)N(t) and K(t)K(t) as measured experimentally. This is perhaps not surprising, because the decay of both N(t)N(t) and K(t)K(t) is set by the (de)methylation rate, which has been well-characterized experimentally. Yet, the figure also shows that our model significantly underestimates the amplitudes of both N(t)N(t) and K(t)K(t).

This raises the question of whether other parameter values would allow our model to better reproduce the measured kernels K(t)K(t) and N(t)N(t), and, secondly, whether this would resolve the discrepancy in information rate between our simulations and the experiments.

The amplitude σN2\sigma^2_N of the output noise correlation function N(t)N(t) is bounded by the number of receptor clusters NcN_{\mathrm c}. In particular, the variance of the receptor activity is σN2=σa2/Nc1/4Nc\sigma^2_N = \sigma^2_a / N_{\mathrm c} \leq 1 / 4 N_{\mathrm c}, where σa21/4\sigma^2_a\leq 1/4 is the variance of the activity of a single receptor cluster. Comparing this bound to the measured receptor noise strength σN2\sigma^2_N reveals that NcN_{\mathrm c} needs to be much smaller than our original model assumes: the number of clusters needs to be as small as Nc10N_{\mathrm c} \lesssim 10. Indeed, Figure 4.4b shows that with Nc=9N_{\mathrm c}=9, our model quantitatively fits the correlation function N(t)N(t) of the receptor activity in a constant background concentration, as measured experimentally  [2].

The amplitude of K(t)K(t), i.e. the gain, depends on the ratio KDA/KDIK_{\mathrm D}^{\mathrm A}/ K_{\mathrm D}^{\mathrm I} of the dissociation constants of the receptor for ligand binding in its active or inactive state, respectively, as well as on the number of receptors per cluster, NN. Both dissociation constants have been well characterized experimentally  [7,15], but the number of receptors per cluster has only been inferred indirectly from experiments  [7,8]. The higher gain as measured experimentally by Mattingly et al.  [2] indicates that NN is larger than assumed in our model: with N=15N=15 our model can quantitatively fit K(t)K(t) (Figure 4.4a).

We thus find that by reducing the number of clusters from Nc=400N_{\mathrm c} = 400 to Nc=9N_{\mathrm c}=9 while simultaneously increasing their size from N=6N=6 to N=15N=15, our model is able to quantitatively fit both N(t)N(t) and K(t)K(t)  [2], see Figure 4.4 and Figure 4.5 for their Fourier representations. This suggests that the number of independent receptor clusters is smaller than hitherto believed, while their size is larger.

Figure 4.5: Fourier representation of the kernels for computing the information transmission rate in the Gaussian approximation, the velocity power spectrum V(ω)V(\omega) [(mms1)2][(\si{\milli\meter\text{s}^{-1}})^2], the squared frequency response |K(ω)|2|K(\omega)|^2, and the noise power spectrum N(ω)N(\omega). The top-left panel shows the individual Fourier kernels as a function of frequency ω\omega for the different models. On the top-right the normalized kernels are shown with linear axis scales. In the bottom panels the integrand for computing the mutual information rate in the Gaussian approximation is shown. In the bottom right, the area under the curves represents the proportionality between the squared gradient steepness g2g^2 and the information rate (units  s mm−−2). In the bottom left plot, the integrand is multiplied by ω\omega, so that with log scaling of the axes the area under the curve is equal to the integral.

4.5.3 Comparing the chemotaxis information rate of the models against experiments

Figure 4.6: The information rate as a function of the number of receptor clusters NcN_c. The cluster size is fixed at N=15N=15. The left panel shows the increase of information rate as a function of gradient steepness for different values of NcN_c, including a line for the experimental data from  [2]. The right panel shows the same data but highlights the increase of the information rate and when increasing the number of receptor clusters. A quadratic fit (shown as dotted lines) is used to extrapolate the information rate. All results were obtained using RR-PWS.

How accurately can our revised model reproduce the measured information rate, and how accurate is the Gaussian framework for the experimental system in the regime studied by Mattingly et al.  [2]? In the revised model, called the “fitted model”, with Nc=9N_{\mathrm c}=9 and N=15N=15, all key quantities for computing the information transmission rate within the Gaussian framework, V(t)V(t), N(t)N(t) and K(t)K(t), are nearly identical to the experiments of Mattingly et al.  [2], see Figure 4.4. Within the Gaussian framework (see Section 4.4), the information transmission rate in our model is thus expected to be very similar to the experimentally measured one, and Figure 4.4c shows that this is indeed the case. To quantify the accuracy of the Gaussian framework, we then recomputed the information transmission rate for the revised model, using exact PWS. We found that the result matches the Gaussian prediction very well. For these shallow and static chemical gradients, the Gaussian model is thus highly accurate. Our analysis validates a posteriori the Gaussian framework adopted by Mattingly et al.  [2].

4.6 Discussion

The application of PWS to the bacterial chemotaxis system shows how crucial it is to have a simulation technique that is exact. Without the latter it would be impossible to determine whether the difference between our predictions and the Mattingly data  [2] is due to the inaccuracy of the model, the inaccuracy of the numerical technique to simulate the model, or the approximations used by Mattingly and coworkers in analyzing the data. In contrast, because PWS is exact, we knew the difference between theory and experiment is either due to the inaccuracy of the model or the approximations used to analyze the data. By then employing the same Gaussian framework to analyze the behavior of the model and the experimental system, we were able to establish that the difference is due to the inaccuracy of our original model.

Our analysis indicates that the size of the receptor clusters in the E. coli chemotaxis system, N15N\approx 15, is larger than that based on previous estimates, N6N\sim 6  [7,8,16]. The early estimates of the cluster size were based on bulk dose-response measurements with a relatively slow ligand exchange, yielding N6N \approx 6  [7,16]. More recent dose-response measurements, at the single cell level and with faster ligand exchange, yield an average that is higher, N8\langle N\rangle\approx 8, and with a broad distribution around it, arising from cell-to-cell variability  [8]. Our estimate, N15N\approx 15, based on fitting the response kernel K(t)K(t) to that measured by Mattingly et al.  [2], therefore appears reasonable. At the same time, the number of clusters, obtained by fitting the noise correlation function N(t)N(t) to the data of Mattingly et al.  [2] is surprisingly low, Nc10N_c \sim 10, given the total number of receptors, Nr103104N_r \sim 10^3\text{--}10^4  [9]. Interestingly, recent experiments indicate that the receptor array is poised near the critical point  [17], where receptor switching becomes correlated over large distances. This effectively partitions receptors into a few large domains, which may explain our fitted values for NN and NcN_c.

It has been suggested that information processing systems are positioned close to a critical point to maximize information transmission  [18,19], although it has been argued that the sensing error of the E. coli chemotaxis system is minimized for independent receptors  [20]. Mattingly et al.  [2] have demonstrated that the chemotactic drift speed in shallow exponential gradients is limited by the information transmission rate  [2], but whether the system has been optimized for information transmission, and how the latter affects chemotactic performance in other spatio-temporal concentration profiles, remain interesting questions for future work.

References

[1]
M. Reinhardt, G. Tkačik, and P. R. ten Wolde, Path Weight Sampling: Exact Monte Carlo Computation of the Mutual Information between Stochastic Trajectories, Physical Review X 13, 041017 (2023).
[2]
H. H. Mattingly, K. Kamino, B. B. Machta, and T. Emonet, Escherichia coli chemotaxis is information limited, Nature Physics 17, 1426 (2021).
[3]
H. C. Berg and E. M. Purcell, Physics of chemoreception, Biophysical Journal 20, 193 (1977).
[4]
J. Monod, J. Wyman, and J.-P. Changeux, On the nature of allosteric transitions: A plausible model, Journal of Molecular Biology 12, 88 (1965).
[5]
N. Barkai and S. Leibler, Robustness in simple biochemical networks, Nature 387, 913 (1997).
[6]
V. Sourjik and H. C. Berg, Functional interactions between receptors in bacterial chemotaxis, Nature 428, 437 (2004).
[7]
T. S. Shimizu, Y. Tu, and H. C. Berg, A modular gradient‐sensing network for chemotaxis in Escherichia coli revealed by responses to time‐varying stimuli, Molecular Systems Biology 6, 382 (2010).
[8]
K. Kamino, J. M. Keegstra, J. Long, T. Emonet, and T. S. Shimizu, Adaptive tuning of cell sensory diversity without changes in gene expression, Science Advances 6, eabc1087 (2020).
[9]
M. Li and G. L. Hazelbauer, Cellular Stoichiometry of the Components of the Chemotaxis Signaling Complex, Journal of Bacteriology 186, 3687 (2004).
[10]
V. Sourjik and H. C. Berg, Receptor sensitivity in bacterial chemotaxis, Proceedings of the National Academy of Sciences 99, 123 (2002).
[11]
V. Sourjik and H. C. Berg, Binding of the Escherichia coli response regulator CheY to its target measured in vivo by fluorescence resonance energy transfer, Proceedings of the National Academy of Sciences 99, 12669 (2002).
[12]
C. J. Morton-Firth, T. S. Shimizu, and D. Bray, A free-energy-based stochastic simulation of the tar receptor complex, Journal of Molecular Biology 286, 1059 (1999).
[13]
D. R. Ortega, C. Yang, P. Ames, J. Baudry, J. S. Parkinson, and I. B. Zhulin, A phenylalanine rotameric switch for signal-state control in bacterial chemoreceptors, Nature Communications 4, 2881 (2013).
[14]
F. Tostevin and P. R. ten Wolde, Mutual Information between Input and Output Trajectories of Biochemical Networks, Physical Review Letters 102, 218101 (2009).
[15]
M. D. Lazova, T. Ahmed, D. Bellomo, R. Stocker, and T. S. Shimizu, Response rescaling in bacterial chemotaxis, Proceedings of the National Academy of Sciences 108, 13870 (2011).
[16]
[17]
J. M. Keegstra, F. Avgidis, Y. Mulla, J. S. Parkinson, and T. S. Shimizu, Near-critical tuning of cooperativity revealed by spontaneous switching in a protein signalling array, bioRxiv 2022.12.04.518992 (2023).
[18]
G. Tkačik, T. Mora, O. Marre, D. Amodei, S. E. Palmer, M. J. Berry, and W. Bialek, Thermodynamics and signatures of criticality in a network of neurons, Proceedings of the National Academy of Sciences 112, 11508 (2015).
[19]
M. Meijers, S. Ito, and P. R. ten Wolde, Behavior of information flow near criticality, Physical Review E 103, L010102 (2021).
[20]
M. Skoge, Y. Meir, and N. S. Wingreen, Dynamics of Cooperativity in Chemical Sensing among Cell-Surface Receptors, Physical Review Letters 107, 178101 (2011).