Pointwise ergodic theorems for nonconventional bilinear polynomial averages along prime orbits

Background

Goldbach conjecture

In 1742, Goldbach wrote a letter to Euler proposing the following conjecture:
dass jede Zahl, welche aus zweyen numeris primis zusammengesetzt ist, ein aggregatum so vieler numerorum primorum sey, als man will (die unitatem mit dazu gerechnet), bis auf die congeriem omnium unitatum.
In modern mathematical language, he is asking whether
Every even integer can be written as sum of two odd primes.
This problem is still open despite much effort and partial progress. The first notable breakthrough was due to Vinogradov in the 1940s, in which he proved the so-called ternary Goldbach conjecture, namely that every large enough odd integer can be written as sum of three primes.
His method had two other side results that interested me.

Binary Goldbach: He proved that almost every even integer, from the perspective of density, can be written as sum of two odd primes.
Ergodic theorem along primes: He implicitly proved mean convergence of ergodic averages along the primes.

Although the binary Goldbach conjecture still seems to be far out of reach, even with current technology, there have been many important advances, two of which motivated my research in prime number theory.
Representing almost every even integer as a sum of two odd primes from a restricted (proper) subset of the primes; this line of research bears close relation with Green-Tao pseudorandomness theory, see for example (Shao, 2014b).
Obtaining Goldbach-type theorems for sums of, e.g. primes and semi-primes, or sums of several squares of primes; the most famous theorem in this direction is due to Chen (1973). This line of research has a close relationship to the Gauss circle problem and the Waring problem.
Although the statement of Goldbach’s theorem is very accessible, progress on this line of inquiry has admitted far-reaching consequences in additive combinatorics and combinatorial number theory, encompassing the work of Green-Tao-Ziegler on detecting polynomial systems of equations inside the primes, most notably the Green-Tao theorem, Tao’s proof of the logarithmic Chowla conjecture, and Yitang Zhang’s prime gaps theorem.
These results, in particular, have shaped my research programme in prime number theory.

Pointwise erodic theorem

The study of pointwise convergence of ergodic averages dates back to Birkhoff (1931); it says that on any probability space (X, μ) equipped with a measure-preserving,¹ ‘sufficiently randomising’ transformation, T : X → X,
(1)

almost surely, whenever f ∈ L∞(X) is (say) bounded; informally: the ‘time averages’ of f converge to its ‘spatial average.’ This theorem was proven essentially concurrently with von Neumann’s mean ergodic theorem, (1932), which established norm convergence of (1).
A natural question, posed in the 1970s by Furstenberg and Bellow, is the existence of subsets 𝐸 ⊂ ℕ with 0 upper-Banach density,

so that
(2)

converged almost everywhere, initially for bounded functions f on a probability space; here and throughout, we use [𝑁] := {1, . . . ,𝑁}.
A natural candidate, first explored by Bourgain in the late eighties, concerned the case where E = ℙ are the set of primes; he proved the following theorem.
Theorem 1 (Bourgain’s prime ergodic theorem). In the setting of Birkhoff’s theorem, suppose additionally that all powers of T, {T2, T3, . . . } are sufficiently randomising. Then
(3)

almost surely, whenever f ∈ L∞(X).2
Despite the simplicity of this statement, extensions of (3) have occupied the ergodic-theoretic community until the last decade; this line of inquiry was unable to be fully resolved by Bourgain—see Mirek and Trojan (2015), Mirek, Trojan and Zorin-Kranich (2017), and Wierdl (1988). In the meantime, Bourgain proved his famous polynomial ergodic theorems (Bourgain, 1988a, 1988b, 1989), which established pointwise convergence of the pertaining ergodic averages when ℙ was replaced by a polynomial orbit, {P(1), P(2), . . . } for integer-valued P.
Motivated by Furstenberg’s (1977) celebrated ergodic-theoretic proof of Szemerédi’s theorem, Bourgain shortly then after proved that the bilinear averages
(4)

converge pointwise almost surely whenever fi ∈ L∞(X) are bounded (Bourgain, 1990)—and this remained essentially the state of the art progress on pointwise ergodic theorems for more than 30 years; crucial to this argument, and more generally to Furstenberg’s, were the fact that the orbits in question were all linear.
The first breakthrough in the theory of multiple ergodic averages along non-linear polynomial orbits was due to H. Furstenberg and B. Weiss, (1996), in which norm convergence was established for the bilinear averages
(5)

this result admitted profound extensions, both dynamically and in combinatorial Ramsey theory, ultimately culminating in Miguel Walsh’s (2012) ergodic theorem, an optimal result in the category of norm convergence of multiple ergodic averages along polynomial orbits.
The pointwise counterpart of Miguel Walsh’s ergodic theorem remains in conjecture form, due to Furstenberg-Bergelson-Leibman, posed as Question 9 in V. Bergelson’s (1996) survey on ergodic Ramsey theory:
Conjecture 1 (Furstenberg-Bergelson-Leibman conjecture, commutative case). For any polynomials, P1, . . . , Pm, and commuting measure-preserving transformations, T1, . . . , Tm : X → X, and bounded functions f1, . . . , fm ∈ L∞(X, μ), the ergodic averages

converge μ-a.e.
This conjecture remains the ‘holy grail’ of pointwise ergodic theory; to give a sense of the difficulty, even understanding pointwise convergence of the simplest open bilinear case—involving a single measure-preserving transformation—namely, the pointwise convergence of (5), resisted significant efforts from Bourgain and others (Assani, 1998, 2005, 2010; Berend, 1985, 1988; Derrien and Lesigne, 1996; Donoso and Sun, 2018a, 2018b; Donoso, Koutsogiannis and Sun, 2020; Huang, Shao and Ye, 2019; Leibman, 2005; Lesigne, 1993; Lesigne, Rittaud and de la Rue, 2003). Indeed, pointwise convergence was not established until 2022, when Ben Krause (University of Bristol), Mariusz Mirek (Rutgers University), and Terence Tao (UCLA) (Krause, Mirek and Tao, 2022) proved the first joint extension of Bourgain’s polynomial ergodic theorem (Bourgain, 1988a, 1988b, 1989) and bilinear ergodic theorem (Bourgain, 1990).

Current programme

Currently, my work on Goldbach and my study of pointwise ergodic theory have intertwined to produce my strongest result, jointly with Professor Krause, Professor Tao, and Dr Joni Teräväinen (University of Cambridge). What follows is a brief summary of the relevant work in this direction, followed by an overview of KMTT (Krause-Mousavi-Tao-Teräväinen).

Goldbach conjecture

The departure point for my investigation of the density version of the Goldbach phenomenon was the work of Shao (2014a, 2014b); below, we use

to denote lower Banach density.
Theorem 2 Let 𝑃 ⊂ ℙ with 𝑑∗(𝑃 ) > 5/8. Then every sufficiently large odd integer can be written as p1 + p2 + p3 with pi ∈ 𝑃. And, 5/8 is sharp.
More recently he showed that there is no ‘reasonable’ density version of the binary Goldbach theorem.
Theorem 3  For any ϵ > 0 there exists a subset 𝐴 ⊆ ℙ with 𝑑∗(𝐴) > 1 − ϵ, such that a positive proportion of the even positive integers, depending on ϵ, cannot be written as a sum of two primes in 𝐴.
In-progress work with Michael Lacey (Georgia Tech), Yaghoub Rahimi (Georgia Tech), and Naga Manasa Venpati (LSU) concerns the behaviour of sum-sets of fairly dense subsets of primes.
Theorem 4 (Special Case). Let 𝑃 ⊂ ℙ be a subset of primes with d∗(P) > 0. The P is an additive basis for ℕ, namely every sufficiently large integer, n, can be written as a sum of elements in P + · · · + P s–many times in the following regimes:
•             3-fold sum-set: If d∗(P) > 1/2 and P is equidistributed mod 100! and n is odd then n ∈ P + P + P.
•             4-fold sum-set: If d∗(P) > 1/2 and n is even then n ∈ P + P + P + P.
•             s-fold sum-set: If d∗(P) > <Insert equation A> and P is equidistributed mod s!, then n ∈ P +· · ·+P is in the s-fold sum-set, provided that n and s ≥ 5 share the same parity.
By combining the techniques used to prove with discrete harmonic-analytic compactness arguments, I have established the following Goldbach-type result, which will appear in Winter 2025; it can be viewed as a natural follow-up of the works of Matomäki (2008) and Teräväinen (2018).
Theorem 5 Almost every large integer of the form 24k+12 can be written as a2+b2+p2+1 where p ∈ ℙ, a2 + b2 + 1 ∈ ℙ, and (a, b) = 1.

Pointwise ergodic theory along the primes: combinatorial consequences.

I currently own the sharpest results on pointwise convergence of ergodic averages along prime orbits (Giannitsi et al., 2022; Lacey, Mousavi and Rahimi, 2022); the following two quantitative visibility results derive from this work and have proven instrumental as I prepare to consider more exotic, sparser, averages along the primes. The upshot of this work is as follows: ‘small’ subsets of intervals are generically ‘invisible’ along prime orbits.
Proposition 1 (Stucture theorem I). Suppose that F ⊂ I ⊂ℤ is a subset of an interval, I, with relative density δ ≥ Cϵ·|I|−ϵ, |F| = δ|I|. Then there exists a decomposition I = I1∪I2, where I1, I2 depend only on F, so that
 F is barely visible along prime orbits:

and
 I2 is small:

This result is nearly optimal, in that it is known to fail without the logarithmic weighting (see (LaVictoire, 2011); conditional on the generalised Riemann hypothesis, we reduced the power of 2 to a single power of the logarithm, which is conjecturally sharp.
If we are willing to replace the logarithmic correction with a small power loss, we can in fact introduce a large degree of arithmetic uniformity, which has proved important in work on theorem 9, and will likely impact joint extensions of Bourgain and KMTT, namely conjecture 2.
Proposition 2 (Structure theorem II, (Giannitsi et al., 2022)). Let ϵ > 0 be arbitrarily small, and R ≥ 1 be arbitrary. Then whenever I is an interval of length |I| ≥ 2CϵRϵ , and F ⊂ I is a subset of relative density δ ≥ Cϵ|I|−ϵ, one may decompose I = I1 ∪ I2, so that
 F is barely visible along prime orbits in many arithmetic progressions: if we let Prob denote the uniform measure on arithmetic progressions of gap size ≤ R inside of I, then

where
ℙb,y;N := {p ≤ N prime : p ≡ b mod y};
• I2 is small: |I2| ≤ Cϵ · δϵ · |I|.
In other words, choosing from shifts of F and primes in any arithmetic progression are usually ‘almost’ independent events.
These two families of results left me well-situated for KMTT, which I will now describe.

Bilinear pointwise ergodic theory

along the primes
In 2022, breakthrough work of Krause-Mirek-Tao established the following special case of the Furstenberg-Bergelson-Leibman conjecture.
Theorem 6 (Krause-Mirek-Tao, special case). Suppose that f1, f2 ∈ L∞(X, μ) are bounded, and that P ∈ ℤ[·] has degree ≥ 2. Then provided that {T, T2, . . . } are ‘sufficiently randomising,’

converges almost surely to the product of expectations; μ-a.e. convergence is always guaranteed provided T is measure preserving.
Although this problem presents dynamically, the arguments lived at the interface of additive combinatorics and adelic harmonic analysis; the delicacy of these arguments has proven sufficiently restrictive that—prior to KMTT—no genuine extensions of theorem 5 were known.
On the other hand, in 2024, Teräväinen (2024) developed an additive combinatorial method to address the Furstenberg-Bergelson-Leibman averages weighted by the Mӧbius function.
Theorem 7. Let (X, ν, T) be a measure-preserving system, μ be Mӧbius function, and P1, P2, · · · Pk be polynomials with integer coefficients. Let f1, f2, · · · fk ∈ L∞(X). Then
(6)

ν−almost everywhere.
While this result follows directly from the later work of Leng, the significance of Teräväinen’s work is that it provided a general mechanism to address multiple ergodic averages weighted by ‘pseudo-random’ functions, e.g. the von Mangoldt function,

which behaves like a (weighted) indicator of the set of primes

These new techniques, combined with methods of KMT and appropriate additive combinatorial approximations, partially developed in the course of work on Goldbach, have allowed us to establish my strongest result thus far.
Theorem 8 (special case). Suppose that f1, f2 ∈ L∞(X, μ) are bounded, and that P ∈ ℤ[·] has degree ≥ 2. Then provided that {T, T2, . . . } are ‘sufficiently randomising,’

converges almost surely to the product of expectations; μ-a.e. convergence is always guaranteed, provided T is measure-preserving.
Compared to KMT, establishing theorem 8 required that we synthesise existing Ramsey-theoretic compactness arguments with the modern theory of pseudo-random approximation; in particular, sieve theoretic techniques were imported to the field for the first time. And, arithmetic considerations, unique to the set of primes, interposed in our adelic harmonic analysis, which required novel combinatorial arguments.

Future plans

In light of Bourgain’s convergence result concerning (4), and the recent work of Matomäki and Tao on norm convergence of ergodic averages in ‘short intervals’, see (Matomäki et al., 2023), KMTT admits two natural follow-ups, which I expect to pursue over the remainder of my fellowship.
Conjecture 2 Suppose that f1, f2 ∈ L∞(X, μ) are bounded. Then, provided that T is weakly mixing—very randomising—

converges almost surely to the product of expectations; μ-a.e. convergence is always guaranteed, provided T is measure-preserving.
And
Conjecture 3 Suppose that f1, f2 ∈ L∞(X, μ) are bounded, let P ∈ ℤ[·] be a polynomial with integer coefficients, and let ϵ > 0 be very small, possibly depending on P. Then, provided that T is weakly mixing

converges almost surely to the product of expectations along sparse sequences; μ-a.e. convergence is always guaranteed, provided T is measure-preserving.
In the previous conjecture, the restriction to the sparse set of times is in fact essentially necessary.

REFERENCES

Assani, I. (1998) ‘Multiple recurrence and almost sure convergence for weakly mixing dynamical systems’, Israel Journal of Mathematics, 103, pp. 111–124. doi: 10.1007/BF02762270.
Assani, I. (2005) ‘Pointwise convergence of nonconventional averages’, Colloquium Mathematicae, 102(2), pp. 245–262.
Assani, I. (2010) ‘Pointwise convergence of ergodic averages along cubes’, Journal d’Analyse Mathématique, 110, pp. 241–269. doi: 10.1007/s11854- 010-0006-3.
Berend, D. (1985) ‘Joint ergodicity and mixing’, Journal d’Analyse Mathématique, 45, pp. 255–284. doi: 10.1007/BF02792552.
Berend, D. (1988) ‘Multiple ergodic theorems’, Journal d’Analyse Mathématique, 50, pp. 123–142. doi: 10.1007/BF02796117.
Bergelson, V. (1996) ‘Ergodic Ramsey theory—an update’, Ergodic Theory of ℤ𝑑– actions, Warwick, 1993–1994, pp. 1–61. London Mathematical Society Lecture Note Series, 228. Cambridge University Press, Cambridge.
Birkhoff, G. (1931) ‘Proof of the ergodic theorem’, Proceedings of the National Academy of Sciences of the United States of America, 17(12), pp. 656–660. doi: 10.1073/pnas.17.12.656.
Bourgain, J. (1988a) ‘On the maximal ergodic theorem for certain subsets of the positive integers’, Israel Journal of Mathematics, 61(1), pp. 39–72. doi: 10.1007/BF02776301.
Bourgain, J. (1988b) ‘On the pointwise ergodic theorem on Lp for arithmetic sets’, Israel Journal of Mathematics, 61(1), pp. 73–84.
Bourgain, J. (1989) ‘Pointwise ergodic theorems for arithmetic sets’, Publications Mathématiques de l’IHÉS, 69, pp. 5–45.
Chen, J.R. (1973) ‘On the representation of a larger even integer as the sum of a prime and the product of at most two primes’, Scientia Sinica, 16, pp. 157–176.
Derrien, J.-M. and Lesigne, E. (1996) ‘Un théorème ergodique polynomial ponctuel pour les endomorphismes exacts et les K-systèmes’, Annales de l’Institut Henri Poincaré, Probabilités et Statistiques, 32(6), pp. 765–778.
Donoso, S. and Sun, W. (2018a) ‘Pointwise convergence of some multiple ergodic averages’, Advances in Mathematics, 330, pp. 946–996. doi: 10.1016/j. aim.2018.03.022.
Donoso, S. and Sun, W. (2018b) ‘Pointwise multiple averages for systems with two commuting transformations’, Ergodic Theory and Dynamical Systems, 38(6), pp. 2132–2157. doi: 10.1017/etds.2016.127.
Donoso, S., Koutsogiannis, A. and Sun, W. (2020) ‘Pointwise multiple averages for sublinear functions’, Ergodic Theory and Dynamical Systems, 40(6), pp. 1594–1618. doi: 10.1017/etds.2018.118.
Furstenberg, H. (1977) ‘Ergodic behavior of diagonal measures and a theorem of Szemerédi on arithmetic progressions’, Journal d’Analyse Mathématique, 31,
pp. 204–256. doi: 10.1007/BF02813304.
Furstenberg, H. and Weiss, B. (1996) ‘A mean ergodic theorem for 1/ NΣNn=1f(Tnx)g(Tn2x)’, in Convergence in Ergodic Theory and Probability. Columbus, OH, 1993. Ohio State University Math Research Institute Publications, 5. Berlin: De Gruyter, pp. 193–227.
Giannitsi, C., Lacey, M.T., Mousavi, H. and Rahimi, Y. (2022) ‘Improving and maximal inequalities for primes in progressions’, Banach Journal of Mathematical Analysis, 16(3), p. 42. doi: 10.1007/s43037-022-00191-9.
Giannitsi, C., Krause, B., Lacey, M., Mousavi, H. and Rahimi, Y. (2023) ‘Averages over the Gaussian Primes: Goldbach’s Conjecture and Improving Estimates’, arXiv preprint, arXiv:2309.14249.
Krause, B., Mirek, M. and Tao, T. (2022) ‘Pointwise ergodic theorems for non- conventional bilinear polynomial averages’, Annals of Mathematics, 195(3),
pp. 997–1109. doi: 10.4007/annals.2022.195.3.4.
Huang, W., Shao, S. and Ye, X. (2019) ‘Pointwise convergence of multiple ergodic averages and strictly ergodic models’, Journal d’Analyse Mathématique, 139(1), pp. 265–305. doi: 10.1007/s11854-019-0061-3.
Lacey, M.T., Mousavi, H. and Rahimi, Y. (2022) ‘Endpoint ℓr improving estimates for Prime averages’, Mathematical Research Letters, 29(6), pp. 1767–1791.
LaVictoire, P. (2011) ‘Universally L1-bad arithmetic sequences’, Journal d’Analyse Mathématique, 113(1), pp. 241–263. doi: 10.1007/s11854-011- 0006-y.
Leibman, A. (2005) ‘Convergence of multiple ergodic averages along polynomials of several variables’, Israel Journal of Mathematics, 146, pp. 303–315. doi: 10.1007/BF02773538.
Lesigne, E. (1993) ‘Equations fonctionnelles, couplages de produits gauches et théorèmes ergodiques pour mesures diagonales’, Bulletin de la Société Mathématique de France, 121(3), pp. 315–351. doi: 10.24033/bsmf.2211.
Lesigne, E., Rittaud, B. and de la Rue, T. (2003) ‘Weak disjointness of measure preserving dynamical systems’, Ergodic Theory and Dynamical Systems, 23(4), pp. 1173–1198. doi: 10.1017/S0143385702001505.
Mirek, M. and Trojan, B. (2015) ‘Cotlar’s ergodic theorem along the prime numbers’, Journal of Fourier Analysis and Applications, 21(4), pp. 822–848. doi: 10.1007/s00041-015-9388-z.
Mirek, M., Trojan, B. and Zorin-Kranich, P. (2017) ‘Variational estimates for averages and truncated singular integrals along the prime numbers’, Transactions of the American Mathematical Society, 369(8), pp. 5403–5425.
Matomäki, K. (2008) ‘The binary Goldbach problem with one prime of the form p= k2+ l2+ 1’, Journal of Number Theory, 128(5), pp. 1195–1210. doi: 10.1016/j.jnt.2007.01.013.
Matomäki, K., Shao, X., Tao, T. and Teräväinen, J. (2023) ‘Higher uniformity of arithmetic functions in short intervals I. All intervals’, Forum of Mathematics, Pi, 11, e29. doi: 10.1017/fmp.2023.28.
Peluse, S. (2020) ‘Bounds for sets with no polynomial progressions’, Forum of Mathematics, Pi, 8, e16. doi: 10.1017/fmp.2020.11.
Peluse, S. and Prendiville, S. (2022) ‘A Polylogarithmic Bound in the Nonlinear Roth Theorem’, International Mathematics Research Notices, 2022(8), pp. 5658– 5684. doi: 10.1093/imrn/rnaa261.
von Neumann, J. (1932) ‘Proof of the Quasi-ergodic Hypothesis’, Proceedings of the National Academy of Sciences of the United States of America, 18(1), pp. 70–82. doi: 10.1073/pnas.18.1.70.
Shao, X. (2014) ‘An L−function-free proof of Vinogradov’s three prime theorem’, Forum of Mathematics, Sigma, 2, e27. doi: 10.1017/fms.2014.27.
Shao, X. (2014) ‘A density version of the Vinogradov three primes theorem’, Duke Mathematical Journal, 163(3), pp. 489–512. doi: 10.1215/00127094- 2410176.
Teräväinen, J. (2018) ‘The Goldbach problem for primes that are sums of two squares plus one’, Mathematika, 64(1), pp. 20–70. doi: 10.1112/ S0025579317000341.
Teräväinen, J. (2024) ‘Pointwise convergence of ergodic averages with Möbius weight’, arXiv preprint, arXiv: 2401.03174.
Wierdl, M. (1988) ‘Pointwise ergodic theorem along the prime numbers’, Israel Journal of Mathematics, 64(3), pp. 315–336. doi: 10.1007/BF02882425.
Walsh, M. (2012) ‘Norm convergence of nilpotent ergodic averages’, Annals of Mathematics, 175(3), pp. 1667–1688. doi: 10.4007/annals.2012.175.3.15.

PROJECT SUMMARY
Given an arbitrary measure preserving system we show that the multilinear ergodic averages sampled along an arbitrary number of sequences coming from a Hardy field converge pointwise almost everywhere. We aim to prove this for as wide a class of Hardy field functions as possible. To do so, we establish a long variational inequality along lacunary sequences which implies a maximal inequality, norm convergence, and pointwise convergence.
By a transference argument it suffices prove this long variational inequality in the case that the measure preserving system is the integers. This reduction allows us to use tools from discrete harmonic analysis, additive combinatorics, and analytic number theory. We then give applications in areas such as upcrossings, equidistribution, and
combinatorics.

PROJECT LEAD
Dr Seyyed Hamed Mousavi received his PhD from Georgia Tech in 2022, under the advisement of Professor E. Croot. Dr Mousavi’s current academic position, held from autumn 2022, is an EPSRC Postdoctoral Fellowship, funded by Professor Krause’s New Investigator Grant on pointwise ergodic theory.
Dr Mousavi’s interests span prime number theory and pointwise ergodic theory. His initial training was in analytic number theory, which he has complemented over the last four years by developing a facility with techniques and ideas from both harmonic analysis and (additive) combinatorics.

PROJECT CONTACT
Dr Seyyed Hamed Mousavi
Senior Research Associate, School of Mathematics
University of Bristol
Email: gj23799@bristol.ac.uk
Web: Seyyed Hamed Mousavi, University of Bristol

FUNDING
This research is funded by a studentship provided by the Heilbronn Institute for Mathematical Research.