The Positive Mass Theorem and the Reduction to Geroch’s Conjecture
We now explain how the Riemannian positive mass theorem is reduced to Corollary 6.3.2. We state the time-symmetric version, which is the one governed by scalar curvature.
The time-symmetric, or Riemannian, positive mass theorem was first proved by Schoen–Yau by the minimal-hypersurface method [SY79b, SY81]. In the smooth dimension-reduction form of their argument, the regularity theory for area-minimizing hypersurfaces gives the theorem for \(3\leq n\leq7\). Witten then gave a spinorial proof for spin asymptotically flat manifolds, without this regularity dimension restriction [Wit81].
The later history is largely about removing the non-spin dimension restriction. Schoen–Yau proposed an all-dimensional singular minimal-slicing approach [SY19], while Lohkamp’s cut-off/compactification observation relates the Riemannian PMT to Geroch torus rigidity; this is the reduction used below. The generic-regularity work culminating in Chodosh–Mantoulidis–Schulze–Wang pushes the minimal-hypersurface/Lohkamp route to dimension \(11\) [CMSW25]. Bi–Hao–He–Shi–Zhu then proved the Riemannian PMT up to dimension \(19\) [BHH+26], and Brendle–Wang subsequently gave a dimension descent scheme which closes the Riemannian theorem in arbitrary dimension [BW26a]. They also derived the spacetime positive energy theorem in arbitrary dimension from the Riemannian theorem and Jang-type arguments [BW26b].
There are also alternative proofs and stability directions which are useful to keep mentally separate from the reduction below. In dimension three, Bray–Kazaras–Khuri–Stern gave a harmonic-function proof of the Riemannian PMT [BKKS22]. Stability versions ask whether small ADM mass forces the geometry to be close to Euclidean space. Lee–Sormani proved pointed intrinsic-flat stability for rotationally symmetric asymptotically flat manifolds [LS14]. Huang–Lee–Sormani proved pointed intrinsic-flat stability for graphical hypersurfaces in Euclidean space under natural technical hypotheses [HLS17]. Dong–Song proved a three-dimensional stability theorem in the general asymptotically flat setting: if the mass of a chosen end tends to zero, then after removing subsets whose boundary areas tend to zero, the remaining spaces converge to Euclidean 3-space in the pointed measured Gromov–Hausdorff topology [DS25]. The proof below records the classical smooth Schoen–Yau/Lohkamp/Geroch mechanism, because that is the version whose topology is most visible from stable minimal hypersurfaces.
Let \(3\leq n\leq7\). Let \((M^n,g)\) be a complete one-ended asymptotically flat Riemannian manifold with nonnegative scalar curvature. Then the ADM mass is nonnegative. If the mass is zero and the usual rigidity hypotheses hold, then \((M,g)\) is isometric to Euclidean space.
For a manifold with several asymptotically flat ends, the same conclusion is applied to one end at a time after the standard reduction which compactifies or fills the other ends without changing the sign of the chosen mass. We focus on the one-ended case because it contains the topological argument.
Recall the ADM mass of an end with asymptotically flat coordinates \(x\):
The nonnegativity part is the one whose topology is most transparent.
Fix an asymptotically flat end and write its coordinate chart as
The symbol \(\delta\) denotes the Euclidean metric in these coordinates. For a tensor \(T\) on the end and a number \(\tau\gt{}0\), we use
and
Thus \(T\in W^{k,p}_{-\tau}\) means that \(T\) decays like \(r^{-\tau}\) and its \(j\)-th coordinate derivatives decay like \(r^{-\tau-j}\) in weighted \(L^p\) sense.
In particular, an AF metric of Sobolev type \((p,q)\) means
together with the integrability condition \(R_g\in L^1\) when the ADM mass is used. The condition \(p\gt{}n\) gives enough Sobolev embedding to read the metric and first derivatives pointwise, while \(q\gt{}\frac{n-2}{2}\) is the decay threshold which makes the ADM mass stable under the density deformation.
If one starts instead from the classical pointwise AF assumption of order \(\tau\),
then the weighted Sobolev hypothesis follows with every smaller decay rate \(q\lt{}\tau\). Indeed, for \(|\alpha|\leq2\),
Thus a \(C^2\) AF end of order \(\tau\gt{}\frac{n-2}{2}\) gives the Sobolev assumption needed below after choosing \(q\) with \(\frac{n-2}{2}\lt{}q\lt{}\tau\). Many analytic statements of the density theorem take this Sobolev condition as the definition of asymptotic flatness.
There are two different ideas which should not be conflated. The original Schoen–Yau proof of the positive mass theorem uses a noncompact minimal hypersurface produced by a limiting Plateau argument. The more topological shortcut described below—cutting off a negative-mass end, compactifying it to a torus, and then contradicting torus rigidity—is usually attributed to Lohkamp’s compactification observation; see [Loh99]. Modern accounts phrase this as follows: using Lohkamp’s idea, one can reduce the Riemannian positive mass theorem to the impossibility of \(R\gt{}0\) on \(\mathbb{T}^n\#X^n\); see the discussion in [LUY24].
Let \((M^n,g)\) be complete, one-ended, and asymptotically flat, with \(R_g\geq0\) and negative ADM mass \(m\lt{}0\). Assume the usual decay for which the ADM mass and the weighted elliptic theory are valid, including \(R_g\in L^1\); for instance one may work with the Sobolev AF condition in Definition 6.4.2. Then, for every sufficiently small \(\varepsilon\gt{}0\), there is a complete asymptotically flat metric \(g_\varepsilon\) with the following properties:
and on some exterior coordinate region \(\{r\geq R_\varepsilon\}\),
Moreover
Thus, if \(\varepsilon\lt{}-m/2\), then \(a_\varepsilon\lt{}0\). In the positive-mass contradiction argument, we may therefore replace the original metric by one which is conformally flat and scalar-flat near infinity, with negative harmonic mass coefficient.
Here the notation in the conclusion is as follows. The function \(u_\varepsilon\) is a positive harmonic function on the exterior Euclidean coordinate region, and \(a_\varepsilon\) is the coefficient of its leading \(r^{2-n}\) term. The notation \(O_\infty(r^\beta)\) is a shorthand for “big-\(O\) with all derivatives”. More precisely, if \(E=O_\infty(r^\beta)\), then for every multi-index \(\alpha\) there is a constant \(C_\alpha\) such that, in the chosen exterior coordinates,
Thus \(O_\infty(r^{1-n})\) means not only that the error itself is \(O(r^{1-n})\), but also that every coordinate derivative has the corresponding differentiated decay; for example
This is stronger than ordinary \(O(r^{1-n})\) notation and is the form needed when differentiating the expansion in the ADM mass or scalar-curvature computations. The equality \(m_{\operatorname{ADM}}(g_\varepsilon)=2a_\varepsilon\) is the standard ADM normalization for a conformally flat end \(u_\varepsilon^{4/(n-2)}\delta\).
Proof. This is the standard density step in the positive mass theorem; see [Bar86, LLU23]. We spell out the mechanism because it is the analytic part which precedes Lohkamp’s compactification.
Choose a large radial cut-off \(\chi_\lambda\) on the asymptotically flat end, with \(\chi_\lambda=1\) for \(r\leq\lambda\) and \(\chi_\lambda=0\) for \(r\geq2\lambda\). Let \(g_{\mathrm{Euc}}=\delta\) be the Euclidean metric in the AF coordinates and set
on the end, extending it by \(g\) on the compact part. Thus \(g_\lambda=g\) on a large compact set and \(g_\lambda=\delta\) for \(r\geq2\lambda\). The only scalar curvature error is in the annulus \(\{\lambda\leq r\leq2\lambda\}\). Define
The basic estimate is
for every \(q'\lt{}q\) with \(\frac{n-2}{2}\lt{}q'\lt{}n-2\). The point is that the linear part of the scalar curvature at the Euclidean metric is
When \(k_\lambda=\chi_\lambda(g-\delta)\), the terms with two derivatives on \(g-\delta\) are exactly \(\chi_\lambda DR_\delta(g-\delta)\); the remaining terms contain at least one derivative of \(\chi_\lambda\) and are supported in the annulus. Since \(|\nabla^k\chi_\lambda|\leq C\lambda^{-k}\) there, and \(g-\delta\in W^{2,p}_{-q}\), these commutator terms go to zero in the weighted norms above. The nonlinear terms in \(R_{g_\lambda}\) contain either \((g_\lambda-\delta)\partial^2g_\lambda\) or \((\partial g_\lambda)^2\), and have the same decay. This proves (6.4.1).
For \(\lambda\) large, solve the conformal correction equation
where \(c_n=\frac{4(n-1)}{n-2}\). The weighted Fredholm estimate for the Laplacian on an asymptotically flat end, together with the smallness of \(V_\lambda\) in (6.4.1), gives a unique solution with
Equivalently, writing \(w_\lambda=1+\eta_\lambda\), the equation is
The operator on the left is a small perturbation of the AF Laplacian \(-c_n\Delta^{g_\lambda}:W^{2,p}_{-q'}\to L^p_{-q'-2}\), which is an isomorphism for \(0\lt{}q'\lt{}n-2\). Thus a Neumann-series/Fredholm argument solves for \(\eta_\lambda\) and gives the norm estimate above. This is the only analytic input in the deformation step. Since \(p\gt{}n\), weighted Sobolev embedding gives \(\|w_\lambda-1\|_{C^0}\to0\); after increasing \(\lambda\) we therefore have \(\frac12\leq w_\lambda\leq2\). Define
The conformal scalar curvature formula gives
Thus the scalar curvature sign is preserved. Since \(w_\lambda\) stays uniformly bounded above and below and \(g_\lambda\) is uniformly equivalent to \(g\), the new metric is complete.
On the region \(r\geq2\lambda\), we have \(g_\lambda=\delta\) and \(V_\lambda=0\), so (6.4.2) becomes
Therefore \(\tilde g_\lambda\) is harmonically flat there:
Here the “conformal mass formula” is the following elementary consequence of the ADM boundary integral. If an AF metric \(g_0\) has mass \(m_{\rm ADM}(g_0)\) and
then
Indeed, in the flat coordinates of the end, \(u^{4/(n-2)}=1+\frac{4A}{n-2}r^{2-n}+O_\infty(r^{1-n})\), so the extra contribution to the ADM integrand is
and the boundary integral gives \(2A\).
Applying (6.4.3) with \(g_0=g_\lambda\) gives
The equation for \(w_\lambda\) then identifies this coefficient. Integrating
over a large coordinate ball and letting the radius tend to infinity, using that \(g_\lambda=\delta\) near infinity, gives
Since \(\partial_\nu w_\lambda=-(n-2)A_\lambda r^{1-n}+O(r^{-n})\) and \(c_n=\frac{4(n-1)}{n-2}\), this becomes
Hence
Here \(g_\lambda\) is exactly Euclidean at infinity, so \(m_{\operatorname{ADM}}(g_\lambda)=0\); the integral above is precisely what produces the new harmonic mass coefficient.
To see that this new mass is close to the original mass, one uses the standard mass-continuity lemma in the density theorem. The construction gives \(\tilde g_\lambda-g\to0\) in \(W^{2,p}_{-q'}\) for every \(\frac{n-2}{2}\lt{}q'\lt{}n-2\), and the scalar curvatures also converge in \(L^1\):
The first term goes to zero because \(w_\lambda\to1\) uniformly and \(R_g\) is integrable on the AF end; the second goes to zero because its support escapes to infinity. The ADM boundary integral is continuous under this \(W^{2,p}_{-q'}\) convergence together with \(L^1\) scalar-curvature convergence: this is the usual Bartnik density mass lemma, obtained by writing the mass integrand as the Euclidean linearization of scalar curvature plus quadratic terms, whose tails are integrable when \(q'\gt{}\frac{n-2}{2}\). Hence
Choosing \(\lambda\) large, the mass of \(\tilde g_\lambda\) therefore differs from \(m\) by less than \(\varepsilon\).
It remains only to identify the coefficient \(A_\lambda\) with the ADM mass. Set \(v=w_\lambda^{4/(n-2)}\), so \(\tilde g_{\lambda,ij}=v\delta_{ij}\) near infinity. Since
we compute
Substituting in the ADM formula yields
Taking \(g_\varepsilon=\tilde g_\lambda\) and \(a_\varepsilon=A_\lambda\) proves the theorem. ◻
Assume that on an exterior coordinate region \(\{r\geq R_0\}\) the metric is
Then, after increasing \(R_0\) if necessary, one can replace \(u\) outside a large compact set by a smooth positive function \(\bar u\) such that
with strict inequality somewhere. Therefore
has nonnegative scalar curvature on the end, has positive scalar curvature somewhere in the transition region, and is exactly flat near infinity. Since \(\bar u=u\) on a full neighborhood of the inner matching sphere, this replacement glues smoothly to the original metric on the compact part.
Proof. The conformal scalar curvature formula on the flat background gives
Since \(R_g\geq0\), the conformal factor is superharmonic:
The negative coefficient \(a\lt{}0\) says that \(u\) approaches \(1\) from below. More precisely, for \(r\) sufficiently large,
After enlarging the compact core, choose \(\varepsilon\gt{}0\) and radii \(R_1\lt{}R_2\) so large that
and such that \(\partial_r u\gt{}0\) throughout \(R_1\leq r\leq R_2\). Thus the transition set
is contained in a compact annulus in the end, and \(|\nabla u|\neq0\) somewhere inside this transition set.
Choose a smooth function \(\Psi:\mathbb{R}\to\mathbb{R}\) with the following properties:
where \(c\gt{}0\), and
with \(\Psi''\lt{}0\) somewhere in \((1-3\varepsilon,1-\varepsilon)\). Such a function is obtained by choosing a smooth nonincreasing function \(\theta:[1-3\varepsilon,1-\varepsilon]\to[0,1]\) which equals \(1\) near the left endpoint and \(0\) near the right endpoint, and then setting
on the transition interval, with the two constant/identity extensions above.
Now set
on the end, and keep \(\bar u=u\) on the compact core. This is smooth across the inner matching region because \(\Psi(t)=t\) wherever \(u\leq1-3\varepsilon\). It is positive because \(u\gt{}0\) and \(c\gt{}0\), and it is constant equal to \(c\) near infinity because \(\Psi\) is constant for \(t\geq1-\varepsilon\).
By the chain rule,
The inequality is strict somewhere in the transition annulus: there \(|\nabla u|\neq0\) at some point where \(\Psi''(u)\lt{}0\). This proves the desired superharmonic cut-off. Notice that no spherical symmetry is used here; the only inputs are conformal flatness, superharmonicity, and the negative mass asymptotic.
Finally use the conformal scalar curvature formula with flat background metric. For any positive function \(\phi\),
Applying this to \(\phi=\bar u\) gives
and it is positive somewhere. Since \(\bar u\) is constant near infinity, \(\bar g\) is flat there. ◻
If a one-ended asymptotically flat manifold \((M^n,g)\), \(3\leq n\leq7\), has nonnegative scalar curvature and negative ADM mass, then some closed manifold of the form
admits a positive-scalar-curvature metric.
Proof. First apply Theorem 6.4.3, choosing the deformation so that the mass remains negative. Then apply Lemma 6.4.4 to the harmonically flat end. We obtain a new complete metric, still denoted by \(g\), with the following properties: \(R_g\geq0\), \(R_g\gt{}0\) somewhere in a compact annulus in the chosen end, and on \(\{r\geq R_2\}\) the metric is
for a constant \(c\gt{}0\).
Choose a coordinate radius \(R\) with \(R_0\lt{}R\lt{}R_2\), and let \(K_R:=M\setminus\{r\gt{}R\}\) be the compact manifold obtained by cutting the chosen end at the coordinate sphere \(S_R=\partial B_R\). Then choose \(L\gt{}R_2\) and let
Since every point of \(\partial Q_L\) has Euclidean radius at least \(L\gt{}R_2\), a whole collar of \(\partial Q_L\) lies in the exactly flat region. Remove from \(M\) the part of the chosen end outside \(Q_L\). Equivalently, the remaining compact manifold with corners is
where \(Q_L\setminus B_R\) is read in the asymptotic coordinate chart. The outer boundary of this fundamental domain is the boundary of the cube \(Q_L\).
Now identify opposite faces of \(\partial Q_L\) by the translations
These translations are isometries for the constant flat metric \(c^{4/(n-2)}\delta\). Hence the metric descends smoothly across the identified faces. There is no corner singularity: the cube is only a fundamental domain for the standard smooth quotient \(Q_L/\!\sim\,=\mathbb{T}^n\), and the metric is the translation-invariant flat metric in a neighborhood of the boundary faces.
Let \(X\) be the closed quotient. We next identify its topology. Form the closed manifold
On the other hand, after the opposite faces of \(Q_L\) are identified, \(Q_L\) becomes \(\mathbb{T}^n\), and the image of the interior ball \(B_R\) is an embedded ball in this torus. Therefore
This is the promised torus connected sum.
The scalar curvature statement is local, so it survives the quotient. Thus the induced metric on \(X\) has \(R\geq0\) everywhere and \(R\gt{}0\) somewhere. By Lemma 6.1.2, the closed manifold \(X\) admits a metric with \(R\gt{}0\) everywhere. Since \(X\simeq\mathbb{T}^n\#Y\), this proves the proposition. ◻
Proof. Suppose, to the contrary, that the mass is negative. By Proposition 6.4.5, some \(\mathbb{T}^n\#Y\) admits a positive-scalar-curvature metric. This contradicts Corollary 6.3.2. Hence the ADM mass is nonnegative. ◻
The equality case is less topological but fits the same philosophy. If an asymptotically flat manifold with \(R_g\geq0\) has zero mass and is not Euclidean, one uses a conformal/deformation argument to produce a new asymptotically flat metric with \(R\geq0\) and strictly negative mass. This contradicts the nonnegativity just proved. In the original Schoen–Yau proof this is combined with the strong maximum principle and the regularity theory for the minimizing hypersurfaces.
The proof has three moving parts.
- Stability plus the Gauss equation gives
This is the whole reason positive scalar curvature descends to a stable minimal hypersurface.
-
A nonzero-degree map to a torus supplies cohomology classes whose cup product remains nonzero after passing to a Poincare-dual minimizing hypersurface.
-
A negative-mass end can be flattened to make a closed positive-scalar-curvature metric on \(\mathbb{T}^n\#Y\), contradicting Geroch.