Skip to article frontmatterSkip to article content
Site not loading correctly?

This may be due to an incorrect BASE_URL configuration. See the MyST Documentation for reference.

1Special Relativity Review

Principle of Special Relativity

At the end of the 19th century, Maxwell’s theory of electromagnetism established itself as the description of electrical and magnetic phenomena. Better yet, it predicted the existence of electromagnetic waves which were discovered in 1888 by Heinrich Hertz. Their study showed that they have all the known properties of light waves (reflection and refraction, interference, polarization and diffraction) and especially the same speed c=1/ϵ0μ0c = 1/\sqrt{\epsilon_0\mu_0} which emerges from Maxwell’s equations. Hertzian and light waves are therefore electromagnetic waves, but Maxwell’s electromagnetism says nothing about the reference frame in which this speed would be defined. Furthermore, the Michelson and Morley experiment (1887) wiki:Michelson–Morley_experiment shows that the speed of light does not seem to compose with the Earth’s velocity around the Sun, while Fizeau’s experiment (1849) Fizeau experiment shows that it partially composes with that of a moving fluid. We therefore understand that electromagnetic theory is poorly aligned with Newtonian mechanics and seems to need patching up according to contradictory experimental results.

Diagram of the Michelson-Morley experiment (1887). Light is split into two perpendicular beams that travel equal distances before recombining at the detector. The absence of interference fringes demonstrates the invariance of the speed of light.

Figure 1:Diagram of the Michelson-Morley experiment (1887). Light is split into two perpendicular beams that travel equal distances before recombining at the detector. The absence of interference fringes demonstrates the invariance of the speed of light.

Rather than undertaking this hazardous path which would have caused the theory to lose its predictive power when bodies are in motion, Albert Einstein proposes a revolutionary approach in his famous 1905 article On the electrodynamics of moving bodies Einstein, 1905. He postulates that the speed of light is the same in every reference frame and thus calls into question the Galilean composition of velocities, and even the notions of space and time. The fundamental principle of this new physics is the principle of special relativity.

If a physical law appears to be true in an inertial reference frame, within the experimental uncertainties defining its domain of validity, then it must remain true in another Galilean reference frame. This is the case, for example, with Newton’s fundamental principle of dynamics. However, imposing this on electromagnetic theory poses a serious problem, as it requires calling into question the Galilean composition of velocities for electromagnetic waves.

Lorentz Transformations

If electromagnetic theory is verified in a Galilean reference frame, what are the spatio-temporal coordinate transformations that can leave this speed invariant? Consider two events with spatio-temporal coordinates (t1,x1,y1,z1)(t_1,x_1,y_1,z_1) and (t2,x2,y2,z2)(t_2, x_2, y_2, z_2) in a reference frame R\mathcal{R} linked by the exchange of a light signal, then:

c2(t2t1)2=(x2x1)2+(y2y1)2+(z2z1)2c^2(t_2-t_1)^2 = (x_2-x_1)^2 + (y_2-y_1)^2 + (z_2-z_1)^2

Imposing the constancy of velocity implies that in another Galilean reference frame R\mathcal{R}' we also verify:

c2(t2t1)2=(x2x1)2+(y2y1)2+(z2z1)2c^2(t'_2-t'_1)^2 = (x'_2-x'_1)^2 + (y'_2-y'_1)^2 + (z'_2-z'_1)^2

It therefore appears judicious to define the spacetime interval for an interval between two infinitely close events:

ds2=c2dt2+dx2+dy2+dz2\dd s^2 = -c^2 \dd t^2 + \dd x^2 + \dd y^2 + \dd z^2

Consider a four-vector of coordinates xαx^\alpha, where component α=0\alpha=0 corresponds to time[1] ctct (with cc the famous maximum speed and tt the time) and components α=1,2,3\alpha=1,2,3 correspond to Cartesian coordinates x,y,zx,y,z. In this course, we will use Greek letters for components ranging from 0 to 3 and Latin letters for spatial components ranging from 1 to 3. We define the Minkowski metric:

ηαβ=(1000010000100001)\eta_{\alpha\beta} = \begin{pmatrix} -1 & 0& 0& 0 \\ 0 & 1 & 0 & 0 \\ 0 & 0 & 1 & 0 \\ 0 & 0 &0 & 1 \end{pmatrix}

such that in Cartesian coordinates we can define the spacetime interval between two close spatio-temporal coordinates:

ds2=c2dt2+dx2=ηαβdxαdxβ\dd s^2 = - c^2 \dd t^2 + \dd \vec x^2 = \eta_{\alpha\beta}\dd x^\alpha \dd x^\beta

To switch to another coordinate system xαx'^\alpha, we introduce the Lorentz transformation Λ  βα\Lambda^\alpha_{\;\beta} as follows:

xα=Λ  βαxβ+aα,x'^{\alpha} = \Lambda^\alpha_{\;\beta} x^\beta + a^\alpha,

where aαa^\alpha is a simple temporal and spatial translation.

What form should this linear transformation take[2]? To ensure that the speed of light is invariant under coordinate system changes xαx'^{\alpha}, we must preserve dx/dt=c\vert \dd \vec x' / \dd t'\vert = c for the propagation of a light ray, hence ds2=ds2=0\dd s'^2=\dd s^2 = 0. The Lorentz transformation must therefore ensure conservation of the spacetime interval:

ds2=ηαβdxαdxβ=ηαβΛ  γαΛ  δβdxγdxδ=ηγδdxγdxβ=ds2\dd s'^2 = \eta_{\alpha\beta}\dd x'^\alpha \dd x'^\beta = \eta_{\alpha\beta} \Lambda^{\alpha}_{\;\gamma} \Lambda^{\beta}_{\;\delta} \dd x^\gamma \dd x^\delta = \eta_{\gamma\delta} \dd x^\gamma \dd x^\beta = \dd s^2

Hence the closure relation:

ηαβΛ  γαΛ  δβ=ηγδ\eta_{\alpha\beta} \Lambda^{\alpha}_{\;\gamma} \Lambda^{\beta}_{\;\delta} = \eta_{\gamma\delta}

From the constitutive relation (8), we can demonstrate that Lorentz transformations form a group defined by Λ  001\Lambda^{0}_{\;0}\geqslant 1 and det  Λ=+1\mathrm{det}\;\Lambda=+1. A few calculations later (see Raimond (2006) or Langlois (2013) for example), we show that the coordinate transformation between two reference frames where one moves at velocity v=ve1\vec v = v \vec e_{1} is uniquely written as:

Λ  βα=(γβγ00βγγ0000100001),β=vc,γ=11β2\Lambda^{\alpha}_{\;\beta} = \begin{pmatrix} \gamma & -\beta \gamma & 0 & 0 \\ -\beta \gamma & \gamma & 0 & 0 \\ 0 & 0 & 1 & 0 \\ 0 & 0 &0 & 1 \end{pmatrix}, \quad \beta = \frac{v}{c},\quad \gamma = \frac{1}{\sqrt{1 - \beta^2}}

to guarantee the constancy of the speed of light in all Galilean reference frames.

If we include spatial rotations, with a reference frame moving at constant velocity v\vec v relative to another reference frame, the components of tensor Λ  βα\Lambda^\alpha_{\;\beta} are finally written as:

Λ  00=γ,Λ  0i=γvi/c,Λ  j0=γvj/c,Λ  ji=δij+(γ1)vivjv2\Lambda^0_{\;0} = \gamma,\quad \Lambda^i_{\;0} = -\gamma v_i / c,\quad \Lambda^0_{\;j} = -\gamma v_j / c,\quad \Lambda^i_{\;j} = \delta_{ij} + (\gamma - 1) \frac{v_i v_j}{v^2}

Lorentz transformations impose that not only does the speed of light not compose with the velocity of an observer or source, but also that it is unsurpassable. However, gravitational force propagates instantaneously over infinite distance in Newtonian theory. The displacement of a mass is instantly felt gravitationally throughout the Universe. After having to rewrite the equations of kinematics to preserve Maxwell’s equations, Einstein worked for the following 10 years to reformulate Newtonian gravitational theory so that it would fit into this new framework. The main ingredient of his approach is the observation that gravitational mass and inertial mass are identical, which is an indication that gravitation could be described by a kinematic effect.

2From Newton to General Relativity

General Relativity is the theory of gravitation at the basis of modern cosmology. It provides a geometric explanation for the gravitational force introduced by Newton three centuries earlier. In this theory, falling to the ground is no longer due to a force vector unfortunately oriented towards the soil, but to the deformation of spacetime caused by the Earth. Formulated this way, General Relativity seems quite complicated for little gain. But the general principles at the base of this theory and the richness of its implications (including Newtonian theory) make it the flagship theory for describing gravitation. Throughout this section, we will introduce step by step several concepts of General Relativity such as the geodesic equation, the metric and the covariant derivative, to arrive at Einstein’s equation of General Relativity, the basis of modern cosmology. This introduction is largely inspired by Weinberg (1972) and Gourgoulhon (2013).

The Equivalence Principle

In Newton’s fundamental principle of dynamics, why is the mass involved in the inertia term strictly the same as that involved in Newtonian gravitation? This troubling equality between inertial mass and gravitational mass, validated by centuries of experimentation (Newton’s pendulums, Eötvös’ balance, etc.), sets gravitation apart from other interactions, such as Coulomb’s force, which depends on the electrical charge of the bodies considered. This suggests that gravitation is not a property of the bodies themselves, but of the space in which they move.

Let us consider a point mass subjected to a uniform and constant external gravitational field g\vec g and to non-gravitational forces fng\vec f_{\rm ng}. Then the fundamental principle of dynamics applied in a Galilean reference frame R\mathcal{R} to this object allows us to predict its position x\vec x at an instant tt by solving the differential equation:

mid2xdt2=mgg+fngm_i\frac{\dd^2\vec x}{\dd t^2} = m_g\vec g + \vec f_{\rm ng}

with mgm_g the gravitational mass and mim_i the inertial mass. Let us place ourselves in an accelerated (non-Galilean) reference frame R\mathcal{R}' relative to the initial Galilean reference frame with an entrainment acceleration ae\vec a_e. In R\mathcal{R}' equipped with coordinates (t,x)(t, x'), the fundamental principle of dynamics is written with an inertial force mae-m\vec a_e (also called fictitious force because it comes from a kinematic effect) due to the entrainment acceleration:

mid2xdt2=mggmiae+fngm_i\frac{\dd^2\vec x'}{\dd t^2} = m_g\vec g -m_i\vec a_e + \vec f_{\rm ng}

Thanks to the equality between inertial mass and gravitational mass mi=mg=mm_i=m_g=m, we notice that we can make a particular choice of reference frame R\mathcal{R}' such that g=ae\vec g = \vec a_e, such that the fundamental principle of dynamics is written as if there were neither acceleration nor gravitation:

md2xdt2=fngm\frac{\dd ^2\vec x'}{\dd t^2} = \vec f_{\rm ng}

The laws of physics therefore appear identical for an observer linked to a Galilean reference frame considering that the object undergoes a gravitational force and for an observer linked to a free-falling reference frame considering that the object does not undergo any gravitational force. The gravitational effect has been incorporated into a coordinate change x=xgt2/2\vec x' = \vec x - g t^2/2. What if this approach were possible for any gravitational field, at least locally in a region where g\vec g is quasi-constant and during an experiment duration where g\vec g is quasi-stationary? The Equivalence Principle formulated by Einstein acknowledges the equivalence between gravitation and acceleration due to the equality of inertial and gravitational masses, at least for gravitational fields that vary weakly in time and space, and elevates it to a construction principle for physical laws.

This is therefore a generalization of the principle of special relativity to all reference frames, in the presence of gravitation or not. This principle is verified experimentally with very good precision, notably by Lunar Laser Ranging Williams et al., 2004 and by the MICROSCOPE satellite with a precision of 2.7×10152.7\times 10^{−15} Touboul et al., 2022.

Equations of motion

Let us apply the Equivalence Principle to the problem of a massive object in free fall. For this object, there therefore exists locally a particular coordinate system such that the equation of its trajectory xμx'^\mu can be written in the same way as if the reference frame were non-accelerated and without gravitation:

d2xμdτ2=0,\frac{\dd^2 x'^\mu}{\dd\tau^2}=0,

with dτ\dd\tau the proper time[3]:

dτ2ημνdxμdxν.\dd \tau^2 \equiv -\eta_{\mu\nu} \dd x'^\mu \dd x'^\nu.

The parameter τ\tau will allow us to parameterize the curve xμ(τ)x'^\mu(\tau), like a curvilinear abscissa. In this equation it only plays the role of a label to parameterize the successive positions of the object, but it has the immense advantage of being Lorentz invariant and of being the time measured by the observer in the particle’s reference frame.

According to the Equivalence Principle, this equation is also valid in a certain neighborhood of the object in question with another choice of spatio-temporal coordinates. There therefore exists another arbitrary coordinate system in which we have the right to rewrite the equation of its trajectory xμx^\mu. Let us look for the form it would take for these coordinates xμx^\mu:

0=d2xμdτ2=ddτ(xμxνdxνdτ)=xμxνd2xνdτ2+2xμxνxρdxνdτdxρdτ.0=\frac{\dd^2 x'^\mu}{\dd\tau^2}=\frac{\dd}{\dd\tau}\left(\frac{\partial x'^\mu}{\partial x^\nu} \frac{\dd x^\nu}{\dd\tau}\right) = \frac{\partial x'^\mu}{\partial x^\nu} \frac{\dd^2 x^\nu}{\dd\tau^2} + \frac{\partial^2 x'^\mu}{\partial x^\nu \partial x^\rho}\frac{\dd x^\nu}{\dd\tau}\frac{\dd x^\rho}{\dd\tau}.

After multiplication by xγ/xμ\partial x^\gamma/\partial x'^\mu, we obtain[4] the new equation of motion:

d2xνdτ2+Γ μρνdxμdτdxρdτ=0,\frac{\dd^2x^\nu}{\dd\tau^2} + \Gamma^\nu_{\ \mu\rho}\frac{\dd x^\mu}{\dd\tau}\frac{\dd x^\rho}{\dd\tau}=0,

where Γ μρν\Gamma^\nu_{\ \mu\rho} is the affine connection defined by:

Γ μρνxνxλ2xλxμxρ.\Gamma^\nu_{\ \mu\rho} \equiv \frac{\partial x^\nu}{\partial x'^\lambda}\frac{\partial^2 x'^\lambda}{\partial x^\mu \partial x^\rho}.

The proper time is rewritten:

dτ2=ημνdxμdxν=gμνdxμdxν\dd \tau^2=-\eta_{\mu\nu} \dd x'^\mu \dd x'^\nu = -g_{\mu\nu} \dd x^\mu \dd x^\nu

which thus defines the metric tensor gμνg_{\mu\nu}:

gμν=ηαβxαxμxβxν\boxed{g_{\mu\nu} = \eta_{\alpha\beta} \frac{\partial x'^\alpha}{\partial x^\mu} \frac{\partial x'^\beta}{\partial x^\nu}}

The tensor gμνg_{\mu\nu} describes the geometry of spacetime in the new coordinate system xμx^\mu and replaces the Cartesian metric ημν\eta_{\mu\nu}. It is the fundamental object of General Relativity because it allows describing distances traveled in non-Euclidean (curved) spacetime.

We could subsequently show that Γ μρν\Gamma^\nu_{\ \mu\rho} can be written using only a single coordinate system and the metric tensor:

Γ μρν=12gλν(gλρxμ+gμλxρgμρxλ)\Gamma^\nu_{\ \mu\rho} = \frac{1}{2}g^{\lambda\nu}\left( \frac{\partial g_{\lambda\rho}}{\partial x^\mu} + \frac{\partial g_{\mu\lambda}}{\partial x^\rho} - \frac{\partial g_{\mu\rho}}{\partial x^\lambda} \right)

If present, forces other than gravitation acting on the test particle can be added to the right-hand side of equation (25):

d2xνdτ2+Γ μρνdxμdτdxρdτ=fngμm\boxed{\frac{\dd^2x^\nu}{\dd \tau^2} + \Gamma^\nu_{\ \mu\rho}\frac{\dd x^\mu}{\dd \tau}\frac{\dd x^\rho}{\dd \tau}=\frac{f_{\rm ng}^\mu}{m}}

with mm the mass of the object and fngμf_{\rm ng}^\mu the contravariant vector of non-gravitational forces acting on a massive particle[5]. The correct formulation of the fundamental principle of dynamics according to the Equivalence Principle is therefore equation (24), because we can show that the latter is indeed invariant under local coordinate system transformations (demonstration Weinberg (1972)[p. 102]) and reduces to the fundamental principle of dynamics in the absence of gravitation (null affine connection).

Covariant derivative

The affine connection Γ μρν\Gamma^\nu_{\ \mu\rho} also appears in the definition of the covariant derivative Vν;μV^\nu{}_{;\mu} of a vector VνV^\nu with respect to coordinate xμx'^\mu:

Vν;μμVν+Γ μρνVρ.V^\nu{}_{;\mu} \equiv \partial_\mu V^\nu + \Gamma^\nu_{\ \mu\rho}V^\rho.

The first term corresponds to the ordinary variation of a vector when displaced in its neighborhood. The second term takes into account the changes in the coordinate system that is also displaced, since the Christoffel symbol describes the changes in the reference frame basis vectors.

Illustration of the variation of a vector V^\mu (cyan) in the neighborhood of a basis (e_\mu, e_\nu) of a curved space. Following a displacement in its neighborhood (here along e_\mu), the vector changes size (first term of the covariant derivative) and the basis that defines its projections and thus coordinates also changes. The covariant derivative calculates the variation of the vector components V^\mu due to these two changes.

Illustration of the variation of a vector VμV^\mu (cyan) in the neighborhood of a basis (eμ,eν)(e_\mu, e_\nu) of a curved space. Following a displacement in its neighborhood (here along eμe_\mu), the vector changes size (first term of the covariant derivative) and the basis that defines its projections and thus coordinates also changes. The covariant derivative calculates the variation of the vector components VμV^\mu due to these two changes.

This definition of the derivative in General Relativity correctly expresses the variation of a vector along a coordinate in curved space. This variation vector transforms like a contravariant tensor under coordinate changes (unlike the usual derivative): the variation vector Vν;μV^\nu{}_{;\mu} is therefore correctly defined for any coordinate system.

To illustrate its full depth, here is the definition of the covariant derivative DVμ/DτDV^\mu/D\tau not with respect to a coordinate, but along any curve parameterized by proper time τ\tau (invariant under coordinate changes):

DVμDτdVμdτ+Γ νλμdxλdτVν.\frac{DV^\mu}{D\tau} \equiv \frac{\dd V^\mu}{\dd\tau} + \Gamma^\mu_{\ \nu\lambda}\frac{\dd x^\lambda}{\dd\tau} V^\nu.

Let VμV^\mu be the 4-velocity vector dxμ/dτ\dd x^\mu/\dd\tau. The equation of motion (24) can then be written very simply as:

DVμDτ=fngμm\boxed{\frac{DV^\mu}{D\tau}=\frac{f_{\rm ng}^\mu}{m}}

This equation, written this way, strongly recalls the fundamental principle of dynamics, but places mechanics in a relativistic framework, invariant under any change of coordinate system and in the presence of gravitation. The notion of covariant derivative is therefore well suited to General Relativity calculations and properly replaces the usual derivative in this framework.

In general, replacing the usual derivatives of a physical theory with covariant derivatives leads to a formulation of physical laws that respects the equivalence principle, thus achieving invariance under reference frame changes and in the presence of gravitation. If a theory is true without gravitation and locally in Minkowski space, then it remains true in any reference frame with gravitation. For example, in the absence of gravitational field, Maxwell’s equations are written:

Fαβxα=Jβ\frac{\partial F^{\alpha\beta}}{\partial x^\alpha} = - J^\beta

where JβJ^\beta is the electric current four-vector and FαβF^{\alpha\beta} is the electromagnetic tensor. If we introduce tensors FμνF^{\mu\nu} and JμJ^\mu such that they reduce to FαβF^{\alpha\beta} and JβJ^\beta in an inertial Minkowski reference frame, then electromagnetic theory respects the equivalence principle if we use the covariant derivative:

Fμν;μ=JμF^{\mu\nu}{}_{;\mu} = - J^\mu

and this formulation is valid in any coordinate system since it is true in Minkowski.

Toward Einstein’s equation

Armed with these tools, let us now move toward a simple derivation of Einstein’s equation which summarizes gravitation as a deformation of spacetime by matter. Let us begin by considering a massive particle moving slowly in a weak gravitational field, constant but arbitrary this time. According to the Equivalence Principle, we have seen that there exists an inertial coordinate system (ct,x)\left(ct',\vec x'\right) such that the equation of motion (14) is still valid in another reference frame (ct,x)\left(ct,\vec x\right) but with gravitational field. The assumption of low velocity allows us to neglect dx/dτ\dd\vec x/\dd\tau compared to cdt/dτc\dd t/\dd\tau. We then have to first order in a weak and quasi-stationary gravity field:

d2xμdτ2+Γ 00μ(cdtdτ)2=0,Γ    00μ12ημνg00xν.\frac{\dd^2x^\mu}{\dd\tau^2} + \Gamma^\mu_{\ 00}\left(c\frac{\dd t}{\dd\tau}\right)^2=0, \qquad \Gamma^\mu_{\;\;00} \approx -\frac{1}{2}\eta^{\mu\nu}\frac{\partial g_{00}}{\partial x^\nu}.

Under the assumption of a weak gravitational field, we can adopt an almost Cartesian metric:

gμν=ημν+hμν,hμν1,g_{\mu\nu}=\eta_{\mu\nu}+h_{\mu\nu},\qquad \vert h_{\mu\nu} \vert \ll 1,

and we obtain to first order:

{μ=1,2,3 :d2xdτ2=12(cdtdτ)2h00μ=0 :d2tdτ2=0.\left\lbrace \begin{array}{rl} \mu=1,2,3\ : & \displaystyle{\frac{\dd^2\vec x}{\dd\tau^2} = \frac{1}{2}\left(c\frac{\dd t}{\dd\tau}\right)^2\vec{\nabla} h_{00} } \\ \mu=0\ : & \displaystyle{\frac{\dd^2 t}{\dd\tau^2} = 0.} \end{array} \right.

From the second equation we deduce that dt/dτ\dd t/\dd\tau is a constant. Therefore we can divide the first equation by dt/dτ\dd t / \dd \tau and we obtain:

d2xdt2=12c2h00.\frac{\dd^2\vec x}{\dd t^2} = \frac{1}{2}c^2\vec{\nabla} h_{00}.

But we know that in the Newtonian limit we have:

d2xdt2=ϕ\frac{\dd^2\vec x}{\dd t^2} = -\vec{\nabla} \phi

with ϕ\phi the gravitational potential (i.e. ϕ=GM/r\phi=-\GN M /r if generated by a mass MM at distance rr, G\GN being Newton’s constant). Comparing with (35), we have h00=2ϕ/c2+constanth_{00}=-2\phi/c^2+\text{constant}. But the metric must be Minkowski at infinity (weak perturbation assumption), therefore h00=2ϕ/c2h_{00}=-2\phi/c^2 and:

g00=(1+2ϕc2),g_{00}=-\left(1+\frac{2\phi}{c^2}\right),

Consequently, the spacetime metric will be able to contain gravitational effects. The element g00g_{00} corresponding to the temporal component of the metric, the beating of clocks therefore depends on the intensity of the gravitational field. This corresponds to the Einstein effect, the only consequence of General Relativity currently used technologically (in GPS, see Figure 4).

Illustration of the Einstein effect. A photon falling into a gravitational well gains energy so its frequency increases. Equivalently, we can say that clocks in a gravitational field run slow compared to identical clocks located outside. GPS receivers must take this effect into account to deduce their position relative to satellites.

Figure 4:Illustration of the Einstein effect. A photon falling into a gravitational well gains energy so its frequency increases. Equivalently, we can say that clocks in a gravitational field run slow compared to identical clocks located outside. GPS receivers must take this effect into account to deduce their position relative to satellites.

This exercise on a point particle teaches us that the gravitational field is ultimately contained in the metric, and that this metric therefore depends on the presence of matter. It is therefore possible to imagine a generalization of this observation. The Newtonian potential is determined by Poisson’s equation:

2ϕ=4πGρm\nabla^2\phi = 4\pi \GN \rho_m

where ρm\rho_m is the mass density and G\GN is Newton’s constant. The latter is associated with the energy density ϵ\epsilon of the matter energy-momentum tensor T00=ϵ=ρmc2T_{00} = \epsilon = \rho_m c^2 (see chapter The expanding Universe), so with equation (37) we can obtain:

2g00=8πGc4T00.\nabla^2 g_{00}=-\frac{8\pi \GN}{c^4} T_{00}.

This equation is not invariant under Lorentz transformations, hence the necessity to modify Newtonian gravitational theory if we admit the principle of special relativity. Tensors are the right objects that can allow us to achieve this goal. We can then imagine that there exists a tensor GμνG_{\mu\nu} combining first and second derivatives of the metric gμνg_{\mu\nu} generalizing this last equation to all coordinates such that:

Gμν=8πGc4Tμν.G_{\mu\nu}=-\frac{8\pi \GN}{c^4} T_{\mu\nu}.

This last equation corresponds to a first version of Einstein’s equation. This reasoning only allowed us to intuit its form, but another more rigorous demonstration allows us to obtain the expression of Einstein’s tensor GμνG_{\mu\nu}:

Gμν=Rμν12gμνRG_{\mu\nu} = R_{\mu\nu} - \frac{1}{2} g_{\mu\nu} R

with RμνR_{\mu\nu} the Ricci tensor and RR the scalar curvature (trace of the Ricci tensor R  μμR^\mu_{\;\mu}), themselves obtained from the Riemann tensor R ναβμR^\mu_{\ \nu\alpha\beta}:

R ναβμ=αΓ νβμ+βΓ ναμΓ ασμΓ νβσ+Γ βσμΓ νασRμν=R μανα.\begin{aligned} R^\mu_{\ \nu\alpha\beta} & = -\partial_\alpha \Gamma^\mu_{\ \nu\beta} + \partial_\beta \Gamma^\mu_{\ \nu\alpha} - \Gamma^\mu_{\ \alpha\sigma}\Gamma^\sigma_{\ \nu\beta} + \Gamma^\mu_{\ \beta\sigma}\Gamma^\sigma_{\ \nu\alpha} \\ R_{\mu\nu} & =R^\alpha_{\ \mu\alpha\nu}. \end{aligned}

Since Einstein’s tensor GμνG_{\mu\nu} contains second derivatives of the metric, Einstein’s equation links the curvature of spacetime and thus the trajectories of bodies to its energy and matter content.

Moreover, GμνG_{\mu\nu} appears to have zero divergence. This is Bianchi’s identity:

G      ;μμν=0.G^{\mu\nu}_{\;\;\;;\mu}=0.

By Bianchi’s identity, we also see that Einstein’s equation can be defined up to a constant[6] while keeping energy conservation. This constant is now called the cosmological constant. Here is Einstein’s equation in its final form Einstein, 1917:

GμνΛgμν=8πGc4Tμν\boxed{G_{\mu\nu}-\Lambda g_{\mu\nu} = -\frac{8\pi \GN}{c^4} T_{\mu\nu}}

3References

{bibliography}

Footnotes
  1. Hereafter, the index 0 of tensors will correspond to the temporal coordinate, while the following indices will correspond to spatial coordinates.

  2. Linearity is imposed by the invariance of physics under translation in space and time Raimond, 2006.

  3. Through this definition, we have chosen a metric signature (,+,+,+)(-,+,+,+) which we will keep throughout.

  4. Knowing that xρxμxμxν=δνρ\frac{\partial x'^\rho}{\partial x^\mu} \frac{\partial x^\mu}{\partial x'^\nu} = \delta^\rho_\nu, because xρxμ\frac{\partial x'^\rho}{\partial x^\mu} is the Jacobian of the coordinate transformation xμxνx^\mu \to x'^\nu and the second factor is its inverse.

  5. If studying a massless particle, it suffices to replace fngμ/mf_{\rm ng}^\mu/m by the model of the interaction applying to this particle.

  6. Because we also have gμν;μ=0g^{\mu\nu}{}_{;\mu}=0.

References
  1. Einstein, A. (1905). Zur Elektrodynamik bewegter Körper. Annalen Der Physik, 322(10), 891–921. https://doi.org/10.1002/andp.19053221004
  2. Raimond, J.-M. (2006). Electromagnétisme et relativité. https://cel.hal.science/cel-00092954
  3. Langlois, D. (2013). Relativité générale. Des fondements géométriques aux applications astrophysiques. De Boeck. https://www.deboecksuperieur.com/livre/9782311007190-relativite-generale
  4. Landau, L. D., & Lifshits, E. M. (1989). Théorie des champs. Editions Mir. https://books.google.bi/books?id=99z7QQAACAAJ
  5. Weinberg, S. (1972). Gravitation and cosmology: principles and applications of the general theory of relativity. http://www.lavoisier.fr/livre/notice.asp?ouvrage=1382255
  6. Gourgoulhon, E. (2013). Relativité générale (cours M2 “Astronomie et Astrophysique”).
  7. Von Eötvös, R. (1890). über die Anziehung der Erde auf verschiedene Substanzen. Math. Naturwissenschaft. Ber. Ungarn, 8, S65-68. https://mek.oszk.hu/03200/03286/html/eotvos1/eotv7.html
  8. Williams, J., Turyshev, S., & Boggs, D. (2004). Progress in lunar laser ranging tests of relativistic gravity. Physical Review Letters. http://prl.aps.org/abstract/PRL/v93/i26/e261101
  9. Touboul, P., Métris, G., Rodrigues, M., Bergé, J., Robert, A., Baghi, Q., André, Y., Bedouet, J., Boulanger, D., Bremer, S., Carle, P., Chhun, R., Christophe, B., Cipolla, V., Damour, T., Danto, P., Demange, L., Dittus, H., Dhuicque, O., … Visser, P. (2022). MICROSCOPE Mission: Final Results of the Test of the Equivalence Principle. Phys. Rev. Lett., 129(12), 121102. 10.1103/PhysRevLett.129.121102
  10. Einstein, A. (1917). Kosmologische Betrachtungen zur allgemeinen Relativitätstheorie. Sitzungsberichte Der Königlich Preussischen Akademie Der Wissenschaften, 142–152. https://adsabs.harvard.edu/pdf/1917SPAW.......142E
  11. Lévy-Leblond, J.-M. (1976). One more derivation of the Lorentz transformation. American Journal of Physics, 44(3), 271–277. 10.1119/1.10490