1Special Relativity Review¶
Principle of Special Relativity¶
At the end of the 19th century, Maxwell’s theory of electromagnetism established itself as the description of electrical and magnetic phenomena. Better yet, it predicted the existence of electromagnetic waves which were discovered in 1888 by Heinrich Hertz. Their study showed that they have all the known properties of light waves (reflection and refraction, interference, polarization and diffraction) and especially the same speed which emerges from Maxwell’s equations. Hertzian and light waves are therefore electromagnetic waves, but Maxwell’s electromagnetism says nothing about the reference frame in which this speed would be defined. Furthermore, the Michelson and Morley experiment (1887) wiki:Michelson–Morley_experiment shows that the speed of light does not seem to compose with the Earth’s velocity around the Sun, while Fizeau’s experiment (1849) Fizeau experiment shows that it partially composes with that of a moving fluid. We therefore understand that electromagnetic theory is poorly aligned with Newtonian mechanics and seems to need patching up according to contradictory experimental results.
Figure 1:Diagram of the Michelson-Morley experiment (1887). Light is split into two perpendicular beams that travel equal distances before recombining at the detector. The absence of interference fringes demonstrates the invariance of the speed of light.
Rather than undertaking this hazardous path which would have caused the theory to lose its predictive power when bodies are in motion, Albert Einstein proposes a revolutionary approach in his famous 1905 article On the electrodynamics of moving bodies Einstein, 1905. He postulates that the speed of light is the same in every reference frame and thus calls into question the Galilean composition of velocities, and even the notions of space and time. The fundamental principle of this new physics is the principle of special relativity.
If a physical law appears to be true in an inertial reference frame, within the experimental uncertainties defining its domain of validity, then it must remain true in another Galilean reference frame. This is the case, for example, with Newton’s fundamental principle of dynamics. However, imposing this on electromagnetic theory poses a serious problem, as it requires calling into question the Galilean composition of velocities for electromagnetic waves.
Lorentz Transformations¶
If electromagnetic theory is verified in a Galilean reference frame, what are the spatio-temporal coordinate transformations that can leave this speed invariant? Consider two events with spatio-temporal coordinates and in a reference frame linked by the exchange of a light signal, then:
Imposing the constancy of velocity implies that in another Galilean reference frame we also verify:
It therefore appears judicious to define the spacetime interval for an interval between two infinitely close events:
Consider a four-vector of coordinates , where component corresponds to time[1] (with the famous maximum speed and the time) and components correspond to Cartesian coordinates . In this course, we will use Greek letters for components ranging from 0 to 3 and Latin letters for spatial components ranging from 1 to 3. We define the Minkowski metric:
such that in Cartesian coordinates we can define the spacetime interval between two close spatio-temporal coordinates:
To switch to another coordinate system , we introduce the Lorentz transformation as follows:
where is a simple temporal and spatial translation.
What form should this linear transformation take[2]? To ensure that the speed of light is invariant under coordinate system changes , we must preserve for the propagation of a light ray, hence . The Lorentz transformation must therefore ensure conservation of the spacetime interval:
Hence the closure relation:
From the constitutive relation (8), we can demonstrate that Lorentz transformations form a group defined by and . A few calculations later (see Raimond (2006) or Langlois (2013) for example), we show that the coordinate transformation between two reference frames where one moves at velocity is uniquely written as:
to guarantee the constancy of the speed of light in all Galilean reference frames.
If we include spatial rotations, with a reference frame moving at constant velocity relative to another reference frame, the components of tensor are finally written as:
Lorentz transformations impose that not only does the speed of light not compose with the velocity of an observer or source, but also that it is unsurpassable. However, gravitational force propagates instantaneously over infinite distance in Newtonian theory. The displacement of a mass is instantly felt gravitationally throughout the Universe. After having to rewrite the equations of kinematics to preserve Maxwell’s equations, Einstein worked for the following 10 years to reformulate Newtonian gravitational theory so that it would fit into this new framework. The main ingredient of his approach is the observation that gravitational mass and inertial mass are identical, which is an indication that gravitation could be described by a kinematic effect.
2From Newton to General Relativity¶
General Relativity is the theory of gravitation at the basis of modern cosmology. It provides a geometric explanation for the gravitational force introduced by Newton three centuries earlier. In this theory, falling to the ground is no longer due to a force vector unfortunately oriented towards the soil, but to the deformation of spacetime caused by the Earth. Formulated this way, General Relativity seems quite complicated for little gain. But the general principles at the base of this theory and the richness of its implications (including Newtonian theory) make it the flagship theory for describing gravitation. Throughout this section, we will introduce step by step several concepts of General Relativity such as the geodesic equation, the metric and the covariant derivative, to arrive at Einstein’s equation of General Relativity, the basis of modern cosmology. This introduction is largely inspired by Weinberg (1972) and Gourgoulhon (2013).
The Equivalence Principle¶
In Newton’s fundamental principle of dynamics, why is the mass involved in the inertia term strictly the same as that involved in Newtonian gravitation? This troubling equality between inertial mass and gravitational mass, validated by centuries of experimentation (Newton’s pendulums, Eötvös’ balance, etc.), sets gravitation apart from other interactions, such as Coulomb’s force, which depends on the electrical charge of the bodies considered. This suggests that gravitation is not a property of the bodies themselves, but of the space in which they move.
Eötvös Experiments (1889)
In this experiment performed in 1889, Eötvös showed that a wooden mass and a platinum mass undergo the same gravitational force to within 10-9, by observing the torsion of a balance Von Eötvös, 1890. The balance compares the torque exerted by gravitational force (dependent on gravitational mass) and the torque exerted by centrifugal force due to Earth’s rotation (dependent on inertial mass). See a description of the experiment here wiki:Eötvös_experiment.
Let us consider a point mass subjected to a uniform and constant external gravitational field and to non-gravitational forces . Then the fundamental principle of dynamics applied in a Galilean reference frame to this object allows us to predict its position at an instant by solving the differential equation:
with the gravitational mass and the inertial mass. Let us place ourselves in an accelerated (non-Galilean) reference frame relative to the initial Galilean reference frame with an entrainment acceleration . In equipped with coordinates , the fundamental principle of dynamics is written with an inertial force (also called fictitious force because it comes from a kinematic effect) due to the entrainment acceleration:
Thanks to the equality between inertial mass and gravitational mass , we notice that we can make a particular choice of reference frame such that , such that the fundamental principle of dynamics is written as if there were neither acceleration nor gravitation:
The laws of physics therefore appear identical for an observer linked to a Galilean reference frame considering that the object undergoes a gravitational force and for an observer linked to a free-falling reference frame considering that the object does not undergo any gravitational force. The gravitational effect has been incorporated into a coordinate change . What if this approach were possible for any gravitational field, at least locally in a region where is quasi-constant and during an experiment duration where is quasi-stationary? The Equivalence Principle formulated by Einstein acknowledges the equivalence between gravitation and acceleration due to the equality of inertial and gravitational masses, at least for gravitational fields that vary weakly in time and space, and elevates it to a construction principle for physical laws.
This is therefore a generalization of the principle of special relativity to all reference frames, in the presence of gravitation or not. This principle is verified experimentally with very good precision, notably by Lunar Laser Ranging Williams et al., 2004 and by the MICROSCOPE satellite with a precision of Touboul et al., 2022.
Equations of motion¶
Let us apply the Equivalence Principle to the problem of a massive object in free fall. For this object, there therefore exists locally a particular coordinate system such that the equation of its trajectory can be written in the same way as if the reference frame were non-accelerated and without gravitation:
with the proper time[3]:
The parameter will allow us to parameterize the curve , like a curvilinear abscissa. In this equation it only plays the role of a label to parameterize the successive positions of the object, but it has the immense advantage of being Lorentz invariant and of being the time measured by the observer in the particle’s reference frame.
According to the Equivalence Principle, this equation is also valid in a certain neighborhood of the object in question with another choice of spatio-temporal coordinates. There therefore exists another arbitrary coordinate system in which we have the right to rewrite the equation of its trajectory . Let us look for the form it would take for these coordinates :
After multiplication by , we obtain[4] the new equation of motion:
where is the affine connection defined by:
The proper time is rewritten:
which thus defines the metric tensor :
The tensor describes the geometry of spacetime in the new coordinate system and replaces the Cartesian metric . It is the fundamental object of General Relativity because it allows describing distances traveled in non-Euclidean (curved) spacetime.
The metric tensor in simple cases
Consider a 2D Euclidean space with metric and coordinates . We perform a rotation of space by an angle , passing to coordinates by:
Let us calculate for the example:
After calculating the other terms, we have:
so the space remains Euclidean after rotation. Equivalently, we could have obtained the metric by studying the conservation of a length element under rotation:
Consider a 2D spherical space of radius . A position on the sphere is given by two angles . An elementary vector on the sphere has length:
Hence the metric on this curved space:
whose curvature is half the Ricci scalar (see Scalar curvature) and equals as expected for a sphere.
Consider a 2D Euclidean spacetime with metric and coordinates . We perform a Galilean translation of space with constant velocity , passing to coordinates by:
Inverse metric
We could subsequently show that can be written using only a single coordinate system and the metric tensor:
If present, forces other than gravitation acting on the test particle can be added to the right-hand side of equation (25):
with the mass of the object and the contravariant vector of non-gravitational forces acting on a massive particle[5]. The correct formulation of the fundamental principle of dynamics according to the Equivalence Principle is therefore equation (24), because we can show that the latter is indeed invariant under local coordinate system transformations (demonstration Weinberg (1972)[p. 102]) and reduces to the fundamental principle of dynamics in the absence of gravitation (null affine connection).
Equation of motion for massless particles
Covariant derivative¶
The affine connection also appears in the definition of the covariant derivative of a vector with respect to coordinate :
The first term corresponds to the ordinary variation of a vector when displaced in its neighborhood. The second term takes into account the changes in the coordinate system that is also displaced, since the Christoffel symbol describes the changes in the reference frame basis vectors.
Illustration of the variation of a vector (cyan) in the neighborhood of a basis of a curved space. Following a displacement in its neighborhood (here along ), the vector changes size (first term of the covariant derivative) and the basis that defines its projections and thus coordinates also changes. The covariant derivative calculates the variation of the vector components due to these two changes.
This definition of the derivative in General Relativity correctly expresses the variation of a vector along a coordinate in curved space. This variation vector transforms like a contravariant tensor under coordinate changes (unlike the usual derivative): the variation vector is therefore correctly defined for any coordinate system.
To illustrate its full depth, here is the definition of the covariant derivative not with respect to a coordinate, but along any curve parameterized by proper time (invariant under coordinate changes):
Let be the 4-velocity vector . The equation of motion (24) can then be written very simply as:
This equation, written this way, strongly recalls the fundamental principle of dynamics, but places mechanics in a relativistic framework, invariant under any change of coordinate system and in the presence of gravitation. The notion of covariant derivative is therefore well suited to General Relativity calculations and properly replaces the usual derivative in this framework.
In general, replacing the usual derivatives of a physical theory with covariant derivatives leads to a formulation of physical laws that respects the equivalence principle, thus achieving invariance under reference frame changes and in the presence of gravitation. If a theory is true without gravitation and locally in Minkowski space, then it remains true in any reference frame with gravitation. For example, in the absence of gravitational field, Maxwell’s equations are written:
where is the electric current four-vector and is the electromagnetic tensor. If we introduce tensors and such that they reduce to and in an inertial Minkowski reference frame, then electromagnetic theory respects the equivalence principle if we use the covariant derivative:
and this formulation is valid in any coordinate system since it is true in Minkowski.
Toward Einstein’s equation¶
Armed with these tools, let us now move toward a simple derivation of Einstein’s equation which summarizes gravitation as a deformation of spacetime by matter. Let us begin by considering a massive particle moving slowly in a weak gravitational field, constant but arbitrary this time. According to the Equivalence Principle, we have seen that there exists an inertial coordinate system such that the equation of motion (14) is still valid in another reference frame but with gravitational field. The assumption of low velocity allows us to neglect compared to . We then have to first order in a weak and quasi-stationary gravity field:
Under the assumption of a weak gravitational field, we can adopt an almost Cartesian metric:
and we obtain to first order:
From the second equation we deduce that is a constant. Therefore we can divide the first equation by and we obtain:
But we know that in the Newtonian limit we have:
with the gravitational potential (i.e. if generated by a mass at distance , being Newton’s constant). Comparing with (35), we have . But the metric must be Minkowski at infinity (weak perturbation assumption), therefore and:
Consequently, the spacetime metric will be able to contain gravitational effects. The element corresponding to the temporal component of the metric, the beating of clocks therefore depends on the intensity of the gravitational field. This corresponds to the Einstein effect, the only consequence of General Relativity currently used technologically (in GPS, see Figure 4).
Figure 4:Illustration of the Einstein effect. A photon falling into a gravitational well gains energy so its frequency increases. Equivalently, we can say that clocks in a gravitational field run slow compared to identical clocks located outside. GPS receivers must take this effect into account to deduce their position relative to satellites.
This exercise on a point particle teaches us that the gravitational field is ultimately contained in the metric, and that this metric therefore depends on the presence of matter. It is therefore possible to imagine a generalization of this observation. The Newtonian potential is determined by Poisson’s equation:
where is the mass density and is Newton’s constant. The latter is associated with the energy density of the matter energy-momentum tensor (see chapter The expanding Universe), so with equation (37) we can obtain:
This equation is not invariant under Lorentz transformations, hence the necessity to modify Newtonian gravitational theory if we admit the principle of special relativity. Tensors are the right objects that can allow us to achieve this goal. We can then imagine that there exists a tensor combining first and second derivatives of the metric generalizing this last equation to all coordinates such that:
This last equation corresponds to a first version of Einstein’s equation. This reasoning only allowed us to intuit its form, but another more rigorous demonstration allows us to obtain the expression of Einstein’s tensor :
with the Ricci tensor and the scalar curvature (trace of the Ricci tensor ), themselves obtained from the Riemann tensor :
Since Einstein’s tensor contains second derivatives of the metric, Einstein’s equation links the curvature of spacetime and thus the trajectories of bodies to its energy and matter content.
Moreover, appears to have zero divergence. This is Bianchi’s identity:
By Bianchi’s identity, we also see that Einstein’s equation can be defined up to a constant[6] while keeping energy conservation. This constant is now called the cosmological constant. Here is Einstein’s equation in its final form Einstein, 1917:
3References¶
{bibliography}
Hereafter, the index 0 of tensors will correspond to the temporal coordinate, while the following indices will correspond to spatial coordinates.
Linearity is imposed by the invariance of physics under translation in space and time Raimond, 2006.
Through this definition, we have chosen a metric signature which we will keep throughout.
Knowing that , because is the Jacobian of the coordinate transformation and the second factor is its inverse.
If studying a massless particle, it suffices to replace by the model of the interaction applying to this particle.
Because we also have .
- Einstein, A. (1905). Zur Elektrodynamik bewegter Körper. Annalen Der Physik, 322(10), 891–921. https://doi.org/10.1002/andp.19053221004
- Raimond, J.-M. (2006). Electromagnétisme et relativité. https://cel.hal.science/cel-00092954
- Langlois, D. (2013). Relativité générale. Des fondements géométriques aux applications astrophysiques. De Boeck. https://www.deboecksuperieur.com/livre/9782311007190-relativite-generale
- Landau, L. D., & Lifshits, E. M. (1989). Théorie des champs. Editions Mir. https://books.google.bi/books?id=99z7QQAACAAJ
- Weinberg, S. (1972). Gravitation and cosmology: principles and applications of the general theory of relativity. http://www.lavoisier.fr/livre/notice.asp?ouvrage=1382255
- Gourgoulhon, E. (2013). Relativité générale (cours M2 “Astronomie et Astrophysique”).
- Von Eötvös, R. (1890). über die Anziehung der Erde auf verschiedene Substanzen. Math. Naturwissenschaft. Ber. Ungarn, 8, S65-68. https://mek.oszk.hu/03200/03286/html/eotvos1/eotv7.html
- Williams, J., Turyshev, S., & Boggs, D. (2004). Progress in lunar laser ranging tests of relativistic gravity. Physical Review Letters. http://prl.aps.org/abstract/PRL/v93/i26/e261101
- Touboul, P., Métris, G., Rodrigues, M., Bergé, J., Robert, A., Baghi, Q., André, Y., Bedouet, J., Boulanger, D., Bremer, S., Carle, P., Chhun, R., Christophe, B., Cipolla, V., Damour, T., Danto, P., Demange, L., Dittus, H., Dhuicque, O., … Visser, P. (2022). MICROSCOPE Mission: Final Results of the Test of the Equivalence Principle. Phys. Rev. Lett., 129(12), 121102. 10.1103/PhysRevLett.129.121102
- Einstein, A. (1917). Kosmologische Betrachtungen zur allgemeinen Relativitätstheorie. Sitzungsberichte Der Königlich Preussischen Akademie Der Wissenschaften, 142–152. https://adsabs.harvard.edu/pdf/1917SPAW.......142E
- Lévy-Leblond, J.-M. (1976). One more derivation of the Lorentz transformation. American Journal of Physics, 44(3), 271–277. 10.1119/1.10490