Hide code cell content

import mmf_setup; mmf_setup.nbinit()
import os
from pathlib import Path
FIG_DIR = Path(mmf_setup.ROOT) / '../Docs/_build/figures/'
os.makedirs(FIG_DIR, exist_ok=True)
import logging; logging.getLogger("matplotlib").setLevel(logging.CRITICAL)
%matplotlib inline
import numpy as np, matplotlib.pyplot as plt
try: from myst_nb import glue
except: glue = None

This cell adds /home/docs/checkouts/readthedocs.org/user_builds/physics-571-math-methods/checkouts/latest/src to your path, and contains some definitions for equations and some CSS for styling the notebook. If things look a bit strange, please try the following:

  • Choose "Trust Notebook" from the "File" menu.
  • Re-execute this cell.
  • Reload the notebook.

4. Tensors and Manifolds#

This is my attempt to unify the geometric picture of tensors and manifolds with the practical aspects of “index pushing” needed to do physics. I am attempting to get to the point here, so rigor will be lacking: please see [Hassani, 2013] for careful definitions. This largely replaces the first part of Chapter 4 of [Arfken et al., 2013].

Essential Definitions#

An \(n\)-dimensional (differentiable) manifold \(M\) is a set that every locally looks like Euclidian space \(\mathbb{R}^{n}\). To work with manifolds, we need a collection of charts called an atlas which maps parts of the manifold to subsets of \(\mathbb{R}^{n}\).

For the purposes of this discussion, we will consider manifolds that are embedded in a higher dimensional Euclidian space \(M\subset \mathbb{R}^{m}\). This embedding will allow use to determine the metric \(\op{g}\), but it is not necessary that manifolds be embedded (or even that they can be embedded, though most can). These manifolds are call Riemannian manifolds.

Important

All of the information about the nature of the manifold is contained in the metric \(\op{g}\). Thus: even if we don’t have (or know) the embedding, we can still work with the manifold if we know \(\op{g}\).

We will start with embedded manifolds, but once we develop the formalism, we will be able to describe, e.g., spacetime, where we cannot “get out” to see the embedding. For example, consider a sphere like the surface of the earth. From space we can see that the surface is curved, and we can calculate anything we like using Euclidian geometry, but as people living on the earth, we must infer properties like the curvature by making local measurements within the manifold itself.

https://upload.wikimedia.org/wikipedia/commons/9/9a/Nondifferentiable_atlas.png

Fig. 1 Example of different charts for describing the surface of the earth – an \(m=2\)-dimensional manifold embedded in \(n=3\)-dimensional space \(\mathbb{R}^3\).#

Embedded Manifolds#

To be concrete, we start with an embedded \(m\)-dimensional manifold \(M\subset \mathbb{R}^{n}\) where \(n\geq m\). This allows us to use Cartesian coordinates \(x \in \mathbb{R}^{n}\) to describe points on the manifold. To work on the manifold, however, we would like to use a different set of coordinates \(X \in \mathbb{R}^{m}\).

Notation

Since there are \(m\) coordinates \(X^{\alpha}\), it is tempting to use the notation \(\vect{X}\) to describe the points. I will try to refrain from doing this, however, since these points are not vectors in the sense of the manifold – it make no sense to “add” two points on a sphere for example. Instead I will just use \(x\) and \(X\) to represent these, and reserve vector notation for the vector spaces like the tangent space.

For the actual vectors and covectors I will use bra-ket notation:

\[\begin{gather*} \newcommand{\uket}[1]{\ket{\hat{#1}}} \newcommand{\ubra}[1]{\bra{\hat{#1}}} \newcommand{\pddiff}[3]{\frac{\partial^2{#1}}{\partial{#2}\partial{#3}}} \ket{v} = \ket{X_\alpha}V^{\alpha}, \qquad \bra{v} = V_{\alpha}\bra{X^\alpha}, \end{gather*}\]

where \(\{\ket{X_\alpha}\}\) is a set of basis vectors and \(\{\bra{X^\alpha}\}\) is a basis of covectors (forms). Here an everywhere unless explicitly stated otherwise, we will use the Einstein summation convention of summing over repeated indices.

In this discussion, we will consider primarily two sets of coordinates: Euclidean coordinates in the embedding space \(x^{i}\), and general coordinates in the chart \(X^{\alpha}\). To help distinguish between these, we will use lower-case letters \(x\) and latin indices \(i\), \(j\), \(k\), \(l\), etc. for the Euclidean coordinates, and upper case letters \(X\) with greek indices \(\alpha\), \(\beta\), \(\gamma\), \(\delta\) etc. for the chart coordinates \(X^{\alpha}\). This choice is so that the usual Euclidean coordinates \(x^1\), \(x^2\) look familiar.

To describe the manifold, we can define the \(n\) functions

\[\begin{gather*} x^{i}(X^{\alpha}) = x^{i}(X^1, X^2, \dots, X^{m}). \end{gather*}\]

Working Example: Polar coordinates

To keep things straight, I recommend thinking about polar coordinates on the unit sphere:

\[\begin{gather*} x^{i} = \begin{pmatrix} x\\ y\\ z \end{pmatrix} = R\begin{pmatrix} \cos\phi\sin\theta\\ \sin\phi\sin\theta\\ \cos\theta \end{pmatrix}, \qquad X^{\alpha} = \begin{pmatrix} \phi\\ \theta \end{pmatrix}. \end{gather*}\]

Note that it is natural to express \(x^{i}(X^{\alpha})\) since this is an embedding: Use this as the basis for the Jacobian \(J^{i}{}_{\alpha} = \partial x^{i}/\partial X^{\alpha}\). Conversely, it is somewhat unnatural to express the inverse \(X^{\alpha}(x^{i})\) which is not uniquely defined.

This provides a complete description of the manifold and its properties everywhere these functions are not singular.

Jacobian Matrix#

In particular, properties of the embedding space can be converted to properties of the manifold through the Jacobian matrix \(\mat{J}\) and its inverse \(\mat{J}^{-1}\):

\[\begin{gather*} \mat{J}^{-1} = \pdiff{X}{x} = \pdiff{(X^1, X^2, \dots, X^m)}{(x^1, x^2, \dots, x^n)}, \qquad [\mat{J}^{-1}]^{i}{}_{\alpha} = \pdiff{X^i}{x^{\alpha}} = X^{i}{}_{,\alpha}\\ \mat{J} = \pdiff{x}{X} = \pdiff{(x^1, x^2, \dots, x^n)}{(X^1, X^2, \dots, X^m)}, \qquad J^{i}{}_{\alpha} = \pdiff{x^i}{X^\alpha} = x^{i}{}_{,\alpha}. \end{gather*}\]

Note that if we think of \(x^i = f(X)\) as a vector valued function, then \(\mat{J} = \mat{\d} f\) is the differential such that

\[\begin{gather*} x(X+\epsilon) = x(X) + \mat{J}\epsilon + O(\epsilon^2). \end{gather*}\]

In matrix form:

\[\begin{gather*} \mat{J} = \begin{pmatrix} \pdiff{x^1}{X^1} & \pdiff{x^1}{X^2} & \dots & \pdiff{x^1}{X^{m}}\\ \pdiff{x^2}{X^1} & \pdiff{x^2}{X^2} & \dots & \pdiff{x^2}{X^{m}}\\ \pdiff{x^3}{X^1} & \pdiff{x^3}{X^2} & \dots & \pdiff{x^3}{X^{m}}\\ \vdots & \vdots & & \vdots\\ \pdiff{x^{n}}{X^1} & \pdiff{x^{n}}{X^2} & \dots & \pdiff{x^{n}}{X^{m}} \end{pmatrix},\qquad \mat{J}^{-1} = \begin{pmatrix} \pdiff{X^1}{x^1} & \pdiff{X^1}{x^2} & \pdiff{X^1}{x^3} &\dots & \pdiff{X^1}{x^{n}}\\ \pdiff{X^2}{x^1} & \pdiff{X^2}{x^2} & \pdiff{X^2}{x^3} & \dots & \pdiff{X^2}{x^{n}}\\ \vdots & \vdots & \vdots & & \vdots\\ \pdiff{X^{m}}{x^1} & \pdiff{X^{m}}{x^2} & \pdiff{X^{m}}{x^3} & \dots & \pdiff{X^{m}}{x^{n}} \end{pmatrix}. \end{gather*}\]

Note that \(\mat{J}^{-1}\mat{J} = \mat{1}\), but that \(\mat{J}\mat{J}^{-1}\) may be rank deficient if \(m < n\).

Vectors#

Consider a particle moving on the manifold. This motion can be described by the curve with coordinates \(x\bigl(X(t)\bigr)\) through the embedded space. This curve defines the velocity vector \(\ket{v}\):

\[\begin{align*} \ket{v} &= \ket{\hat{e}_i}\overbrace{\diff{x^i(t)}{t}}^{v^i} = \underbrace{\ket{\hat{e}_i}\overbrace{\pdiff{x^i}{X^\alpha}}^{[\mat{J}]^{i}{}_{\alpha}}}_{\ket{X_\alpha}}\overbrace{\diff{X^\alpha}{t}}^{V^\alpha}\\ &= \ket{\hat{e}_i}v^i = \ket{X_\alpha} V^\alpha. \end{align*}\]

Note: the vector \(\ket{v}\) represents a “physical” object – something real (as defined by the embedding). By contrast, the components \(v^{i}\) and \(V^{\alpha}\) are the “columns of numbers” often associated with vectors.

\[\begin{gather*} \ket{v} = \Bigl(\begin{matrix} \ket{X_1} & \ket{X_2} & \cdots & \ket{X_m} \end{matrix} \Bigr) \underbrace{\begin{pmatrix} V^1\\ V^2\\ \vdots\\ V^{m} \end{pmatrix}}_{V^\alpha} \end{gather*}\]

Unlike the actual vector \(\ket{v}\), these numbers depend on the coordinate choice.

The Jacobian matrix \(\mat{J}\) plays two roles here:

\[\begin{gather*} v^i = J^{i}{}_{\alpha}V^{\alpha} = \pdiff{x^{i}}{X^{\alpha}}V^{\alpha} , \qquad \ket{X_\alpha} = \ket{\hat{e}_i} J^{i}{}_{\alpha}. \end{gather*}\]

Acting to the right, it transforms the components of a vector in the chart \(V^{\alpha}\) into the coordinates in the embedding space \(v^i\). Acting to the left, (or equivalently, the transpose \(\mat{J}^T\) acting to the right), it transforms the basis vectors \(\ket{\hat{e}_i}\) from the embedding space into the coordinate basis vectors \(\ket{X_\alpha}\) that we can use to express vectors like the velocity in terms of its components \(V^{\alpha} = \dot{X}^\alpha\) in the chart coordinates.

Important

  • The basis vectors \(\ket{X_\alpha}\) need not be orthonormal in the embedding space. It is often convenient to choose coordinates where the basis is at least orthogonal as this makes the metric nice, but it is not necessary or assumed here.

  • However, the basis vectors are orthonormal in the sense that \(\braket{X^\alpha|X_\beta}=\delta^{\alpha}_{\beta}\). In fact, as we shall see, the metric is exactly the thing that allows us to define such an inner product, and is precisely defines so that these basis vectors remain orthonormal.

  • We are being careful about our index location. The components of the contravariant vectors are expressed with an upper index: \(V^{\alpha}\) (expressed in the chart) and \(v^{i}\) (expressed in the embedding space). To keep the index convention, we label the actual basis vectors with a lower index \(\ket{X_{\alpha}}\).

_images/Covector.svg

Fig. 2 Visual depiction of a vector \(\ket{a}\) (top) and a covector \(\bra{v}\) (middle). The bottom shows the covector \(\bra{v}\) acting on the vector \(\ket{a}\) to give \(\braket{v|a} = 2\). Notice that the vector would pierce the stacked covector exactly two times.#

Covectors (Forms)#

In addition to vectors, we also have covectors or 1-forms: these are linear operators which take vectors into numbers, and should be thought of as the manifold generalization of row vectors \(\bra{v}\):

\[\begin{gather*} \bra{v}\bigl(a\ket{a} + b\ket{b}\bigr) = a\braket{v|a} + b\braket{v|b} \in \mathbb{R}. \end{gather*}\]

Given a basis of covectors \(\{\bra{X^{\alpha}}\}\), we can express a covector \(\bra{v}\) as

\[\begin{gather*} \bra{v} = V_\alpha\bra{X^\alpha}, \qquad \ket{v} = \ket{X_\alpha}V^\alpha. \end{gather*}\]

Here we use lower indices for the coefficients \(V_\alpha\) which called covariant in contradistinction to the contravariant \(V^\alpha\) with raised indices that describe the vector (or contravariant vector) \(\ket{v}\):

Important

Without a metric, there is no way to relate vectors and covectors. Geometrically they are different types of objects. (See Fig. 2.) Thus, the contravariant coefficients \(V^{\alpha}\) of the vector bear no relationship with the covariant coefficients \(V_\alpha\) of the covector.

We can choose the basis of covectors \(\{\bra{X^{\alpha}}\}\) to be dual to the basis of vectors \(\{\ket{X_{\alpha}}\}\):

\[\begin{gather*} \braket{X^{\alpha}|X_{\beta}} = \delta^{\alpha}_{\beta},\qquad \braket{v|g} = V_{\alpha}\braket{X^\alpha|X_\beta}G^{\beta} = V_{\alpha}G^{\alpha}. \end{gather*}\]

For this to be invariant under coordinate transforms, the covariant coefficients \(V_{\alpha}\) and dual basis vectors must transform with the inverse Jacobian \(\mat{J}^{-1}\):

\[\begin{gather*} v_{i} = V_{\alpha}[\mat{J}^{-1}]^{\alpha}{}_{i} = V_{\alpha}\pdiff{X^{\alpha}}{x^{i}}, \qquad \bra{X^{\alpha}} = [\mat{J}^{-1}]^{\alpha}{}_{i}\bra{\hat{e}^{i}}. \end{gather*}\]

Completeness#

Armed with these dual bases, we can express the identity operator as

\[\begin{gather*} \op{1} = \ket{X_\alpha}\bra{X^\alpha}, \qquad \op{1}\ket{v} = \ket{v}, \qquad \bra{v}\op{1} = \bra{v}. \end{gather*}\]

Transformations#

The key property of vectors and covectors, and the defining properties of tensors, is that their components transform linearly under coordinate transformation:

\[\begin{gather*} A^{\alpha} \rightarrow \Lambda^{\alpha}{}_{\beta}A^{\beta},\qquad B_{\alpha} \rightarrow \Lambda_{\alpha}{}^{\beta}B_{\beta}. \end{gather*}\]

Invariance of the action of forms on vectors implies

\[\begin{gather*} \braket{b|a} = B_{\alpha}A^{\alpha} \rightarrow B_{\alpha}A^{\alpha} = B_{\beta}\underbrace{\Lambda_{\alpha}{}^{\beta} \Lambda^{\alpha}{}_{\gamma}}_{\delta^{\beta}_{\gamma}}A^{\gamma} \implies \Lambda_{\alpha}{}^{\beta}\Lambda^{\alpha}{}_{\gamma} = \delta^{\beta}_{\gamma}. \end{gather*}\]

In matrix form:

\[\begin{gather*} \Lambda^{\alpha}{}_{\gamma} = [\mat{\Lambda}]^{\alpha}{}_{\gamma}, \qquad \Lambda_{\alpha}{}^{\beta} = [(\mat{\Lambda}^{-1})^T]_{\alpha}{}^{\beta}. \end{gather*}\]

If you have a collection of numbers that does not transform appropriately, then it cannot be considered to be a tensor.

Tensors#

Let \(\mathcal{V}\) denote a vector space (i.e., the space of tangent vectors at a fixed point in the manifold), and let \(\mathcal{V}^*\) denote the dual space of covectors. A tensor \(\op{T}^{r}_{s}\) in \(\mathcal{T}^{r}_{s}(\mathcal{V})\) on \(\mathcal{V}\) is a multilinear map

\[\begin{gather*} \op{T}^{r}_{s}: \underbrace{\mathcal{V}^{*}\times\mathcal{V}^{*}\times\cdots\times\mathcal{V}^{*}}_{r} \times \underbrace{\mathcal{V}\times\mathcal{V}\times\cdots\times\mathcal{V}}_{s} \rightarrow \mathbb{R}. \end{gather*}\]

Thus, covectors are tensors in \(\mathcal{T}_{1}^{0}(\mathcal{V})\) since they take vectors into numbers, and vectors are tensors in \(\mathcal{T}^{1}_{0}(\mathcal{V})\) since they take covectors into numbers. Given a basis \(\ket{X_{\alpha}}\) for \(\mathcal{V}\) and a basis \(\bra{X^{\alpha}}\) for \(\mathcal{V}^*\), the components of the tensor are

\[\begin{gather*} A^{\alpha_1\dots\alpha_r}_{\beta_1\dots\beta_s} = \op{A}\Bigl(\bra{X^{\alpha_1}},\dots,\bra{X^{\alpha_r}}, \ket{X^{\beta_1}},\dots,\ket{X^{\beta_s}}\Bigr), \end{gather*}\]

and we can write

\[\begin{gather*} \op{A} = A^{\alpha_1\dots\alpha_r}_{\beta_1\dots\beta_s} \underbrace{ \ket{X_{\alpha_1}}\otimes\dots\otimes\ket{X_{\alpha_r}}\otimes \bra{X^{\beta_1}}\otimes\dots\otimes\bra{X^{\beta_s}} }_{\ket{X_{\alpha_1}}\dots\ket{X_{\alpha_r}} \bra{X^{\beta_1}}\dots\bra{X^{\beta_s}} }. \end{gather*}\]

Metric Tensor#

A metric tensor (or metric) \(\op{g}\in \mathcal{T}^{0}_{2}(\mathcal{V})\) acts as an inner product on the vector space: it is bilinear operation that takes two vectors into a real number, satisfying all of the properties of an inner product (a.k.a. dot product):

\[\begin{gather*} \op{g}\bigl(\ket{a}, \ket{b}\bigr) = \braket{a|b}. \end{gather*}\]

Thus, given a vector \(\ket{v}\), the metric defines a linear operator that acts on vectors returning numbers. In other words, it maps the vector \(\ket{v}\) to a covector \(\bra{v}\):

\[\begin{gather*} \bra{v} \equiv \op{g}\bigl(\ket{v}, \cdot\bigr), \qquad \braket{v|a} = \op{g}\bigl(\ket{v}, \ket{a}\bigr). \end{gather*}\]

Note that the metric may be different at different points in the manifold: thus we should think of metric tensor field. If the manifold is not embedded in another space, then the metric provides all of the information about the manifold’s structure.

If the manifold is embedded, however, then we can use the Jacobian matrix \(\mat{J}\) to induce the metric from the embedding because the embedding space \(\mathbb{R}^n\) has a nature metric – the Euclidean metric – which we can use to induce the metric on the manifold.

Consider the dot product of two vectors \(\ket{a}\) and \(\ket{b}\):

\[\begin{gather*} \ket{a} = \ket{X_\alpha}A^\alpha = \uket{e_i}J^{i}{}_{\alpha}A^{\alpha} = \uket{e_i}a^{i},\\ \ket{b} = \ket{X_\alpha}B^\alpha = \uket{e_i}J^{i}{}_{\alpha}B^{\alpha} = \uket{e_i}b^{i}. \end{gather*}\]

The inner product can now be induced from the embedding space:

\[\begin{gather*} \braket{a|b} = a^{i}\delta_{ij}b^{j} = a_{i}b^{i} = \Bigl(\begin{matrix} a^1 & \cdots & a^n \end{matrix}\Bigr) \begin{pmatrix} b^1 \\ \vdots \\ b^n \end{pmatrix}. \end{gather*}\]

Note that the metric in the embedding space is just the identity \(\delta_{ij}\), so the covariant components equal the contravariant components: \(a_i = a^i\). This is generally not true in the chart \(A_\alpha \neq A^\alpha\). Equating the inner product in the chart and the embedding space, we have

\[\begin{gather*} \braket{a|b} = \overbrace{a_{i}}^{J_{i\alpha}A^{\alpha}}\underbrace{b^{i}}_{J^{i}{}_{\alpha}B^\alpha} = A^{\alpha}\underbrace{J_{i\alpha}J^{i}{}_{\beta}}_{g_{\alpha\beta}}B^{\beta}. \end{gather*}\]

Thus, we have

\[\begin{gather*} \braket{a|b} = A^{\alpha}g_{\alpha\beta}B^{\beta} \quad \text{where} \quad g_{\alpha\beta} = J_{i\alpha}J^{i}{}_{\beta} = [\mat{J}^T\mat{J}]_{\alpha\beta}. \end{gather*}\]

Note that \(g_{\alpha\beta} = g_{\beta\alpha}\) is symmetric.

Stated more directly

\[\begin{gather*} g_{\alpha\beta}= \op{g}\Bigl(\overbrace{\ket{X_{\alpha}}}^{\ket{\hat{e}_{i}}J^{i}{}_{\alpha}}, \overbrace{\ket{X_{\beta}}}^{\ket{\hat{e}_{j}}J^{j}{}_{\beta}}\Bigr) = J^{i}{}_{\alpha}\;\overbrace{ \op{g}\Bigl(\ket{\hat{e}_{i}}, \ket{\hat{e}_{j}}\Bigr) }^{\braket{\hat{e}_i|\hat{e}_j}=\delta_{ij}} J^{j}{}_{\beta} = J^{i}{}_{\alpha}\delta_{ij}J^{j}{}_{\beta} = [\mat{J}^{T}\mat{J}]_{\alpha\beta}. \end{gather*}\]

Armed with these coefficients, we can compute the covariant coefficients \(A_\alpha\):

\[\begin{gather*} A_\alpha = g_{\alpha \beta}A^\beta \end{gather*}\]

where we say that the metric lowers the index. Note that these are the coefficients of the covector \(\bra{a}\):

\[\begin{gather*} \bra{a} = A_\alpha\bra{X^\alpha} = A^{\alpha}g_{\alpha\beta}\bra{X^\beta}. \end{gather*}\]

With this notation, we can write the contraction

\[\begin{gather*} \braket{A|B} = A_\alpha B^\alpha. \end{gather*}\]

It is also useful to define a related set of numbers \(g^{\alpha\beta}\) which raise the index, taking covariant components back into contravariant components:

\[\begin{gather*} A^{\alpha} = g^{\alpha\beta}A_{\beta} = g^{\alpha\beta}g_{\beta \gamma}A^{\gamma}. \end{gather*}\]

This must be the matrix inverse of \(g_{\alpha\beta}\):

\[\begin{gather*} g^{\alpha\beta}g_{\beta \gamma} = \delta^{\alpha}_{\gamma}, \qquad g_{\alpha\beta} = [\mat{g}]_{\alpha\beta}, \qquad g^{\alpha\beta} = [\mat{g}^{-1}]_{\alpha\beta}. \end{gather*}\]

“The Metric” (Geodesics)#

How do we relate the metric tensor \(\op{g}\) with the idea of a metric that allows us to compute distances? This is expressed in terms of the line element \(\d{s}\):

\[\begin{gather*} \d{s}^2 = g_{\alpha\beta}\d{X}^{\alpha}\d{X}^{\beta}. \end{gather*}\]

One can then integrate a curve to obtain the distance between two points

\[\begin{gather*} \int_{\gamma(s)}\d{s}. \end{gather*}\]

A geodesic is a path \(\gamma(s)\) that minimizes the distance between two points. Using calculus of variations, we can derive the geodesic equations satisfied by such a path (see below).

Derivatives#

Complications arrive in the notion of derivatives because, by their nature, they compare the behavior of vectors in different spaces – at neighboring points. In the Euclidean embedded space, this is not an issue since all vectors live in the same space, but once we introduce coordinate transforms, there is no guarantee about how the tangent space at neighboring points are related.

Important

The idea of covariance is we consider only quantities that are well defined within the manifold, or embedding space. For example, we can consider quantities like the number returned by a form acting on a vector \(\braket{u|v}\) or the number returned by the metric \(\op{g}\bigl(\ket{u}, \ket{v}\bigr)\) (which is the same if the space has a metric to related \(\ket{u}\) and \(\bra{u}\).) These values should not change if we change our coordinate system: in other words, they are invariant under coordinate transformations.

In contrast, the “numbers” like \(V^{\alpha}\), which are the components of the vector in a particular chart, depend on the choice of coordinates – specifically through the Jacobian matrix \(J_{\alpha}{}^{i} = \partial_{\alpha}x^{i}\). Since physicists like to work with numbers, much attention is given to these coordinates, and how they transform – so much so that some books refer to the numbers themselves as tensors. The emphasis on how the components transform is so that physicists can ensure that they are forming invariant quantities by contracting indices in covariant-contravariant pairs: \(\braket{u|v} = U_{\alpha}V^{\alpha}\).

To ensure that you are studying physics rather than numerology, I recommend that you try to express all of your final results in terms of fundamental properties of the manifold – or at least in terms of properties computed in the embedding space. Of course, work with coordinates because they are useful, but don’t trust that something is coordinate invariant just because the indices are properly paired – make sure you understand what the quantity means physically.

To see how one might get into difficulty, consider a function \(f(x)\) on the manifold where \(x(X)\) depend on some chart coordinates \(X\). The gradient in the coordinate chart is

\[\begin{gather*} \pdiff{f}{X^{\alpha}} = \pdiff{f}{x^{i}}\pdiff{x^{i}}{X^{\alpha}} = \pdiff{f}{x^{i}}J^{i}{}_{\alpha}. \end{gather*}\]

We thus see that that coefficients of the gradient behave naturally as a covector

\[\begin{gather*} \pdiff{f}{x^{i}} = \pdiff{f}{X^{\alpha}}[\mat{J}^{-1}]^{\alpha}{}_{i}. \end{gather*}\]

Notation

This motivates the following notation to emphasize the covariant nature of derivatives.

\[\begin{gather*} \pdiff{}{X^{\alpha}} = \partial_{\alpha}, \qquad \pdiff{}{X_{\alpha}} = \partial^{\alpha} = g^{\alpha\beta}\partial_{\beta}. \end{gather*}\]

We will use this notation going forward:

We can now consider various derivative operations constructed from

\[\begin{gather*} \bra{\nabla} = \bra{X^{\alpha}}\partial_{\alpha}, \qquad \ket{\nabla} = \ket{X_{\alpha}}\partial^{\alpha}. \end{gather*}\]

Unless specified otherwise, the derivative should act to the right.

Warning

If you carefully use \(\bra{\nabla}\) and \(\ket{\nabla}\) as we do below, you will obtain properly invariant quantities, but you can run into difficulty if you blindly use \(\partial_{\alpha}\). For example, you might think that

\[\begin{gather*} \partial_{\alpha}V^{\beta} \end{gather*}\]

are the components of a tensor in \(\mathcal{T}^{1}_{1}\), but it is not. In particular,

\[\begin{gather*} \partial_{\alpha}V^{\alpha} \neq \braket{\nabla|v} \end{gather*}\]

is not the proper divergence in general, and is not invariant under coordinate transforms.

Christoffel Symbols#

In general, the basis vectors \(\ket{X_j}\) for the chart might change as one moves along the manifold. This change can re-expressed in terms of the basis:

\[\begin{gather*} \partial_{\beta}\ket{X_\alpha} = \ket{X_\gamma}\Gamma^{\gamma}_{\alpha\beta}, \qquad \Gamma^{\gamma}_{\alpha\beta} = \bra{X^\gamma}\partial_{\beta}\ket{X_\alpha} = g^{\gamma\delta}\underbrace{\bra{X_{\delta}} \partial_{\beta}\ket{X_\alpha}}_{[\alpha\beta,\delta]}. \end{gather*}\]

The coefficient \([\alpha\beta,\gamma]\) are known as the Christoffel symbols of the first kind and \(\Gamma^{\gamma}_{\alpha\beta}\) are known as the Christoffel symbols of the second kind. They can be computed by differentiating the basis vectors:

\[\begin{gather*} \partial_{\alpha}g_{\beta \gamma} + \partial_{\beta}g_{\alpha \gamma} - \partial_{\gamma}g_{\alpha\beta} = 2[\alpha \beta, \gamma],\\ [\alpha \beta, \gamma] = \frac{1}{2}\left( \partial_{\alpha}g_{\beta\gamma} + \partial_{\beta}g_{\alpha\gamma} - \partial_{\gamma} g_{\alpha\beta} \right),\\ \Gamma^{\gamma}_{\alpha\beta} = \frac{g^{\gamma\delta}}{2}\left( \partial_{\alpha}g_{\beta\delta} + \partial_{\beta}g_{\alpha\delta} - \partial_{\delta}g_{\alpha\beta} \right). \end{gather*}\]

These formulae should make it clear that, for the tangent space, \(\Gamma^{\gamma}_{\alpha\beta} = \Gamma^{\gamma}_{\beta\alpha}\) is symmetric in the lower indices.

Important

Although we use index notation, \(\Gamma\) is not a tensor. To see this, note that in Euclidean coordinates, \(g_{ij} = \delta_{ij}\) is constant, so \(\Gamma^{\gamma}_{\alpha\beta} = 0\). If this were to transform as a tensor by multipliction with Jacobian factors, then it would have to be zero in all frames, which is clearly not.

Geodesics#

The geodesic equations can be found by minimizing the “energy” functional

\[\begin{gather*} E(\gamma) = \frac{1}{2}\int_{a}^{b} \norm{\dot{\gamma}}_g^2\d{t} \end{gather*}\]

where \(\gamma(t)\) is a path from \(a\) and \(b\) whose parameterization is chosen so that the speed is constant over the whole path:

\[\begin{gather*} \norm{\dot{\gamma}}_{g} = \frac{L(\gamma)}{b-a}, \qquad L(\gamma) = \int_{a}^{b} \norm{\dot{\gamma}}_{g} \d{t}, \end{gather*}\]

where \(L(\gamma)\) is the length of the path. Recall that the path is described by the coordinates \(X^i(t)\), so

\[\begin{gather*} \dot{\gamma} = \ket{X^{i}}\diff{X^i}{t}, \qquad \norm{\dot{\gamma}}_{g}^2 = g_{ij}\diff{X^i}{t}\diff{X^j}{t}. \end{gather*}\]

The energy can be minimized using calculus of variations, and which gives the geodesic equation

\[\begin{gather*} \diff[2]{X^k}{t} + \Gamma^{k}_{ij}\diff{X^i}{t}\diff{X^j}{t} = 0. \end{gather*}\]

The same Christoffel symbols (of the second kind) enter.

Gradient#

The gradient of a function is straightforward:

\[\begin{gather*} \bra{\nabla}\psi = (\partial_{\alpha}\psi)\bra{X^{\alpha}}. \end{gather*}\]

Note that this is naturally a form (covector) since the gradient of a function takes a vector (curve) and computes the directional derivative of the function along that direction:

\[\begin{gather*} \ket{v} = \ket{X_{\beta}}\dot{X}^{\beta}, \\ \bigl(\bra{\nabla}\psi\bigr)\ket{v} = (\partial_{\alpha}\psi) \underbrace{\braket{X^{\alpha}|X_{\beta}}}_{\delta^{\alpha}_{\beta}} \dot{X}^{\beta} = \dot{X}^{\alpha}\partial_{\alpha}\psi = \nabla_{\ket{v}}\psi. \end{gather*}\]

Furthermore, note that, for this to be invariant, the gradient must act as a covariant tensor, hence justifying the compact notation with a lower index.

Covariant Derivative#

Differentiating a vector \(\ket{v} = \ket{X_\alpha}V^{\alpha}\) is more complicated because the basis elements may also change, as described by the Christoffel symbols. We start by considering the following, where the where the derivative acts to the left:

\[\begin{gather*} \ket{v}\bra{\overset{\leftharpoonup}{\nabla}} = \partial_{\alpha}(\ket{X_{\beta}}V^{\beta})\bra{X^{\alpha}} = \bigl( V^{\beta}\partial_{\alpha}\ket{X_{\beta}} + \ket{X_{\beta}}\partial_{\alpha}V^{\beta}\bigr)\bra{X^{\alpha}}. \end{gather*}\]

Inserting the identity \(\op{1} = \ket{X_\alpha}\bra{X^{\alpha}}\) we recover the Christoffel symbols:

\[\begin{gather*} \ket{v}\bra{\overset{\leftharpoonup}{\nabla}} = \bigl( V^{\beta}\ket{X_\gamma}\underbrace{\braket{X^{\gamma}|\partial_{\alpha}|X_{\beta}}} _{\Gamma^{\gamma}_{\beta\alpha}} + \ket{X_{\beta}}\partial_{\alpha}V^{\beta}\bigr)\bra{X^{\alpha}}\\ = \ket{X_\gamma} \underbrace{\bigl( \partial_{\alpha}V^{\gamma} + V^{\beta}\Gamma^{\gamma}_{\beta\alpha}\bigr) }_{V^{\gamma}_{;\alpha}}\bra{X^{\alpha}}. \end{gather*}\]

This is the covariant derivative of a vector:

\[\begin{gather*} V^{\gamma}_{;\alpha} = \partial_{\alpha}V^{\gamma} + V^{\beta}\Gamma^{\gamma}_{\beta\alpha}. \end{gather*}\]

Unlike \(\partial_{\beta}V^{\alpha} \equiv V^{\alpha}_{,\beta}\), this transforms properly and deserves the shorthand notation.

Divergence#

We can also act the form on the vector to obtain the divergence:

\[\begin{align*} \braket{\nabla|v} &= \bra{X^{\beta}}\partial_{\beta}(\ket{X_{\alpha}}V^{\alpha}) = \underbrace{(\braket{X^{\beta}|\partial_{\beta}|X_{\alpha}})} _{\Gamma^{\beta}_{\alpha\beta}}V^{\alpha} + \underbrace{\braket{X^{\beta}|X_{\alpha}}} _{\delta^{\alpha}_{\beta}} \partial_{\beta}V^{\alpha}\\ &= \partial_{\alpha}V^{\alpha} + \Gamma^{{\beta}}_{\beta\alpha}V^{\alpha} = V^{\alpha}_{;\alpha}. \end{align*}\]

The contraction on the two indices of the Christoffel symbol has a nice form in terms of the determinant of the metric

\[\begin{gather*} \Gamma^{\beta}_{\beta\alpha} = \frac{1}{\sqrt{\det(g)}}\pdiff{\sqrt{\det(g)}}{X^{\alpha}}. \end{gather*}\]

where \(\det(g)\) is the determinant of \(g_{\alpha\beta}\):

\[\begin{gather*} \braket{\nabla|v} = \frac{\partial_{\alpha}\left(\sqrt{\det(g)} V^{\alpha}\right)} {\sqrt{\det(g)}}. \end{gather*}\]

Laplacian#

Finally, the Laplacian can be computed by taking the divergence of the gradient after first using inverse metric to convert the gradient to a vector:

\[\begin{gather*} \bra{\nabla}\psi = \bra{\nabla\psi} = (\partial_{\alpha}\psi)\bra{X^{\alpha}},\\ \ket{\nabla\psi} = \ket{X_{\alpha}}\partial^{\alpha}\psi = \ket{X_{\alpha}}g^{\alpha\beta}\partial_{\beta}\psi\\ \nabla^2\psi = \braket{\nabla|\nabla\psi} = \frac{\partial_{\alpha}\left(\sqrt{\det(g)}\;\partial^{\alpha}\psi\right)} {\sqrt{\det(g)}}. \end{gather*}\]

Expanded fully, this gives:

\[\begin{gather*} \nabla^2\psi = \frac{\partial_{\alpha}\left(\sqrt{\det(g)}\;g^{\alpha\beta}\partial_{\beta}\psi\right)} {\sqrt{\det(g)}}. \end{gather*}\]