Article snapshot taken from Wikipedia with creative commons attribution-sharealike license.
Give it a read and then ask your questions in the chat.
We can research this topic together.
"Tensor index notation" redirects here. For a summary of tensors in general, see Glossary of tensor theory.
In mathematics, Ricci calculus is the modern name for what used to be called the absolute differential calculus (the foundation of tensor calculus), developed by Gregorio Ricci-Curbastro in 1887–96 and subsequently popularized in a paper written with his pupil Tullio Levi-Civita at the turn of 1900.
This article summarizes the rules of index notation and manipulation for tensors and tensor fields.
Introduction
Tensors and tensor fields can be expressed in terms of their components (meaning scalar coefficients of a tensor basis), and operations on tensors and tensor fields can be expressed in terms of operations on their components. Basis-independent tensor fields and operations are the focus of the Ricci calculus, and the notation leverages this to allow compact expressions of such tensor fields and operations.
While much of the notation may be applied with any tensors, operations relating to a differential structure are only applicable to tensor fields. Where needed, the notation extends to components of non-tensors, particularly multidimensional arrays.
A tensor may be expressed as a linear sum of the tensor product of vector and covector basis elements. The resulting tensor components are labelled by indices of the basis. Each index has one possible value per dimension of the underlying vector space. The number of indices equals the order of the tensor.
For compactness and convenience, the notational convention implies certain things, notably that of summation over repeated indices and of universal quantification of free indices.
Expressions in the notation of the Ricci calculus may generally be interpreted as a set of simultaneous equations relating the components functions over a manifold, usually more specifically as functions of the coordinates on the manifold. This allows intuitive manipulation of expressions with familiarity of only a limited set of rules.
Where a distinction is to be made between the space-like basis elements and a time-like element, this is conventionally done through indices as follows:
The lowercase Latin alphabeta, b, c... is used to indicate restriction to 3-dimensional Euclidean space, which take values 1, 2, 3 for the spatial components, and the time-like element is shown separately.
The lowercase Greek alphabet α, β, γ... is used for 4-dimensional spacetime, which typically take values 0 for time components and 1, 2, 3 for the spatial components.
else in general mathematical contexts, any symbol can be used, independent of the dimension of the vector space.
Coordinate and index notation
The author(s) will usually make it clear whether a subscript is intended as an index or as a label.
For example, in 3-D Euclidean space and using Cartesian coordinates; the coordinate vectorA = (A1, A2, A3) = (Ax, Ay, Az) shows a direct correspondence between the subscripts 1, 2, 3 and the labels x, y, z. In the expression Ai, i is interpreted as an index ranging over the values 1, 2, 3, while the x, y, z subscripts are not indices, more like "names" for the components.
Reference to coordinate systems
Indices themselves may be labelled using diacritic-like symbols, such as a hat (^), bar (¯), tilde (), or prime (′)
If a tensor has a list of indices all raised or lowered, one shorthand is to use a capital letter for the list:
where I = i1i2 ... in and J = j1j2 ... jm.
Sequential summation
Two vertical bars | | around a set of indices (with a contraction):
denotes the summation in which each preceding index is counted up to (and not including) the value of the next index:
Only one group of the repeated set of indices has the vertical bars around them (the other contracted indices do not). More than one group can summed in this way:
where
This is useful to prevent over-counting in some summations, when tensors are symmetric or antisymmetric.
Alternatively, using the capital letter convention for multi-indices, an underarrow is placed underneath the block of indices:
By contracting an index with a non-singular metric tensor, the type of a tensor can be changed, converting a lower index to an upper index or vice versa:
and
The base symbol in many cases is retained (e.g. using A where B appears here), and when there is no ambiguity, repositioning an index may be taken to imply this operation.
General outlines for index notation and operations
Tensors are equal if and only if every corresponding component is equal, e.g. tensor A equals tensor B if and only if
for all α, β and γ. Consequently, there are facets of the notation that are useful in checking that an equation makes sense (an analogous procedure to dimensional analysis).
Indices not in contractions are called free indices.
Indices in contractions are termed dummy indices, or summation indices.
If a tensor equality has n free indices, and if the dimensionality of the underlying vector space is m, the equality represents m equations: each has a specific set of index values.
For instance, if
is in 4-dimensions (that is, each index runs from 0 to 3 or 1 to 4), then because there are three free indices (α, β, δ), there are 4 = 64 equations:
(followed by 61 more equations, each with various other choices of α, β, δ)...
This illustrates the compactness and efficiency of using index notation: many equations which all share a similar structure can be collected into one simple tensor equation.
Indices are replaceable labels
Replacing any index symbol throughout by another leaves the tensor equation unchanged (provided there is no conflict with other symbols already used). This can be useful when manipulating indices, such as using index notation to verify vector calculus identities or identities of the Kronecker delta and Levi-civita symbol (see also below). An example of a correct change is:
as for a possible invalid change:
In the first replacement, λ replaced α and γ replaced μ everywhere, so the expression still has the same meaning. In the second, λ did not fully replace α and similarly for γ and μ, and is entirely inconsistent for reasons shown next.
Indices are the same in every term
The same indices on each side of a tensor equation always appear in the same (upper or lower) position throughout every term, except for indices repeated in a term (which implies a summation over that index), for example:
as for a possible invalid expression:
In other words, non-repeated indices must be of the same type in every term of the equation. In the above identity α, β, δ line up throughout and γ occurs twice in one term due to a contraction (correctly once as an upper index and once as a lower index), so it's a valid as an expression. In the invalid expression, while β lines up, α and δ do not, and γ appears twice in one term (contraction) and once in another term, which is inconsistent.
Brackets and punctuation used once where implied
When applying a rule to a number of indices (differentiation, symmetrization etc., shown next), the bracket or punctuation symbols denoting the rules are only shown on one group of the indices to which they apply.
If the brackets enclose covariant indices - the rule applies only to all covariant indices enclosed in the brackets, not to any contravariant indices which happen to be placed intermediately between the brackets.
Similarly if brackets enclose contravariant indices - the rule applies only to all enclosed contravariant indices, not to intermediately placed covariant indices.
Round brackets ( ) around some or all indices denotes the symmetrized part of the tensor: the sum of the tensor components with those indices permuted, then divided by the number of permutations.
For two symmetrizing indices, there are two indices to sum over and permute:
as an example,
For three symmetrizing indices, there are three indices to sum over and permute:
For p symmetrizing indices – sum over all components with those indices permuted:
Square brackets around some or all indices denotes the antisymmetrized part of the tensor. To antisymmetrize, a total is formed with one term per permutation, components arising from an even permutation of the indices are added, while components arising from an odd permutation of the indices are subtracted, then the total is divided by the number of permutations.
For two antisymmetrizing indices:
For three antisymmetrizing indices:
as an example,
For p antisymmetrizing indices – sum over the permutations of those indices multiplied by the signature of the permutation, divided by the number of permutations:
where n is the dimensionality of the underlying vector space.
Symmetry and antisymmetry sum
Any tensor can be written as the sum of its symmetric and antisymmetric parts on two indices:
To indicate partial differentiation of a tensor field with respect to a coordinate variable , a comma is placed before an added lower index of the coordinate variable. This may repeated (without adding further commas):
These components do not transform covariantly. This derivative is characterized by the product rule and the derivatives of the coordinates
To indicate covariant differentiation of any tensor field, a semicolon ( ; ) or a forward slash ( / , less common) is placed before an added lower (covariant) index.
For a contravariant vector: where is a Christoffel symbol of the second kind.
For a covariant vector:
For an arbitrary tensor:
Defining the multi-indices (using the capital-letter convention above):
the covariant derivative can be written
and this is not the summation convention, so for this case the summation signs are explicitly stated. The components of this derivative of a tensor field transform covariantly, and hence form another tensor field. This derivative is characterized by the product rule and the fact that the derivative of the metric is zero:
The covariant formulation of the directional derivative of any tensor field along a vector may be expressed as its contraction with the covariant derivative, e.g.:
The Lie derivative is another derivative that is covariant, but which should not be confused with the covariant derivative. It is defined even in the absence of a metric. The Lie derivative of a type-(r,s) tensor field along (the flow of) a contravariant vector field may be expressed as
or defining the multi-indices
the Lie derivative can be written
analagous to the covariant derivative. This derivative is characterized by the product rule and the fact that the derivative of the given contravariant vector field is zero.
Given a tensor field and a basis (of linearly independent vector fields),
the coefficients of the tensor field in a basis can be determined by evaluating a suitable combination of the basis and dual basis, and inherits the correct indexing. Some examples are listed below.
Throughout, let ei be vector fields that constitute a vector basis at each point (a moving frame) for the tangent space, and ε be the dual basis (also called the cobasis) of the cotangent space, so that
A tensor T as a multilinear function of 1-forms α, β, γ... and vectors a, b, c... corresponds to a contraction between the components of the tensor and the input vectors and 1-forms, resulting in another tensor; the notational correspondence is:
Covariant tensor:
Contravariant tensor:
Mixed tensor:
hence the specific components of a tensor can be found by inserting the basis and dual basis vectors:
Covariant tensor:
Contravariant tensor:
Mixed tensor:
The tensor product of basis vectors gives the basis tensor elements, contracting with the components gives the tensor:
Covariant tensor:
Contravariant tensor:
Mixed tensor:
analogous to vectors and 1-forms presented above.
This kind of calculation, i.e. "algebraic", also applies for some geometric operations in which appear coefficients that are not tensorial, for instance:
when multiplied and contracted. The components are the same in any basis and form an invariant tensor of type (1,1), i.e. the identity of the tangent bundle over the identity mapping of the base manifold, and so its trace is an invariant.
The dimensionality of spacetime is its trace: