Chapter 4: Symmetry in Quantum Mechanics


Summary of this chapter in "Modern Quantum Mechanics" (J.J. Sakurai), by Patrick Van Esch
Last revision November, 22, 2003.

There are some brilliant parts in this chapter (for example, the part on time inversion), and there are some extremely poor explanations (the lattice symmetry).

1. Symmetries, conservation laws and degeneracies.

Symmetries in Classical Physics

Symmetry in classical physics is here displayed in the case we have managed to have one of the configuration space coordinates to be the symmetry, as given in (4.1.1).  Although this may seem to be a very special case, it isn't, because any continuous symmetry can be formatted in that way using canonical transformations (at least, in principle !).  This is probably the most simplistic derivation of Noether's theorem that exists!  In essence, it states that to any single-parameter symmetry of the dynamics, corresponds a conserved quantity during the time evolution.

Symmetry in Quantum Mechanics

A symmetry of a quantum system, or a candidate symmetry, depending on a single parameter, corresponds to  a unitary transformation of Hilbert space.  (well, we'll see that there  is also another possibility called an anti-unitary transformation).  It is pointed out that to any continuous parametrisation of a unitary transformation (we're considering a one-parameter family of unitary transformations), we can associate a Hermitian generator of the family, which can be extracted by looking at an infinitesimal value of the parameter.  Although we can play that game with just any one-parameter family of unitary transformations, the name of symmetry is deserved in the case the generator commutes with the Hamiltonian.  In that case, the generator corresponds to an observable which is a constant of motion.  Note that the infinitesimal transformation determined by the generator determines the entire parametrized set of transformations by exponentiation.

Degeneracies

A very basic observation is made: if a system has a certain symmetry, then this doesn't mean that every "solution", for example, energy eigenstate, of that system, possesses that symmetry: only the set of solutions has to posses the symmetry.  In the case of energy eigenkets, if there is more than one (up to a factor) eigenket for a given value, we call that energy value degenerate, and we have a linear subspace of kets that are eigenkets of this value.  This means that the symmetry operator applied to any ket of this set has to be (another or the same) ket of this set.  The example of rotational symmetry and the set of spin-l eigenkets is given.

2. Discrete symmetries, parity or space inversion

Apart from symmetries related to a continuous parameter (such as rotations, translations etc..), there can also be a set of discrete symmetries, which belong to a finite, or a countable set of operations.  In this section, we study the parity or space inversion operation.    We start to get used to the way of implementing a symmetry in quantum mechanics: to the abstract idea of symmetry corresponds a unitary operator acting in Hilbert space, and we have to find it.
As was the case with other symmetry operators, like translation and rotation, we have to start from the interpretation of certain observables, and define what we mean by the symmetry operation we want to implement, and what it should do on expectation values.  That gives us then mathematical contraints on the form the unitary operator corresponding to the symmetry.  If we have enough constraints, it is uniquely specified, but usually we need to put in some extra ad hoc "phase conventions".  This can be simple or tricky !
In the case of a Hilbert space where a position operator for a particle is defined (so our quantum system is a spinless single particle in 3-dim Euclidean space) we say that the action of parity changes the sign of the expectation value of the position operator for any state.  From this can be deduced equation (4.2.4).  But that doesn't fix the operator pi completely.  Indeed, the best we can do is equation (4.2.5), but that still leaves an arbitrary phase factor to choose.  There is then an extra convention, that this phase factor equals one.  From that follows that pi-squared equals one, and that the eigenvalues of pi can only be 1 or -1.  Note how tricky this is!  A basic property, namely (4.2.7), follows from what seems to be a convention. The reason is of course that we take it to be fundamental for parity to be such that twice applying a parity operation brings us back to the original state.  This is a stronger statement than rotation over 360 degrees for spin-1/2 particles, where we were back to the original state with a minus sign, but that's because for spin-1/2 particles, we were mathematically forced to do so (representation of SU(2) is simply that way).
From the known properties of parity working on position, is deduced how parity behaves with momentum.  The defining property of momentum, namely that it is the generator for space translations, is used to find this out. 
The effect of parity on angular momentum is worked out for orbital angular momentum from the properties of position and momentum, but that has nothing fundamental.  For angular momentum in general, it is postulated that parity should commute with the rotation operator (and hence with angular momentum).  Note that this is extra input, because we cannot use the properties deduced from what happens to a featureless point particle to uniquely fix what should happen to particles with spin.

Parity is a discrete symmetry that gets its defining action from something that happens in 3-dim Euclidean space, just as rotations (but that was a continuous symmetry).  We can hence consider the combined group of symmetries, with rotations and parity.  It turns out that quantities that behaved in a vectorial or scalar way under rotations, now can have an additional property: namely they can be odd, or they can be even under parity.  This is what makes the difference between scalars, and pseudoscalars, and vectors and pseudovectors.

Wave Functions under Parity

The wave function is nothing else but the representation of kets in the position basis.  Parity eigenkets then have even or odd wavefunctions.  The parity of angular momentum eigenkets is discussed.
A theorem (which will come back under many different forms) is stated for parity.  It is a variant of a more general theorem which states that non-degenerate eigenkets of a hamiltonian that satisfies a certain symmetry (that commutes with a certain symmetry operator) are also eigenkets of the symmetry.  In this specific case, the symmetry is the parity operator.
We already pointed out that if there is degeneracy, then this theorem doesn't hold, and only the set of eigenkets (and not the individual eigenkets) satisfies the symmetry.

Symmetrical double well potential

A very instructive example is qualitatively discussed, and shows how coupling (interaction in the broad sense) of two systems (the individual wells) leads to breaking of degeneracy (spitting of energy levels) on one hand, and gives an idea of perturbative interactions on the other hand.  Let's make this clearer.
If the individual wells are completely separated, then we have of course degeneracy: the particle can be in the left well, or in the right well, and these solutions are equivalent.  If there is interaction (the barrier is finite), then the energy eigenstates are not degenerate (but unlocalized).  If we work in localized states, then these are not stationary, and the oscillation frequency between both localized states is given by Bohr's formula (4.2.39).

Parity Selection Rule

It is quite simple to establish that the matrix element of the position operator between two parity eigenstates of same parity vanishes.  This is the parity extension of the Wigner-Eckart theorem if one wants to.  This property will play a role in radiative transitions, and will suppress certain radiative transitions.  As such, it gives rise to a selection rule (known before the advent of quantum mechanics) called Laporte's rule.

Parity non conservation

Parity is an important operation because it often is a symmetry.  Until 1956, it was thought that the fundamental interactions in nature were symmetric under parity  But then it was established that weak nuclear interactions are not symmetrical under parity.

3. Lattice translation as a discrete symmetry

A symmetry that is extremely important in solid-state physics, is a lattice symmetry.  In true solid-state physics, this has to be a 3-dimensional lattice of course, but we will limit ourselves here to a toy model in 1 dimension.  The hamiltonian is supposed to commute with translations over the lattice distance a. 
Then there comes a confusing discussion.  I think that Sakurai's aim is to deduce Bloch's theorem (for mathematicians: Flouquet's theorem).  The first part is understandable, when the ket labeled with n corresponds to a localized state and there is no "coupling" between adjacent lattice sites due to an infinite potential barrier.  The combination given in equation (4.3.6) and the fact that this is an eigenket of the translation operator over the lattice spacing, is fundamental.  Here, each ket with label n is to be understood as coming from a set of kets that obeys equation (4.3.5).  In this first part, this set is well-defined: it are the eigenkets of the Hamiltonian, given infinite barriers.  But what comes next is totally confused in my opinion: once you lower the barriers, what is now the definition of our set of kets with label n ?  Clearly, they are not eigenkets of the hamiltonian.  Also, they are not completely localized.  So what are they ?
Of course, one can try to understand what Sakurai was at: he was probably thinking of the state of an electron in a free atom at a lattice position, and taking |n) as that solution at lattice point n, and |n+1) as the same solution, but shifted to lattice point n+1.   Taking into account then the potentials from the other atoms, at other lattice positions, as a perturbation, is exactly what the tight binding method is about.  But personally, I'm really sorry he's confusing this specific technique in solid state physics with the more general property derivable from the lattice symmetry. 

If we want to progress, I think we have to see this tight binding approximation as an example.  We take, in equation 4.3.12, a general state which obeys the lattice symmetry.  He then derives, from that, Bloch's theorem, stated in 4.3.18.  We also know that, because of the commutation between the hamiltonian and the lattice translation operator, we can find a set of simultaneous eigenkets.  So we look for the energy eigenkets which take on the form specified in Bloch's theorem.  Now Bloch's theorem is interesting, because in it appears a "modulated" plane wave with wave number k, and this k will have a lot of properties that really make us think of an actual wave number and associated momentum, but that's not worked out here.

Going back to our example of tight binding, we can establish a kind of "dispersion relation" where we have the energy eigenvalue as a function of k.  It turns out that not all k-values show up: the values that do show up are called the Brillouin zone (and not, as Sakurai says, the energy level!).  Well, in fact, this is much more complicated in true 3-dim solid state physics !  The first Brillouin zone is the Wigner-Seitz primitive cell of the reciprocal lattice, defined as the geometrical place of points that can be reached from a reciprocal lattice point without crossing a single bragg plane.  The second brillouin zone is similar, except that you have to cross exactly one Bragg plane etc...  Reminder: the reciprocal space is the space of k-vectors (up to factors of 2 pi...).

Anyway, I think this section in Sakurai is a bit confusing, but the main result is that a lattice symmetry gives rise to eigenstates that take on the form specified by Bloch's theorem.

4. The time-reversal discrete symmetry

Contrary to the previous section, this one is in my opinion brilliant !
Time reversal is a very subtle symmetry, and plays in fact a much bigger role in quantum field theory than in NRQM, but it is, as a preparation, a very good idea to treat it here, and can be very illuminating for the rest.
As Sakurai explains, one should better use the term "reversal of motion".  The importance of this operation is explained in classical physics first.  For a potential depending on position, Newton's equation of motion satisfies the symmetry of reversal of motion.  Next, this symmetry is also explained to be valid in classical electromagnetism, although at first sight the magnetic part doesn't seem to obey this symmetry.  If the reversal of the currents generating the field is taken into account, however, it is clear that classical electrodynamics is symmetrical under reversal of motion.
Then, an important point is raised: when looking at the Schrodinger equation, it is seen that the time-reversed solution is not a solution, but its complex conjugate is.  So reversal of motion and complex conjugation seem to have something to do with eachother in quantum mechanics.  That's why we need the following discussion.

Digression on Symmetry Operations

The problem is tackled what form needs to take on the operator in Hilbert space that represents a symmetry.  Before delving into what can be a symmetry of a specific system, we should ask ourselves what are the formal isomorphisms within the Hilbert space that conserve the formalism of quantum mechanics.  By this, I mean: we have attached physical meanings to all the states in a Hilbert space: to the ket |x) corresponds the meaning Mx.  Now, the question is, what kind of transformation T on Hilbert space can be performed, such that we could associate this time T|x) with meaning Mx, and keep the same formalism.  A simple example is the multiplication of all kets with a phase factor exp(i a).  It is clear that we can do that, and no physics changes.  If you think about it, the only measurable thing that comes out of a quantum mechanical theory are probabilities, which are always taken to be absolute values of bracketsSo a transformation T that conserves the absolute value of brackets is ok.  We know that unitary transformations conserve values of the brackets, so these are already ok.  But that's too severe.  It turns out that also anti-unitary operators (which are not linear operators) conserve the absolute value of brackets.
So transformations T which are unitary or anti unitary possess the property that quantum mechanics as a physical theory gives the same predictions whether we associate |x) or T|x) to the meaning Mx.  But that doesn't mean that any such T is a symmetry of the theory (= the dynamics).  In order for an operator S to be a symmetry, we do the following.  Interpretation Mx is associated to ket |x), and interpretation Sx, corresponding to the symmetry applied to situation Mx, is associated to the ket S|x).  If S is a true symmetry of the dynamics, then the evolution of S|x) after a time t comes down to S of the situation which corresponds to the evolution of |x) after time t. 
Maybe a small discussion can be added here.  The candidate symmetries S here are NOT symmetries of the physical situation, in the sense that the physical situation Mx and the physical situation Sx are physically distinct situations (which, if S turns out to be a symmetry, have similar behaviour under the dynamics).  One shouldn't confuse this with something like gauge symmetry.  Here, in the formalism, we talk about different kets that correspond to exactly the same physical situation.  In fact, this is an abuse of the quantum formalism, because to each physical situation should only correspond one ray.

Next, it is worked out what is an antiunitary operator.  It is a bit subtle: in a given basis, an antiunitary operator can be represented by a unitary matrix and the operation "complex conjugation".  However, this splitting up is not geometrical, in that if we choose another basis, then the unitary matrix will not be the one of the unitary operator corresponding to the first unitary matrix after the coordinate transformation.  So there is no unitary operator associated to an antiunitary operator: depending on the choice of basis, it is different.  This makes it a headache to define what it means for an anti-unitary operator to act on a bra.  So we will always consider that it can only act on kets.

Time-Reversal Operator

It is first argued that in order to define an operator that implements motion reversal (usually called the time reversal operator), then this operator cannot be a linear operator.  So it has to be an antiunitary operator.  If time reversal is a symmetry of the dynamics, then equation (4.4.31) has to hold.  The important property (4.4.36) is worked out.  This allows us to relate an observable and its "time reversed" version.  Interesting operators have a certain behaviour under this transformation: they are even or odd.  For example, the momentum operator is expected to be odd under time reversal.  Position is even, and angular momentum is odd.

Wave function

If we take the phase convention (4.4.47), then it is worked out that the wave function of the time-reversed state is the complex conjugate of the original wave function.  This is in the case of a spinless particle.  As with parity, we have the theorem that the non-degenerate eigenstate of the hamiltonian should satisfy the symmetry ; in this case, this comes down to saying that its wave function is real.

Time reversal for a Spin-1/2 System

The aim is to find an explicit form of the time reversal operator in the 2-dim Hilbert space of a spin-1/2 particleAngular momentum should flip sign under time reversal.  So applying time reversal to the general spin-up ket in direction n will have to give the spin down ket in that direction, up to a phase factor.  If we now write the time reversal operator as a unitary matrix and a complex conjugation (we have to choose a basis for that to have a meaning, and we take the up and down ket for the z-direction as the basis) we find an expression (4.4.65), again, under the assumption that we are going to write it out in matrix notation in the basis of the spin along the z-axis.  The "symmetry breaking" appearing of S_y is only the result of the fact that we have chosen this basis.  As said before, the splitting-up of an antiunitary operator in a complex conjugation followed by a unitary matrix is not a geometrically well defined action.
A very remarkable property is derived: the square of the time reversal operator is -1 for a spin-1/2 system.  It is easily verified that it is +1 for a spinless system.  This is a similar weirdness as was the case when we rotated a spin-1/2 system over 360 degrees and also found -1.   In fact, both weirdnesses are related !
It is worked out what the time reversal operator does on spin-j systems.  In fact, the reasoning applied to spin-1/2 systems didn't really use the fact that it was spin-1/2 up to the last equal sign in 4.4.65, so we have equation 4.4.73 in all generality, and in equation 4.4.75 we note the link between the -1 in rotation over 360 degrees for half integer spins, and the -1 sign in the square of the time reversal operator.  All this is independent of the phase convention (eta) one takes, because that phase is (due to the antilinearity) taken in absolute square. 
Some remarks on what is a good choice for eta closes this section, together with some properties of tensor operators that have a definite parity under time reversal.

Interactions with Electric and Magnetic Fields: Kramers Degeneracy

Because of the fact that the time reversal operator is anti-unitary, the fact that the Hamiltonian is invariant under time reversal doesn't mean that there is a conserved quantity, as is usually the case with such symmetries.
However, the property remains that the set of eigenkets belonging to an energy eigenvalue has to be symmetric under time reversal.  It is then worked out that in the case of half-integer spin, the fact that the time reversal operator squared equals -1 leads to the conclusion that an energy eigenket, and its time reversal, cannot be related by a simple phase factor (and so have to span a 2-dim space).  So half-integer spin systems with a time reversal symmetry have to have degenerate energy eigenvalues.  That's Kramer's degeneracy.

Chapter overview

In this chapter, discrete changes which can turn out to be (or not) symmetries of the dynamics, are discussed.  The relationship between the physical meaning of the change, and  the form it takes as an operator on Hilbert space, is worked out.  The main results concern parity (space reflection) and time reversal.  In order to introduce time reversal, we needed to discuss anti-unitary (and anti-linear) operators.

Back to main course page