Chapter 3: Theory of Angular Momentum
Summary of this chapter in "Modern Quantum Mechanics" (J.J. Sakurai),
by Patrick Van Esch
Last revision November, 14, 2003.
1. Rotations and angular momentum commutation relations.
Finite versus Infinitesimal Rotations
It is pointed out that finite rotations around different axes do
not
commute in 3-dimensional Euclidean space. The finite rotation
matrix for an active rotation around the z-axis is written down in
equation (3.1.3), and the infinitesimal rotations around the z, x and y
axes are written down next (up to order 2). Manipulation of these
inifinitesimal rotations leads to a very important commutation relation
given in equation (3.1.9). Although this may seem bizarre, this
equation contains all the information needed about what we call
"rotations".
Infinitesimal rotations in Quantum Mechanics
In equation (3.1.10) a most important concept is introduced: to a
rotation in 3-dimensional Euclidean space must correspond an operator
(will turn out to be unitary) acting on the state kets that does to the
states what is considered a rotation in space. The same structure
as for translations in space and time evolution is used: an
infinitesimal change corresponds to a hermitian operator that is the
generator for the finite change. For translations in space, this
generator was momentum, for time evolution, it was the hamiltonian, and
we now define angular momentum as the generator of rotations in
3-dim
space. As in those other cases, this is just a matter of
definition, and the correspondence with classical variables will turn
out to be the case (or not ! In this case, orbital angular
momentum will, and spin angular momentum will not, correspond to the
classical concept). To go from the infinitesimal generator
(3.1.15) to the finite rotation operator, we use operator
exponentiation in (3.1.16). The rotation operator in Hilbert
space is a representation of the rotations in 3-dim Euclidean
space. Hence, the commutation relations for infinitesimal
operators should be respected as worked out in equation (3.1.18).
This leads to the commutation relations (3.1.20). They are called
the fundamental commutation relations of angular momentum
because they
specify completely the algebra of angular momentum.
2. Spin 1/2 systems and finite rotations
Rotation Operator for Spin 1/2
In the two-dimensional Hilbert space of spin-1/2 particles, the
operators defined in (3.2.1) satisfy the fundamental commutation
relations of angular momentum. If we postulate that they are the
angular momentum, then we have the rotation operators, which work out
to several interesting relations. Equation (3.2.6) confirms
for example the property we expect of a rotation operator.
However, equation (3.2.15) is a surprise: a rotation over 2 pi gives
minus the state !
Spin Precession Revisited
It is noted in equation (3.2.18) that the time evolution operator is
nothing else but a rotation about the axis of the field with the
rotation angle being proportional to time.
Neutron Interferometry Experiment to Study 2 pi rotations
In order to expose this sign flip of the state of a spin-1/2 particle
after a 2 pi rotation, neutron interferometry has been applied: we
compare the phase of a particle with the phase of the same particle
when it went through a magnetic field. It turns out that we need
indeed a rotation of 4 pi to get back to the same interference pattern,
confirming the sign flip after 2 pi.
Pauli two-component Formalism
The two-dimensional Hilbert space for spin-1/2 particles is written out
in matrix form. The elements are complex 2-tuples, and the Pauli
matrices (3.2.32) play an important role. Several properties of
these matrices are worked out.
Rotations in the two-component formalism
Taking S in the 2-component system as the angular momentum, the
rotation operator follows, as written down in equation (3.2.42).
Working this out in the form of a unit vector along the rotation axis,
together with the rotation angle, one obtains equation (3.2.44), which
is explicitly given in equation (3.2.45) as a 2x2 matrix.
It is argued that the expression in (3.2.47) behaves as a 3-vector
under rotations. Looking for the eigenvector with
eigenvalue +1 of the rotation operator we expect this to correspond to
the state with spin up in the rotation axis direction. Working
this out, one finds the result in (3.2.52).
3 O(3), SU(2) and Euler rotations
Orthogonal Group
The orthogonal group is the group represented by the 3x3 real,
orthogonal matrices. It's called O(3).
I'm a bit confused here: I
thought the group we're talking about is SO(3) ! What happens
with the disconnected piece ?
Unitary unimodular group
The unitary unimodular group is the group of unitary 2x2 matrices with
determinant 1. A correspondence between two complex numbers
satisfying the condition (3.3.8) and the group elements is
established. Next, these complex numbers are put into
correspondence with the explicit rotation matrix in the 2-dimensional
Hilbert space (which is also a unitary unimodular matrix in the basis
in which (3.2.45) was worked out. Working things backward, we can
associate a direction (in 3-dim Euclidean space) and a rotation angle
to this matrix (in 2-dim Hilbert space). The relations are worked
out in equation (3.3.10). This relationship between rotations in
Euclidean space in 3 dimensions, and two complex numbers, was in fact
already known long before any quantum mechanical treatment, in
classical mechanics. Interpreted as such, the two complex numbers
are called the Cayley-Klein parameters of the rotation.
The group
(as an abstract entity, whether we take them as 2x2 matrices,
Cayley-Klein parameters, or other) is called SU(2).
It is pointed out that there is a 2 to 1 relationship
Euler Rotations
An arbitrary rotation in Euclidean 3-dim space can be resolved in 3
rotations about special axes, and these rotations are called Euler
rotations. This is well known from classical mechanics (used a
lot in rigid body motion). However, there is no unique convention
about what exactly are Euler rotations ! In classical mechanics,
it is usually taken to be rotations around axes fixed to the rigid
body
we're rotating. In this part, it is shown that there is an
equivalence with rotations around axes fixed in space (the
coordinate
axes z and y). In fact, the rotation angles in the two cases are
the same, but the order is opposite.
One should carefully read what is written in this part to follow the
argument. Once formula (3.3.19) is established, writing a general
rotation (specified by "mechanical" Euler angles) as a product of
rotations around space-fixed axes, the representation of this property
by the rotation operators in Hilbert space is written down in
(3.3.20). This is easily translated in the case of 2-dim Hilbert
space, and results in the 2x2 matrix (3.3.21).
4. Density Operators and pure versus mixed ensembles
Polarized Versus Unpolarized beams
And now for something completely
different ! I have to say that I don't understand why this
important item has been set out in the "angular momentum" section, but
that doesn't really matter.
It is first explained that _every_ state in the 2-dimensional Hilbert
space corresponds to a preferred direction in 3-dim Euclidean space,
namely the one that has this state as an "up" state. If we want to say
that there "is no preferred direction" we don't have a ray in Hilbert
space corresponding to that. In fact, it turns out that we do not
have to take a bigger Hilbert space to do this, we need to take a
statistical ensemble of Hilbert space states to express this
idea. What is introduced here cannot be of more importance.
It turns out that what we called "states" up to now were special kinds
of systems, called "pure states". We are now considering
more
general kinds of states called "mixtures". At first sight,
mixtures are bunches of "pure states" mixed together with "classical
probability". No big deal, one would say. We have the same
in classical physics: individual points in phase space correspond to
pure states, but we can of course define a density function over phase
space, and then we have a statistical mixture. In a certain way,
what we are doing here is the quantum equivalent of that. But it
will turn out that these quantum mixtures are much more intertwined
with the quantum formalism than the classical notion of a density
function over phase space, and this for two reasons. First of
all, in the statistical interpretation (Copenhagen School, which we
take here) of quantum mechanics, we always need an ensemble of states
to be able to define a probability. In mixtures we have a
combination of quantum probabilities and statistical mixture
probabilities. But second, where a statistical mixture of
different classical states in phase space defined uniquely the weight
of each state in the mixture, in quantum mixtures we have the choice
! Mixing different Hilbert rays together with different
weights
can give rise to the same mixtures, which are indistinguishable in
every sense. So from the mixture we cannot go back to the precise
composition, as there are many different compositions giving the same
result. All this makes that "quantum mixtures" are much more
intertwined with quantum mechanics than is classical statistical
mechanics with classical physics.
Ensemble Averages and Density Operator
For starters, we consider classical ensembles of quantum states,
meaning: we consider states (rays in Hilbert space) with classical
probabilities which sum up to 1. If we take the ensemble
average
of any operator, we first calculate the expectation value of that
operator with each of the quantum states present in the ensemble, and
then weight it with the classical weight. That's expressed in
equation (3.4.6). Note that we can put as many or as few quantum
states in the ensemble as we want. The states do not have to be
independent, and hence we can have more states than dimensions of the
Hilbert space. The case of only one single state in the ensemble
(with probability 1) is then called a "pure state". And then
something remarkable happens: in equation (3.4.7), we work in an
explicit basis (the b kets), and it turns out that we can separate
the
part containing on one hand the classical probabilities and the Hilbert
state kets, and on the other hand the representation of the operator
we're taking the expectation of. The first part is
represented by
a construction, called the density operator, as given in
(3.4.8).
The formula for the expectation values then reduces to (3.4.10).
So all ensemble information is included in the density operator,
because all physical quantities we'll ever measure can be expressed as
expectation values of operators. So two ensembles with identical
density operators are in all respects physically equal. The fact
that total probability has to be equal to 1 is expressed by the trace
condition (3.4.11).
We stress again that the remarkable fact is that different compositions
of hilbert space vectors and weights can give rise to the same density
operator, and are hence physically equal situations.
Next one analyses what is the density operator of a pure state.
Some examples in the 2-dimensional hilbert space are given.
Time evolution of Ensembles
One can think of the evolution of a statistical ensemble of states if
one keeps the classical probabilities the same, and evolves the states
in time (Schroedinger picture). Of course, because different
mixtures give rise to the same density operator, we'd better find that
they evolve in the same way too ! But it turns out that the
evolution of the density operator can be written in equation (3.4.29)
in a way that is only dependent on the density operator and the
hamiltonian. So the evolution is not dependent on the precise
mixture we took to build the density operator. Note, that the
time evolution of the density operator looks a lot like the Heisenberg
equation of motion. But there are two things wrong: first we're
in the Schroedinger picture and second, there's this minus sign.
In fact, both remarks are equivalent if one thinks about it !
There's also a striking equivalence with the classical evolution
equation of the density function in phase space, as written in equation
(3.4.30).
Continuum generalizations
There's nothing special here, we already get used to the idea that when
we work in a continuous basis, sums become integrals, so the trace
becomes an integral too. In fact, most of the work of
mathematical physics is to make these heuristic formulations
rigorous. It then turns out that the results are in the
overwhelming majority the same as with the heuristic techniques used
here.
Quantum Statistical mechanics
The density operator opens the gate to statistical physics in quantum
mechanics. It actually turns out that quantum statistical physics
is a lot easier and more coherent than classical statistical
physics.
The first point is the introduction of a quantity which is the entropy
; a quite mysterious quantity in classical physics, it turns out to
have a very simple and precise definition in quantum physics: equation
(3.4.35). There's actually still a unit conversion factor needed,
called Boltzmann's constant, so entropy is given by equation
(3.4.41). In thermodynamical equilibrium, the density operator
doesn't change in time anymore (so we can diagonalise the density
operator and the Hamiltonian together). The internal energy is
the expectation value of the hamiltonian. We can now maximise
entropy with the constraint of constant internal energy (using a
langrange multiplier) and then we find the equation (3.4.48) for the
elements of the density matrix in the energy eigenbasis. This is
the canonical ensemble.
The partition function can be written as given in equation
(3.4.51). As it is known from statistical physics that from the
partition function one can deduce a lot of interesting thermodynamic
quantities.
All this is illlustrated by a simple example in 2-dim Hilbert space,
using spins subjected to a magnetic field and a finite temperature.
5 Eigenvalues and Eigenstates of Angular Momentum
This is a very calculational part. We only give a brief overview
of the ideas.
Commutation Relations and the Ladder Operators
It is worked out that J^2 commutes with the three components of angular
momentum, so we can work in a basis where both J^2 and one of the
components (Jz) are diagonal. Two helper operators are
introduced
in equation (3.5.5), and lots of interesting stuff is derived
concerning those.
Eigenvalues of J2 and Jz
It is first worked out that there must be kets which obey (3.5.17) and
(3.5.23). From these extremal states on, the whole finite set of
independent eigenstates of Jz is worked out, climbing the ladder.
The result is summarized in equations (3.5.33) and (3.5.34).
Matrix Elements of Angular Momentum Operators
In the above basis of J2 and Jz, these operators take on of course
quite simple representations (diagonal). Next, the
representations in that basis of the other angular momentum operators
is worked out.
Representations of the Rotation Operator
Once we have the representation of the angular momentum operators in a
certain basis, by exponentiation, we can find of course the rotation
operator representation that goes with it. This can be done using
polar representation of the rotation, or using Euler angles. The
case j=1 is worked out for the y-rotation.
6. Orbital Angular Momentum
Orbital angular momentum is defined by the vector product of
operators
r x p. This is a priori different from the generator of
rotations, which is the definition of angular momentum, but on
functions depending on position (wave functions in wave mechanics), it
turns out to be equivalent.
Orbital Angular Momentum as Rotation Generator
One can easily verify that the three operators of r x p satisfy the
commutation relations of angular momentum. That by itself is of
course not sufficient to be called "angular momentum", but when a state
is characterised only by a point in space (the basis functions
in wave
mechanics), then one shows,in equation (3.6.5), that these operators
_are_ the generators of what is reasonably taken to be a
rotation. We used the fact that p is the generator of translation
to find this out. Once again, this worked because our basis
states in Hilbert space were supposed to be completely defined by a
point in 3-dim Euclidean space.
Next, the same game is worked out in spherical coordinates, and one
finds representations in this basis for the 3 operators of orbital
angular momentum, as given in equations (3.6.9), (3.6.11) and
(3.6.12). Also the ladder operators are worked out (equation
3.6.13) and the L2 operator, in (3.6.14). It is then observed
that this L2 operator corresponds to the angular part of the Laplacian
operator in 3-dim vector analysis, expressed in spherical
coordinates. Because this is so funny, this is worked out in two
different ways.
Spherical Harmonics
If we can assume that the functions on which we operate (the wave
functions) factorise as written down in equation (3.6.22), which is
known to be the case if we have a spinless particle in a spherically
symmetrical potential, then the angular part always takes on the form
of spherical harmonics. An equivalence between the eigenstates
(abstract) of Lz and L2 as we worked them out above, and the spherical
harmonics is established. From the representation by spherical
harmonics of the eigenstates of Lz and L2 on one hand, and the explicit
representations of the L-operators in terms of spherical coordinates on
the other hand, one builds up the explicit functional expressions of
the spherical harmonics.
Because spherical harmonics have to be single valued functions in 3-dim
Euclidean space, it is next argued that only integer values of l
can
appear.
To give a short overview of what happened here: in the case of orbital
angular momentum - which only makes sense if we have a Hilbert space in
which there is a basis of position kets in 3-dim Euclidean space - and
its associated operators can be written in that basis ; actually even
in the angular part of that basis. The eigenkets of L2 and Lz
then become spherical harmonics, and the operators take on a
representation as worked out.
Spherical Harmonics as Rotation Matrices
The link between the rotation operator in Hilbert space, and spherical
harmonics is worked out here. Perspectives get complicated if one
doesn't pay attention. We want to write the rotation operator as
represented in the basis of the eigenstates of L2 and Lz. As
such, we don't care whether the original angular momentum is "orbital
angular momentum" or not. But in the case of orbital angular
momentum, we can use a trick to sneak in the 3-dim coordinates (which
normally don't have anything to do with the rotation operator in
Hilbert space). We can consider the rotation of the ket
representing the z-axis in 3-dim Euclidean space onto the ket
representing just any direction n. With Euler angles, this can
always be achieved as given in (3.6.47). Inserting a complete
set, and projecting onto an eigenket of L2 and Lz, we find a
relationship between two spherical harmonics (the l,m projections of
the kets corresponding to the z and the n direction respectively), and
elements of the rotation operator in the l,m basis. Given the
fact that the l,m representation of a rotation operator has nothing to
do with the origin of the angular momentum (whether this is coming from
orbital or other angular momentum), we find the universal relationship
in (3.6.52).
The calculation is not difficult, the subtlety resides in keeping
distinct what is valid in which case...
7. Addition of Angular Momenta
Simple Examples of Angular Momentum Addition
An implicitly introduced but very important concept here is the
direct
product of Hilbert spaces. For finite dimensional complex
vector
spaces, its construction is simple: take a basis in each space,
construct (set) product of the two bases, and build a new complex
vector space with as a basis this product. So if the first one is
n-dimensional, and the second is m-dimensional, the direct product
space is n.m dimensional. The basis vectors of this new space are
then noted as the "direct product" of the basis vector of the first and
the basis vector of the second space that gave rise to it. This
"direct product" notation is bilinear, in that you can write in general
a direct product of a vector of the first space, and a vector of the
second space. However, most elements in the product space cannot
be written in that way (but only as a superposition of vectors that can
be written that way). One shouldn't confuse the direct product
with the direct sum of two complex vector spaces. The direct
sum
can be handled in a similar matter, except that the basis of the new
space is now the union (and not the set product) of the two bases, so
the dimension of the direct sum space is n + m.
For infinite-dimensional spaces (although this is not rigorous), the
product space is given by the direct product of individual kets from
each of the spaces, assuming bilinearity, and we also consider all
possible linear superpositions of these products to be an element of
the product space.
When we consider the "whole" of two quantum systems, each of them
having their Hilbert space, then the Hilbert space of the "whole" is
the direct product of those hilbert spaces. If you think
about
it, this is the logical way to consider "taking the respective degrees
of freedom together".
All this to explain what's written down in equation (3.7.1): the
hilbert space of a featureless point particle in 3-dim Euclidean
space (spanned by the position eigenkets) is multiplied with the 2-dim
hilbert space of the spin state of a spin-1/2 particle, to give us the
hilbert space of a spin-1/2 particle in 3-dim euclidean space.
Product states are then a general ket of the hilbert space of
featureless point particles and a spin-1/2 ket. But the most
general ket in this space is not to be written as such a product state,
but as a linear superposition of such states. So the most general
state of a spin-1/2 particle is NOT a wave function in space and a
spin-1/2 state ! That's a very peculiar case.
Operators acting in one of the Hilbert spaces being a factor in the
direct product have their natural extension in the following way:
imagine we have an operator that acts on the first space. In that
case, we define its action on a product state as the product state
which is the product of the action of the original operator on the
first part, and simply a copy of the second part. Now, a general
ket of the product space cannot be written as a product state !
So we have to expand this general state into a superposition of product
states, and apply our natural extension of the operator to each of the
terms individually, making the superposition afterwards (which is no
problem because we're still dealing with a linear operator). This
is what one tries to explain in equation (3.7.3).
The other example worked out in this part is the composition of two
spin-1/2 systems. This is an interesting example, because it is
the easiest example of the addition of angular momentum, which will be
worked out in all generality in the next section.
Formal Theory of Angular Momentum Addition
One really needs to sit down, take pencil and paper and follow the
calculations, at least once in ones life, in order to understand
this. So we limit ourselves here to some very general
overview. The product of the Hilbert space of a spin-j1 system
and the Hilbert space of a spin-j2 system can have two natural
bases. We can work with bases, because the Hilbert spaces are
finite dimensional. One way of taking a basis is by the
product
states of the eigenstates of the angular momentum in each of the two
component spaces ; but the only thing that matters there is the
eigenvalues of J1z and J2z, because J1^2 and J2^2 are fixed (j1 and
j2). So we have a basis which is determined by m1 and m2, as
written in equations (3.7.27x). The other possible basis is
the
eigenvalues of J^2 and Jz, which are based on the sum angular momentum
(the sum of the natural extensions of the angular momenta in the two
component spaces). Here, J^2 is NOT fixed, so this basis is
determined by j and m, as written down in equations (3.7.30x).
We are looking for the unitary transformation from one basis into
the
other. All the elements of this unitary transformation can be
determined (up to a few arbitrary sign conventions) by using the
identies of the ladder operators of the sum being the sum of the ladder
operators, which gives recursion relations between the different basis
kets. The elements of these unitary transformation matrix are
called the Clebsch-Gordan coefficients.
Recursion Relations for the Clebsch-Gordan Coefficients
The fundamental (but trivial) relation (3.7.45), together with the
relations giving us the effect of a ladder operator on an angular
momentum base ket, are all we need to establish the Clebsch-Gordan
coefficients. They are written down explicitly in equation
(3.7.49). The nitty-gritty details should be worked out with
pencil and paper...
Clebsch-Gordan Coefficients and Rotation Matrices
Here, a new notion is introduced: the
direct product of two
operators. It is a natural extension of
the definition of the operator in a factor space to the whole product
space: the direct product of two operators, the first one acting on the
first factor space, and the second acting on the second factor space,
is the operator acting on product states such that the first part is
transformed under the first operator and the second part is transformed
under the second operator. For a general ket in the product
space, which is not a product state, but a superposition of product
states, we have to apply the rule to the composing product states in
the superposition.
A direct sum of operators,
acting in orthogonal subspaces of the final
space, is something else: a general vector of the final space can be
written in a unique way as a superposition of vectors in the different
subspaces. The direct sum operator (acting on the final space) is
then nothing else but the superposition of the actions of the original
operators, each on 'their' component in 'their' subspace.
It should now be clear that the
rotation operator in a direct product
space is the direct product of the rotation operators in the individual
spaces (think about it). On the other hand, we now also
know that
this direct product operator is reducible (as a representation of the
rotation group), and hence can be put in block-diagonal form using a
unitary transformation, which is nothing else but what is written down
in equation (3.7.68). This unitary transformation is of course again
nothing else but the transformation containing the Clebsch-Gordan
coefficients ! The similarity transformation is written out
componentwise in equation (3.7.69), and is called the Clebsch-Gordan
series.
This is then applied for the case where we can write elements of the
rotation matrix as spherical harmonics.
8. Schwinger's oscillator model of Angular Momentum
Angular Momentum and Uncoupled Oscillators
A very remarkable model is worked out here: two uncoupled harmonic
oscillators (+ and -, for a name) are shown to span a Hilbert space
that can also be seen as the direct
sum of one copy of each spin l
(integer and half integer). The link is given in equations
(3.8.8) concerning the main operators, and (3.8.14) concerning the
basis kets. Equation (3.8.18) then gives the normalized basis
kets of the angular momentum system as a function of the "vacuum" ket.
Explicit Formula for Rotation Matrices
The only non-trivial rotation operator, when considering Euler angles,
is given by the rotation around the y-axis, so if we can find a general
expression for that operator in the basis of angular momentum, we have
an explicit expression for just any rotation operator in that basis
(and solved the problem of the representation of the rotation group in
arbitrary dimensions). One starts by applying the rotation
operator on both sides of equation (3.8.18) and inserting the identity
(rotation and its inverse) in between each operator product.
Applying the Bakers-Haussdorff lemma to the expressions in between
brackets, we have the result in (3.8.25) and (3.8.26), and using the
binomial theorem to write out the powers of these expressions needed,
one arrives at the expression (3.8.29). Although quite involved,
it gives us an explicit expression
for the rotated ket j,m as a
function of the creation and annihilation operators in Schwinger's
model applied to the vacuum. The last thing to do is to
write
these creation and annihilation operators applied to the vacuum as
basis kets j,m. That's done and the result is Wigner's formula
(3.8.33), which gives the explicit expression for the rotation matrix
for arbitrary spin (rotation around y).
It is probably not clear immediately what is the power of Schwinger's
model here. In fact, using the creation and annihilation of
different harmonic oscillators (here we used two of them) is such a
generic scheme, that it can be used to model almost any quantum system
; in fact it may almost be taken as the definition of quantisation, an
oscillator being nothing else but a "counter of quanta". We can
almost forget about the underlying "oscillators". In quantum
field theory, this method is used in general to describe "particles"
which turn out to be nothing else but quanta of harmonic
oscillators. Each particle type, with each possible momentum, is
associated with an "oscillator" and the quanta are the number of
particles of that type and with that momentum. In that case, the
number of oscillators considered is infinite, while in this example,
there were only two, so this is a great way to see the mechanism at
work in a simple case without being bothered by all the other
difficulties (infinities everywhere) of quantum field theory at once.
9. Spin correlation measurements and Bell's Inequality
This part is really worth the effort because it settles an extremely
important issue, fixing for ever the weirdness of quantum
mechanics. Long thought to be in the realm of metaphysical
considerations, it is about the opinion Einstein (and many others) had
about quantum mechanics: is the
probabilistic aspect of quantum
mechanics due to a kind of underlying "statistical mechanics",
with parameters we don't know about, which are distributed according to
certain laws, and which give the impression of randomness in the
outcomes
of quantum mechanical measurements, or is this probabilistic nature
intrinsic, with no underlying mechanism ? It turns out
that this
question can partly be turned into a scientific question, much to the
surprise of people, in the following way: Bell showed that if there is
an underlying statistical mechanics, that has to obey certain
conditions which seem reasonable, then
this imposes
conditions on the probabilities of experiments which are not always
satisfied by quantum mechanics. So it is sufficient to
place
oneself in these conditions (where the predictions of quantum mechanics
cannot follow from an underlying statistical mechanics), do the
experiment and we know that the outcome then has to decide as to
whether quantum mechanics or an underlying mechanism is correct.
We now have already all the tools available to do this, so we will work
out such a case explicitly in this section.
Correlations in Spin-Singlet States
The systems which have been studied and can show the above remarkable
property of quantum mechanics (namely, that no underlying statistical
mechanics can result in the same measurement results) are correlated
spin systems, meaning, two particles (of which the motion in space can
be considered classical) with correlated spins. A simple way to
prepare these states is by using two spin-1/2 particles that result
from a total spin-0 state. We then know that the individual spins
have to make up a singlet state, as written down in (3.9.1). The
particles travel in opposite directions, and after they are a distance
apart (and there is no reasonable way to assume they can still
interact), one can measure a spin component on each side. The
correlations of the outcomes of measurement will turn out to be the
important quantity that will defy "statistical mechanics" explanations
within certain conditions. What is of course typically
"quantum" is that we can have correlations between two measurements,
for example, the measurement of the x-component of the spin of the
first particle and the measurement of the z-component of the spin of
the second particle ; it doesn't make sense to speak of correlations
between the measurement of the x-component of the first particle and
the z-component of the first particle, for example, because these are
incompatible measurements. In order to be able to talk about
correlations of measurements (on the same system) we have to be able to
measure them simultaneously.
Einstein's Locality Principle and Bell's Inequality
The way Einstein and his followers saw the probabilistic nature of
quantum mechanics, was that what people called a pure state in quantum
mechanics (a well-defined state in Hilbert space) corresponded in fact
to an ensemble of different systems, and the observables we measure are
then drawn from functions over this ensemble. Einstein's view was
that in fact, when the two particles get separated, they constitute
by
themselves independent entities, and all measurements on them are
independent. The correlation between measurements comes from an
initial correlation of the "hidden variables" in each of the
independent entities. So this comes down to saying that
every
possible measurement on each of the particles is "in advance"
determined by a set of hidden variables each of the particles carry
with them. What exactly are these hidden variables is not
specified and can be the object of different theories. The only
thing that matters here is that we have an initial ensemble of "couples
of hidden variables", each particle taking one of them, in such a way
that that hidden variable specifies all possible outcomes of all
measurements on that particle, and that the case of spin-0 is
respected. It is then worked out in the case we consider only 3
possible directions of measurement of the spin, which means that each
particle has to carry with it in advance the outcome to three binary
(up or down) measurements in its hidden variables. This means
that there are 8 categories of particles to be considered ; because of
the compatibility with spin-0, if the hidden variables say that
particle 1 has spin up in the x direction, then the hidden variables of
particle 2 have to say that it has spin down in the x direction.
The ensemble that corresponds to our superposed quantum state (remember
that in hidden variable theories, to one pure quantum state corresponds
an ensemble of hidden variables) is then specified completely if we
know the probabilities of the 8 different cases. Equation (3.9.6)
is then evident, and from it follows inequality (3.9.9).
Let us retrace what we've done: we're NOT working with a quantum
theory. We're making the assumption that each particle has
'hidden variables' that determine in advance what will be the outcome
to 3 different spin direction measurements. We know that we can
only perform one of them in quantum theory, but that doesn't
matter. Then there
exists an innocent-sounding inequality concerning the combined
probabilities of two outcomes (which, on one single particle,
cannot be
measured). The trick is of course that these combined
probabilities of two outcomes CAN be determined if we look at the two
particles. Indeed, for example, if particle 1 is measured along
the x-axis, and particle 2 is measured along the y-axis, then the
result of the second measurement, inverted, is what the first particle
would have given if we had measured it along the y-axis. So the
two correlated particles allow us in fact to measure spin along two
different directions at the same time.
Quantum Mechanics and Bell's Inequality
If we now come back to quantum mechanics, then it is not difficult to
calculate these probabilities if we choose 3 axes in the plane along
which we will perform the measurements, and the result is given in
equation (3.9.11) for such a probability. Filling it in in Bell's
inquality, we see that it is not satisfied for certain angles.
This already indicates that quantum mechanics cannot find its
probabilistic origin in an underlying statistical mechanics using
hidden variables as explained above. But then, quantum mechanics
can be wrong. Experiment has to decide whether quantum mechanics is
right or not. Several experiments indicate that quantum mechanics
is right, hence excluding the kind of "statistical hidden variable
mechanics" in the way it has been used to deduce Bell's inequalities.
10. Tensor Operators
Vector Operator
In classical physics, a vector is somehow a quantity that has
"magnitude and direction". Analytically, vectors are represented
by 3 coordinates in Euclidean space, but that doesn't mean that 3
numbers form a vector. The essential property of a vector is
that
it transforms under a rotation just as one expects: the same
magnitude,
and the direction is "rotated" as required. Mathematically, this
means that the 3 coordinates of a vector before and after rotation have
to undergo a transformation, which is nothing else but the matrix
multiplication with the 3-dimensional rotation matrix that corresponds
to the rotation under consideration.
In quantum mechanics, things are the same. Only, now a vector is
not something that is described with 3 numbers, giving "magnitude and
direction", it consists of 3 operators on hilbert space.
But not
just any three operators form a vector. In order for them to form
a vector, their transformation properties must be such, that the
expectation value of the 3 operators under a rotated state must
transform as a classical 3-dim vector from the expectation value of the
3 operators under the unrotated state. The funny thing here
is
that the 3 operators stay the same, and it are the state kets that
are
rotated. Working the condition out for arbitrary states, one
arrives at equation (3.10.3) for a vector operator. Using the
infinitesimal representation of the rotation operator as a function of
angular momentum, this condition is equivalent to (3.10.8).
Cartesian Tensors versus Irreducible Tensors
In classical physics, cartesian tensors are nothing else but "multiple
index vectors", and nothing stops us from defining the equivalent in
quantum mechanics. However, looking at it from
another point of view, we can see that a cartesian tensor of rank r
consists then of 3^r operators, and then these operators should form a
3^r dimensional representation of the rotation group if we apply the
generalisation of equation (3.10.3). But we already know all
irreducible representations of the rotation group ! It turns
out
that cartesian tensors (their components) can be written as linear
combinations of things that correspond to irreducible representations.
This is actually no surprise: all representations of a group are
irreducible representations, or can be written as a linear combination
of irreducible representations. This is illustrated in equations
(3.10.12) and (3.10.13), for a specific case of a rank 2 cartesian
tensor. We can write it as the sum of a scalar (spin 0), a vector
(spin 1) and a thing that will turn out to be a spin-2 representation
with 5 components. So we see that what the group
representation properties is concerned, these cartesian tensors are
quite messy objects. We therefore define spherical tensors
to be
things that are irreducible (fixed l) representations of the rotation
group.
First some illustrations are used (such as putting a vector as argument
of the set of spherical harmonics of given l) to suggest the definition
of a spherical tensor, which is a set of operators. That definition is
formulated in equation (3.10.22). So, in a similar way as for a
cartesian tensor, a spherical tensor is defined by its properties under
rotation ; it turns out that the components of a spherical tensor under
rotation transform in a very similar way as do transform the
eigenstates (j,m) (look at equation 3.5.49). In fact, they
transform in the opposite way (inverse rotation). The
infinitesimal condition for a spherical tensor is then given in
equation (3.10.25).
Product of Tensors
The rather complicated-sounding theorem in fact says that we can make
"products of spherical tensors" by using a similar linear superposition
as those that occur when we combine two sets of spin kets into
the "sum of spins".
Matrix Elements of Tensor Operators: The Wigner-Eckart Theorem
Finally, a famous theorem: The Wigner-Eckart theorem. It states
that the matrix representation of a spherical tensor in a set of basis
kets of angular momentum takes on a specific form: each block between a
set of j' and j (and alpha' and alpha, standing for other operators
defining a complete set of commuting operators together with angular
momentum) consists of elements that are the "sum of spin" elements as
if we added together the spin j with the spin of the tensor to obtain
the spin j', and a scale factor which depends on the particularity of
the tensor, but which is the same in the entire block. This
theorem is not very surprising, but it is practical when having to
calculate matrix representations of tensor operators.
Back to main course page