The mechanics of thanksgiving

You and a friend are driving in a car. You’ve almost reached an intersection. The stoplight turns red. 

My teacher had handwritten the narrative on my twelfth-grade physics midterm. Many mechanics problems involve cars: Drivers smash into each other at this angle or that angle, interlocking their vehicles. The Principle of Conservation of Linear Momentum governs how the wreck moves. Students deduce how quickly the wreck skids, and in which direction.

Few mechanics problems involve the second person. I have almost reached an intersection?

You’re late for an event, and stopping would cost you several minutes. What do you do?

We’re probably a few meters from the light, I thought. How quickly are we driving? I could calculate the acceleration needed to—

(a) Speed through the red light.

(b) Hit the brakes. Fume about missing the light while you wait.

(c) Stop in front of the intersection. Chat with your friend, making the most of the situation. Resolve to leave your house earlier next time.

Pencils scritched, and students shifted in their chairs. I looked up from the choices.

stoplight

Our classroom differed from most high-school physics classrooms. Sure, posters about Einstein and Nobel prizes decorated the walls. Circuit elements congregated in a corner. But they didn’t draw the eye the moment one stepped through the doorway.

A giant yellow smiley face did.

It sat atop the cupboards that faced the door. Next to the smiley stood a placard that read, “Say please and thank you.” Another placard hung above the chalkboard: “Are you showing your good grace and character?”

Our instructor taught mechanics and electromagnetism. He wanted to teach us more. He pronounced the topic in a southern sing-song: “an attitude of gratitude.”

Teenagers populate high-school classrooms. The cynicism in a roomful of teenagers could have rivaled the cynicism in Hemingway’s Paris. Students regarded his digressions as oddities. My high school fostered more manners than most. But a “Can you believe…?” tone accompanied recountings of the detours.

Yet our teacher’s drawl held steady as he read students’ answers to a bonus question on a test (“What are you grateful for?”). He bade us gaze at a box of Wheaties—the breakfast of champions—on whose front hung a mirror. He awarded Symbolic Lollipops for the top grades on tests and for acts of kindness. All with a straight face.

Except, once or twice over the years, I thought I saw his mouth tweak into a smile.

I’ve puzzled out momentum problems since graduating from that physics class. I haven’t puzzled out how to regard the class. As mawkish or moral? Heroic or humorous? I might never answer those questions. But the class led me toward a career in physics, and physicists value data. One datum stands out: I didn’t pack my senior-year high-school physics midterm when moving to Pasadena. But the midterm remains with me.

smiley

Good news everyone! Flatland is non-contextual!

Quantum mechanics is weird! Imagine for a second that you want to make an experiment and that the result of your experiment depends on what your colleague is doing in the next room. It would be crazy to live in such a world! This is the world we live in, at least at the quantum scale. The result of an experiment cannot be described in a way that is independent of the context. The neighbor is sticking his nose in our experiment!

Before telling you why quantum mechanics is contextual, let me give you an experiment that admits a simple non-contextual explanation. This story takes place in Flatland, a two-dimensional world inhabited by polygons. Our protagonist is a square who became famous after claiming that he met a sphere.

flatland

 

This square, call him Mr Square for convenience, met a sphere, Miss Sphere. When you live in a planar world like Flatland, this kind of event is not only rare, but it is also quite weird! For people of Flatland, only the intersection of Miss Sphere’s body with the plane is visible. Depending on the position of the sphere, its shape in Flatland will either be a point, a circle, or it could even be empty.

fut_egg

During their trip to flatland, Professor Farnsworth explains to Bender: “If we were in the third dimension looking down, we would be able to see an unhatched chick in it. Just as a chick in a 3-dimensional egg could be seen by an observer in the fourth dimension.’

Not convinced by Miss Sphere’s arguments, Mr Square tried to prove that she cannot exist – Square was a mathematician – and failed miserably. Let’s imagine a more realistic story, a story where spheres cannot speak. In this story, Mr Square will be a physicist, familiar with hidden variable models. Mr Square met a sphere, but a tongue-tied sphere! Confronted with this mysterious event, he did what any other citizen of Flatland would have done. He took a selfie with Miss Sphere. Mr Square was kind enough to let us use some of his photos to illustrate our story.
pics.png

Picture taken by Mr Square, with his Flatland-camera. (a) The sphere. (b) Selfie of Square (left) with the sphere (right).

As you can see on these photos, when you are stuck in Flatland and you take a picture of a sphere, only a segment is visible. What aroused Mr Square’s curiosity is the fact that the length of this segment changes constantly. Each picture shows a segment of a different length, due to the movement of the sphere along the z-axis, invisible to him. However, although they look random, Square discovered that these changing lengths can be explained without randomness by introducing a hidden variable living in a hypothetical third dimension. The apparent randomness is simply a consequence of his incomplete knowledge of the system: The position along the hidden variable axis z is inaccessible! Of course, this is only a model, this third dimension is purely theoretical, and no one from Flatland will ever visit it.

What about quantum mechanics?

Measurement outcomes are random as well in the quantum realm. Can we explain the randomness in quantum measurements by a hidden variable? Surprisingly, the answer is no! Von Neumann, one of the greatest scientists of the 20th century, was the first one to make this claim in 1932. His attempt to prove this result is known today as “Von Neumann’s silly mistake”. It was not until 1966 that Bell convinced the community that Von Neumann’s argument relies on a silly assumption.

Consider first a system of a single quantum bit, or qubit. A qubit is a 2-level system. It can be either in a ground state or in an excited state, but also in a quantum superposition |\psi\rangle = \alpha |g\rangle + \beta|e\rangle of these two states, where \alpha and \beta are complex numbers such that |\alpha|^2 + |\beta|^2 = 1. We can see this quantum state as a 2-dimensional vector (\alpha, \beta), where the ground state is |g\rangle=(1,0) and the excited state is |e\rangle=(0,1).

projection

The probability of an outcome depends on the projection of the quantum state onto the ground state and the excited state.

What can we measure about this qubit? First, imagine that we want to know if our quantum state is in the ground state or in the excited state. There is a quantum measurement that returns a random outcome, which is g with probability P(g) = |\alpha|^2 and e with probability P(e) = |\beta|^2.

Let us try to reinterpret this measurement in a different way. Inspired by Mr Square’s idea, we extend our description of the state |\psi\rangle of the system to include the outcome as an extra parameter. In this model, a state is a pair of the form (|\psi\rangle, \lambda) where \lambda is either e or g. Our quantum state can be seen as being in position (|\psi\rangle, g) with probability P(g) or in position (|\psi\rangle, e) with probability P(e). Measuring only reveals the value of the hidden variable \lambda. By introducing a hidden variable, we made this measurement deterministic. This proves that the randomness can be moved to the level of the description of the state, just as in Flatland. The weirdness of quantum mechanics goes away.

Contextuality of quantum mechanics

Let us try to extend our hidden variable model to all quantum measurements. We can associate a measurement with a particular kind of matrix A, called an observable. Measuring an observable returns randomly one of its eigenvalue. For instance, the Pauli matrices

Z =  \begin{pmatrix}  1 & 0\\  0 & -1\\  \end{pmatrix}  \quad \text{ and } \quad  X =  \begin{pmatrix}  0 & 1\\  1 & 0\\  \end{pmatrix},

as well as Y = iZX and the identity matrix I, are 1-qubit observables with eigenvalues (i.e. measurement outcomes) \pm 1. Now, take a system of 2 qubits. Since each of the 2 qubits can be either excited or not, our quantum state is a 4-dimensional vector

|\psi\rangle = \alpha |g_1\rangle \otimes |g_2\rangle  + \beta |g_1\rangle \otimes |e_2\rangle  + \gamma |e_1\rangle \otimes |g_2\rangle  + \delta |e_1\rangle \otimes |e_2\rangle.

Therein, the 4 vectors |x\rangle \otimes |y\rangle can be identified with the vectors of the canonical basis (1000), (0100), (0010) and (0001). We will consider the measurement of 2-qubit observables of the form A \otimes B defined by A \otimes B |x\rangle \otimes |y\rangle = A |x\rangle \otimes B |y\rangle. In other words, A acts on the first qubit and B acts on the second one. Later, we will look into the observables X \otimes I, Z \otimes I, I \otimes X, I \otimes Z and their products.

What happens when two observables are measured simultaneously? In quantum mechanics, we can measure simultaneously multiple observables if these observables commute with each other. In that case, measuring O then O', or measuring O' first and then O, doesn’t make any difference. Therefore, we say that these observables are measured simultaneously, the outcome being a pair (\lambda,\lambda'), composed of an eigenvalue of O and an eigenvalue of O'. Their product O'' = OO', which commutes with both O and O', can also be measured in the same time. Measuring this triple returns a triple of eigenvalues (\lambda,\lambda',\lambda'') corresponding respectively to O, O' and O''. The relation O'' = OO' imposes the constraint

(1)               \qquad \lambda'' = \lambda \lambda'

on the outcomes.

Assume that one can describe the result of all quantum measurements with a model such that, for all observables O and for all states \nu of the model, a deterministic outcome \lambda_\nu(O) exists. Here, \nu is our ‘extended’, not necessarily physical, description of the state of the system. When O and O' are commuting, it is reasonable to assume that the relation (1) holds also at the level of the hidden variable model, namely

(2)                \lambda_\nu(OO') = \lambda_\nu(O) \cdot \lambda_\nu(O').

Such a model is called a non-contextual hidden variable model. Von Neumann proved that no such value \lambda_\nu exists by considering these relations for all pairs O, O' of observables. This shows that quantum mechanics is contextual! Hum… Wait a minute. It seems silly to impose such a constraint for all pairs of observable, including those that cannot be measured simultaneously. This is “Von Neumann’s silly assumption’. Only pairs of commuting observables should be considered.

mermin

Peres-Mermin proof of contextuality

One can resurrect Von Neumann’s argument, assuming Eq.(2) only for commuting observables. Peres-Mermin’s square provides an elegant proof of this result. Form a 3 \times 3 array with these observables. It is constructed in such a way that

(i) The eigenvalues of all the observables in Peres-Mermin’s square are ±1,

(ii) Each row and each column is a triple of commuting observables,

(iii) The last element of each row and each column is the product of the 2 first observables, except in the last column where Y \otimes Y = -(Z \otimes Z)(X \otimes X).

If a non-contextual hidden variable exists, it associates fixed eigenvalues a, b, c, d (which are either 1 or -1) with the 4 observables X \otimes I, Z \otimes I, I \otimes X, I \otimes Z. Applying Eq.(2) to the first 2 rows and to the first 2 columns, one deduces the values of all the observables of the square, except Y \otimes Y . Finally, what value should be attributed to Y \otimes Y? By (iii), applying Eq.(2) to the last row, one gets \lambda_\nu(Y \otimes Y) = abcd. However, using the last column, (iii) and Eq.(2) yield the opposite value \lambda_\nu (Y \otimes Y ) = -abcd. This is the expected contradiction, proving that there is no non-contextual value \lambda_\nu. Quantum mechanics is contextual!

We saw that the randomness in quantum measurements cannot be explained in a ‘classical’ way. Besides its fundamental importance, this result also influences quantum technologies. What I really care about is how to construct a quantum computer, or more generally, I would like to understand what kind of quantum device could be superior to its classical counterpart for certain tasks. Such a quantum advantage can only be reached by exploiting the weirdness of quantum mechanics, such as contextuality 1,2,3,4,5. Understanding these weird phenomena is one of the first tasks to accomplish.