Quantum mechanics is weird! Imagine for a second that you want to make an experiment and that the result of your experiment depends on what your colleague is doing in the next room. It would be crazy to live in such a world! This is the world we live in, at least at the quantum scale. The result of an experiment cannot be described in a way that is independent of the context. The neighbor is sticking his nose in our experiment!
Before telling you why quantum mechanics is contextual, let me give you an experiment that admits a simple non-contextual explanation. This story takes place in Flatland, a two-dimensional world inhabited by polygons. Our protagonist is a square who became famous after claiming that he met a sphere.
This square, call him Mr Square for convenience, met a sphere, Miss Sphere. When you live in a planar world like Flatland, this kind of event is not only rare, but it is also quite weird! For people of Flatland, only the intersection of Miss Sphere’s body with the plane is visible. Depending on the position of the sphere, its shape in Flatland will either be a point, a circle, or it could even be empty.
As you can see on these photos, when you are stuck in Flatland and you take a picture of a sphere, only a segment is visible. What aroused Mr Square’s curiosity is the fact that the length of this segment changes constantly. Each picture shows a segment of a different length, due to the movement of the sphere along the z-axis, invisible to him. However, although they look random, Square discovered that these changing lengths can be explained without randomness by introducing a hidden variable living in a hypothetical third dimension. The apparent randomness is simply a consequence of his incomplete knowledge of the system: The position along the hidden variable axis z is inaccessible! Of course, this is only a model, this third dimension is purely theoretical, and no one from Flatland will ever visit it.
What about quantum mechanics?
Measurement outcomes are random as well in the quantum realm. Can we explain the randomness in quantum measurements by a hidden variable? Surprisingly, the answer is no! Von Neumann, one of the greatest scientists of the 20th century, was the first one to make this claim in 1932. His attempt to prove this result is known today as “Von Neumann’s silly mistake”. It was not until 1966 that Bell convinced the community that Von Neumann’s argument relies on a silly assumption.
Consider first a system of a single quantum bit, or qubit. A qubit is a 2-level system. It can be either in a ground state or in an excited state, but also in a quantum superposition of these two states, where and are complex numbers such that . We can see this quantum state as a 2-dimensional vector , where the ground state is and the excited state is .
What can we measure about this qubit? First, imagine that we want to know if our quantum state is in the ground state or in the excited state. There is a quantum measurement that returns a random outcome, which is with probability and with probability .
Let us try to reinterpret this measurement in a different way. Inspired by Mr Square’s idea, we extend our description of the state of the system to include the outcome as an extra parameter. In this model, a state is a pair of the form where is either or . Our quantum state can be seen as being in position with probability or in position with probability . Measuring only reveals the value of the hidden variable . By introducing a hidden variable, we made this measurement deterministic. This proves that the randomness can be moved to the level of the description of the state, just as in Flatland. The weirdness of quantum mechanics goes away.
Contextuality of quantum mechanics
Let us try to extend our hidden variable model to all quantum measurements. We can associate a measurement with a particular kind of matrix , called an observable. Measuring an observable returns randomly one of its eigenvalue. For instance, the Pauli matrices
as well as and the identity matrix , are 1-qubit observables with eigenvalues (i.e. measurement outcomes) . Now, take a system of 2 qubits. Since each of the 2 qubits can be either excited or not, our quantum state is a 4-dimensional vector
Therein, the 4 vectors can be identified with the vectors of the canonical basis and . We will consider the measurement of 2-qubit observables of the form defined by . In other words, acts on the first qubit and acts on the second one. Later, we will look into the observables , , , and their products.
What happens when two observables are measured simultaneously? In quantum mechanics, we can measure simultaneously multiple observables if these observables commute with each other. In that case, measuring then , or measuring first and then , doesn’t make any difference. Therefore, we say that these observables are measured simultaneously, the outcome being a pair , composed of an eigenvalue of and an eigenvalue of . Their product , which commutes with both and , can also be measured in the same time. Measuring this triple returns a triple of eigenvalues corresponding respectively to , and . The relation imposes the constraint
(1)
on the outcomes.
Assume that one can describe the result of all quantum measurements with a model such that, for all observables and for all states of the model, a deterministic outcome exists. Here, is our ‘extended’, not necessarily physical, description of the state of the system. When and are commuting, it is reasonable to assume that the relation (1) holds also at the level of the hidden variable model, namely
(2)
Such a model is called a non-contextual hidden variable model. Von Neumann proved that no such value exists by considering these relations for all pairs , of observables. This shows that quantum mechanics is contextual! Hum… Wait a minute. It seems silly to impose such a constraint for all pairs of observable, including those that cannot be measured simultaneously. This is “Von Neumann’s silly assumption’. Only pairs of commuting observables should be considered.
One can resurrect Von Neumann’s argument, assuming Eq.(2) only for commuting observables. Peres-Mermin’s square provides an elegant proof of this result. Form a array with these observables. It is constructed in such a way that
(i) The eigenvalues of all the observables in Peres-Mermin’s square are ±1,
(ii) Each row and each column is a triple of commuting observables,
(iii) The last element of each row and each column is the product of the 2 first observables, except in the last column where .
If a non-contextual hidden variable exists, it associates fixed eigenvalues , , , (which are either 1 or -1) with the 4 observables , , , . Applying Eq.(2) to the first 2 rows and to the first 2 columns, one deduces the values of all the observables of the square, except . Finally, what value should be attributed to ? By (iii), applying Eq.(2) to the last row, one gets . However, using the last column, (iii) and Eq.(2) yield the opposite value . This is the expected contradiction, proving that there is no non-contextual value . Quantum mechanics is contextual!
We saw that the randomness in quantum measurements cannot be explained in a ‘classical’ way. Besides its fundamental importance, this result also influences quantum technologies. What I really care about is how to construct a quantum computer, or more generally, I would like to understand what kind of quantum device could be superior to its classical counterpart for certain tasks. Such a quantum advantage can only be reached by exploiting the weirdness of quantum mechanics, such as contextuality 1,2,3,4,5. Understanding these weird phenomena is one of the first tasks to accomplish.