| This document explains potential effects of speculation, and how undesirable |
| effects can be mitigated portably using common APIs. |
| |
| =========== |
| Speculation |
| =========== |
| |
| To improve performance and minimize average latencies, many contemporary CPUs |
| employ speculative execution techniques such as branch prediction, performing |
| work which may be discarded at a later stage. |
| |
| Typically speculative execution cannot be observed from architectural state, |
| such as the contents of registers. However, in some cases it is possible to |
| observe its impact on microarchitectural state, such as the presence or |
| absence of data in caches. Such state may form side-channels which can be |
| observed to extract secret information. |
| |
| For example, in the presence of branch prediction, it is possible for bounds |
| checks to be ignored by code which is speculatively executed. Consider the |
| following code:: |
| |
| int load_array(int *array, unsigned int index) |
| { |
| if (index >= MAX_ARRAY_ELEMS) |
| return 0; |
| else |
| return array[index]; |
| } |
| |
| Which, on arm64, may be compiled to an assembly sequence such as:: |
| |
| CMP <index>, #MAX_ARRAY_ELEMS |
| B.LT less |
| MOV <returnval>, #0 |
| RET |
| less: |
| LDR <returnval>, [<array>, <index>] |
| RET |
| |
| It is possible that a CPU mis-predicts the conditional branch, and |
| speculatively loads array[index], even if index >= MAX_ARRAY_ELEMS. This |
| value will subsequently be discarded, but the speculated load may affect |
| microarchitectural state which can be subsequently measured. |
| |
| More complex sequences involving multiple dependent memory accesses may |
| result in sensitive information being leaked. Consider the following |
| code, building on the prior example:: |
| |
| int load_dependent_arrays(int *arr1, int *arr2, int index) |
| { |
| int val1, val2, |
| |
| val1 = load_array(arr1, index); |
| val2 = load_array(arr2, val1); |
| |
| return val2; |
| } |
| |
| Under speculation, the first call to load_array() may return the value |
| of an out-of-bounds address, while the second call will influence |
| microarchitectural state dependent on this value. This may provide an |
| arbitrary read primitive. |
| |
| ==================================== |
| Mitigating speculation side-channels |
| ==================================== |
| |
| The kernel provides a generic API to ensure that bounds checks are |
| respected even under speculation. Architectures which are affected by |
| speculation-based side-channels are expected to implement these |
| primitives. |
| |
| The array_index_nospec() helper in <linux/nospec.h> can be used to |
| prevent information from being leaked via side-channels. |
| |
| A call to array_index_nospec(index, size) returns a sanitized index |
| value that is bounded to [0, size) even under cpu speculation |
| conditions. |
| |
| This can be used to protect the earlier load_array() example:: |
| |
| int load_array(int *array, unsigned int index) |
| { |
| if (index >= MAX_ARRAY_ELEMS) |
| return 0; |
| else { |
| index = array_index_nospec(index, MAX_ARRAY_ELEMS); |
| return array[index]; |
| } |
| } |