Contents |
Cryptography, from the Greek kryptós (secret) and graphein (writting) is the technique of transmitting secret information that a recipient can access without other parties being able to. Terminology is now established that we call Alice ("A") the sender, Bob ("B") the recipient and Eve ("E") the eavesdropper. The message is called plaintext and its encrypted version is called cyphertext. Cryptography consists in using a cypher (an algorithm) to turn the plaintext into cyphertext and back:
plaintext 🡒 cyphertext.
Both the destination and a Hacker will reverse this process, one legitimately and accessing the intended information, the other in violation of the intended secrecy.
An example of a cipher is, for instance, the following sequence:
which is a naively encrypted version of the original sentence, using the rot13 algorithm which rotates by 13 the Latin alphabet, i.e., a🡒n, b🡒o, c🡒p, etc. Because 13 is half the alphabet, rotating it twice brings us back to the original message: a🡒n🡒a, b🡒o🡒b, etc. This is a 13-shift version of a cipher actually used by Caesar, who was rotating the alphabet by -3 letter:
a🡒x, b🡒y, c🡒z, d🡒a, e🡒b
Such operations are described mathematically by modular arithmetics, $\oplus$, which equates two numbers $a$ and $b$ modulo $m$, written
$$a\equiv b \pmod m$$
if they differ by a multiple of the modulo $m$, i.e., iff there exists $k\in\mathbb{N}$ such that
$$a=b+km$$
Here is a text Caesar might have encoded:
in rot13, this would have given:
in rot1
One problem of such ciphers is that one can notice patterns and break them. Caesar's cipher is particularly fragile because one can try all rotations of the alphabet and see which soup of letter makes sense. This is called a "brute-force attack". Even for such simple algorithms, one would not use brute force but more subtle technique, like frequency analysis, i.e., the fact that the letter 'e' occurs on average 12.702% in the English language, as compared to 'z' which occurs only 0.077% of the time. Between these two extremes, on has for frequent letters 't' (9.356%), 'a' (8.167%), 'o' (7.507%), etc., and, on the rare end, 'j' (0.153%), 'x' (0.150%), 'q' (0.095%). So by counting how many times a letter occurs, and matching it with its frequency, one can make good guesses as to which letter is which. Of course, frequency analysis is not limited to occurences of single letters, but also of combinations of two successive lettes (digrams) and three (trigrams), etc. Note that one can use letters, numbers or basically anything for the encrypted message...
While Caesar's cipher had only one number, the $x$ in rot$x$, to encrypt and decrypt, one can easily make the cipher more robust by using a "key", which encodes the information that rotates the alphabet in a varying way, so that the same letter in the cyphertext can correspond to different letter of the plaintext, and vice-versa. The simplest variation is to decide on a key, for instance:
which translates in arithmetics as:
and that can be repeated to fit the message's length:
This turns Jule Vernes' message
into
since the plaintext in alphabet arithmetics reads:
which yields:
which evaluates to
since 13$\oplus$19=32=6 mod 26 and 18$\oplus$19=11 mod 26, or, back to alphabet:
This is better but still vulnerable. In particular, if one knows the length of the key, one can then break the cyphertext into as many sub-texts and perform frequency analysis on these, thus reconstructing the key. One way to guess the length of the key is called Kasiski's examination and consists in finding lucky repetitions of patterns, e.g.:
cryptography shall be shortened to crypto secretsecret secre ts ecretsecr et secret
This encodes by modular arithmetic to:
where one can see the odd repetition of vwbhyi in vwbhyizwdhms lmddq vx xkgwnxshv yi vwbhyi. Since there are 24 letters between these two repetitions, this suggests that the key is a divisor of 24, which mean it has length 1 (Caesar's cipher), 2, 3, 4, 6, 8, 12 or 24. That's a lot of possibilities. In a long text, other occurences would allow to narrow it down, and one does not count on such generous repetitions but look at repetitions even of digrams. In the example above, although a very small sample, we have yi appearing three times (it corresponds to to, which is the 15th most occuring digram in English, after "th", "he", "in", "an", "er", "on", etc.)
There are other tricks, for instance, since modular arithmetics is commutative, subtracting the cipher text from itself offset by the key length produces the plain text encrypted with itself. If any "probable word" in the plain text is known or can be guessed, its self-encryption can be recognized, which allows recovery of the key by subtracting the known plaintext from the cipher text. Key elimination is especially useful against short messages.
It is easy to see what is the problem of the previous method. The key itself has redundancy, which eventually leaks into the cyphertext. Therefore, it should be enough, actually, to simply use a much longer key (one can also use several short keys which relative lenghts are prime and successively encrypt the message). This actually provides perfect cryptography, in the sense that it cannot be broken even with unlimited computing power (this was proven by Claude Shannon). In fact it is easy to show that any cyphertext can be decoded to any plaintext whatsoever using the appropriate key. For instance:
decrypts to
with the key XXX but it decrypts to
with the key XXX. So it's easy to ensure perfect security, just use a unique key, as long as the message. The problem with that, though, is that both Alice and Bob must share the key and make sure nobody else has access to it. How do they share this secret? If they exchange long texts, and often, they need a lot of such keys. It is difficult to secure such a fresh, disposable information, so they need a way to generate cryptographic keys, securely. Here comes the concept of "asymmetric keys" (the schemes above are using "symmetric" keys), which involves two keys: a public one, that everybody has access to, even the eavesdropper, and a secret one. The concept works as follows. If you, Bob, want to send me (Alice) a secret message, I give you a key that, in the most popular implementation (which we detail next), comes in the form of two numbers:
$$(e,n)$$
Your message $m$ is encrypted into a cyphertext $c=m^e\pmod n$. I can then decrypt it with a private key $d$ which gives me back $m$ by exponentiating it with $d$: $m=c^d\pmod n$. We now look in detail at the most popular way to achieve this.
This brings us to the mid-1970s. The concept of public-key cryptography was put forward by someone (James H. Ellis) who could not however find how to implement it. This was done shortly after (1973), but classified, by his colleague Clifford Cocks, in a scheme later re-discovered in 1977 (published next year), this time openly (academia) by Ron Rivest, Adi Shamir, and Leonard Adleman, to become the RSA algorithm, one of the most famous schemes of computer science (which keeps little trace of Cocks' earlier insight by several years: such is the price of secrecy!) Interestingly, in the classified undersworld, Malcolm J. Williamson had developed after Cocks (in 1974) what is now known as the Diffie–Hellman key exchange, which in the open world preceded RSA by one year!
The RSA scheme works as follows. The person who generates the public key does so by picking up two large primes numbers, $p$ and $q$. Computing their product is easy:
$$n=pq$$
but from $n$, finding $p$ and $q$ is computationally hard.
Jevons wrote in his 1874 book Principles of Science: Can the reader say what two numbers multiplied together will produce the number 8 616 460 799? I think it unlikely that anyone but myself will ever know. The idea was basically correct although in this case, this so-called Jevons's number was quickly factored (by Charles J. Busk) in 1889 as $89\,681\times 96\,079$.
From $p$ and $q$, it is easy to compute the so-called totient $\varphi(n)$ of $n=pq$, which is the number of co-prime numbers with $n$. Since $n$ is prime, and since the function is multiplicative, i.e. $\varphi(pq)=\varphi(p)\varphi(q)$, then:
$$\varphi(n)=(p-1)(q-1)$$
since for a prime number $p$, all smaller integers are co-prime by definition and therefore $\varphi(p)=p-1$.
The encryption works as follows. The person who publishes the public key $n$ also publishes a coprime $e$ of $\varphi(n)$, which is typically taken as $e=2 ^{16}+1=65537$ (which has little chance to factor $\varphi(n)$ but this should be checked).
The encryption is then achieved by Bob, by taking the $e$ power of the message $m$ modulo $n$:
$$c=m^e\pmod n$$.
Now Alice can decrypt it using her private key $d$ which she computes by finding the inverse of $e$ modulo the totient:
$$ed\equiv 1\pmod{\varphi(n)}$$
In which case, modular arithmetics shows that:
$$m=c^d\pmod n=m^{ed}\pmod n=m\pmod n=m$$
provided that $m<n$, which can always be made the case by breaking the message in different substreams (generating new keys if necessary). Actually, RSA is used to transmit not a message, but a symmetric key to be used in a one-time pad cipher, which is much faster (and fully secure). The details of the above line relies on Fermat's little theorem which yields $m^{ed}\pmod n=m\pmod p$ and $m^{ed}\pmod n=m\pmod q$ so that $m^{ed}\pmod n=m\pmod{pq}=m\pmod{n}$. A reverse application of RSA makes it useful for signing or authentificating a plaintext.