The Bijection Between Natural Numbers and Binary Strings

05 Jun, 2025

Computers fundamentally operate on bits and sequences of bits. It is therefore useful to study the associated mathematical structure of the concept of sequences of bits. In this blogpost we give a brief introduction to the mathematical concept of finite binary strings and it's relation to the natural numbers.

We write a binary string of length n as $x = x_{1} x_{2} . . . x_{n}$ where $x_{i} \in {0, 1}$ .

We define the concatenation of two binary string $x = x_{1} . . . x_{n}$ and $y = y_{1} . . . y_{m}$ as $x y = x_{1} . . . x_{n} y_{1} . . . y_{m}$ .

Let $B^{*} = {ϵ, 0, 1, 00, 01, 11, 000, . . .}$ the set of all finite binary strings with $ϵ$ the empty string. We call $l$ the length of a binary string, i.e. $l (00) = 2$ .

Let $B : N^{+} \to B^{*}$ be the function that takes a number and returns its binary representation, i.e. $B (4) = 100$ .

$B$ by itself won't give us a bijection because we can't match binary strings like $ϵ$ or $00$ .

To get a bijection we have to use the following function from $N_{0}$ to $B^{*}$

⟨ n ⟩ : = B (n + 1)_{2 : l} where l = ⌊ \log_{2} (n + 1) ⌋ + 1

This function simply adds one to the number $n$ , than represents $n$ as a binary and takes the substring from the second character on. The first character is always a one and we drop it.

For example we have $⟨ 3 ⟩ = B (4)_{2 : 3} = 00$ or $⟨ 0 ⟩ = B (1)_{2 : 1} = ϵ$ where we use the convention that a slice $x_{i : j}$ with $i > j$ is empty.

We now proof that the mapping above is indeed bijective:

Injective: Suppose $⟨ x ⟩ = ⟨ y ⟩$ , this would mean $B (x + 1)_{2 : l_{x}} = B (y + 1)_{2 : l_{y}}$ . This can only be the case when $l (x) = l (y)$ . But than equality would mean that $B (x + 1) = B (y + 1)$ because both have the same leading one as first char. That implies $x + 1 = y + 1$ because the binary representation of a number is unique. Therefore we can conclude that $⟨ n ⟩$ is injective.

Surjective: Let $x$ be an arbitrary binary string. Concatenate this string with $1$ from the left side and call this string $y$ . The natural number which fulfils $B (n + 1) = y$ is the number that fulfils $⟨ n ⟩ = x$ .

More formally we may define the inverse as

⟨ x ⟩^{- 1} = b (1 x) - 1 where b (x) = \sum_{i = 1}^{l (x)} 2^{l (x) - i} [[x_{i} = 1]]

$b : B^{*} \to N^{+}$ is the natural mapping that interprets the binary string $1 x$ as a natural number.

Example:

$x = 101$ , $l (101) = 3$

$b (101) = 2^{3 - 1} \cdot 1 + 2^{3 - 2} \cdot 0 + 2^{3 - 3} \cdot 1 = 5$ .

Note that $0 \leq b (x)$ because the min value is obtained iff all $x_{i}$ are $0$

Note that $b (x) \leq 2^{l (x)} \sum_{i = 1}^{l (x)} 2^{- i} = 2^{l (x)} (1 - 2^{- l (x)}) = 2^{l (x)} - 1$ because the maximum value is obtained iff all $x_{i}$ are $1$

We have $b (1 x) = 2^{l (x)} + b (x)$ and therefore $2^{l (x)} - 1 \leq b (1 x) - 1$ . Furthermore we have $b (1 x) - 1 \leq 2^{l (x)} + 2^{l (x)} - 1 - 1 = 2^{l (x) + 1} - 2$ .

This gives us an effective bound for $⟨ x ⟩^{- 1}$ :

2^{l (x)} - 1 \leq ⟨ x ⟩^{- 1} \leq 2^{l (x) + 1} - 2

This shows that the range in which a number associated with a binary string is determined solely by the length of the binary string.

Note that the above bijection is often used in literature such that we write $l (n)$ for $l (⟨ n ⟩)$ and $x + 1$ instead of $⟨ x ⟩^{- 1} + 1$ if clear from the context. It also gives us a way to enumerate all binary strings.