Introduction

If you look at all the numbers from $1$ to $10$ , you’d notice that $10\%$ of them contain the digit $7$ - specifically, only the number $7$ contains that digit.

Well, if you look at all the numbers from $1$ to $100$ , how many contain the digit $7$ ? Here we need to work a bit. Obviously, all numbers with a $7$ in the ones place contain $7$ , so that’s $10$ numbers already:

07, 17, 27, 37, 47, 57, 67, 77, 87, 97

Also, all numbers that have $7$ in their tens place contain the digit, which gives us $10$ more numbers, but we already counted $77$ , so we only have $9$ numbers to account for here. This gives us a total of $19$ numbers that contain $7$ , or $19\%$ of the numbers.

What’s interesting is that the higher up you go, the higher the percentage becomes, until you get to the infinite case in which you get that $100\%$ of numbers contain $7$ .

This is the reasoning I saw in pop media, explained by “science people”. And while it may look like that, we know in mathematics not to trust what patterns or sequences look like at first glance, but to rigorously show that the infinite case behaves as claimed.

You might have an intuition that this statement is correct, but my goal with this article is to prove (or disprove) it formally, while guiding you (the reader) along.

How Do We Formally Prove This?

From here on out we’ll use tools from Calculus I. If you haven’t studied limits, you can skip to the conclusion for the answer.

To formally prove a statement, we need to formally state it. Here’s my interpretation of the statement, as inferred from the videos (and my “proof”) above.

Theorem:

Let $s_n$ be a sequence such that $s_i$ is the number of integers containing $7$ as a digit in the set
$[i] \coloneqq \{1, 2, \dots, i\}.$
Then:
$\lim_{n \to \infty} \frac{s_n}{n} = 1.$

Hopefully you’re convinced this captures the intended claim. If you think there’s a better interpretation, feel free to reach out - I’m happy to revise.

The Proof

Let $s_n$ be the sequence counting how many numbers in $\left[ n \right] \coloneqq \{1, \dots, n\}$ contain the digit $7$ . Let $a_n = \dfrac{s_n}{n}$ .

We’ll show that $a_n$ converges; then all of its subsequences converge to the same limit. Thus, we can look at the subsequence $a_{10^n}$ and evaluate that limit.

To show that $a_n$ converges, we’ll show it’s bounded above and monotone increasing.

$a_n$ is Bounded by $1$

Note:

This might feel obvious, but we’ll include it for completeness.

Assume, for contradiction, that $a_n$ is not bounded above by $1$ . Then there exists $n$ with $a_n > 1$ :

a_n > 1 \iff \frac{s_n}{n} > 1 \iff s_n > n,

meaning there are more numbers containing the digit $7$ in $\left[n\right]$ than there are numbers in the set (since $n = |\left[n\right]|$ ). Contradiction. Therefore, $a_n \le 1$ .

$a_n$ is Monotone Increasing

Let $n\in\mathbb{N}$ . Since $\left[n\right] \subseteq \left[n+1\right]$ , the number of integers containing $7$ in $\left[n+1\right]$ (i.e., $s_{n+1}$ ) is greater than or equal to that in $\left[n\right]$ . If not, some element of $\left[n\right]$ would be missing from $\left[n+1\right]$ , contradicting $\left[n\right]\subset\left[n+1\right]$ .

$a_n$ Converges to $1$

We know $a_n$ is monotone increasing and bounded above, so it converges. To find its limit, consider the subsequence $a_{10^n}$ . Define

b_n = a_{10^n} = \frac{s_{10^n}}{10^n}.

To count how many numbers from $1$ to $10^n$ contain the digit $7$ , consider the set of $n$ -digit strings

A_n = \left\{ (d_{n-1}, d_{n-2}, \dots, d_1, d_0) \ \middle|\ \forall\,0\le i<n:\ d_i\in\{0,1,\dots,9\} \right\},

where $d_k$ is the digit in the $10^k$ place. Clearly $|A_n| = 10^n$ .

Note:

In our setup, the all-zeros string is treated as the representative for $10^n$ and plays the same role here.

When we see “at least once” in combinatorics, it’s natural to use complements. The number of strings with no $7$ at all is $9^n$ . Therefore, the number of strings (hence numbers) containing at least one $7$ is $10^n - 9^n$ .

Check against our earlier examples:

For $1$ to $10$ ( $n=1$ ): $10^1 - 9^1 = 1$ , matching the example.
For $1$ to $100$ ( $n=2$ ): $10^2 - 9^2 = 100 - 81 = 19$ , also matching.

Thus $s_{10^n} = 10^n - 9^n$ , and

\lim_{n\to\infty} \frac{s_{10^n}}{10^n} = \lim_{n\to\infty} \left(1 - \left(\frac{9}{10}\right)^n\right) = 1 - 0 = 1.

Hence $b_n \to 1$ , and therefore $a_n \to 1$ .

The Conclusion

From the above, the limit is $1$ . That is, as we consider larger and larger initial segments ( $1$ to $1,000,000,000,000$ , etc.), the proportion of numbers containing the digit $7$ tends to $100\%$ .

This phenomenon doesn’t rely on $7$ specifically; the same argument works for any fixed digit. So, in that asymptotic sense, “all numbers contain that digit”.

But of course, not every individual number contains every digit - for example, $5$ doesn’t contain $7$ . That’s the paradoxical charm of infinities: the set of counterexamples is infinite, yet vanishingly rare in the limiting proportion. If you pick a large number uniformly at random from a vast range, it’s very unlikely to be missing a given digit.

Extra

To sanity-check the result, here’s a small Python script sampler.py you can download and run here:

from random import randint

n = 100
k = 10_000

count = sum(len(set(str(randint(0, 10**n - 1)))) < 10 for _ in range(k))
print(f"\t{count} / {k} => {count / k * 100}%")

Running it ( $n$ refers to the range $[0, 10^n - 1]$ and $k$ to the number of samples) yields:

❯ python3 sampler.py
    9 / 10000 => 0.09%
❯ python3 sampler.py
    0 / 10000 => 0.0%
❯ python3 sampler.py
    2 / 10000 => 0.02%

Very small percentages, which gives us some numerical validations.

All Numbers Contain the Digit 7