Retro Computer

Language can tell us a lot about societal trends, technological advances, and cultural shifts. Tools like Google Ngrams have provided a way to see these trends and explore the frequency of words and phrases across centuries of literature.

Inspired by this, I wanted to explore something similar but across multiple languages and then be able to visualize it. I found an Ngrams API (based off of Google Ngrams) and created some charts across English, German, and Russian.

I started off by exploring some historical trends of the term “internet” across languages using charts which revealed some interesting insights.

eng

“Internet” usage started growing fast in the early 1990s in the English corpus, peaking around 2000. This aligns with the global expansion of internet access and its integration into everyday life. Interestingly, the usage began to decline after this peak, possibly reflecting the emergence of new technologies and terminologies.

ger

Similar to the English data, the German corpus shows a sharp increase in the usage of “Internet” starting in the 1990s.

rus

Russian data has a unique pattern. Significant usage of the term “Интернет” (internet) only began after 2000, with a dramatic increase after 2010. This suggests a later but more intense adoption of the term in Russian.

treemap

To put it all together and compare all three languages, I created a treemap that shows the usage of “Internet” in 2019 across English, German, and Russian. With the historical context in mind, I chose to focus on the year 2019 because by this time, the term “Internet” had established itself across all three languages, making it an ideal year to analyze how each language’s usage contributes to the whole.