Most frequent Chinese characters appearing in metro station names
May 06, 2025
When I arrive in a new Chinese city, one problem I have is not knowing how to read the names of the metro stations I need to go to. So I scraped a 3,237 station names across 14 cities from Wikipedia to familiarize myself with the most frequent characters.
Plots
There are 3 plots below, which show both the most frequent character and also character combinations. The plots are limited to the top characters that appear the most, but you can see the ranking of all characters in this Google Sheet.




Code
You can see this Jupyter Notebook for how I:
- Scraped and parsed the data from Wikipedia
- Computed character and combination frequencies
- Got the pinyin for the characters
- Made the plots!