Information Visualization · University of Bologna · 2025
A data-driven portrait of 849 universities, from 1879 to the present
The development of China's university system has been shaped by distinct historical phases, including political transitions, institutional reforms, and economic expansion. From a handful of colonial-era institutions in the 1880s, the system grew into a network spanning every corner of a continent-sized country. This project uses Wikidata records to trace four dimensions of that story: temporal rhythm, provincial inequality, regional sequencing, and geographic drift.
Annual count of newly founded universities (bars) overlaid with cumulative total (line). Four political inflection points are annotated. The x-axis begins at 1895 — only 3 institutions predate this.
The surges above were not distributed evenly across the country. But simply counting universities hides a deeper question — are the biggest provinces really the "densest" in higher education?
Left: a bubble map showing absolute university counts per province (bubble size proportional to number of universities). Right: a ranked bar chart — toggle between absolute count and universities per million inhabitants to see how the ranking shifts when province size is taken into account.
👇 Click a button — watch the ranking rearrange
Whether measured in absolute counts or per capita, today's distribution is a snapshot. But how did it come to be? The next chart reveals when each region's stock of universities was actually built — and why the per-capita ranking looks the way it does.
If the East has more universities today, did it always? This heatmap reveals when each region's stock of universities was actually built.
Color intensity shows what share of a region's total universities were founded in each decade.
▶ Pattern: Coastal regions show a slow, organic trickle over 100 years.
▶ Pattern: Interior regions show a sudden, massive spike concentrated almost entirely in the 1950s.
If the balance of university founding shifted over time, did the geographic center of gravity shift too? The final map traces this movement across 150 years.
Mean geographic center of universities founded in each decade, plotted as a trajectory on an interactive map. Circle size reflects the number of universities with valid GPS coordinates in that decade. The trajectory is confined to a surprisingly small region — the middle and lower Yangtze basin.
Circle size = universities with valid coordinates · Color gradient: blue (1890s) → red (2020s) · Dotted line = trajectory · Scroll to zoom
Limitations & Data Quality
Six known issues that affect interpretation of the findings above
849 records represent roughly 65% of China's ~1,300 bachelor-degree-granting institutions. The dataset is biased toward historically prominent or internationally visible universities, underrepresenting newer regional colleges and vocational schools.
Only ~63% of records include valid GPS coordinates. Inner Mongolia (11%), Shanxi (43%), and Henan (50%) are most affected, introducing systematic bias into the RQ4 mean-center trajectory.
Some records lack an inceptionYear; others record reorganization dates rather than true founding years. Post-2010 data is sparse — recent institutions are likely under-documented in Wikidata.
Analysis is restricted to mainland China and Hong Kong SAR. Taiwan is excluded on historical grounds (independent system post-1949); Macau is excluded because Wikidata returns no universities for it. Per-capita figures in RQ2 use 2020 census population, so they reflect current density — not historical density at the time each institution was founded.
The SPARQL query targets Wikidata class Q3918 ("university"), which includes military academies, party schools, and research institutes that may not qualify as conventional higher education institutions.
Data reflects Wikidata's state at query time (2025). The apparent drop-off after 2010 should not be interpreted as a real slowdown — it reflects documentation lag, not declining institutional growth.