Hakka Chinese linguistic data

Last updated ·

Select languages above to compare their features side by side

Common questions about Hakka Chinese

What linguistic data does this Hakka Chinese page show?
Word order, tone system, gender count, case marking, adposition direction, syllable structure (including final consonants), consonant inventory traits, vowel system, morphological alignment, script, register stratification, speaker count, and geographic area. Each row is one feature with Hakka's value visible; you can add other languages to read the same feature side by side.
Where do the Hakka Chinese data points come from?
Typological features are merged from URIEL+ (Mortensen et al.) and a curated set authored against descriptive grammars. Speaker counts come from Ethnologue and Glottolog. Geographic area is computed from the Asher 2007 world language atlas. Similarity scores combine genetic distance, typological overlap, and lexical-borrowing data.
Why is Hakka so geographically scattered?
The Hakka people (literally 'guest families') migrated south from northern China in waves between the 4th and 13th centuries, settling in mountainous interior regions of Guangdong, Fujian, and Jiangxi. From the 19th century onward, large numbers emigrated to Taiwan, Southeast Asia, and the Americas. The diaspora pattern explains why Hakka is found in unconnected speech communities across two continents.
How do Hakka tones compare to other Sinitic varieties?
Hakka has six contour tones (the exact count varies by sub-variety). Like Cantonese and Min Nan, Hakka kept the Middle Chinese tone categories more conservatively than Mandarin, which collapsed the entering tones. Tone sandhi is less extensive in Hakka than in Min Nan but still active.
Why does Hakka share a high similarity score with Cantonese?
Both are conservative Sinitic varieties that retained Middle Chinese final stops and voicing contrasts that Mandarin lost. They share most syntax (SVO, classifiers, isolating morphology) and a chunk of cognate vocabulary, even though they aren't mutually intelligible. The factor breakdown chip on the row tells you which dimensions contributed most.

Sources for Hakka Chinese

The grammatical descriptions on this page are informed by the following published reference and descriptive grammars. Grammatical facts themselves are not subject to copyright; the scholars who documented them deserve attribution.

  1. Hashimoto, Mantaro J. (1973). The Hakka Dialect: A Linguistic Study of its Phonology, Syntax and Lexicon. Cambridge University Press.
  2. Chappell, Hilary (2001). "Synchrony and diachrony of Sinitic languages: A brief history of Chinese dialects." In Sinitic Grammar: Synchronic and Diachronic Perspectives, Oxford University Press.
  3. Lee, Wai-Sum & Zee, Eric (2009). "Hakka Chinese." Journal of the International Phonetic Association, 39(1), 107–111.
  4. 臺灣客家語拼音方案 (Taiwanese Hakka Romanization System), Ministry of Education, Republic of China, 2012 revision.
  5. Lau, Chun-fat (2001). A Grammar and Lexicon of Hakka: Historical Materials from the Basel Mission Library. HAL / INALCO.
  6. 客家語常用詞辭典 (Hakka Common Words Dictionary), Ministry of Education, Taiwan. https://hakkadict.moe.edu.tw/

See all data sources and dataset-level citations for the broader bibliography.

enzhesfrpt