Malay linguistic data

Last updated ·

Select languages above to compare their features side by side

Common questions about Malay

What linguistic data does this Malay page show?
Word order, tone, gender count, case marking, adposition direction, syllable structure, consonant inventory traits, vowel system, morphological alignment, script, register stratification, speaker count, and geographic area. Each row is one feature with Malay's value visible; you can add other languages to read the same feature side by side.
Where do the Malay data points come from?
Typological features are merged from URIEL+ (Mortensen et al.) and a curated set authored against descriptive grammars. Speaker counts come from Ethnologue and Glottolog. Geographic area is computed from the Asher 2007 world language atlas. Similarity scores combine genetic distance, typological overlap, and lexical-borrowing data.
What's the relationship between Malay and Indonesian?
Both are standardized varieties of the Malay language, mutually intelligible but with diverging orthography, vocabulary, and loanword sources. Malaysian Malay leans toward English and Arabic borrowings; Indonesian leans toward Dutch and Javanese. Brunei Malay is a third standard. At the everyday-spoken level the differences are smaller than the standards make them look.
How does Malay reduplication work?
Full reduplication (orang 'person' → orang-orang 'people', many people') marks plurals or genericity; partial reduplication (lelaki 'man' from laki) is lexicalized. Reduplication can also signal intensification (cepat 'fast' → cepat-cepat 'very fast / hurriedly') or reciprocal/repeated action.
Why is Malay's similarity score with Tagalog only moderate despite both being Austronesian?
They sit in different sub-branches: Malay is Malayic (Western Malayo-Polynesian), Tagalog is in the Philippine sub-branch with its famous voice/focus marking system Malay doesn't replicate. Lexical overlap is real but not heavy; typology agrees on the broad strokes but not the morphology. The factor breakdown chip on the row tells you which dimensions contributed most.

Sources for Malay

The grammatical descriptions on this page are informed by the following published reference and descriptive grammars. Grammatical facts themselves are not subject to copyright; the scholars who documented them deserve attribution.

  1. Mintz, Malcolm W. 1994. A Student's Grammar of Malay and Indonesian. Singapore: EPB Publishers.
  2. Sneddon, James Neil et al. 2010. Indonesian Reference Grammar (2nd ed.). Crows Nest: Allen & Unwin.
  3. Sneddon, James Neil. 2000. Understanding Indonesian Grammar. Crows Nest: Allen & Unwin.
  4. Nik Safiah Karim et al. 2008. Tatabahasa Dewan (3rd ed.). Kuala Lumpur: Dewan Bahasa dan Pustaka.
  5. Jong, Serena P. W. & Cheung, Lawrence Y. L. 2022. Syntactic Analysis of -kah in Malay Polar Questions. PACLIC 36.

See all data sources and dataset-level citations for the broader bibliography.

enzhesfrpt