Glossary

This glossary lists preferred terms and terms to avoid when writing about languages, communities, and identities in the context of community language documentation and archiving.

Language terms

Terms to use Terms to avoid
Adivasi language Tribal language
Local language Vernacular
Mother tongue Minority language
Low-resource language
Dialect / बोली

Note on “dialect” and बोली: These terms carry a hierarchy that implies the language is lesser or incomplete. Prefer “language”, “mother tongue”, or “local language” unless the speaker themselves uses these words to describe their own speech.

Community and identity terms

Terms to use Terms to avoid
Dalit Caste slurs such as “chamar”, “chahar”, “Harijan” — do not use at all unless directly quoting someone; ensure community dignity is maintained
Savarna Upper-caste
Women Female
Men Male

Note on caste slurs: Words such as “chamar”, “chahar”, and “Harijan” are deeply harmful. Do not use them in documentation, captions, metadata, or descriptions. If you must quote someone who used such a word, make clear it is a direct quotation and add a note about the term’s impact.

Note on “female” / “male”: These are biological/medical classifications. When referring to people in social or community contexts, use “women” and “men”.