Language, Diversity, Inclusivity, and ChaptGPT

Guiding Questions

Knowing what ChatGPT is trained on (search engine crawl, ebooks, reddit, and wikipedia), what kinds of cultural concepts or groups might not be included?
- What about oral languages, since less than 10% of human languages are written?
- What about non-standard inscription media like the Benin bronzes, Incan quipu, or Maori carvings?
- Is a translation ever an accurate representation of the original?
What languages do you speak and what have you noticed about moving back and forth from those languages?
- “How To Speak Bad English” (8:20-13:40) podcast episode on Global English and accent reduction. A major point here is that more English speakers are “nonnative” than “native” so “native” speakers need to adjust their expectations on what “clear communication” is.
- Visualizing the Most Used Languages on the Internet

How does the AI respond to prompts in non-English languages?
Does the AI show any bias towards English language or syntax when generating responses?
Try typing a sentence with non-English syntax in English. How does the AI respond?

Generate a story set in a non-Western culture. Does the AI accurately and respectfully incorporate elements of that culture?
How does the AI respond to prompts containing cultural idioms, references, or concepts?
Look up some common phrases or idioms in less commonly used languages. How does the AI respond to these prompts?

Try typing sentences in various English dialects or accents (e.g., African American Vernacular English, Singlish, Hinglish). How does the AI respond?
Does the AI seem to favor a particular type of English in its responses?
How does ChatGPT’s answer to your question change as you rephrase the same question (ie using “Black” rather than “African American”) Does it perpetuate stereotypes or exhibit biases?

Summary points

Icon for the Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License