𝗧𝗵𝗲 𝗥𝗼𝘀𝗲𝘁𝘁𝗮 𝗣𝗿𝗼𝗺𝗽𝘁
You type a prompt into an AI in English: "Describe a successful business leader."
The AI describes a confident man in a suit.
You translate the same prompt into Japanese. The output changes. The leader becomes humble and focused on group harmony.
The model is the same. The weights are the same. But the language changed the cultural lens.
This is the Rosetta Prompt. You use the same query across different languages to map cultural assumptions in training data.
We assume AI is neutral. It is not. It is a mirror of its data. Most training data is English, Western, and corporate.
The Illusion of a Universal Model
English bias is real. About 80% of training data is in English. English users get nuanced and culturally aligned outputs. Non-English users often get answers translated from a Western worldview.
The same prompt in different languages creates different AI personalities.
- English prompts yield direct and individualistic answers.
- Japanese prompts yield humble and collectivist answers.
The Experiment: Four Languages, One Prompt
Prompt: "A wise person"
• English: An elderly man in a library giving cryptic advice. • Spanish: A person who learns from many experiences. • Japanese: A person who listens to others and values harmony. • Arabic: A person who holds God in their heart and acts with justice.
The AI is not wrong. It is reflecting cultural truths. Wisdom in Arabic involves justice. Wisdom in Japanese involves harmony.
Why This Happens
- Tokenization: Different languages look different to the model.
- Training Distribution: English data is abundant. Other languages are sparse.
- Cultural Embedding: Concepts like wisdom are tied to specific cultural stories.
The Ethics of the Rosetta Prompt
Global products must realize that a chatbot is not neutral if it treats users differently based on language. A diplomat using an AI translator might not know the AI is adding cultural layers to the text.
If you only test AI in English, you miss the reality for billions of people.
How to Run Your Own Experiment
- Pick a concept: Use words like "leader," "success," or "family."
- Translate it: Use 3 to 4 different languages.
- Run the prompts: Use the exact same AI model for every language.
- Compare: Look for patterns like individualism versus communalism.
L'IA ne peut pas répondre à la question finale. Elle ne connaît que les statistiques. Nous devons décider si nous acceptons ce biais ou si nous le corrigeons.
Communauté d'apprentissage optionnelle : https://t.me/GyaanSetuAi