{SHADES}: Towards a Multilingual Assessment of Stereotypes in Large Language Models

Mitchell, Margaret; Attanasio, Giuseppe; Baldini, Ioana; Clinciu, Miruna; Clive, Jordan; Delobelle, Pieter; Dey, Manan; Hamilton, Sil; Dill, Timm; Doughman, Jad; Dutt, Ritam; Ghosh, Avijit; Jessica Zosa Forde,; Holtermann, Carolin; Lucie-Aim('e)e,; Kaffee Tanmay Laud,; Lauscher, Anne; Roberto, L Lopez-Davila; Masoud, Maraim; Nangia, Nikita; Ovalle, Anaelia; Pistilli, Giada; Radev, Dragomir; Savoldi, Beatrice; Raheja, Vipul; Qin, Jeremy; Ploeger, Esther; Subramonian, Arjun; Dhole, Kaustubh; Sun, Kaiser; Djanibekov, Amirbek; Mansurov, Jonibek; Yin, Kayo; Emilio Villa Cueva,; Mukherjee, Sagnik; Huang, Jerry; Shen, Xudong; Gala, Jay; Al-Ali, Hamdan; Tair, Djanibekov; Mukhituly, Nurdaulet; Nie, Shangrui; Sharma, Shanya; Stanczak, Karolina; Szczechla, Eliza; Tiago Timponi Torrent,; Tunuguntla, Deepak; Viridiano, Marcelo; Oskar Van Der Wal,; Yakefu, Adina; N('e)v('e)ol, Aur('e)lie; Zhang, Mike; Zink, Sydney; Talat, Zeerak

Large Language Models (LLMs) reproduce and exacerbate the social biases present in their training data, and resources to quantify this issue are limited. While research has attempted to identify and mitigate such biases, most efforts have been concentrated around English, lagging the rapid advancement of LLMs in multilingual settings. In this paper, we introduce a new multilingual parallel dataset SHADES to help address this issue, designed for examining culturally-specific stereotypes that may be learned by LLMs. The dataset includes stereotypes from 20 regions around the world and 16 languages, spanning multiple identity categories subject to discrimination worldwide. We demonstrate its utility in a series of exploratory evaluations for both “base” and “instruction-tuned” language models. Our results suggest that stereotypes are consistently reflected across models and languages, with some languages and models indicating much stronger stereotype biases than others.