Pierre Le Jeune

StereoTales: Multilingual Framework for Open-Ended Stereotype Discovery in LLMs
Blog

Every frontier LLM generates harmful stereotypes in open-ended generation

When given the freedom to write stories, do frontier LLMs fall back on harmful stereotypes? Giskard's R&D team prompted 23 leading models to generate over 650,000 open-ended stories across 10 languages, then analyzed the demographic associations they produced. Every single model generated harmful stereotypes, many of which the models themselves recognized as harmful.

Pierre Le Jeune - Lead Machine Learning Researcher
Pierre Le Jeune
View post