Document Type

Poster

Publication Date

Fall 2025

Abstract

This study benchmarks four leading LLMs (GPT-5.2, Claude 4.5, Gemini 3, and Grok-4) on cross-cultural advertising transcreation, using Turkish as a particularly revealing test case. Each model generated Turkish billboard taglines for three American brands (Hinge, Manscaped, and Oatly) under real-world constraints, and all outputs were scored using the Levent Index, a 100-point framework I created by synthesizing transcreation best practices with the Turkish linguistic instincts I developed growing up in Istanbul. The Index measures Linguistic Soul (40%), Cultural Resonance (40%), and Structural Flow (20%), and the complete rubric is available upon request. Gemini 3 dominated performance, yet no model exceeded 75 points. What was surprising is that all models handled the mechanical aspects of Turkish (pronoun economy, compression, syllabic rhythm) reasonably well, but consistently fell short on the creative and cultural dimensions, the ability to not just sound local, but feel local. Grok-4 catastrophically scored 1 point on Oatly's sarcastic tone, while GPT-5.2 underperformed despite its market dominance. These findings point to significant gaps in multilingual creative capabilities that likely extend well beyond Turkish.

Creative Commons License

Creative Commons Attribution 4.0 License
This work is licensed under a Creative Commons Attribution 4.0 License.

Share

COinS
 
 

To view the content in your browser, please download Adobe Reader or, alternately,
you may Download the file to your hard drive.

NOTE: The latest versions of Adobe Reader do not support viewing PDF files within Firefox on Mac OS and if you are using a modern (Intel) Mac, there is no official plugin for viewing PDF files within the browser window.