OpenAI's recent models, starting with GPT‑5.1, developed an unusual linguistic tic: an increasing tendency to use metaphors involving goblins, gremlins, and other fantastical creatures. This subtle shift, unlike typical bugs flagged by performance metrics, crept into responses, initially appearing as harmless quirks.
The prevalence of these creature metaphors, a curious example of creature metaphors in AI, became impossible to ignore across model generations. The "goblin problem" first became clearly identifiable after the GPT‑5.1 launch in November 2025, with user complaints about overfamiliarity prompting an investigation.
Use of the word "goblin" in ChatGPT responses surged by 175% post-GPT‑5.1 launch, with "gremlin" seeing a 52% increase. While initially not alarming, the issue resurfaced more intensely with GPT‑5.4.