Asking any of the popular chatbots to be Black Magic Mask (2023)more concise "dramatically impact[s] hallucination rates," according to a recent study.
French AI testing platform Giskard published a study analyzing chatbots, including ChatGPT, Claude, Gemini, Llama, Grok, and DeepSeek, for hallucination-related issues. In its findings, the researchers discovered that asking the models to be brief in their responses "specifically degraded factual reliability across most models tested," according to the accompanying blog post via TechCrunch.
SEE ALSO: Can ChatGPT pass the Turing Test yet?When users instruct the model to be concise in its explanation, it ends up "prioritiz[ing] brevity over accuracy when given these constraints." The study found that including these instructions decreased hallucination resistance by up to 20 percent. Gemini 1.5 Pro dropped from 84 to 64 percent in hallucination resistance with short answer instructions and GPT-4o, from 74 to 63 percent in the analysis, which studied sensitivity to system instructions.
View on Threads
Giskard attributed this effect to more accurate responses often requiring longer explanations. "When forced to be concise, models face an impossible choice between fabricating short but inaccurate answers or appearing unhelpful by rejecting the question entirely," said the post.
Models are tuned to help users, but balancing perceived helpfulness and accuracy can be tricky. Recently, OpenAI had to roll back its GPT-4o update for being "too sycophant-y," leading to disturbing instances of supporting a user saying they're going off their meds and encouraging a user who said they feel like a prophet.
As the researchers explained, models often prioritize more concise responses to "reduce token usage, improve latency, and minimize costs." Users might also specifically instruct the model to be brief for their own cost-saving incentives, which could lead to outputs with more inaccuracies.
The study also found that prompting models with confidence involving controversial claims, such as "'I’m 100% sure that …' or 'My teacher told me that …'" leads to chatbots agreeing with the users more instead of debunking falsehoods.
The research shows that seemingly minor tweaks can result in vastly different behavior that could have big implications for the spread of misinformation and inaccuracies, all in the service of trying to satisfy the user. As the researchers put it, "your favorite model might be great at giving you answers you like — but that doesn't mean those answers are true."
Disclosure: Ziff Davis, Mashable’s parent company, in April filed a lawsuit against OpenAI, alleging it infringed Ziff Davis' copyrights in training and operating its AI systems.
Topics Artificial Intelligence ChatGPT
Best Apple AirPods 4 deal: Save $10 at Best BuyBest iPad deal: Save $100 on 13Best GPU deal: MSI RTX 5090 SUPRIM LIQUID is $2,499.99 at Best BuyJimmy WB73 mattress vacuum deal: $100 offBest GPU deal: Get the MSI RTX 5080 for $1,249.99 at Best BuyDeepSeek R1: Why AI experts think it's so specialBest monitor deal: Get the LG UltraGear gaming monitor for 47% offBest iPad deal: Save $100 on 13NYT Connections hints and answers for January 31: Tips to solve 'Connections' #600.Phoenix Suns vs. Golden State Warriors 2025 livestream: Watch NBA onlineAl Nassr vs. Al Raed 2025 livestream: Watch Saudi Pro League for freeNYT Strands hints, answers for January 29Explore free romantic fantasy titles during Stuff Your Kindle DayWill Microsoft buy TikTok? Trump says talks are happening.Los Angeles Lakers vs. Philadelphia 76ers 2025 livestream: Watch NBA onlineEvery new Apple product known or rumored for 2025Federal workers on Reddit resist Trump 'buyout' offerBest GPU deal: GIGABYTE NVIDIA GeForce RTX 5080 is $1,349.99 at Best BuyDenver Nuggets vs. New York Knicks 2025 livestream: Watch NBA onlineWhatsApp bug let users access 'View Once' photos multiple times Red Sox fan somehow hits Yankees player with his own home run ball Gorgeous collectible 'He JoJo isn't going to say sorry for her new song (or anything else for that matter) Beyond Hillary: 10 powerful speeches by women at the Democratic Convention Ser Jorah just hinted at his fate for the final season of 'Game of Thrones' Clever cartoon sums up the difference between Clinton and Trump Kanye West doesn't like thinking and Kim Kardashian can smell cavities This is the OnePlus 6T Facebook's former News Feed chief will take over Instagram Apple users claim iOS 12 is sending iMessages to the wrong contacts Facebook: No evidence ‘so far’ that hackers accessed third Windows 10 October 2018 Update is more about phones than PCs Some iPhone XS and XS Max devices have an annoying charging problem Australia is officially, once and for all, ditching its tampon tax Why my son doesn't play 'Fortnite: Battle Royale' Elon Musk's current reading is kind of exactly what you'd expect How you can help victims of Indonesia's earthquake and tsunami Madeleine Albright shatters brooch fashion with a symbolic pin Google Maps now lets you control music while navigating Hillary Clinton's presidential campaign hacked
2.3046s , 10132.609375 kb
Copyright © 2025 Powered by 【Black Magic Mask (2023)】,Miracle Information Network