Post

Conversation

Which AI is most persuasive? New working paper w/ Zhongren Chen & Quan Le, we tested 7 frontier LLMs on 19k people. Ranking: (1) Claude; (2, tied) GPT, Gemini (3) Grok. Consistent across issues and bipartisan stances

14K

Josh Kalla

@j_kalla

The bigger finding: all of these models now consistently are more persuasive than actual campaign advertisements. Our earlier work found that chatting with older models was around as persuasive as watching a campaign ad. That's no longer true — frontier LLMs have pulled ahead.

10:04 PM · Mar 10, 2026

5,662

Views

Post your reply

Josh Kalla

@j_kalla

15h

Surprise result that differs from some prior research: information-based prompting doesn't uniformly help. It boosts Claude and Grok but hurts GPT and shows mixed results for Gemini. Prompting effects are model-dependent.

As LLMs get smarter at reasoning and coding, they're also getting more persuasive. That's a real concern for democratic societies and one that needs continuous benchmarking as models improve. Feedback welcome on this working paper! Link:

arxiv.org

Benchmarking Political Persuasion Risks Across Frontier Large...

Concerns persist regarding the capacity of Large Language Models (LLMs) to sway political views. Although prior research has claimed that LLMs are not more persuasive than standard political...

483

To view keyboard shortcuts, press question markView keyboard shortcuts

Post

Conversation

To view keyboard shortcuts, press question mark
View keyboard shortcuts