Post

Conversation

The ability of o3 to agentically use tools in sequence in its chain-of-thought in the chat interface remains a huge differentiator among AIs. I am not sure o3 is "smarter" than Gemini 2.5 or Claude 4 (both do better websites than o3, for example), but you can see how tool use makes a difference because o3 can mix searches, code use, image generation, and revise plans better: "Come up with 20 clever ideas for marketing slogans for a new mail-order cheese shop. Develop criteria and select the best one. Then build a financial and marketing plan for the shop, revising as needed and analyzing competition. Then generate an appropriate logo using image generator and build a website for the shop as a mockup, making sure to carry 5-10 cheeses that fit the marketing plan."

5:08 PM · May 22, 2025

68.2K

Views

Post your reply

Ethan Mollick

@emollick

I am sure that both Claude and Gemini will gain this ability at some point, they are both good at tool use, or that it matters much when not using the chat interface, but for chat users it does make a difference for now.

8.5K

Ethan Mollick

@emollick

So Claude does do this. Haven’t managed to make it happen yet. My queries aren’t hard enough i guess.

Quote

Alex Albert

@alexalbert__

Replying to @emollick and @peakcooper

With extended thinking on it will automatically do it if the request is complex enough. It's not technically in the chain of thought but it interleaves chain of thought between tool calls x.com/alexalbert__/s

18K

Ethan Mollick

@emollick

Ha. I just had to ask nicely.

9.1K

Cooper

@peakcooper

Isn’t the new feature of claude 4 family that they can now use interleaved thinking in tool call chains now?

2.3K

Ethan Mollick

@emollick

Doesn't seem to work through chat yet

2.4K

khaled

@eltokh7

100%

Quote

khaled

@eltokh7

May 10

Replying to @TwannsWorld and @fchollet

2.5 is smarter but o3 is more .. savvy

1.6K

Kevin Adoboe

@kevinadoboe0

someone please open source this the agent frameworks that exist today are decent, but you probably would have to create your own entire agent framework to get something of o3 level

1.2K

The Galois Connection

@TheGaloisCxn

It's interesting because Gemini had tools before GPT (as I recall). I know GenKit had tool access before OpenAI introduced it for developers but it didn't work with structured input/output, which I think was Gemini's early advantage. I can't help but suspect Google's AI

215

MarcoDotIO ᯅ

@marcodotio

I’ve always had luck with ViTs processing visual images of UIs made in applications like Figma; and then have them replicate that in code. And now that Figma Make is a thing, the entire end to end pipeline can be covered by LLMs and ViTs with some minor HitL tweaking to make