Post

Conversation

I am not impressed by o3-mini, OpenAI's latest reasoning model. It's better than previous reasoning models at some things and worse at others. I remain unconvinced we're going to see further big gains from throwing more compute at o1-style models.

11:22 AM · Feb 5, 2025

5,604

Views

Post your reply

Timothy B. Lee

@binarybits

Feb 5

More details are here, exclusively for paying subscribers:

o3-mini is a mixed bag

From understandingai.org

2.3K

@ekmplsrw

Feb 5

I'm not keeping track of all of the different models. Trying to keep track of the various players... Open AI, Claude, Deepseek, etc.... Are regular computer users expected to keep track of these different models?

No. Regular computer users will mostly just use whatever ChatGPT defaults to.

153

Jamie Munro

@munro_research

AI is the future. You're early. Most people rely on one AI model. Unlock the power of multiple. One subscription, AnyModel: ChatGPT, Claude, Gemini, Llama and more

But o3 mini is less compute not more

Less inference compute, but I would expect OpenAI used more training compute—at least relative to o1-mini and possibly compared to o1.

why update on o3 mini but not deep research? and isn't ARC result enough to show we aren't hitting a wall? respectfully, I feel like you're staying committed to your prior rather than updating based on the available evidence

I started testing o3 before deep research came out. Also deep research is a product not a model. I will get there.

are you saying reasoning model is also hitting a wall?

I think it’s too soon to say that but certainly no proof that it isn’t happening.

Looks like the o-series models are still an LLM, just producing more tokens that allow it to explore the problem space with more self-reflection and trying different things. I think they should add more invocation of honest external tools, but that won't be cheap.

With all this money, these assholes are better off hiring some best humansnto try create real knowledge instead of permutating the stuff that has been written.

To view keyboard shortcuts, press question markView keyboard shortcuts

Post

Conversation

To view keyboard shortcuts, press question mark
View keyboard shortcuts