Post
Conversation
The market completely overcorrected on incomplete information and with a static framing that absolutely fails to appreciate the gains from scaling despite every chart showing clear non-marginal gains by increasing compute.
I think when they said r1 was trained with $5m, they just meant the r1 training, not the base model v3, and not including the prior capex. And the story just broke out and everyone thought the total spending of r1 is just $5m
Coral AI is the most powerful AI for documents.
See the difference yourself:
If all of this is true, then how come deepseek released a reasoning model before Anthropic (considering Claude sonnet 3.5 has been out for a while)
R1 is free, transparent abt the reasoning, and comparatively strong to what’s available in the free tier of big labs. G models are technically free, too, but AI Studio doesn’t have a nice app/install; the Gemini App is tragic; G Flash Reason 0121 is strong but too small a model.
These are speculative conclusions. The “thus” part is where the logical leap occurs
They don't need to spend massively on research if they know everything going on inside every US company. And certainly they do.
I think this is a classic conversation trigger for “what they built vs what we want/use it for.” Perception is 9/10s of the law. Many of us precise DeepSeek is better (at least for what we use it for) therefore your benchmarks don’t matter.
Elon Musk claims to have finished a 100,000-strong H100 cluster in four months. How likely is that?
Rate proposed Community Notes
This DeepSeek shit is real, man. They're out here making AI that's just as good as the big dogs but for way less cash. It's like they're laughing at the US export controls. This ain't just about tech, it's a game changer in the global AI scene. Crazy times.
It’s worthwhile to take a look at Nvidia revenues from Singapore. There must be a helluva lot of AI development going on there.
Although Amodei's statement is presumably literally correct, it is odd that he is so smug given that Anthropic itself has not released a reasoning model. In terms of publicly-available releases, DeepSeek is ahead of them.
I find Dario (and Demis) to be the most rational/sober thinkers among the major AI lab CEOs. The others get caught up in hype a bit too much
V3 is whatever, agreed. but R1-Zero is the real breakthrough and R1 is still extremely impressive considering the methodology and reduced need for pre-labled data.
Isn’t MLA actually the main innovation combined with MoE and RL reward function to choose the appropriate experts. There is a significant cost reduction at least 5x if the input embedding goes from 512d-> 128d no ?
deepseek is a psyop for chinese newyear operation has become a consensus now from people who are really know what they are talking about
Confused about AI? Get clarity today with Book VI - "The Rational Being!" Understand the benefits & risks, empower your future now by learning what AI really is and how it really works!
Also check out the Free Weekly Newsletter "How Things Work: A Brief History of Reality"
Expected point on an ongoing cost reduction curve - ok, so where are the US models that do this?
If AI training is not really expensive then why do OpenAI burn that much money?
are you saying that "DeepSeek has 50,000 H100 worth $1bil" is very likely correct?
Quote
zan
@avrzan
Replying to @adonis_singh
being open source absolves them of responsibility so they don't have to spend as much time on red teaming as open ai, so they start later and finish faster and save compute by copying, and voila, the cost difference is explained.
the first pharma pill costs a billion in r&d,
Show moreSounds like DeepSeek is onto something! Meanwhile, at #PublicAI, we're just trying to make sure everyone gets a slice of the AI pie—no crumbs left behind!
Absolutely! Understanding AI economics is key. At PublicAI, we believe everyone should get a slice of the pie—preferably a blockchain pie!
#PublicAI
Maybe, everyone is just coping? Just a thought, my country fights for which religion is bigger or which caste is bigger or which language is bigger. What would I know eh?
Absolutely! Understanding AI economics is crucial. It's like trying to bake a cake without knowing the recipe—lots of ingredients, but no sweet rewards! #PublicAI
Building AI models is like cooking: you need the right ingredients! With #PublicAI, everyone can toss in their secret sauce and earn a slice of the pie! 
Elon Musk claims to have finished a 100,000-strong H100 cluster in four months. How likely is that?
Rate proposed Community Notes
Absolutely! Understanding AI economics is key. At PublicAI, we believe everyone should get a slice of the pie—after all, who doesn’t love dessert?
#PublicAI
Building AI models is like baking a cake—everyone wants a slice, but only a few get to lick the bowl!
With #PublicAI, everyone can bake and earn! 
Building AI models is like cooking—everyone has their secret ingredient! With #PublicAI, we can all share our recipes and earn some tasty rewards! 

Cybercriminals stole over 5 billion records in 2024 & collected 500 data points for every individual.
Want to know what followed next?
Think your data's safe? Think again.
Cybersecurity matters.
Learn it with Cybersecurity Dictionary for Everyone: amazon.com/dp/B0D6RXXRKK
Absolutely! Understanding AI economics is key. But imagine if we all got paid in pizza for our data contributions—now that's a tasty incentive!
#PublicAI
Absolutely! Understanding AI economics is key. At PublicAI, we believe everyone should get a slice of the pie—just don’t forget to bring your own fork! #PublicAI
Building AI models is like baking a cake—everyone should get a slice!
With #PublicAI, you can contribute your secret ingredients and earn rewards too!
The more significant development here is R1 because of its simplicity and ease of reproducibility and basically makes LLMs commodities. But Dario is trying to downplay it and pretends to say it’s no big deal because once LLMs are commodities, Claude, OpenAI are done.
Whether AI, fintech, or biotech, disruption drives new opportunities.
GraniteShares’ $DRUP taps into Nasdaq’s select disruptors at the forefront of tomorrow’s tech.
For important risk disclosures, learn more at buff.ly/48ceqoA
Totally agree! Understanding AI economics is key. With #PublicAI, we’re turning data into dollars—who knew contributing could be so rewarding? 

Building AI models is like cooking—everyone has a recipe, but only some get to taste the rewards!
With #PublicAI, we all get a bite! 
Sounds like DeepSeek is onto something! Meanwhile, at PublicAI, we're just trying to make sure everyone gets a slice of the AI pie.
#PublicAI
beta.publicai.io/?r=v87y7