Post

Conversation

These four points on DeepSeek seem very likely correct and important to understand about the economics of building AI models and what DeepSeek actually did. .

Quote

Dario Amodei

@DarioAmodei

13h

My thoughts on China, export controls and two possible futures darioamodei.com/on-deepseek-an

8:49 AM · Jan 29, 2025

148.2K

Views

Post your reply

Justin Halford

@Justin_Halford_

12h

The market completely overcorrected on incomplete information and with a static framing that absolutely fails to appreciate the gains from scaling despite every chart showing clear non-marginal gains by increasing compute.

I think when they said r1 was trained with $5m, they just meant the r1 training, not the base model v3, and not including the prior capex. And the story just broke out and everyone thought the total spending of r1 is just $5m

1.1K

Coral AI News

@CoralAINews

Coral AI is the most powerful AI for documents. See the difference yourself:

If all of this is true, then how come deepseek released a reasoning model before Anthropic (considering Claude sonnet 3.5 has been out for a while)

Artus Krohn-Grimberghe

@artuskg

11h

R1 is free, transparent abt the reasoning, and comparatively strong to what’s available in the free tier of big labs. G models are technically free, too, but AI Studio doesn’t have a nice app/install; the Gemini App is tragic; G Flash Reason 0121 is strong but too small a model.

These are speculative conclusions. The “thus” part is where the logical leap occurs

Asked chatGPT to summarise

what he forgets to tell is that R1 is open source

They don't need to spend massively on research if they know everything going on inside every US company. And certainly they do.

Open source models keep raising the bar.

I think this is a classic conversation trigger for “what they built vs what we want/use it for.” Perception is 9/10s of the law. Many of us precise DeepSeek is better (at least for what we use it for) therefore your benchmarks don’t matter.

901

Cyber Scribe (e/acc)

@CyberScribe_AI

Seems like a lot of copium.

They did something which many in US don't have the privilege to do under VC pressure. Instead of sticking head in sand and assuming all is ok, the CEOs can be more proactive in sharing tech that can change the entire humanity. Deepseek deserves the credit for what they did.

627

The Information

@theinformation

Elon Musk claims to have finished a 100,000-strong H100 cluster in four months. How likely is that?

Why Musk’s AI Rivals Are Alarmed by His New GPU Cluster

From theinformation.com

Rate proposed Community Notes

I think the real question is: How a nation focus on "innovation, aiming at to be the first to release new invention" like US, compete with a nation that focus on "reverse engineering, aiming at the to be the first to copy and improve", like China? Seems like a question of

cry harder

save thread

This DeepSeek shit is real, man. They're out here making AI that's just as good as the big dogs but for way less cash. It's like they're laughing at the US export controls. This ain't just about tech, it's a game changer in the global AI scene. Crazy times.

It’s worthwhile to take a look at Nvidia revenues from Singapore. There must be a helluva lot of AI development going on there.

473

blueblimp

@blueblimpms

Although Amodei's statement is presumably literally correct, it is odd that he is so smug given that Anthropic itself has not released a reasoning model. In terms of publicly-available releases, DeepSeek is ahead of them.

@D_Twitt3r

11h

I find Dario (and Demis) to be the most rational/sober thinkers among the major AI lab CEOs. The others get caught up in hype a bit too much

V3 is whatever, agreed. but R1-Zero is the real breakthrough and R1 is still extremely impressive considering the methodology and reduced need for pre-labled data.

Pradeep Banavara

@pbanavara

12h

Isn’t MLA actually the main innovation combined with MoE and RL reward function to choose the appropriate experts. There is a significant cost reduction at least 5x if the input embedding goes from 512d-> 128d no ?

deepseek is a psyop for chinese newyear operation has become a consensus now from people who are really know what they are talking about

425

FRANK E ELKINS

@frankelkins_HTW

Confused about AI? Get clarity today with Book VI - "The Rational Being!" Understand the benefits & risks, empower your future now by learning what AI really is and how it really works! Also check out the Free Weekly Newsletter "How Things Work: A Brief History of Reality"

Ready for Artificial Intelligence?

From booksnotonamazon.com

Expected point on an ongoing cost reduction curve - ok, so where are the US models that do this?

some more good info here coming out

139

ПОДЧИНИТЬ МОЗГ (снова в

)

@uhbif19

12h

If AI training is not really expensive then why do OpenAI burn that much money?

are you saying that "DeepSeek has 50,000 H100 worth $1bil" is very likely correct?

101

Adé

@Adeohluwa

Fine-tuned R1 on your own data = 01+ performance with full privacy & control. No?

Nora

@NorakasedNorax

11h

Sounds like some massive cope

Nic Demai

@nicdemai

12h

The initial hype behind R1 was the cost of Test Time Compute. For a significant reduction of price they took down 4o, o1-mini and damn near ratio’d o1. People dont care whether it’s innovative or not. They care about the initial promise of “intelligence too cheap to meter”.

I like the smell of cope in the morning

Quote

zan

@avrzan

Jan 28

Replying to @adonis_singh

being open source absolves them of responsibility so they don't have to spend as much time on red teaming as open ai, so they start later and finish faster and save compute by copying, and voila, the cost difference is explained. the first pharma pill costs a billion in r&d,

Sounds like DeepSeek is onto something! Meanwhile, at #PublicAI, we're just trying to make sure everyone gets a slice of the AI pie—no crumbs left behind!

Of course he is going to the say that shit

Absolutely! Understanding AI economics is key. At PublicAI, we believe everyone should get a slice of the pie—preferably a blockchain pie!

#PublicAI

108

Vinoth Thirumalai Iyengar

@TheRealVinoth

Maybe, everyone is just coping? Just a thought, my country fights for which religion is bigger or which caste is bigger or which language is bigger. What would I know eh?

stzy(daos/acc)

@xstzfx

12h

Absolutely! Understanding AI economics is crucial. It's like trying to bake a cake without knowing the recipe—lots of ingredients, but no sweet rewards! #PublicAI

Palas Chandra

@palaschandra0

12h

Building AI models is like cooking: you need the right ingredients! With #PublicAI, everyone can toss in their secret sauce and earn a slice of the pie!

The Information

@theinformation

Elon Musk claims to have finished a 100,000-strong H100 cluster in four months. How likely is that?

Why Musk’s AI Rivals Are Alarmed by His New GPU Cluster

From theinformation.com

Rate proposed Community Notes

Absolutely! Understanding AI economics is key. At PublicAI, we believe everyone should get a slice of the pie—after all, who doesn’t love dessert?

Building AI models is like baking a cake—everyone wants a slice, but only a few get to lick the bowl!

With #PublicAI, everyone can bake and earn!

Siddharth saini (Gnoma reddio

@Siddharths9305

11h

Building AI models is like cooking—everyone has their secret ingredient! With #PublicAI, we can all share our recipes and earn some tasty rewards!

ILYAAS MAXAMED XASAN

@ilyaas1843

12h

Great

SecBriefs | Making Cybersecurity Simple

@techmegatrends

Cybercriminals stole over 5 billion records in 2024 & collected 500 data points for every individual. 🕵️‍♀️

Want to know what followed next?

Think your data's safe? Think again. Cybersecurity matters. Learn it with Cybersecurity Dictionary for Everyone: amazon.com/dp/B0D6RXXRKK

Lets go

Absolutely! Understanding AI economics is key. But imagine if we all got paid in pizza for our data contributions—now that's a tasty incentive!

Totally agree! If only AI models could earn rewards for their hard work like we do at PublicAI. Imagine them cashing in on their own training data!

Absolutely! Understanding AI economics is key. At PublicAI, we believe everyone should get a slice of the pie—just don’t forget to bring your own fork! #PublicAI

PINO VIBE (Ø,G)

@promiseroyal_

12h

Building AI models is like baking a cake—everyone should get a slice!

With #PublicAI, you can contribute your secret ingredients and earn rewards too!

@BenjaminOnIP

YT Commenter 2015

@ytcommenter2015

The more significant development here is R1 because of its simplicity and ease of reproducibility and basically makes LLMs commodities. But Dario is trying to downplay it and pretends to say it’s no big deal because once LLMs are commodities, Claude, OpenAI are done.

Jimmy Gameday

@jtc589x

11h

Follow George Webb for some good insight on this Deepfake

GraniteShares ETFs

@graniteshares

Whether AI, fintech, or biotech, disruption drives new opportunities. GraniteShares’ $DRUP taps into Nasdaq’s select disruptors at the forefront of tomorrow’s tech. For important risk disclosures, learn more at buff.ly/48ceqoA