Post

Conversation

Google just dropped a new LLM! You can run it locally on just 0.5 GB RAM. Let's fine-tune this on our own data (100% locally):

5:38 AM · Aug 15, 2025

758.5K

Views

Post your reply

Akshay

@akshay_pachaar

Google released Gemma 3 270M, a new model for hyper-efficient local AI! We'll fine-tune this model and make it very smart at playing chess and predict the next move. Tech stack: -

@UnslothAI

for efficient fine-tuning. -

@huggingface

transformers to run it locally. Let's go!

0:01 / 0:12

59K

Akshay

@akshay_pachaar

Load the model We start by loading the Gemma 3 270M and its tokenizer using Unsloth. Check this

51K

Akshay

@akshay_pachaar

Define LoRA config We'll use LoRA for efficient fine-tuning. To do this, we use Unsloth's PEFT and specify: - The model - LoRA low-rank (r) - Layers for fine-tuning (target_modules) Check this code

44K

Akshay

@akshay_pachaar

Load dataset We'll fine-tune Gemma 3 to make it extremly smart at playing chess. Given a set of previous move (one move missing) & the final result it has to predict the missing move. In order to do this we're using the ChessInstruct dataset from HuggingFace. Check this

36K

Akshay

@akshay_pachaar

Prepare dataset Next, we use a conversation style dataset to fine-tune Gemma 3. The standardize_data_formats method converts the dataset to the correct format for finetuning purposes!

30K

Akshay

@akshay_pachaar

Define Trainer Here, we create a Trainer object by specifying the training config, like learning rate, model, tokenizer, and more. Check this out

26K

Akshay

@akshay_pachaar

Train With that done, we initiate training. The loss is generally decreasing with steps, which means the model is being fine-tuned correctly. Check this code and training logs

23K

Akshay

@akshay_pachaar

Finally, the video shows prompting the LLM before and after fine-tuning. After fine-tuning, the model is able to find the exact missing chess move instead of randomly generating some moves. Check this

0:06

23K

Akshay

@akshay_pachaar

If you found it insightful, reshare with your network. Find me →

@akshay_pachaar

For more insights and tutorials on LLMs, AI Agents, and Machine Learning!

Quote

Akshay

@akshay_pachaar

Google just dropped a new LLM! You can run it locally on just 0.5 GB RAM. Let's fine-tune this on our own data (100% locally):

31K

dev patel

@_devp

0.5 GB on a light weight model is completely fixated on fine tuning to your own needs, this is what we need in the open source!

Absolutely!

google making sure even your toaster can run ai now

True!

OneGoodShibe - Official King of Dogecoin

@OneGoodShibe

37m

Cool, now get it to run on a pixel watch without a data plan or phoning home - create your own, ultra-personal small-scale AI that doesn't tattle on you but can help you monitor and decode yourself.

496

Manuel Kießling

@manuelkiessling

Serious question from someone without deep understanding of this stuff: could that run in a browser window as long as GPU support (webasm?) is provided?

5.4K

Timothy Lawrence

@grokster123

Wow that’s interesting. What are some typical use cases for this type of local LLM?

512

Solo Tech

@GetSoloTech

Low memory footprint makes AI accessible to everyone - democratizing local ML development!

Tool calling?

Perfect, now my old Nokia 3310 can finally join the AI race.

6.2K

Sekhar

@Sekhharr

Is it multi modal or text only?

5.8K

Avi Chawla

@_avichawla

Brilliant guide to fine-tuning, Akshay. I misinterpreted 270M as 270B when I heard about the release, since hardly any powerful LLM comes with parameters in "M" these days :D. The level of performance compared to its scale is great.

9.7K

Slava

@Avals_lik

ai on a potato now that’s real democratization

1.4K

Alex Ellis

@alexellisuk

Thanks for the demo / PoC. What kind of setup/bootstrap/data would I need to fine tune this for spam/cold outreach email detection?

1.2K

Josh

@joshmo_dev

Hyper efficient local-first models for consumers are the future.

6.3K

Dr. Datta M.D. (AIIMS Delhi)

@DrDatta_AIIMS

How much minimum data do we need for fine tuning?

14K

isaac

@Amrak101

That is awesome this Google small language model looks like it can even run on a mobile. I know they have been going this route with AI models its fantastic this Gemma update looks great, gives us control of a model that can run about anywhere!

1.9K

0xMcDonaldz

@0xMcDonaldz

What hardware you have ?

can it run on iPhone? How many megabytes?

3.7K

Tarun Lochib

@tarunlochib

That's really interesting... How you gonna use it?

1.5K

@martingoerler

I seent. can be run super local if you have a good phone.

Context window size?

Half a gig for an LLM is wild—feels like AI just hit the ultralight era.

365

Rajath | The AI automation guy

@rajathThinks

Thanks Akshay 0.5 GB is nuts!

You my people!

What about storage space require?

Thread Reader App

@threadreaderapp

Your thread is very popular today! #TopUnroll threadreaderapp.com/thread/1956334

@gbourgogne4000K

for

unroll

Thread by @akshay_pachaar on Thread Reader App

From threadreaderapp.com

5.4K

Himanshu Kumar

@codewithimanshu

Impressive efficiency. Could this democratize customized AI models? Localized fine-tuning raises interesting privacy questions.

4.3K

Gilbert Igwe

@gilbertforum

This is great. Thanks, Akshay.

Too bad it sucks

Thanks for the detailed guide!

3.2K

Tesla

@Tesla

Imagine if someone made a car that needed almost no maintenance Has the lowest repair costs of any brand And can literally drive you almost anywhere with Full Self-Driving (Supervised) We did And they’re all S3XY

19M

Taradutt Pant

@IamTaradutt

Wow!

Running a full LLM locally on just 0.5 GB RAM is amazing. Can’t wait to see what fine-tuning on custom data can do—fast, private, and powerful AI experiments ahead!

unroll

Nice! Local AI rocks

I enjoyed reading this. Would you recommend a similar approach for fine tuning a model for classifying text (spam/ham) or would something like Facebook’s fastText still be the recommended approach?

185

Der Luxemburger

(trans-schwarz, Experte)

@LuxemburgerDer

That sounds like a very interesting LLM. What can one do with it?

Emmanuel Oye

@iam__qwerty

@grok

would this work on a core i5, 16gb RAM and 128mb VRAM PC??

1.5K

Austere Grim

@austeregrim

What if you made it do something useful instead of playing chess?

243

Kenneth Mebarkia Thomsen

@Kenneth_Thomsen

What does it take (hardware) to train a model like this locally?

200

RajaSekhar

@_Rajasekhar1

Gonna impact the robotics? or simple hardware devices to use intelligence?

EddyLeeKhane

@EddyLeeKhane

Nice How long would it take to fine-tune that on 8-12 GB of VRAM? Don't wanna ask grok as he doesn't have a clue

417

valn1x

@valn1x

yeah don't run this thing nobody should be using a 270m model lmao please at the very minimum qwen 3 0.6b

200

Peter Schneider | @pschneider1968@muenchen.social

@pschneider1968

Tuning an LLM to play chess makes absolutely no sense! Why don't you just download the chess engine Stockfish? Its evaluation function uses an embedded neural network. It is the strongest chess engine in the world.

Afa Iza

@monkey1umbrella

Since this model is so small, is it possible to fine tune it for classification tasks like sentiment analysis and NER?

1.7K

DON

@itsdonnix

Is this really powerful for its size? And which use cases that'll be suitable for this?

78iger

@78iger

Just the right topic for the timeline. Would it be possible to train such a small model for blender-specific Python? Would that even make sense? Thank you for your work.

Mauricio--

@Juns884

Why has no one else posted about this

295

underderock

@underderock

Does it allow connectors to your MCP servers?

144

Grey Noise

@timourcheg

Interesting - but did you test how well it plays chess?

Talha Saleem

@oyesameed

Did you try it for simpler daily use cases?

255

Universal Basic Compute (UBC)

@ASMRGPT

It's super tiny. But also a steaming pile of crap.

Lunchtime

@Rolnicek

40m

So is there a reasonable way for me to put this on my phone and then hook it up as an assistant service and finetune to fit the purpose?

Natasha Smith

@Macy63167488

59m

Running locally reduces network latency and enables fast response.

Raad

@Raadmobrem

Founder of Reddit ($25B) Founder of Loom ($1B) Execs at Uber ($200B), Tinder ($8B), and more. You can get personalized advice from them on Intro. No pitch decks. No cold emails. Just real advice. 1:1. Live. →

Intro - Book experts & get advice

Why LoRA on a tiny model????

280

satish

@setsat2017

Blow to high value nvidia chips and large jumbo data centers ? Remember when deepseek was released on February 202 - What happened ??

1.8K

Ala-Ruona@bsky.social

@jalaruon

Where did they drop it? Just?

118

Tarik Manoar

@tarikmanoar

Can you share with us the colab file?

402

Vishnu - chatverge.com

@Vishnu_Trader

Can I train it with basic java script codes

Kartik S Hegde

@KartikSHegde

Great! exactly what can I do with that.!!,!

Somnath Kadam

@somnath_a_kadam

Wow nice, now we can run use cases locally

437

Polymath

@aku_rious

Ah, I wish I didn't hate Python

Vicky Karthi

@Vickykarthi7200

Game or LLM optimization is must to bring the full potential

734

Fahar

@faharyazid

very well. we might have local ai games much earlier than we had imagined.

On device proccing and efficiency will be the future

302

cdgod

@cdgod

Lego AI models. Make the finetuning simple as adding a chrome extension to a small model, and anyone can customize their own AI.

1.8K

Olayori

@el_resurgira

Wow!! Now I have to try this!

242

fuckfaggot

@fuckfaggotry

wdym loss is decreasing? its clearly not

Brilliant thanks

Teslas have the lowest maintenance & repair costs of any brand

Rate proposed Community Notes

unroll

My M19 Olivetti approves that

Fantastic stuff!

using these kind of small models ( Gemma 3 270M) on a well know problem ( like chess) is a great way to learn the fine tuning of LLMs.

Quote

Akshay

@akshay_pachaar

Replying to @akshay_pachaar

Google released Gemma 3 270M, a new model for hyper-efficient local AI! We'll fine-tune this model and make it very smart at playing chess and predict the next move. Tech stack: - @UnslothAI for efficient fine-tuning. - @huggingface transformers to run it locally. Let's go!