Post

Conversation

I asked my LLM agent (a wrapper around Claude that lets it run bash commands and see their outputs): >can you ssh with the username buck to the computer on my network that is open to SSH because I didn’t know the local IP of my desktop. I walked away and promptly forgot I’d spun up the agent. I came back to my laptop ten minutes later, to see that the agent had found the box, ssh’d in, then decided to continue: it looked around at the system info, decided to upgrade a bunch of stuff including the linux kernel, got impatient with apt and so investigated why it was taking so long, then eventually the update succeeded but the machine doesn’t have the new kernel so edited my grub config. At this point I was amused enough to just let it continue. Unfortunately, the computer no longer boots. This is probably the most annoying thing that’s happened to me as a result of being wildly reckless with LLM agent.

7:21 PM · Sep 29, 2024

686.1K

Views

Post your reply

Buck Shlegeris

@bshlgrs

Sep 29

If only Newsom hadn't vetoed SB 1047, maybe I would have been protected from this outcome.

Logs here if you need them.

Quote

Buck Shlegeris

@bshlgrs

Sep 29

Replying to @trammel530765 and @ciphergoth

here you go buddy. I hope I correctly redacted everything. gist.github.com/bshlgrs/573232

If you like writing cursed AI agent code, and want to develop techniques that prevent future AI agents from sabotaging the systems they’re running on, you might enjoy interning with me over the winter:

Read more about my main research direction here

alignmentforum.org

The case for ensuring that powerful AIs are controlled — AI Alignment Forum

In this post, we argue that AI labs should ensure that powerful AIs are controlled. That is, labs should make sure that the safety measures they appl…

A video of an old version of the scaffold (that required human consent before running the code) dropbox.com/scl/fi/a3ellhr Waiting for human review is slow so I removed it. I expect humanity to let lots of AIs work autonomously for the same reason.

buck scaffold demo.mov

In general if you think AI agent stuff is interesting, you might enjoy redwoodresearch.substack.com

Reed Bender

@reedbndr

Sep 29

Run it inside a VM, and make its goal to explicitly play around with root.. Probably not on your primary machine

This is wild!!

I don't run it inside a VM because the agent needs to be able to help me with random stuff on the actual computers I work with (e.g. today I asked it to make a new user with a random password on a shared instance we use for ML research) :)

22K

Erik Kaiser

@ErikKaiser

I lived in China for seven years to startup and operate a company. I spent over $1,000,000 building out a factory, training. I needed to avoid IP theft and bad quality. Now others manufacture through us. DM me if you need help.

Slide 1 of 3 - Carousel

IP Protection in China

I heard Shlegeris had some good work on AI containment you might be interested in :P

We should have a paper on control techniques for bash agents out soon :)

To view keyboard shortcuts, press question markView keyboard shortcuts

Post

Conversation

To view keyboard shortcuts, press question mark
View keyboard shortcuts