Discussion about this post

User's avatar
Chris Willis's avatar

I’m a bit of an LLM skeptic for real-world applications, but I have to say Claude building that color app from a single prompt was extremely impressive.

(I can’t remember if it was you or ACX who was pondering the question of why there’s such a mismatch in different people’s attitudes to AI, and I agree that it has a lot to do with coding. If you only use LLMs for help with real-world tasks, and if those tasks are niche enough that you couldn’t just get the same answer by googling, then LLM performance still lags quite a bit behind the hype.)

Daniel Reeves's avatar

PS: Let me clarify a key distinction between level 2 and level 3 autonomy.

In the official levels of self-driving, both levels 2 and 3 require you, the human, to be ready to take control in real time. The difference is that at level 2 it's your responsibility to decide when to take control. You're supervising and it's up to you to disengage the AI if it's about to screw up. At level 3 you no longer have to supervise everything it does. You have to be ready to take over at any moment but the AI will get your attention if it needs you. You can read a book or otherwise do your own thing much of the time.

I advocate maintaining that distinction to whatever else we're applying these autonomy levels to.

Writing: Level 2 means you're considering every word the AI generates and using that word if and only if it's what you endorse saying, in your own voice. (Better yet, the plagiarism litmus test: don't use anyone's or anything's exact words without explicitly quoting them.) At level 3 you're still the one in charge and should read every word before publishing since you're vouching for the finished product, but you're not supervising the writing word by word as it's written.

Coding: Level 2 means you're in the integrated development environment (IDE) with all the code. At level 3 you ditch the IDE and just talk to the AI in English. You don't worry about the literal code but you're involved in implementation decisions. At least some of them.

In short, level 2 means the AI is assisting you and level 3 means you're directing the AI.

2 more comments...

No posts

Ready for more?