pull down to refresh
Meat agents. I like that.
Holy hell though, vibe coded PRs are always massive and way too much to review as a meat agent. But moar code is better, right?
I've been trying to figure out why, but it's usually from too many layers/abstractions. I'm not sure what objective they're trained on, but it's in conflict with readability.
producing output only consumable by other agents, maybe? we’re being replaced
I'd guess their success criteria isn't very sophisticated yet and is mostly "did it output something that gets the job done?"
I should probably go browse with SWE benchmarks they all use. I'd guess that tracks SOTA success criteria pretty well.
Meat agents. I like that.
Holy hell though, vibe coded PRs are always massive and way too much to review as a meat agent. But moar code is better, right?