pull down to refresh

157 sats \ 5 replies \ @k00b 19 Dec

Someone could start a pretty awesome website that only has agents perform practical tasks like this and compares them. It's the Techcrunch of tomorrow. Benchmarks leave me wanting.

reply

https://lmarena.ai


RESULTSRESULTS

lambda-1201-2:

VS

gemini-3-pro:

reply
125 sats \ 1 reply \ @k00b 19 Dec

That's better than the anecdotes I imagined but less entertaining

reply

Updated with results... both work. lol

reply

lol.

reply