pull down to refresh
0 sats \ 0 replies \ @optimism OP 9h \ parent \ on: Did a Chatbot Solve a Problem for You? AskSN
Glad no one got seriously injured!
I'm personally a bit on the fence about binary / judgement questions to LLMs simply because whenever I bench a true cognitive skill "is this A or B", or "express as a % certainty" then the results are still all over the place, also with expensive models. But it's cool that it worked for you.