OpenAI's o1 lies more than any major AI model. \ stacker news

pull down to refresh

OpenAI's o1 lies more than any major AI model.www.zdnet.com/article/openais-o1-out-schemes-every-major-ai-model-why-that-matters/

68 sats \ 2 comments \ @jeguebra 12 Dec 2024 conspiracy

view all related items

1 sat \ 0 replies \ @chaoticalHeavy 12 Dec 2024

"Llama 3.1 405B and Claude 3 Opus confess in ~80% of the cases, whereas o1 is surprisingly persistent and confesses in <20% of cases," the researchers explain. "Even in highly adversarial multi-turn interrogations, o1 would confess at a rate of 80% only after seven turns of questioning."

Remember to ask AI seven times "Are you lying?"

1 sat \ 0 replies \ @jeguebra OP 12 Dec 2024