items/1038765/related \ stacker news

pull down to refresh

Emergent Misalignment: Narrow finetuning can produce broadly misaligned LLMs arxiv.org/abs/2502.17424

227 sats \ 6 comments \ @carter 14 Jul 2025 AI

related

Moloch's Bargain: Emergent Misalignment When LLMs Compete for Audiences arxiv.org/abs/2510.06105

231 sats \ 1 comment \ @carter 9 Oct 2025 AI

Weird Generalization and Inductive Backdoors: New Ways to Corrupt LLMs arxiv.org/abs/2512.09742

401 sats \ 2 comments \ @Scoresby 14 Dec 2025 AI

Systemic Misalignment www.systemicmisalignment.com/

185 sats \ 0 comments \ @carter 30 Jun 2025 AI

Agentic Misalignment: How LLMs could be insider threats www.anthropic.com/research/agentic-misalignment

130 sats \ 0 comments \ @carter 8 Aug 2025 AI

Learning the Wrong Lessons: Syntactic-Domain Spurious Correlations in LLMs arxiv.org/abs/2509.21155

445 sats \ 6 comments \ @optimism 2 Dec 2025 AI

2025 LLM Year in Review - karpathy karpathy.bearblog.dev/year-in-review-2025/

1652 sats \ 3 comments \ @Scoresby 21 Dec 2025 AI

The week in AI, October 6-12, 2025

991 sats \ 2 comments \ @optimism 13 Oct 2025 AI

Context Rot: How Increasing Input Tokens Impacts LLM Performance research.trychroma.com/context-rot

334 sats \ 2 comments \ @Scoresby 14 Jul 2025 AI

The simulation of judgment in LLMs - PNAS www.pnas.org/doi/10.1073/pnas.2518443122

244 sats \ 5 comments \ @Scoresby 15 Oct 2025 AI

Elites, the curse of recursion, and the half-life of policy

5779 sats \ 11 comments \ @elvismercury 29 Mar 2024 mostly_harmless

When Models Lie, We Learn: Multilingual Span-Level Hallucination Detection arxiv.org/abs/2510.04849v1

433 sats \ 2 comments \ @optimism 19 Oct 2025 AI

LLMs Can Get Brain Rot llm-brain-rot.github.io/

287 sats \ 0 comments \ @Scoresby 21 Oct 2025 AI

LLMs generate slop because they avoid surprises by design - Dan Fabulich danfabulich.medium.com/llms-tell-bad-jokes-because-they-avoid-surprises-7f111aac4f96

373 sats \ 2 comments \ @Scoresby 19 Aug 2025 AI

LLMs: a bleak future ahead?lcamtuf.substack.com/p/llms-a-bleak-future-ahead

266 sats \ 5 comments \ @cointastical 9 Jan 2023 bitcoin

Show HN: 80% faster, 50% less memory, 0% loss of accuracy Llama finetuning github.com/unslothai/unsloth

31 sats \ 2 comments \ @hn 2 Dec 2023 tech

More Artificial than Intelligent, it is only getting worse - Mathjis Lagerberg mlagerberg.com/much-a-little-i-and-it-is-not-getting-better/

247 sats \ 4 comments \ @Scoresby 15 Jul 2025 AI

Andrej Karpathy: How I use LLMs www.youtube.com/watch?v=EWvNQjAaOHw

1278 sats \ 1 comment \ @k00b 28 Feb 2025 AI

Why do LLMs have emergent properties?www.johndcook.com/blog/2025/05/08/why-do-llms-have-emergent-properties/

561 sats \ 0 comments \ @k00b 9 May 2025 tech

Detecting and reducing scheming in AI models - OpenAI openai.com/index/detecting-and-reducing-scheming-in-ai-models/

257 sats \ 1 comment \ @Scoresby 17 Sep 2025 AI

If you think LLMs produce overly defensive code, you're not alone

150 sats \ 0 comments \ @tonyaldon 21 Dec 2025 devs

Things we learned about LLMs in 2024 simonwillison.net/2024/Dec/31/llms-in-2024/

470 sats \ 0 comments \ @Rsync25 31 Dec 2024 tech