sign up
sign up
sign up
sign up
pull down to refresh
BrokenMath: A Benchmark for Sycophancy in Theorem Proving with LLMs
arxiv.org/abs/2510.04721
210 sats
\
1 comment
\
@jakoyoh629
25 Oct 2025
AI
related
Mathematics in the Library of Babel — Daniel Litt
www.daniellitt.com/blog/2026/2/20/mathematics-in-the-library-of-babel
2760 sats
\
2 comments
\
@Scoresby
22 Feb
AI
-2757 sats
To Make Language Models Work Better, Researchers Sidestep Language
www.quantamagazine.org/to-make-language-models-work-better-researchers-sidestep-language-20250414/
210 sats
\
0 comments
\
@0xbitcoiner
15 Apr 2025
AI
Vibe physics
www.math.columbia.edu/~woit/wordpress/?p=15012
2355 sats
\
4 comments
\
@south_korea_ln
1 Aug 2025
science
Hallucination Stations On Some Basic Limitations of Transformer-Based LM
arxiv.org/pdf/2507.07505
213 sats
\
0 comments
\
@0xbitcoiner
23 Jan
AI
Meet the new biologists treating LLMs like aliens
www.technologyreview.com/2026/01/12/1129782/ai-large-language-models-biology-alien-autopsy/
580 sats
\
1 comment
\
@winteryeti
14 Jan
AI
How large are large language models?
gist.github.com/rain-1/cf0419958250d15893d8873682492c3e
231 sats
\
0 comments
\
@carter
14 Jul 2025
AI
Scalable MatMul-Free Language Modeling — 10x Reduction On LLMs Computation
arxiv.org/abs/2406.02528
110 sats
\
1 comment
\
@0xbitcoiner
10 Jun 2024
science
freebie
Why language models hallucinate - OpenAI
openai.com/index/why-language-models-hallucinate/
438 sats
\
4 comments
\
@Scoresby
6 Sep 2025
AI
The ORCA Benchmark Evaluates How Well AIs Deal with Everyday Math
www.omnicalculator.com/reports/omni-research-on-calculation-in-ai-benchmark
260 sats
\
0 comments
\
@0xbitcoiner
27 Feb
AI
Why OpenAI’s solution to AI hallucinations would kill ChatGPT tomorrow
theconversation.com/why-openais-solution-to-ai-hallucinations-would-kill-chatgpt-tomorrow-265107
618 sats
\
25 comments
\
@south_korea_ln
17 Sep 2025
AI
Google releases VaultGemma, its first privacy-preserving LLM
arstechnica.com/ai/2025/09/google-releases-vaultgemma-its-first-privacy-preserving-llm/
253 sats
\
0 comments
\
@0xbitcoiner
15 Sep 2025
AI
Experimental evidence of the effects of LLMs vs web search on depth of learning
academic.oup.com/pnasnexus/article/4/10/pgaf316/8303888
176 sats
\
1 comment
\
@0xbitcoiner
20 Jan
AI
Financial Statement Analysis with Large Language Models
papers.ssrn.com/sol3/papers.cfm?abstract_id=4835311&fbclid=IwY2xjawIJNupleHRuA2FlbQIxMAABHWJxn71ESvZCS0FxEF_31oro1rwtk4rlgOst5Q4A6tuxDhxB9cgZBPizAg_aem_OAMNHiz7Vyv2bb2vt2yM0Q
222 sats
\
2 comments
\
@scatman
31 Jan 2025
AI
LLMs and the Specter of the Cognitive Black Hole
www.psychologytoday.com/us/blog/the-digital-self/202403/llms-and-the-specter-of-the-cognitive-black-hole
200 sats
\
0 comments
\
@ch0k1
22 Mar 2024
science
Large Language Models Pass the Turing Test
arxiv.org/pdf/2503.23674
374 sats
\
11 comments
\
@south_korea_ln
15 Apr 2025
AI
Prime Fields, Text Manglers and Progress Report on Indra
6263 sats
\
0 comments
\
@l0k18
1 May 2023
bitcoin
Debate May Help AI Models Converge on Truth
www.quantamagazine.org/debate-may-help-ai-models-converge-on-truth-20241108/
258 sats
\
0 comments
\
@0xbitcoiner
8 Nov 2024
science
How to turn LLM Pinocchio into a real boy
12.7k sats
\
10 comments
\
@Scoresby
7 Oct 2025
AI
On the Inevitability of Left-Leaning Political Bias in Aligned Language Models
arxiv.org/abs/2507.15328
366 sats
\
17 comments
\
@0xbitcoiner
12 Oct 2025
AI
LLMs Can Get Brain Rot
llm-brain-rot.github.io/
287 sats
\
0 comments
\
@Scoresby
21 Oct 2025
AI
Poisoning Attacks on LLMs Require a Near-constant Number of Poison Samples
arxiv.org/abs/2510.07192
130 sats
\
0 comments
\
@0xbitcoiner
9 Oct 2025
AI
more