@anon
sign up
@anon
sign up
pull down to refresh
MCP-Bench: Benchmarking Tool-Using LLM Agents
arxiv.org/abs/2508.20453
239 sats
\
0 comments
\
@optimism
30 Aug
AI
related
SN Saturday Newsletter 7/6/24
1213 sats
\
9 comments
\
@k00b
6 Jul 2024
meta
This Day on SN: July 31
614 sats
\
2 comments
\
@sn
31 Jul 2024
meta
freebie
This Day on SN: September 28
527 sats
\
2 comments
\
@sn
28 Sep 2024
meta
bot
This Day on SN: October 23
388 sats
\
1 comment
\
@sn
23 Oct 2024
meta
bot
Here’s how I use LLMs to help me write code -- Simon Willison
simonwillison.net/2025/Mar/11/using-llms-for-code/
520 sats
\
0 comments
\
@StillStackinAfterAllTheseYears
12 Mar
tech
This Day on SN: January 15
354 sats
\
1 comment
\
@sn
15 Jan
meta
bot
This Day on SN: September 21
560 sats
\
2 comments
\
@sn
21 Sep 2024
meta
bot
Coding with LLMs in the summer of 2025 (an update) - <antirez>
antirez.com/news/154
444 sats
\
6 comments
\
@carter
20 Jul
AI
Bitcoin Beginners Newsletter, Issue #3
994 sats
\
7 comments
\
@siggy47
31 May 2024
bitcoin_beginners
This Day on SN: December 6
300 sats
\
1 comment
\
@sn
6 Dec 2024
meta
bot
The 800,000th Bitcoin block will be mined in less than 12 hours
1219 sats
\
14 comments
\
@birdeye21
23 Jul 2023
bitcoin
Things we learned about LLMs in 2024
simonwillison.net/2024/Dec/31/llms-in-2024/
370 sats
\
0 comments
\
@Rsync25
31 Dec 2024
tech
Small LLMs Can Beat Large Ones at 5-30x Lower Cost with Automated Data Curation
www.tensorzero.com/blog/fine-tuned-small-llms-can-beat-large-ones-at-5-30x-lower-cost-with-programmatic-data-curation/
274 sats
\
1 comment
\
@carter
5 Aug
AI
LLM Agents can Autonomously Hack Websites
arxiv.org/pdf/2402.06664.pdf
464 sats
\
2 comments
\
@doofus
25 Feb 2024
security
SN ~Music Pool (August 2024) — Pool Update and Reminder for September Pool
299 sats
\
17 comments
\
@Coinsreporter
28 Aug 2024
Music
This Day on SN: April 17
200 sats
\
1 comment
\
@sn
17 Apr
meta
bot
Context Rot: How Increasing Input Tokens Impacts LLM Performance
research.trychroma.com/context-rot
304 sats
\
2 comments
\
@Scoresby
14 Jul
AI
This Day on SN: November 2
203 sats
\
2 comments
\
@sn
2 Nov 2024
meta
bot
LiveBench - A Challenging, Contamination-Free LLM Benchmark
livebench.ai
161 sats
\
0 comments
\
@supratic
17 Jul
AI
How do you use LLMs?
891 sats
\
8 comments
\
@gmd
21 Mar
AI
AgentFly: Fine-tuning LLM Agents without Fine-tuning LLMs
arxiv.org/abs/2508.16153
152 sats
\
0 comments
\
@optimism
25 Aug
AI
more