sign up
sign up
sign up
sign up
pull down to refresh
MCP-Bench: Benchmarking Tool-Using LLM Agents
arxiv.org/abs/2508.20453
269 sats
\
0 comments
\
@optimism
30 Aug 2025
AI
related
The week in AI, August 4-10, 2025
2353 sats
\
12 comments
\
@optimism
11 Aug 2025
AI
"Benchwashing" - how do you defend against this?
1748 sats
\
10 comments
\
@optimism
9 Aug 2025
AskSN
Opti's Claude 4.5 Sonnet "vibe coding" report
1155 sats
\
13 comments
\
@optimism
5 Oct 2025
AI
Jan v3 4B: great in instruction following
huggingface.co/janhq/Jan-v3-4B-base-instruct
519 sats
\
0 comments
\
@optimism
2 Feb
AI
GDPval: Measuring the performance of our models on real-world tasks - OpenAI
openai.com/index/gdpval/
388 sats
\
8 comments
\
@Scoresby
2 Oct 2025
AI
OpenAI Secretly Funded Benchmarking Dataset Linked To o3 Model
www.searchenginejournal.com/openai-secretly-funded-frontiermath-benchmarking-dataset/537760/
441 sats
\
0 comments
\
@frostdragon
21 Jan 2025
tech
Gemini 3 and Antigravit : Why Google's latest AI releases are a big deal
fortune.com/2025/11/19/google-gemini-3-antigravity-ai-explained/?utm_source=flipboard&utm_content=fortune/magazine/Personal+finance
161 sats
\
1 comment
\
@DrBrader99
19 Nov 2025
AI
Alibaba has released its flagship Qwen3-Max model with a trillion parameters
chat.qwen.ai/
197 sats
\
0 comments
\
@lunin
25 Sep 2025
AI
Not every user owns an iPhone
calendar.perfplanet.com/2024/not-every-user-owns-an-iphone/
490 sats
\
2 comments
\
@nym
9 Jan 2025
Design
The Week AI Shook Things — and Nvidia Showed Who's Boss
372 sats
\
1 comment
\
@economy
24 Nov 2025
Stacker_Stocks
AI agents find $4.6M in blockchain smart contract exploits
red.anthropic.com/2025/smart-contracts/
289 sats
\
2 comments
\
@0xbitcoiner
2 Dec 2025
AI
Laura Nursky's presentation on AI/job friction
440 sats
\
3 comments
\
@optimism
26 Sep 2025
AI
Cypher Update #2: No Nuts no Bolts, No Guts no Glory
55k sats
\
5 comments
\
@cypherspace
1 Apr 2024
builders
freebie
Launching Sovereign Chat - FOSS AI that answers all questions about Privacy
geyser.fund/project/sovereignchat/
52.8k sats
\
9 comments
\
@Marconius_Solidus
23 Apr 2024
privacy
RGB consensus layer released to production
rgb.tech/blog/release-v0-12-consensus/
144.2k sats
\
52 comments
\
@dr_orlovsky
10 Jul 2025
rgb
Episodes 183 & 184: Zero Base and Hello Tauri
187 sats
\
1 comment
\
@AtlantisPleb
24 Jul 2025
openagents
Stacker News Roundtable #2 - LSPs
87.6k sats
\
81 comments
\
@sn
13 Oct 2023
bitcoin
AI is actually bad at math, ORCA shows
www.theregister.com/2025/11/17/ai_bad_math_orca/
197 sats
\
4 comments
\
@0xbitcoiner
18 Nov 2025
AI
Launching Sovereign Chat 0.1 - FOSS AI that answers all questions about Privacy
sovereignoutcomes.com/launching-sovereign-chat-01
63.9k sats
\
7 comments
\
@Marconius_Solidus
4 Apr 2024
privacy
[New Pod] Open Source AI & Confidential Compute w/ Anthony & Marks (OpenSecret)
fountain.fm/episode/YSwEaV5aXBH0aMcliUm5
20.4k sats
\
0 comments
\
@MaxAWebster
8 May 2025
ideasfromtheedge
Introducing PayPerQ (ppq.ai), your new default GPT4 experience, powered by LN
35.2k sats
\
53 comments
\
@MattAhlborg0
28 Feb 2024
bitcoin
more