items/1202986/related \ stacker news

pull down to refresh

Communication Efficient LLM Pre-training with SparseLoCo arxiv.org/abs/2508.15706

130 sats \ 0 comments \ @carter 1 Sep 2025 AI

related

1-Bit LLM: The Most Efficient LLM Possible?www.youtube.com/watch?v=7hMoz9q4zv0

563 sats \ 1 comment \ @carter 24 Jun 2025 AI

Making LLMs Cheaper and Better via Performance-Efficiency Optimized Routing arxiv.org/abs/2508.12631

187 sats \ 9 comments \ @carter 22 Aug 2025 AI

LLaDA2.0: Scaling Up Diffusion Language Models to 100B arxiv.org/abs/2512.15745

299 sats \ 0 comments \ @optimism 19 Dec 2025 AI

LLM evaluation at scale with the NeurIPS Efficiency Challenge blog.mozilla.ai/exploring-llm-evaluation-at-scale-with-the-neurips-large-language-model-efficiency-challenge/

210 sats \ 0 comments \ @localhost 22 Feb 2024 tech

LLMs Can Get Brain Rot llm-brain-rot.github.io/

287 sats \ 0 comments \ @Scoresby 21 Oct 2025 AI

Meta Open-Sources MEGALODON LLM for Efficient Long Sequence Modeling www.infoq.com/news/2024/06/meta-llm-megalodon/

125 sats \ 0 comments \ @TheWildHustle 11 Jun 2024 opensource

Efficient LLM Inference arxiv.org/abs/2507.14397

151 sats \ 0 comments \ @carter 3 Oct 2025 AI

LLaMA-Factory: Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)github.com/hiyouga/LLaMA-Factory

187 sats \ 0 comments \ @carter 19 Sep 2025 AI

AgentFly: Fine-tuning LLM Agents without Fine-tuning LLMs arxiv.org/abs/2508.16153

182 sats \ 0 comments \ @optimism 25 Aug 2025 AI

Elites, the curse of recursion, and the half-life of policy

5779 sats \ 11 comments \ @elvismercury 29 Mar 2024 mostly_harmless

Deep Dive into LLMs like ChatGPT www.youtube.com/watch?v=7xTGNNLPyMI

630 sats \ 1 comment \ @k00b 8 Feb 2025 AI

LLMs generate slop because they avoid surprises by design - Dan Fabulich danfabulich.medium.com/llms-tell-bad-jokes-because-they-avoid-surprises-7f111aac4f96

373 sats \ 2 comments \ @Scoresby 19 Aug 2025 AI

The Future of LLM Models: Real-Time Collaboration & Micropayments w/ BTC and LN

238 sats \ 0 comments \ @Rsync25 27 Jun 2024 lightning

2025 LLM Year in Review - karpathy karpathy.bearblog.dev/year-in-review-2025/

1652 sats \ 3 comments \ @Scoresby 21 Dec 2025 AI

Scalable MatMul-Free Language Modeling — 10x Reduction On LLMs Computation arxiv.org/abs/2406.02528

110 sats \ 1 comment \ @0xbitcoiner 10 Jun 2024 science freebie

When Models Lie, We Learn: Multilingual Span-Level Hallucination Detection arxiv.org/abs/2510.04849v1

433 sats \ 2 comments \ @optimism 19 Oct 2025 AI

LLMs generate ‘fluent nonsense’ when reasoning outside their training zone venturebeat.com/ai/llms-generate-fluent-nonsense-when-reasoning-outside-their-training-zone/

166 sats \ 0 comments \ @carter 21 Aug 2025 AI

Extracting memorized pieces of (copyrighted) books from open-weight llm models arxiv.org/pdf/2505.12546

2372 sats \ 2 comments \ @carter 24 Jun 2025 AI

Building with LLM's with Marcus Workman youtu.be/ZNNP7rQI-xA?feature=shared

3109 sats \ 0 comments \ @Car 20 Jan 2024 builders

12,000+ API Keys and Passwords Found in Public Datasets Used for LLM Training thehackernews.com/2025/02/12000-api-keys-and-passwords-found-in.html

858 sats \ 3 comments \ @aljaz 28 Feb 2025 security

Masking private information on the fly when using cloud LLMs

233 sats \ 0 comments \ @m0wer 26 May 2025 tech