sign up
sign up
sign up
sign up
pull down to refresh
Communication Efficient LLM Pre-training with SparseLoCo
arxiv.org/abs/2508.15706
130 sats
\
0 comments
\
@carter
1 Sep 2025
AI
related
1-Bit LLM: The Most Efficient LLM Possible?
www.youtube.com/watch?v=7hMoz9q4zv0
563 sats
\
1 comment
\
@carter
24 Jun 2025
AI
Making LLMs Cheaper and Better via Performance-Efficiency Optimized Routing
arxiv.org/abs/2508.12631
187 sats
\
9 comments
\
@carter
22 Aug 2025
AI
LLaDA2.0: Scaling Up Diffusion Language Models to 100B
arxiv.org/abs/2512.15745
299 sats
\
0 comments
\
@optimism
19 Dec 2025
AI
LLM evaluation at scale with the NeurIPS Efficiency Challenge
blog.mozilla.ai/exploring-llm-evaluation-at-scale-with-the-neurips-large-language-model-efficiency-challenge/
210 sats
\
0 comments
\
@localhost
22 Feb 2024
tech
LLMs Can Get Brain Rot
llm-brain-rot.github.io/
287 sats
\
0 comments
\
@Scoresby
21 Oct 2025
AI
Meta Open-Sources MEGALODON LLM for Efficient Long Sequence Modeling
www.infoq.com/news/2024/06/meta-llm-megalodon/
125 sats
\
0 comments
\
@TheWildHustle
11 Jun 2024
opensource
Efficient LLM Inference
arxiv.org/abs/2507.14397
151 sats
\
0 comments
\
@carter
3 Oct 2025
AI
LLaMA-Factory: Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
github.com/hiyouga/LLaMA-Factory
187 sats
\
0 comments
\
@carter
19 Sep 2025
AI
AgentFly: Fine-tuning LLM Agents without Fine-tuning LLMs
arxiv.org/abs/2508.16153
182 sats
\
0 comments
\
@optimism
25 Aug 2025
AI
Elites, the curse of recursion, and the half-life of policy
5779 sats
\
11 comments
\
@elvismercury
29 Mar 2024
mostly_harmless
Deep Dive into LLMs like ChatGPT
www.youtube.com/watch?v=7xTGNNLPyMI
630 sats
\
1 comment
\
@k00b
8 Feb 2025
AI
LLMs generate slop because they avoid surprises by design - Dan Fabulich
danfabulich.medium.com/llms-tell-bad-jokes-because-they-avoid-surprises-7f111aac4f96
373 sats
\
2 comments
\
@Scoresby
19 Aug 2025
AI
The Future of LLM Models: Real-Time Collaboration & Micropayments w/ BTC and LN
238 sats
\
0 comments
\
@Rsync25
27 Jun 2024
lightning
2025 LLM Year in Review - karpathy
karpathy.bearblog.dev/year-in-review-2025/
1652 sats
\
3 comments
\
@Scoresby
21 Dec 2025
AI
Scalable MatMul-Free Language Modeling — 10x Reduction On LLMs Computation
arxiv.org/abs/2406.02528
110 sats
\
1 comment
\
@0xbitcoiner
10 Jun 2024
science
freebie
When Models Lie, We Learn: Multilingual Span-Level Hallucination Detection
arxiv.org/abs/2510.04849v1
433 sats
\
2 comments
\
@optimism
19 Oct 2025
AI
LLMs generate ‘fluent nonsense’ when reasoning outside their training zone
venturebeat.com/ai/llms-generate-fluent-nonsense-when-reasoning-outside-their-training-zone/
166 sats
\
0 comments
\
@carter
21 Aug 2025
AI
Extracting memorized pieces of (copyrighted) books from open-weight llm models
arxiv.org/pdf/2505.12546
2372 sats
\
2 comments
\
@carter
24 Jun 2025
AI
Building with LLM's with Marcus Workman
youtu.be/ZNNP7rQI-xA?feature=shared
3109 sats
\
0 comments
\
@Car
20 Jan 2024
builders
12,000+ API Keys and Passwords Found in Public Datasets Used for LLM Training
thehackernews.com/2025/02/12000-api-keys-and-passwords-found-in.html
858 sats
\
3 comments
\
@aljaz
28 Feb 2025
security
Masking private information on the fly when using cloud LLMs
233 sats
\
0 comments
\
@m0wer
26 May 2025
tech
more