@anon
sign up
@anon
sign up
pull down to refresh
TREAD: Token Routing for Efficient Architecture-agnostic Diffusion Training
arxiv.org/abs/2501.04765
100 sats
\
0 comments
\
@carter
19 Aug
AI
related
PixNerd: Pixel Neural Field Diffusion
arxiv.org/abs/2507.23268
302 sats
\
2 comments
\
@optimism
4 Aug
AI
Exploring Conditions for Diffusion models in Robotic Control
arxiv.org/abs/2510.15510
242 sats
\
0 comments
\
@optimism
31 Oct
AI
Transformers are Graph Neural Networks
arxiv.org/pdf/2506.22084
110 sats
\
0 comments
\
@carter
1 Jul
AI
Reticulum network routing
15 sats
\
0 comments
\
@BallLightning
27 Oct 2024
devs
Hacking Diffusion into Qwen3 for the ARC Challenge
www.matthewnewton.com/blog/arc-challenge-diffusion
121 sats
\
0 comments
\
@carter
13 Aug
AI
AI Diffusion Report
1304 sats
\
0 comments
\
@Tony
5 Nov
AI
Deep researcher with test-time diffusion
research.google/blog/deep-researcher-with-test-time-diffusion/
100 sats
\
0 comments
\
@carter
24 Sep
AI
AI Diffusion Report | SNL #197
333 sats
\
3 comments
\
@Car
7 Nov
meta
Diffusion Language Models are Super Data Learners
jinjieni.notion.site/Diffusion-Language-Models-are-Super-Data-Learners-239d8f03a866800ab196e49928c019ac
152 sats
\
0 comments
\
@carter
11 Aug
AI
tinygrad: A simple and powerful neural network framework
tinygrad.org/
10 sats
\
1 comment
\
@premitive1
15 Aug 2023
tech
Having fun with diffusion.io
30 sats
\
0 comments
\
@Danny_stacks_sats
21 Oct 2023
tech
Diffusion Language Models Know the Answer Before Decoding
arxiv.org/abs/2508.19982
244 sats
\
0 comments
\
@optimism
28 Aug
AI
Context Rot: How Increasing Input Tokens Impacts LLM Performance
research.trychroma.com/context-rot
304 sats
\
2 comments
\
@Scoresby
14 Jul
AI
TinyML: Ultra-low power Machine Learning
www.ikkaro.net/what-tinyml-is/
105 sats
\
1 comment
\
@hn
16 Jan 2024
tech
The Bitter Lesson is coming for Tokenization
lucalp.dev/bitter-lesson-tokenization-and-blt/
110 sats
\
0 comments
\
@carter
25 Jun
AI
Better and Faster Large Language Models via Multi-Token Prediction
arxiv.org/abs/2404.19737
21 sats
\
0 comments
\
@hn
1 May 2024
tech
How Perplexity optimized 1T parameter AI models for AWS EFA
www.theregister.com/2025/11/05/perplexity_1t_parameter_models_aws_efa/
100 sats
\
0 comments
\
@0xbitcoiner
6 Nov
AI
Hidet: A Deep Learning Compiler for Efficient Model Serving
pytorch.org/blog/introducing-hidet/
110 sats
\
1 comment
\
@hn
28 Apr 2023
tech
The Gentleman's Guide To Routing Nodes
docs.megalithic.me/the-gentlemans-guide-to-routing-nodes/a-node-for-a-gentleman
10.2k sats
\
2 comments
\
@megalithic
17 Apr 2024
lightning
How Meta trains large language models at scale
engineering.fb.com/2024/06/12/data-infrastructure/training-large-language-models-at-scale-meta/
227 sats
\
0 comments
\
@hn
13 Jun 2024
tech
Machine Learning and Rust
www.arewelearningyet.com/
20 sats
\
0 comments
\
@Rsync25
24 Oct 2023
tech
more