@anon
sign up
@anon
sign up
pull down to refresh
TREAD: Token Routing for Efficient Architecture-agnostic Diffusion Training
arxiv.org/abs/2501.04765
100 sats
\
0 comments
\
@carter
19 Aug
AI
related
PixNerd: Pixel Neural Field Diffusion
arxiv.org/abs/2507.23268
302 sats
\
2 comments
\
@optimism
4 Aug
AI
Transformers are Graph Neural Networks
arxiv.org/pdf/2506.22084
110 sats
\
0 comments
\
@carter
1 Jul
AI
Reticulum network routing
15 sats
\
0 comments
\
@BallLightning
27 Oct 2024
devs
Hacking Diffusion into Qwen3 for the ARC Challenge
www.matthewnewton.com/blog/arc-challenge-diffusion
121 sats
\
0 comments
\
@carter
13 Aug
AI
Deep researcher with test-time diffusion
research.google/blog/deep-researcher-with-test-time-diffusion/
100 sats
\
0 comments
\
@carter
24 Sep
AI
Diffusion Language Models are Super Data Learners
jinjieni.notion.site/Diffusion-Language-Models-are-Super-Data-Learners-239d8f03a866800ab196e49928c019ac
152 sats
\
0 comments
\
@carter
11 Aug
AI
tinygrad: A simple and powerful neural network framework
tinygrad.org/
10 sats
\
1 comment
\
@premitive1
15 Aug 2023
tech
Having fun with diffusion.io
30 sats
\
0 comments
\
@Danny_stacks_sats
21 Oct 2023
tech
Diffusion Language Models Know the Answer Before Decoding
arxiv.org/abs/2508.19982
244 sats
\
0 comments
\
@optimism
28 Aug
AI
Context Rot: How Increasing Input Tokens Impacts LLM Performance
research.trychroma.com/context-rot
304 sats
\
2 comments
\
@Scoresby
14 Jul
AI
TinyML: Ultra-low power Machine Learning
www.ikkaro.net/what-tinyml-is/
105 sats
\
1 comment
\
@hn
16 Jan 2024
tech
The Bitter Lesson is coming for Tokenization
lucalp.dev/bitter-lesson-tokenization-and-blt/
110 sats
\
0 comments
\
@carter
25 Jun
AI
Better and Faster Large Language Models via Multi-Token Prediction
arxiv.org/abs/2404.19737
21 sats
\
0 comments
\
@hn
1 May 2024
tech
Hidet: A Deep Learning Compiler for Efficient Model Serving
pytorch.org/blog/introducing-hidet/
110 sats
\
1 comment
\
@hn
28 Apr 2023
tech
The Gentleman's Guide To Routing Nodes
docs.megalithic.me/the-gentlemans-guide-to-routing-nodes/a-node-for-a-gentleman
10.2k sats
\
2 comments
\
@megalithic
17 Apr 2024
lightning
How Meta trains large language models at scale
engineering.fb.com/2024/06/12/data-infrastructure/training-large-language-models-at-scale-meta/
227 sats
\
0 comments
\
@hn
13 Jun 2024
tech
Machine Learning and Rust
www.arewelearningyet.com/
20 sats
\
0 comments
\
@Rsync25
24 Oct 2023
tech
Rnostr: A high-performance and scalable nostr relay written in Rust.
github.com/rnostr/rnostr
2478 sats
\
1 comment
\
@Rsync25
5 Jul 2023
nostr
PlebDevs: Live Coding / deep dive on hedgehog protocol
www.youtube.com/watch?v=B8mesP3Xqyg
200 sats
\
0 comments
\
@TheWildHustle
27 Mar 2024
devs
Show HN: Wordllama – Things you can do with the token embeddings of an LLM
github.com/dleemiller/WordLlama
131 sats
\
0 comments
\
@hn
15 Sep 2024
tech
Apple just released an interesting diffusion based coding language model
9to5mac.com/2025/07/04/apple-just-released-a-weirdly-interesting-coding-language-model/
131 sats
\
1 comment
\
@carter
8 Jul
AI
more