@anon
sign up
@anon
sign up
pull down to refresh
uGMM-NN: Univariate Gaussian Mixture Model Neural Network
arxiv.org/abs/2509.07569
100 sats
\
0 comments
\
@carter
11 Sep
AI
related
The Math Behind Batch Normalization
towardsdatascience.com/the-math-behind-batch-normalization-90ebbc0b1b0b
100 sats
\
0 comments
\
@ch0k1
9 May 2024
science
Normalizing Flows are Capable Generative Models
machinelearning.apple.com/research/normalizing-flows
100 sats
\
0 comments
\
@carter
30 Jun
AI
Pangu Pro MoE: Mixture of Grouped Experts for Efficient Sparsity
arxiv.org/pdf/2505.21411
100 sats
\
1 comment
\
@carter
3 Jul
AI
Deep Learning Illustrated, Part 1: How Does A Neural Network Work?
towardsdatascience.com/neural-networks-illustrated-part-1-how-does-a-neural-network-work-c3f92ce3b462
100 sats
\
0 comments
\
@ch0k1
2 Feb 2024
science
FAMO: A Fast Optimization Method for Multitask Learning (MTL)
www.marktechpost.com/2024/05/05/famo-a-fast-optimization-method-for-multitask-learning-mtl-that-mitigates-the-conflicting-gradients-using-o1-space-and-time/
111 sats
\
0 comments
\
@ch0k1
7 May 2024
science
Deep Learning with Python
62 sats
\
0 comments
\
@devJack
30 Dec 2024
BooksAndArticles
PixNerd: Pixel Neural Field Diffusion
arxiv.org/abs/2507.23268
302 sats
\
2 comments
\
@optimism
4 Aug
AI
tinygrad: A simple and powerful neural network framework
tinygrad.org/
10 sats
\
1 comment
\
@premitive1
15 Aug 2023
tech
Mathematical Introduction to Deep Learning: Methods, Implementations, and Theory
arxiv.org/abs/2310.20360
24 sats
\
1 comment
\
@hn
1 Jan 2024
tech
XLSTM: Extended Long Short-Term Memory
arxiv.org/abs/2405.04517
21 sats
\
0 comments
\
@hn
8 May 2024
tech
Understanding Machine Learning: From Theory to Algorithms
www.cs.huji.ac.il/~shais/UnderstandingMachineLearning/copy.html
10 sats
\
0 comments
\
@hn
4 Apr
tech
Arraymancer – Deep Learning Nim Library
github.com/mratsim/Arraymancer
138 sats
\
1 comment
\
@hn
31 Mar 2024
tech
Mixtral 8x7B: A Sparse Mixture of Experts language model
arxiv.org/abs/2401.04088
51 sats
\
1 comment
\
@hn
9 Jan 2024
tech
NeuralOS Demo
neural-os.com/
265 sats
\
2 comments
\
@carter
30 Jul
AI
The OpenAI Keynote
stratechery.com/2023/the-openai-keynote/
1047 sats
\
3 comments
\
@kr
7 Nov 2023
tech
Kolmogorov-Arnold networks may make neural networks more understandable
www.quantamagazine.org/novel-architecture-makes-neural-networks-more-understandable-20240911/
20 sats
\
0 comments
\
@hn
12 Sep 2024
tech
Tversky Neural Networks
gonzoml.substack.com/p/tversky-neural-networks
207 sats
\
0 comments
\
@carter
21 Aug
AI
Implementing Neural Networks in TensorFlow (and PyTorch)
towardsdatascience.com/implementing-neural-networks-in-tensorflow-and-pytorch-3c1f097e412a
55 sats
\
1 comment
\
@ch0k1
9 Jul 2024
devs
Google’s DeepMind Tackles Weather Forecasting, With Great Performance
arstechnica.com/science/2024/12/googles-deepmind-tackles-weather-forecasting-with-great-performance/
156 sats
\
2 comments
\
@0xbitcoiner
5 Dec 2024
science
Advancements in machine learning for machine learning
blog.research.google/2023/12/advancements-in-machine-learning-for.html
162 sats
\
1 comment
\
@hn
16 Dec 2023
tech
Why Deep Learning Works Unreasonably Well
www.youtube.com/watch?v=qx7hirqgfuU
100 sats
\
1 comment
\
@carter
11 Aug
AI
more