@anon
sign up
@anon
sign up
pull down to refresh
uGMM-NN: Univariate Gaussian Mixture Model Neural Network
arxiv.org/abs/2509.07569
100 sats
\
0 comments
\
@carter
11 Sep 2025
AI
related
The Math Behind Batch Normalization
towardsdatascience.com/the-math-behind-batch-normalization-90ebbc0b1b0b
100 sats
\
0 comments
\
@ch0k1
9 May 2024
science
Pangu Pro MoE: Mixture of Grouped Experts for Efficient Sparsity
arxiv.org/pdf/2505.21411
100 sats
\
1 comment
\
@carter
3 Jul 2025
AI
Deep Learning Illustrated, Part 1: How Does A Neural Network Work?
towardsdatascience.com/neural-networks-illustrated-part-1-how-does-a-neural-network-work-c3f92ce3b462
100 sats
\
0 comments
\
@ch0k1
2 Feb 2024
science
FAMO: A Fast Optimization Method for Multitask Learning (MTL)
www.marktechpost.com/2024/05/05/famo-a-fast-optimization-method-for-multitask-learning-mtl-that-mitigates-the-conflicting-gradients-using-o1-space-and-time/
111 sats
\
0 comments
\
@ch0k1
7 May 2024
science
Deep Learning with Python
62 sats
\
0 comments
\
@devJack
30 Dec 2024
BooksAndArticles
What Is Machine Learning ?
134 sats
\
0 comments
\
@0xbitcoiner
8 Jul 2024
science
tinygrad: A simple and powerful neural network framework
tinygrad.org/
10 sats
\
1 comment
\
@premitive1
15 Aug 2023
tech
Mathematical Introduction to Deep Learning: Methods, Implementations, and Theory
arxiv.org/abs/2310.20360
24 sats
\
1 comment
\
@hn
1 Jan 2024
tech
XLSTM: Extended Long Short-Term Memory
arxiv.org/abs/2405.04517
21 sats
\
0 comments
\
@hn
8 May 2024
tech
Mixture of Experts Explained
huggingface.co/blog/moe
29 sats
\
0 comments
\
@Rsync25
27 Dec 2024
tech
Understanding Machine Learning: From Theory to Algorithms
www.cs.huji.ac.il/~shais/UnderstandingMachineLearning/copy.html
10 sats
\
0 comments
\
@hn
4 Apr 2025
tech
The most important machine learning equations: A comprehensive guide
chizkidd.github.io//2025/05/30/machine-learning-key-math-eqns/
120 sats
\
1 comment
\
@hn
28 Aug 2025
tech
Arraymancer – Deep Learning Nim Library
github.com/mratsim/Arraymancer
138 sats
\
1 comment
\
@hn
31 Mar 2024
tech
Mixtral 8x7B: A Sparse Mixture of Experts language model
arxiv.org/abs/2401.04088
51 sats
\
1 comment
\
@hn
9 Jan 2024
tech
Normalizing Flows are Capable Generative Models
machinelearning.apple.com/research/normalizing-flows
100 sats
\
0 comments
\
@carter
30 Jun 2025
AI
NeuralOS Demo
neural-os.com/
265 sats
\
2 comments
\
@carter
30 Jul 2025
AI
The OpenAI Keynote
stratechery.com/2023/the-openai-keynote/
1047 sats
\
3 comments
\
@kr
7 Nov 2023
tech
Kolmogorov-Arnold networks may make neural networks more understandable
www.quantamagazine.org/novel-architecture-makes-neural-networks-more-understandable-20240911/
20 sats
\
0 comments
\
@hn
12 Sep 2024
tech
GDPval: Measuring the performance of our models on real-world tasks - OpenAI
openai.com/index/gdpval/
358 sats
\
8 comments
\
@Scoresby
2 Oct 2025
AI
Tversky Neural Networks
gonzoml.substack.com/p/tversky-neural-networks
207 sats
\
0 comments
\
@carter
21 Aug 2025
AI
OpenAI's GPT-5 is a cost cutting exercise
www.theregister.com/2025/08/13/gpt_5_cost_cutting
217 sats
\
1 comment
\
@Coinsreporter
13 Aug 2025
AI
more