@anon
sign up
@anon
sign up
pull down to refresh
Hidet: A Deep Learning Compiler for Efficient Model Serving
pytorch.org/blog/introducing-hidet/
110 sats
\
1 comment
\
@hn
28 Apr 2023
tech
related
Experimental model release: DeepSeek-V3.2-Exp
github.com/deepseek-ai/DeepSeek-V3.2-Exp
167 sats
\
0 comments
\
@carter
29 Sep 2025
AI
At NeurIPS, NVIDIA Advances Open Model Development for Digital and Physical AI
blogs.nvidia.com/blog/neurips-open-source-digital-physical-ai/
157 sats
\
0 comments
\
@0xbitcoiner
2 Dec 2025
AI
Scalable MatMul-Free Language Modeling — 10x Reduction On LLMs Computation
arxiv.org/abs/2406.02528
110 sats
\
1 comment
\
@0xbitcoiner
10 Jun 2024
science
freebie
Hermes 4 outperforms OpenAI models with minimal content restrictions
121 sats
\
0 comments
\
@lunin
1 Sep 2025
AI
NVIDIA: Transforming LLM Alignment with Efficient Reinforcement Learning
www.marktechpost.com/2024/05/05/nvidia-ai-open-sources-nemo-aligner-transforming-large-language-model-alignment-with-efficient-reinforcement-learning/
20 sats
\
0 comments
\
@ch0k1
7 May 2024
tech
Raspberry Pi AI Kit Update: Dataflow Compiler Now Available
64 sats
\
0 comments
\
@0xbitcoiner
10 Jul 2024
DIY
Making The Smallest And Dumbest LLM With Extreme Quantization
hackaday.com/2025/10/23/making-the-smallest-and-dumbest-llm-with-extreme-quantization/
200 sats
\
0 comments
\
@0xbitcoiner
24 Oct 2025
AI
ATLAS: A New Paradigm in LLM Inference via Runtime-Learning Accelerators
www.together.ai/blog/adaptive-learning-speculator-system-atlas
100 sats
\
0 comments
\
@carter
14 Oct 2025
AI
Large Language Models for Compiler Optimization
arxiv.org/abs/2309.07062
20 sats
\
1 comment
\
@hn
18 Sep 2023
tech
Compiling LLMs into a MegaKernel: A path to low-latency inference
zhihaojia.medium.com/compiling-llms-into-a-megakernel-a-path-to-low-latency-inference-cf7840913c17
10 sats
\
0 comments
\
@hn
19 Jun 2025
tech
Apple releases CoreNet, a library for training deep neural networks
github.com/apple/corenet
10 sats
\
0 comments
\
@hn
24 Apr 2024
tech
Nvidia Shows Off GPU for Ultra-Long Context Models
developer.nvidia.com/blog/nvidia-rubin-cpx-accelerates-inference-performance-and-efficiency-for-1m-token-context-workloads/
157 sats
\
1 comment
\
@lunin
14 Sep 2025
AI
OpenAI o3-mini model release
openai.com/index/openai-o3-mini/
196 sats
\
0 comments
\
@ch0k1
3 Feb 2025
news
Gemma3 – The current strongest model that fits on a single GPU
ollama.com/library/gemma3
46 sats
\
0 comments
\
@hn
12 Mar 2025
tech
Episode 113: Agent Memories & Reflections
110 sats
\
1 comment
\
@AtlantisPleb
25 Jul 2024
openagents
Practical Deep Learning for Coders - Practical Deep Learning
course.fast.ai/
163 sats
\
0 comments
\
@Chep
29 Jul 2024
builders
Design an Easy-to-Use Deep Learning Framework
towardsdatascience.com/design-an-easy-to-use-deep-learning-framework-52d7c37e415f
71 sats
\
1 comment
\
@ch0k1
11 Apr 2024
tech
LL3M: Large Language 3D Modelers
threedle.github.io/ll3m/
40 sats
\
0 comments
\
@hn
17 Aug 2025
tech
Nvidia's bombshell: Its new AI model is open, massive, and ready to rival GPT-4
venturebeat.com/ai/nvidia-just-dropped-a-bombshell-its-new-ai-model-is-open-massive-and-ready-to-rival-gpt-4/
100 sats
\
0 comments
\
@beorange
3 Oct 2024
alter_native
OpenAI’s hunger for data is coming back to bite it
www.technologyreview.com/2023/04/19/1071789/openais-hunger-for-data-is-coming-back-to-bite-it/
50 sats
\
0 comments
\
@shadowymartian
20 Apr 2023
bitcoin
Code Llama, a state-of-the-art large language model for coding
ai.meta.com/blog/code-llama-large-language-model-coding/
35 sats
\
1 comment
\
@hn
24 Aug 2023
tech
more