items/170881/related \ stacker news

pull down to refresh

Hidet: A Deep Learning Compiler for Efficient Model Serving pytorch.org/blog/introducing-hidet/

110 sats \ 1 comment \ @hn 28 Apr 2023 tech

related

Experimental model release: DeepSeek-V3.2-Exp github.com/deepseek-ai/DeepSeek-V3.2-Exp

167 sats \ 0 comments \ @carter 29 Sep AI

Maple Proxy – the Maple AI API that brings encrypted LLMs to your OpenAI apps blog.trymaple.ai/introducing-maple-proxy-the-maple-ai-api-that-brings-encrypted-llms-to-your-openai-apps/

297 sats \ 1 comment \ @0xbitcoiner 3 Sep AI

At NeurIPS, NVIDIA Advances Open Model Development for Digital and Physical AI blogs.nvidia.com/blog/neurips-open-source-digital-physical-ai/

157 sats \ 0 comments \ @0xbitcoiner 2 Dec AI

Hermes 4 outperforms OpenAI models with minimal content restrictions

121 sats \ 0 comments \ @lunin 1 Sep AI

Making The Smallest And Dumbest LLM With Extreme Quantization hackaday.com/2025/10/23/making-the-smallest-and-dumbest-llm-with-extreme-quantization/

200 sats \ 0 comments \ @0xbitcoiner 24 Oct AI

Large Language Models for Compiler Optimization arxiv.org/abs/2309.07062

20 sats \ 1 comment \ @hn 18 Sep 2023 tech

Compiling LLMs into a MegaKernel: A path to low-latency inference zhihaojia.medium.com/compiling-llms-into-a-megakernel-a-path-to-low-latency-inference-cf7840913c17

10 sats \ 0 comments \ @hn 19 Jun tech

Apple releases CoreNet, a library for training deep neural networks github.com/apple/corenet

10 sats \ 0 comments \ @hn 24 Apr 2024 tech

Gemma3 – The current strongest model that fits on a single GPU ollama.com/library/gemma3

46 sats \ 0 comments \ @hn 12 Mar tech

Episode 113: Agent Memories & Reflections

110 sats \ 1 comment \ @AtlantisPleb 25 Jul 2024 openagents

Practical Deep Learning for Coders - Practical Deep Learning course.fast.ai/

163 sats \ 0 comments \ @Chep 29 Jul 2024 builders

Design an Easy-to-Use Deep Learning Framework towardsdatascience.com/design-an-easy-to-use-deep-learning-framework-52d7c37e415f

71 sats \ 1 comment \ @ch0k1 11 Apr 2024 tech

LL3M: Large Language 3D Modelers threedle.github.io/ll3m/

40 sats \ 0 comments \ @hn 17 Aug tech

Nvidia's bombshell: Its new AI model is open, massive, and ready to rival GPT-4 venturebeat.com/ai/nvidia-just-dropped-a-bombshell-its-new-ai-model-is-open-massive-and-ready-to-rival-gpt-4/

100 sats \ 0 comments \ @beorange 3 Oct 2024 alter_native

OpenAI’s hunger for data is coming back to bite it www.technologyreview.com/2023/04/19/1071789/openais-hunger-for-data-is-coming-back-to-bite-it/

50 sats \ 0 comments \ @shadowymartian 20 Apr 2023 bitcoin

Code Llama, a state-of-the-art large language model for coding ai.meta.com/blog/code-llama-large-language-model-coding/

35 sats \ 1 comment \ @hn 24 Aug 2023 tech

Free Dolly: Introducing the World's First Truly Open Instruction-Tuned LLM www.databricks.com/blog/2023/04/12/dolly-first-open-commercially-viable-instruction-tuned-llm

306 sats \ 1 comment \ @nullama 13 Apr 2023 bitcoin

Bend: a high-level language that runs on GPUs (via HVM2)github.com/HigherOrderCO/Bend

51 sats \ 0 comments \ @hn 17 May 2024 tech

Meet PowerInfer: A Fast LLM on a Single Consumer-Grade GPU www.marktechpost.com/2023/12/23/meet-powerinfer-a-fast-large-language-model-llm-on-a-single-consumer-grade-gpu-that-speeds-up-machine-learning-model-inference-by-11-times/

10 sats \ 2 comments \ @ch0k1 24 Dec 2023 AI

Candle: Minimalist ML framework for Rust github.com/huggingface/candle

10 sats \ 0 comments \ @Rsync25 29 Jun 2024 rust

DeepScaleR: Surpassing O1-Preview with a 1.5B Model by Scaling RL pretty-radio-b75.notion.site/DeepScaleR-Surpassing-O1-Preview-with-a-1-5B-Model-by-Scaling-RL-19681902c1468005bed8ca303013a4e2

21 sats \ 0 comments \ @hn 11 Feb tech