@anon
sign up
@anon
sign up
pull down to refresh
Hidet: A Deep Learning Compiler for Efficient Model Serving
pytorch.org/blog/introducing-hidet/
110 sats
\
1 comment
\
@hn
28 Apr 2023
tech
related
Maple Proxy – the Maple AI API that brings encrypted LLMs to your OpenAI apps
blog.trymaple.ai/introducing-maple-proxy-the-maple-ai-api-that-brings-encrypted-llms-to-your-openai-apps/
297 sats
\
1 comment
\
@0xbitcoiner
3 Sep
AI
Hermes 4 outperforms OpenAI models with minimal content restrictions
121 sats
\
0 comments
\
@lunin
1 Sep
AI
OpenAI offers free GPT-4o Mini fine-tuning to counter Meta’s Llama 3.1 release
venturebeat.com/ai/ai-arms-race-escalates-openai-offers-free-gpt-4o-mini-fine-tuning-to-counter-metas-llama-3-1-release/
21 sats
\
0 comments
\
@ch0k1
25 Jul 2024
news
Large Language Models for Compiler Optimization
arxiv.org/abs/2309.07062
20 sats
\
1 comment
\
@hn
18 Sep 2023
tech
Compiling LLMs into a MegaKernel: A path to low-latency inference
zhihaojia.medium.com/compiling-llms-into-a-megakernel-a-path-to-low-latency-inference-cf7840913c17
10 sats
\
0 comments
\
@hn
19 Jun
tech
Apple releases CoreNet, a library for training deep neural networks
github.com/apple/corenet
10 sats
\
0 comments
\
@hn
24 Apr 2024
tech
Nvidia Shows Off GPU for Ultra-Long Context Models
developer.nvidia.com/blog/nvidia-rubin-cpx-accelerates-inference-performance-and-efficiency-for-1m-token-context-workloads/
157 sats
\
1 comment
\
@lunin
14 Sep
AI
Gemma3 – The current strongest model that fits on a single GPU
ollama.com/library/gemma3
46 sats
\
0 comments
\
@hn
12 Mar
tech
Episode 113: Agent Memories & Reflections
110 sats
\
1 comment
\
@AtlantisPleb
25 Jul 2024
openagents
Practical Deep Learning for Coders - Practical Deep Learning
course.fast.ai/
163 sats
\
0 comments
\
@Chep
29 Jul 2024
builders
Design an Easy-to-Use Deep Learning Framework
towardsdatascience.com/design-an-easy-to-use-deep-learning-framework-52d7c37e415f
71 sats
\
1 comment
\
@ch0k1
11 Apr 2024
tech
LL3M: Large Language 3D Modelers
threedle.github.io/ll3m/
40 sats
\
0 comments
\
@hn
17 Aug
tech
Nvidia's bombshell: Its new AI model is open, massive, and ready to rival GPT-4
venturebeat.com/ai/nvidia-just-dropped-a-bombshell-its-new-ai-model-is-open-massive-and-ready-to-rival-gpt-4/
100 sats
\
0 comments
\
@beorange
3 Oct 2024
alter_native
OpenAI’s hunger for data is coming back to bite it
www.technologyreview.com/2023/04/19/1071789/openais-hunger-for-data-is-coming-back-to-bite-it/
50 sats
\
0 comments
\
@shadowymartian
20 Apr 2023
bitcoin
Code Llama, a state-of-the-art large language model for coding
ai.meta.com/blog/code-llama-large-language-model-coding/
35 sats
\
1 comment
\
@hn
24 Aug 2023
tech
Free Dolly: Introducing the World's First Truly Open Instruction-Tuned LLM
www.databricks.com/blog/2023/04/12/dolly-first-open-commercially-viable-instruction-tuned-llm
306 sats
\
1 comment
\
@nullama
13 Apr 2023
bitcoin
Bend: a high-level language that runs on GPUs (via HVM2)
github.com/HigherOrderCO/Bend
51 sats
\
0 comments
\
@hn
17 May 2024
tech
Meet PowerInfer: A Fast LLM on a Single Consumer-Grade GPU
www.marktechpost.com/2023/12/23/meet-powerinfer-a-fast-large-language-model-llm-on-a-single-consumer-grade-gpu-that-speeds-up-machine-learning-model-inference-by-11-times/
10 sats
\
2 comments
\
@ch0k1
24 Dec 2023
AI
Candle: Minimalist ML framework for Rust
github.com/huggingface/candle
10 sats
\
0 comments
\
@Rsync25
29 Jun 2024
rust
DeepScaleR: Surpassing O1-Preview with a 1.5B Model by Scaling RL
pretty-radio-b75.notion.site/DeepScaleR-Surpassing-O1-Preview-with-a-1-5B-Model-by-Scaling-RL-19681902c1468005bed8ca303013a4e2
21 sats
\
0 comments
\
@hn
11 Feb
tech
AI for All: Powering APIs and Large Language Models with Lightning ⚡🤖
lightning.engineering/posts/2023-07-05-l402-langchain/
1967 sats
\
1 comment
\
@Rsync25
6 Jul 2023
bitcoin
more