items/878588/related \ stacker news

pull down to refresh

Train your own R1 reasoning model locally unsloth.ai/blog/r1-reasoning

214 sats \ 1 comment \ @aljaz 7 Feb 2025 AI

related

Don't Overthink It: A Survey of Efficient R1-style LRMs arxiv.org/abs/2508.02120

162 sats \ 2 comments \ @optimism 10 Aug 2025 AI

Parallel-R1: Towards Parallel Thinking via Reinforcement Learning arxiv.org/abs/2509.07980

248 sats \ 0 comments \ @optimism 10 Sep 2025 AI

DeepSeek-R1 incentivizes reasoning in LLMs through reinforcement learning www.nature.com/articles/s41586-025-09422-z

151 sats \ 0 comments \ @carter 19 Sep 2025 AI

Alibaba Qwen-32B - open source ~o1/R1 reasoning that can run on your laptop

208 sats \ 6 comments \ @gmd 6 Mar 2025 AI

Less is More: Recursive Reasoning with Tiny Networks arxiv.org/html/2510.04871v1

65 sats \ 1 comment \ @byzantine 9 Oct 2025 AI

Rabbit sells out two batches of 10,000 R1 pocket AI companions over two days www.theverge.com/2024/1/10/24033498/rabbit-r1-sold-out-ces-ai

944 sats \ 4 comments \ @TheWildHustle 13 Jan 2024 tech

sapientinc/HRM: Hierarchical Reasoning Model Official Release github.com/sapientinc/HRM

191 sats \ 1 comment \ @m0wer 5 Aug 2025 AI

Less is More: Recursive Reasoning w/ Tiny Networks - Alexia Jolicoeur-Martineau github.com/SamsungSAILMontreal/TinyRecursiveModels

338 sats \ 1 comment \ @Scoresby 8 Oct 2025 AI

What if... someone optimized a model for taking action

1915 sats \ 1 comment \ @optimism 3 Jul 2025 AI

Unlocking Long-Horizon Agentic Search with Large-Scale Asynchronous RL arxiv.org/abs/2508.07976

213 sats \ 0 comments \ @optimism 13 Aug 2025 AI

OpenAI releases o1, its first model with ‘reasoning’ abilities www.theverge.com/2024/9/12/24242439/openai-o1-model-reasoning-strawberry-chatgpt?utm_source=tldrnewsletter

272 sats \ 0 comments \ @plebone 13 Sep 2024 tech

Open models by OpenAI openai.com/open-models/

196 sats \ 2 comments \ @carter 5 Aug 2025 AI

I Trained a Custom Model…

2111 sats \ 12 comments \ @antic 30 Dec 2023 art

SkyThought: Train your own o1 github.com/NovaSky-AI/SkyThought

183 sats \ 0 comments \ @kehiy 24 Jul 2025 AI

Deep Dive into LLMs like ChatGPT www.youtube.com/watch?v=7xTGNNLPyMI

630 sats \ 1 comment \ @k00b 8 Feb 2025 AI

Forget DeepSeek. Large language models are getting cheaper still www.economist.com/science-and-technology/2025/02/12/forget-deepseek-large-language-models-are-getting-cheaper-still?utm_campaign=a.io-btl_fy2526_all_conversion-aiasc-sub_prospecting_global-global_auction_facebook-instagram&utm_medium=social-media.content.pd&utm_source=facebook-instagram&utm_content=discovery.content.non-subscriber.content_staticlinkad_np-automatedForgetDeepSeek.Largelanguagemodelsaregettingcheaperstill-n-jul_na-na_article_na_na_na_na&utm_term=sa.int-all&utm_id=120221663881300437&fbclid=IwY2xjawLqYvtleHRuA2FlbQEwAGFkaWQBqySe7MDKFWJyaWQRMTAyM1EwT0JYdlpHWWo4Yk4BHg-nt473P7_Lyq4MGmV0jc-H886-XU5cDq9CLU4gj_JUDF7AjvIZQJawSEoQ_aem_IumIV7lTgY1T2YPLurCoaQ

251 sats \ 1 comment \ @south_korea_ln 21 Jul 2025 AI

Reasoning Is Not Model Improvement manidoraisamy.com/reasoning-not-ai.html

151 sats \ 1 comment \ @carter 23 Oct 2025 AI

Agentic Reinforced Policy Optimization arxiv.org/abs/2507.19849

171 sats \ 0 comments \ @optimism 29 Jul 2025 AI

Tongyi-DeepResearch

301 sats \ 0 comments \ @optimism 29 Oct 2025 AI

Is Chain-of-Thought Reasoning of LLMs a Mirage?arxiv.org/abs/2508.01191

427 sats \ 9 comments \ @optimism 7 Aug 2025 AI

Raspberry Pi AI Kit Review | HackSpace #80

243 sats \ 0 comments \ @0xbitcoiner 28 Jun 2024 tech