@anon
sign up
@anon
sign up
pull down to refresh
Train your own R1 reasoning model locally
unsloth.ai/blog/r1-reasoning
204 sats
\
1 comment
\
@aljaz
7 Feb
AI
related
Parallel-R1: Towards Parallel Thinking via Reinforcement Learning
arxiv.org/abs/2509.07980
218 sats
\
0 comments
\
@optimism
10 Sep
AI
Google releases its own 'reasoning' AI model
techcrunch.com/2024/12/19/google-releases-its-own-reasoning-ai-model/
11 sats
\
0 comments
\
@ch0k1
20 Dec 2024
news
sapientinc/HRM: Hierarchical Reasoning Model Official Release
github.com/sapientinc/HRM
161 sats
\
1 comment
\
@m0wer
5 Aug
AI
DeepPHY: Benchmarking Agentic VLMs on Physical Reasoning
arxiv.org/abs/2508.05405
182 sats
\
0 comments
\
@optimism
10 Aug
AI
Chinese researchers just built an open-source rival to ChatGPT in 2 months
www.livescience.com/technology/artificial-intelligence/china-releases-a-cheap-open-rival-to-chatgpt-thrilling-some-scientists-and-panicking-silicon-valley
21 sats
\
0 comments
\
@ch0k1
25 Jan
news
Replit - How to train your own LLM Models
blog.replit.com/llm-training
11 sats
\
1 comment
\
@hn
20 Apr 2023
tech
R-Zero: Self-Evolving Reasoning LLM from Zero Data
arxiv.org/abs/2508.05004
198 sats
\
1 comment
\
@carter
10 Sep
AI
Teachable Machine
teachablemachine.withgoogle.com/
121 sats
\
1 comment
\
@hn
7 Jan 2024
tech
DeepSeek-R1 incentivizes reasoning in LLMs through reinforcement learning
www.nature.com/articles/s41586-025-09422-z
121 sats
\
0 comments
\
@carter
19 Sep
AI
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via RL
arxiv.org/abs/2501.12948
9 sats
\
0 comments
\
@hn
25 Jan
tech
Emerging Reasoning with Reinforcement Learning
hkust-nlp.notion.site/simplerl-reason
9 sats
\
0 comments
\
@hn
26 Jan
tech
OpenAI releases new simulated reasoning models with full tool access
arstechnica.com/ai/2025/04/openai-releases-new-simulated-reasoning-models-with-full-tool-access/
20 sats
\
0 comments
\
@Coinsreporter
17 Apr
AI
Orca 2: Teaching Small Language Models How To Reason
arxiv.org/pdf/2311.11045.pdf
21 sats
\
0 comments
\
@kr
21 Nov 2023
tech
OpenAI releases o1, its first model with ‘reasoning’ abilities
www.theverge.com/2024/9/12/24242439/openai-o1-model-reasoning-strawberry-chatgpt?utm_source=tldrnewsletter
172 sats
\
0 comments
\
@plebone
13 Sep 2024
tech
Agentic Reinforced Policy Optimization
arxiv.org/abs/2507.19849
141 sats
\
0 comments
\
@optimism
29 Jul
AI
OpenAI o3-mini model release
openai.com/index/openai-o3-mini/
196 sats
\
0 comments
\
@ch0k1
3 Feb
news
OpenAI O1 Model
openai.com/index/learning-to-reason-with-llms/
22 sats
\
0 comments
\
@hn
12 Sep 2024
tech
RubyLLM: A delightful Ruby way to work with AI
github.com/crmne/ruby_llm
54 sats
\
0 comments
\
@hn
15 Mar
tech
What if... someone optimized a model for taking action
1885 sats
\
1 comment
\
@optimism
3 Jul
AI
Train Your Own O1 Preview Model Within $450
sky.cs.berkeley.edu/project/sky-t1/
31 sats
\
0 comments
\
@hn
21 Feb
tech
How One AI Model Creates a Physical Intuition of Its Environment
www.quantamagazine.org/how-one-ai-model-creates-a-physical-intuition-of-its-environment-20251003/
199 sats
\
0 comments
\
@0xbitcoiner
3 Oct
AI
more