items/1228490/related \ stacker news

pull down to refresh

LLaMA-Factory: Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)github.com/hiyouga/LLaMA-Factory

187 sats \ 0 comments \ @carter 19 Sep 2025 AI

related

AgentFly: Fine-tuning LLM Agents without Fine-tuning LLMs arxiv.org/abs/2508.16153

182 sats \ 0 comments \ @optimism 25 Aug 2025 AI

Making LLMs Cheaper and Better via Performance-Efficiency Optimized Routing arxiv.org/abs/2508.12631

187 sats \ 9 comments \ @carter 22 Aug 2025 AI

1-Bit LLM: The Most Efficient LLM Possible?www.youtube.com/watch?v=7hMoz9q4zv0

563 sats \ 1 comment \ @carter 24 Jun 2025 AI

Scalable MatMul-Free Language Modeling — 10x Reduction On LLMs Computation arxiv.org/abs/2406.02528

110 sats \ 1 comment \ @0xbitcoiner 10 Jun 2024 science freebie

LLM evaluation at scale with the NeurIPS Efficiency Challenge blog.mozilla.ai/exploring-llm-evaluation-at-scale-with-the-neurips-large-language-model-efficiency-challenge/

210 sats \ 0 comments \ @localhost 22 Feb 2024 tech

Free Dolly: Introducing the World's First Truly Open Instruction-Tuned LLM www.databricks.com/blog/2023/04/12/dolly-first-open-commercially-viable-instruction-tuned-llm

307 sats \ 1 comment \ @nullama 13 Apr 2023 bitcoin

Efficient LLM Inference arxiv.org/abs/2507.14397

151 sats \ 0 comments \ @carter 3 Oct 2025 AI

LLaDA2.0: Scaling Up Diffusion Language Models to 100B arxiv.org/abs/2512.15745

299 sats \ 0 comments \ @optimism 19 Dec 2025 AI

Meta Open-Sources MEGALODON LLM for Efficient Long Sequence Modeling www.infoq.com/news/2024/06/meta-llm-megalodon/

125 sats \ 0 comments \ @TheWildHustle 11 Jun 2024 opensource

Communication Efficient LLM Pre-training with SparseLoCo arxiv.org/abs/2508.15706

130 sats \ 0 comments \ @carter 1 Sep 2025 AI

Forget DeepSeek. Large language models are getting cheaper still www.economist.com/science-and-technology/2025/02/12/forget-deepseek-large-language-models-are-getting-cheaper-still?utm_campaign=a.io-btl_fy2526_all_conversion-aiasc-sub_prospecting_global-global_auction_facebook-instagram&utm_medium=social-media.content.pd&utm_source=facebook-instagram&utm_content=discovery.content.non-subscriber.content_staticlinkad_np-automatedForgetDeepSeek.Largelanguagemodelsaregettingcheaperstill-n-jul_na-na_article_na_na_na_na&utm_term=sa.int-all&utm_id=120221663881300437&fbclid=IwY2xjawLqYvtleHRuA2FlbQEwAGFkaWQBqySe7MDKFWJyaWQRMTAyM1EwT0JYdlpHWWo4Yk4BHg-nt473P7_Lyq4MGmV0jc-H886-XU5cDq9CLU4gj_JUDF7AjvIZQJawSEoQ_aem_IumIV7lTgY1T2YPLurCoaQ

251 sats \ 1 comment \ @south_korea_ln 21 Jul 2025 AI

DoubleAgents: Fine-Tuning LLMs for Covert Malicious Tool Calls pub.aimind.so/doubleagents-fine-tuning-llms-for-covert-malicious-tool-calls-b8ff00bf513e

151 sats \ 0 comments \ @carter 13 Aug 2025 AI

pylint MCP provider

2428 sats \ 6 comments \ @optimism 4 Jun 2025 builders

No More Floating Points, The Era of 1.58-bit Large Language Models medium.com/ai-insights-cobet/no-more-floating-points-the-era-of-1-58-bit-large-language-models-b9805879ac0a

100 sats \ 1 comment \ @0xbitcoiner 11 Mar 2024 science freebie

The Future of LLM Models: Real-Time Collaboration & Micropayments w/ BTC and LN

238 sats \ 0 comments \ @Rsync25 27 Jun 2024 lightning

The Best Way of Running GPT-OSS Locally - KDnuggets www.kdnuggets.com/the-best-way-of-running-gpt-oss-locally

148 sats \ 0 comments \ @optimism 25 Aug 2025 AI

Verbalized Sampling: How to Mitigate Mode Collapse and Unlock LLM Diversity arxiv.org/abs/2510.01171

196 sats \ 0 comments \ @Scoresby 17 Oct 2025 AI

Deep Dive into LLMs like ChatGPT www.youtube.com/watch?v=7xTGNNLPyMI

630 sats \ 1 comment \ @k00b 8 Feb 2025 AI

"Benchwashing" - how do you defend against this?

1748 sats \ 10 comments \ @optimism 9 Aug 2025 AskSN

LLMs Can Get Brain Rot llm-brain-rot.github.io/

287 sats \ 0 comments \ @Scoresby 21 Oct 2025 AI

Learning the Wrong Lessons: Syntactic-Domain Spurious Correlations in LLMs arxiv.org/abs/2509.21155

445 sats \ 6 comments \ @optimism 2 Dec 2025 AI