pull down to refresh

I too self-host all my production applications that use LLM and all of it is RAG (in the most basic form where I just feed documents to a single-shot summarizer / categorizer / NLP chain.)