Im a computer amateur with no deep knowledge on servers. I run my own next cloud instance, but i don't know so much about this. I want to have my own self hosted LLM. Ive read is possible? Im i dreaming, or Is it doable for an amateur?
I offer a 1000 sat bounty to the person who can generates the best "Top 3" list of Self Hosted LLMs.
In order to securew the bounty, I suggest to consider that the following considerations are important in my search :
- Must be able to run in a home desktop computer from an ubuntu system.
- Must be able to be able to be hosted in my home server so it can be accessed remotely.
- Optional but ideal (multi language??... i need it in spanish / english / russian)
I dont have time to do this research, so Id like to outsorce it!
Thanks!
1,000 sats paid
It's gotta be Mistral.
Free & open source & you can download it via torrent and run it offline and locally.
It also might be the best LLM period. Beats GPT-4 in many benchmarks and Gemini isn't really out.
Edit: I haven't installed it myself yet, maybe somebody else can help you with that :)
I can definitely help assist you in this search, but importantly, in order to give you the right kind of answers, I'll need to know the following info:
I'll also point out in advance that while you can do this, depending upon what kind of information you might be specifically desiring (i.e. do you want an LLM model that is more focused on helping with coding tasks and answers, or something that's more "generalized" in it's responses for typical chatbot conversations?), and more importantly, based on your expectations as far as performance is concerned, you may not want to do it.
While there are certainly LLM models out there which are both reasonable in model size for your downloading to set it up and with enough parameters to train the model (some of the LLAMA models are ideal in this respect), unfortunately the performance you're going to get running these on the average computer, like a gaming PC, is going to be pretty piss-poor. The response speed/output generation will be excruciatingly slow in most cases.
Thanks a lot for this answer. I have to go to sleep for now. I'll write specs tomorrow!
GPT4All is all you need.
Easy to use, and to update. Everything is simply one click away, available for Linux, macOS, and Windows.
If you want more powerful stuff, there is oobabooga's text generation web ui, which is a web based application that you could run on your server.
Top 3 Self-Hosted LLMs ConsiderationsTop 3 Self-Hosted LLMs Considerations
How much RAM do you have available? Does the server have a GPU?
The choice of model would depend on that.
Guys all answers are amazing. But I must give the bounty to only one person. I'm gonna go to the answer provided by @02dc6a50c1 as it fulfills all the requirements.
Will also give another round of zaps cuz all answers are great.
@dieselbaby answer is also quite comprehensive and I'd like to continue asking questions to this user. Zaps will flow in that direction as well!
This whole bounty business is hard!
Hello everyone! I have seen all answers and they are great! I will give each a jiucy zap for your texts and I will also give the bounty to the best answer I find. Im super grateful for everyones participation! I think you all helped the internet to make a good database on something important.
Im gonna go to sleep for now and I will give the bounty tomorrow when I wake up and can analyse the answers better!
Thanks a lot!
deleted by author
Hello! Oh! I didnt expeceted that you will delete the anser :(... I have not backed it up :(... can you help me with that? Could you send it over again... PLEASE!