Update docs.

This commit is contained in:
retoor 2025-01-20 13:36:59 +01:00
parent d27325beae
commit cde66da983

View File

@ -40,7 +40,10 @@ It's also possible to give `api_key` as parameter to the initiation of the `rage
--- ---
## Costs ## For free!
But there is a small catch. It's very easy to replace Open AI with a local hosted LLM like Olama. Olama is installed within minutes using a one liner. You will figure the replacement of the URL out in 20 minutes or so. So, with a good hour + time to download your favorite Olama LLM you have a chatbot for free in a few hours wherefrom most is waiting. I recommend models above 3b of even above 7b. My personal experience with Olama LLM's is that llama models / qwen (3b+) / gemma2 works the best. Gemma2 is made by Google and is only 2b and 4gb or so. Gemma2 is probably the most for less. You can try it out with the `python -m ragent.demo_olama` command. Just kiddin'. You really have to do this small thing yourself. I don't have the right hardware to run a decent LLM so i just didn't implement it. Don't be cheap AND lazy. It's worth it.
## Costs if you use Open AI
You can chat the whole day with the bots for just ten cents or so. See here how much it costed to test it extensively. You can chat the whole day with the bots for just ten cents or so. See here how much it costed to test it extensively.
![Intense usage costs.](costs.png) ![Intense usage costs.](costs.png)