Hi all,
after some weeks spent playing with "toys" like ChatGPT and similar, made available both as online webpages and APIs by their owners, I came to the conclusion that the real business is creating specialized LLMs augmented by RAGs (Retrieval Augmented Generation).
I mean, chatting for a while with a generic chatbot on the most diverse themes is fascinating. Letting a tool create an image (or even a short video) based on given prompts is awesome. So to having TTS functionality or Assistant-like tools.
But what about a system tailored to a specific task? Something that knows almost everything about a product or the often-changing timetables of buses/trains/planes.
For what I read so far, we need a good LLM coupled with a RAG "module".
Now, generating a proprietary LLM is out of question due to power and costs associated. Maybe a RAG could be feasible (mainly if limited in scope).
So, the goal of this thread is to open a discussion on how to build a system based on the above concepts (and others that may emerge here).
Which tools to use? Any hands-on experience to share?
Is there already a system that loops on documents (PDFs, text, spreadsheets..) in a folder and build a RAG on those? Can it work in conjunction with a free (or low cost) LLM in order to bring up a complete, professionally looking solution about a specific matter?
Can we host the final "object" on a dedicated VPS making it available to a (more or less) large audience by the means of APIs ?
As said, this thread is intented as a 360 degrees collection of ideas and experiences. How B4X comes in to play? Well, don't underestimate B4X power and flexibility...