Hi all,
after some weeks spent playing with "toys" like ChatGPT and similar, made available both as online webpages and APIs by their owners, I came to the conclusion that the real business is creating specialized LLMs augmented by RAGs (Retrieval Augmented Generation).
I mean, chatting for a while with a generic chatbot on the most diverse themes is fascinating. Letting a tool create an image (or even a short video) based on given prompts is awesome. So to having TTS functionality or Assistant-like tools.
But what about a system tailored to a specific task? Something that knows almost everything about a product or the often-changing timetables of buses/trains/planes.
For what I read so far, we need a good LLM coupled with a RAG "module".
Now, generating a proprietary LLM is out of question due to power and costs associated. Maybe a RAG could be feasible (mainly if limited in scope).
So, the goal of this thread is to open a discussion on how to build a system based on the above concepts (and others that may emerge here).
Which tools to use? Any hands-on experience to share?
Is there already a system that loops on documents (PDFs, text, spreadsheets..) in a folder and build a RAG on those? Can it work in conjunction with a free (or low cost) LLM in order to bring up a complete, professionally looking solution about a specific matter?
Can we host the final "object" on a dedicated VPS making it available to a (more or less) large audience by the means of APIs ?
As said, this thread is intented as a 360 degrees collection of ideas and experiences. How B4X comes in to play? Well, don't underestimate B4X power and flexibility...
after some weeks spent playing with "toys" like ChatGPT and similar, made available both as online webpages and APIs by their owners, I came to the conclusion that the real business is creating specialized LLMs augmented by RAGs (Retrieval Augmented Generation).
I mean, chatting for a while with a generic chatbot on the most diverse themes is fascinating. Letting a tool create an image (or even a short video) based on given prompts is awesome. So to having TTS functionality or Assistant-like tools.
But what about a system tailored to a specific task? Something that knows almost everything about a product or the often-changing timetables of buses/trains/planes.
For what I read so far, we need a good LLM coupled with a RAG "module".
Now, generating a proprietary LLM is out of question due to power and costs associated. Maybe a RAG could be feasible (mainly if limited in scope).
So, the goal of this thread is to open a discussion on how to build a system based on the above concepts (and others that may emerge here).
Which tools to use? Any hands-on experience to share?
Is there already a system that loops on documents (PDFs, text, spreadsheets..) in a folder and build a RAG on those? Can it work in conjunction with a free (or low cost) LLM in order to bring up a complete, professionally looking solution about a specific matter?
Can we host the final "object" on a dedicated VPS making it available to a (more or less) large audience by the means of APIs ?
As said, this thread is intented as a 360 degrees collection of ideas and experiences. How B4X comes in to play? Well, don't underestimate B4X power and flexibility...