Lädt...

🔧 Deployable On-Premises RAG


Nachrichtenbereich: 🔧 Programmierung
🔗 Quelle: dev.to

I’m excited to introduce Minima, an open-source Retrieval-Augmented Generation (RAG) solution designed to work seamlessly on-premises or with integrations like ChatGPT and the Model Context Protocol (MCP). Whether you’re looking for a fully local RAG setup or prefer to integrate with external LLMs, Minima has you covered.

Minima is a containerized RAG solution that prioritizes security, flexibility, and simplicity. You can run it fully locally or integrate it with external AI services, depending on your needs.

Key Features

Minima currently supports three modes of operation:

Isolated Installation

• Fully on-premises operation with no external dependencies (e.g., ChatGPT or Claude).

• All neural networks—LLM, reranker, and embedding—run on your cloud or local PC.

• Ensures your data stays secure and private.

Custom GPT

• Query your local documents directly through the ChatGPT app or web interface via custom GPTs.

• The indexer runs on your local PC or cloud, while ChatGPT serves as the primary LLM.

Anthropic Claude

• Use the Claude app to query your local documents.

• The indexer operates on your local PC, with Anthropic Claude as the primary LLM.

With Minima, you can enjoy a flexible RAG solution that adapts to your infrastructure and security preferences.

Would love to hear your feedback, thoughts, or ideas! Check it out, and let me know what you think.

Cheers!

https://github.com/dmayboroda/minima

...

📰 Evolution of RAGs: Naive RAG, Advanced RAG, and Modular RAG Architectures


📈 30.22 Punkte
🔧 AI Nachrichten

🔧 Create an agent and build a deployable notebook from it in watsonx.ai — Part 2


📈 26.53 Punkte
🔧 Programmierung

🔧 Create an agent and build a deployable notebook from it in watsonx.ai — Part 2


📈 26.53 Punkte
🔧 Programmierung

🔧 Can AI build a deployable API?


📈 26.53 Punkte
🔧 Programmierung

🔧 How to turn a Jupyter Notebook into a deployable artifact


📈 26.53 Punkte
🔧 Programmierung

🔧 How to turn a Jupyter Notebook into a deployable artifact


📈 26.53 Punkte
🔧 Programmierung

📰 ChatGTP Used by Threat Actors to Create Deployable Malware


📈 26.53 Punkte
📰 IT Security Nachrichten

🪟 Rainbow Six Siege revives Clash, Deployable Shields with new fixes


📈 26.53 Punkte
🪟 Windows Tipps

🔧 RAG Explained: Fine-Tuning vs RAG


📈 20.15 Punkte
🔧 Programmierung

🎥 What is Retrieval Augmented Generation (RAG) and how does Azure AI Search unlock RAG?


📈 20.15 Punkte
🎥 Video | Youtube

📰 Understanding RAG Part X: RAG Pipelines in Production


📈 20.15 Punkte
🔧 AI Nachrichten

🔧 RAG Redefined : Ready-to-Deploy RAG for Organizations at Scale.


📈 20.15 Punkte
🔧 Programmierung

📰 Understanding RAG Part II: How Classic RAG Works


📈 20.15 Punkte
🔧 AI Nachrichten

🔧 The RAG Triad: Guide to Evaluating and Optimizing RAG Systems


📈 20.15 Punkte
🔧 Programmierung

📰 Understanding RAG Part IX: Fine-Tuning LLMs for RAG


📈 20.15 Punkte
🔧 AI Nachrichten

🔧 RAG is Dead. Long Live RAG!


📈 20.15 Punkte
🔧 Programmierung

🔧 RAG, AI Agents and Agentic RAG - what is it and how does it work?


📈 20.15 Punkte
🔧 Programmierung

🔧 Embedding RAG VS Graph RAG: (Under 5 Minutes)


📈 20.15 Punkte
🔧 Programmierung

📰 Understanding RAG Part VIII: Mitigating Hallucinations in RAG


📈 20.15 Punkte
🔧 AI Nachrichten

📰 Supercharge Your RAG with Multi-Agent Self-RAG


📈 20.15 Punkte
🔧 AI Nachrichten

matomo