Skip to content

Atonomy-AI/AtoRAG

Folders and files

NameName
Last commit message
Last commit date

Latest commit

ย 

History

7 Commits
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 

Repository files navigation

๐Ÿš€ AtoRAG - Universal RAG Knowledge Base Extension

by AtonomyAI

Transform any content into a searchable RAG (Retrieval-Augmented Generation) knowledge base through Claude Desktop

AtoRAG is a completely self-contained desktop extension by AtonomyAI that turns Claude Desktop into a powerful RAG-enabled knowledge management system. Store documents, CSV data, policies, research papers, meeting notes, or any text content, then search and retrieve it using natural language with enhanced semantic search and TF-IDF similarity algorithms.

โœจ Key Features

๐ŸŽฏ Universal Content Storage

  • Documents: PDFs, text files, research papers, manuals
  • Structured Data: CSV files with automatic column detection
  • Policies & Procedures: Company documents, guidelines, SOPs
  • Meeting Notes: Agendas, minutes, action items
  • Research: Studies, reports, analysis documents
  • Contracts: Agreements, invoices, legal documents

๐Ÿ” RAG-Powered Intelligent Search

  • Enhanced Similarity: TF-IDF + string similarity hybrid for superior accuracy
  • Natural Language: Search using everyday language queries
  • Semantic Search: Finds contextually relevant content, not just keyword matches
  • Smart Filtering: Filter by type, tags, date ranges, and partitions
  • Advanced Scoring: Multiple relevance factors with proper similarity ranking
  • Real-time Results: Instant search with optimized performance
  • Retrieval-Augmented Generation: Perfect for feeding Claude with relevant context

๐Ÿท๏ธ Automatic Organization

  • Auto-tagging: Extracts meaningful tags from content
  • Type Detection: Automatically categorizes documents
  • Smart Summaries: Generates concise summaries
  • Collections: Organize related documents together

๐Ÿ’ป Zero Dependencies & Production Ready

  • Self-Contained: Complete RAG system with no Python, no external installations
  • Enhanced Similarity: TF-IDF + string similarity hybrid for accurate results
  • SQLite Smart Database: Efficient local content storage with semantic search
  • Built-in Node.js: Uses Claude Desktop's runtime (Node.js v23.11.0+ supported)
  • Local Storage: All data and content analysis stay on your machine
  • Drag & Drop: Install complete RAG system with a single .dxt file
  • Fully Tested: 100% tool validation through comprehensive pipeline testing
  • Production Quality: Robust error handling and optimized performance

๐Ÿข Enterprise RAG Solutions

  • Shared Corporate Knowledge Bases: PostgreSQL-powered RAG servers with advanced similarity algorithms
  • Multi-User RAG Access: Role-based permissions and audit trails for knowledge retrieval
  • Centralized RAG Management: All employees access same intelligently-indexed knowledge base
  • Custom RAG Integrations: SSO, API development, custom similarity models, etc.

Enterprise Architecture:

โ”Œโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”    โ”Œโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”    โ”Œโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”
โ”‚   Claude Desktopโ”‚    โ”‚   AtoRAG Client โ”‚    โ”‚  PostgreSQL RAG โ”‚
โ”‚   (Employee)    โ”‚โ—„โ”€โ”€โ–บโ”‚   (MCP Server)  โ”‚โ—„โ”€โ”€โ–บโ”‚   (Corporate)   โ”‚
โ””โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”˜    โ””โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”˜    โ””โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”˜
                              โ”‚
                              โ–ผ
                       โ”Œโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”
                       โ”‚   Redis Cache   โ”‚
                       โ”‚   (TF-IDF)      โ”‚
                       โ””โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”˜

Interested in an enterprise MCP solution? Contact us at info@atonomy.ai

๐Ÿš€ Quick Start

Installation

Option 1: Download Pre-built Extension

  1. Download the latest release for a pre-built .dxt file that can be uploaded directly to Claude Desktop.

Option 2: Build from Source

  1. Clone the repository:

    git clone https://github.com/Atonomy-AI/AtoRAG.git
    cd AtoRAG
  2. Build the extension:

    ./scripts/build.sh
  3. Install in Claude Desktop:

    • Open Claude Desktop
    • Go to Settings โ†’ Extensions
    • Drag and drop the generated AtoRAG.dxt file
    • Click 'Install'
  4. Optional: Configure backup directory path (recommended)

  5. Start using: The extension is now available in your Claude conversations!

โš™๏ธ Configuration

Backup Directory Path (Recommended)

To preserve your backups when uninstalling the extension, set a custom backup path:

  1. During installation, set Backup Directory Path to a location outside the extension directory
  2. Recommended paths:
    • macOS/Linux: ~/Documents/AtoRAG-Backups/
    • Windows: C:\Users\YourName\Documents\AtoRAG-Backups\

If not configured, backups will be stored in ~/.atorag/backups/ (may be lost when uninstalling).

Basic Usage

Store documents (multiple ways):

# Upload files directly in Claude Desktop, then:
Save these uploaded documents to AtoRAG

# Or paste content:
Store this policy document: [paste your content]

# Or import CSV data:
Import this CSV file: [paste CSV content]

Search your knowledge base:

Find all documents about data privacy
Search for information about budget planning
What do we have on customer feedback?

Browse by type:

Show me all policy documents
List all meeting notes from this month

๐Ÿค– What is RAG (Retrieval-Augmented Generation)?

RAG combines the power of information retrieval with AI generation to provide more accurate, contextual responses. Instead of relying solely on the AI's training data, RAG systems:

  1. ๐Ÿ“š Store Knowledge: Documents are processed and stored with TF-IDF analysis
  2. ๐Ÿ” Retrieve Relevant Info: When you ask a question, the system finds the most relevant stored content
  3. ๐Ÿง  Generate Responses: AI uses the retrieved information to provide accurate, up-to-date answers

๐ŸŽฏ How AtoRAG Implements RAG

AtoRAG transforms Claude Desktop into a complete RAG system:

Your Question โ†’ Enhanced Similarity Search โ†’ Retrieved Context โ†’ Claude + Context โ†’ Enhanced Answer

Example RAG Flow:

  1. Store: "Our company policy states employees can work remotely 3 days per week"
  2. Ask: "What's our remote work policy?"
  3. Retrieve: AtoRAG finds the relevant policy document using semantic search
  4. Generate: Claude uses the retrieved policy text to give you an accurate answer

๐Ÿ”ง RAG Technical Implementation

  • ๐Ÿงฎ Enhanced Similarity: TF-IDF + string similarity hybrid (70% TF-IDF + 30% string matching)
  • ๐Ÿ—„๏ธ Smart Database: SQLite with efficient content storage and enhanced similarity search
  • โœ‚๏ธ Smart Chunking: Automatically splits large documents (>500 words) into searchable chunks
  • ๐Ÿ” Semantic Search: Finds contextually relevant content with accurate similarity scoring
  • โšก Real-time: Instant retrieval and context injection into Claude conversations
  • ๐ŸŽฏ Production Quality: Comprehensive testing with 100% tool validation
  • ๐Ÿ“Š Advanced Analytics: Content analysis, readability scoring, and key phrase extraction

๐ŸŽจ RAG Use Cases

๐Ÿ“Š Business Intelligence RAG:

  • Store quarterly reports, then ask "What were our key challenges last quarter?"
  • Upload meeting minutes, then ask "What decisions were made about the budget?"

๐Ÿ”ฌ Research RAG:

  • Store research papers, then ask "What are the latest findings on AI safety?"
  • Upload documentation, then ask "How do I implement this API?"

๐Ÿ“š Knowledge Management RAG:

  • Store company policies, then ask "What's our vacation policy?"
  • Upload manuals, then ask "How do I troubleshoot this error?"

๐Ÿ› ๏ธ Available MCP Tools (9 Total)

๐Ÿ“„ Document Management

  • add_document: Store any document with automatic analysis and tagging
  • search_documents: Enhanced semantic search with TF-IDF + string similarity
  • get_document: Retrieve specific documents by ID with full metadata
  • list_documents: Browse all documents with advanced filtering options
  • delete_document: Remove documents from the knowledge base
  • analyze_content: Analyze text for readability, key phrases, and insights

๐Ÿ“ Organization Tools

  • manage_collections: Create, list, and organize document collections
  • manage_partitions: Create and manage partitions for structured organization

๐Ÿ’พ Backup & Restore

  • backup_restore: Create backups, restore from backups, and list available snapshots

๐ŸŽจ Usage Examples

Storing Different Content Types

Upload Multiple Files:

# Upload PDFs, Word docs, text files, etc. in Claude Desktop, then:
Save all these uploaded documents to AtoRAG and to the "policy" partition

Paste Content:

Store this meeting note:
Title: "Q4 Planning Meeting - Dec 2024"
Content: "Attendees: John, Sarah, Mike. Agenda: Budget review..."

Batch Processing:

# Upload multiple research papers, then:
Save these research papers to AtoRAG in the "research" partition

Smart Searching

Natural Language:

Find documents about budget planning
Search for privacy policies
What do we have on customer feedback?

Filtered Search:

Show me all meeting documents from this month
Find policy documents tagged with "security"
Search for CSV files about sales data

Content Analysis

Before Storing:

Analyze this content before I store it: [paste content]

Results:

  • Word count and reading time
  • Detected document type
  • Suggested tags
  • Key topics
  • Summary preview

Backup & Restore

Create Backup:

Create a backup of my knowledge base

List Backups:

Show me all available backups

Restore from Backup:

Restore from the most recent backup

Custom Backup Location:

Create a backup in my custom directory: /Users/myname/Documents/MyBackups/

๐Ÿ—๏ธ RAG Technical Architecture

Enhanced RAG Processing Pipeline

  • ๐Ÿงฎ Hybrid Similarity: TF-IDF + string similarity hybrid (70%/30%) for superior accuracy
  • โœ‚๏ธ Smart Chunking: Automatically splits large documents (>500 words) into optimal chunks
  • ๐Ÿ” Enhanced Search: Advanced similarity scoring with proper ranking algorithms
  • ๐Ÿ“Š Multi-factor Scoring: Combines semantic understanding with exact string matching
  • ๐Ÿท๏ธ Content Analysis: Automatic type detection, tagging, and readability analysis
  • ๐ŸŽฏ Production Quality: 100% tool validation with comprehensive testing

Local RAG Storage & Performance

  • ๐Ÿ—„๏ธ SQLite Smart Database: Efficient content storage with enhanced similarity search
  • ๐Ÿ  Home Directory: ~/.atorag/ - All content analysis and data stored locally
  • ๐Ÿ’พ Robust Backups: Complete RAG system backup and restore with configurable paths
  • ๐Ÿ”’ Privacy First: All content analysis and data stay on your machine - no cloud dependencies
  • โšก Optimized Performance: Real-time search with efficient TF-IDF similarity calculation

Document Types

  • document - General documents
  • policy - Policies and procedures
  • meeting - Meeting notes and agendas
  • report - Reports and analysis
  • research - Research papers and studies
  • guide - Manuals and guides
  • contract - Legal documents
  • financial - Invoices and financial docs
  • csv - Structured data files

๐ŸŽฏ Use Cases

๐Ÿ“Š Business Intelligence

  • Store and search company policies
  • Analyze meeting minutes and decisions
  • Track project documentation
  • Organize contracts and agreements

๐Ÿ”ฌ Research & Development

  • Build a personal research library
  • Store and cross-reference papers
  • Track experimental data
  • Organize literature reviews

๐Ÿ“ˆ Data Management

  • Import and search CSV datasets
  • Store data dictionaries
  • Track data lineage
  • Analyze structured information

๐Ÿ“ Knowledge Management

  • Personal note-taking system
  • Team knowledge sharing
  • Document version control
  • Content organization

๐Ÿ›ก๏ธ Privacy & Security

  • 100% Local: All data stored on your machine
  • No Cloud: No external API calls or data transmission
  • Encrypted Storage: Local file system security
  • Private by Design: Your data never leaves your device

๐Ÿ”ง Development

Building from Source

npm install
./scripts/build.sh

The build.sh script handles everything - it installs dependencies, builds the extension, and creates the .dxt file in the root directory.

๐Ÿ“œ License

MIT License - Open source and free to use, modify, and distribute.

๐Ÿค Contributing

We welcome contributions! This is designed to be a universal knowledge management solution for everyone.

๐Ÿ†˜ Support

Having issues? Check these common solutions:

  1. Extension not loading: Restart Claude Desktop
  2. Search not working: Check if documents are stored
  3. Import failing: Verify file format
  4. Performance issues: Large documents may take time to process
  5. Other Issues: Submit an issue on GitHub

๐ŸŒŸ Why AtoRAG?

  • Autonomous: Works independently without external dependencies
  • Intelligent: Smart content processing and search
  • Universal: Handles any type of content
  • Private: Your data stays with you
  • Simple: Just drag, drop, and start using

Transform your Claude Desktop into the ultimate knowledge management system with AtoRAG! ๐Ÿš€


Built with โค๏ธ by AtonomyAI for the Claude Desktop community

About

AtoRAG Universal Knowledge Base - MCP RAG extension for Claude Desktop

Resources

License

Stars

Watchers

Forks

Packages

No packages published