Knowledge Retrieval

RAG-powered search through indexed knowledge bases with sub-100ms hybrid search combining keyword matching and semantic understanding.

Overview

Knowledge Retrieval is a built-in capability that brings sophisticated RAG (Retrieval-Augmented Generation) capabilities to AI agents. When enabled on your MCP Gateway, agents can perform hybrid search that combines traditional keyword matching with deep semantic understanding, delivering the most relevant information from your knowledge base in milliseconds.

Key Features

Hybrid Search: Combines BM25 keyword search with vector semantic search
Sub-100ms Latency: Optimized for real-time agent interactions
Smart Chunking: Automatically segments documents for optimal context
Relevance Scoring: Transparent ranking with explainable results
LLM Optimization: Returns chunks sized for context windows
Multi-Format Support: PDF, DOCX/XLSX/PPTX, Markdown, AsciiDoc, HTML/XHTML, CSV, images (PNG, JPEG, TIFF, BMP, WEBP)

How to Enable

Navigate to your MCP Gateway settings
Locate Knowledge Retrieval in the Built-in Capabilities section
Toggle the capability ON
Configure knowledge bases (create and index documents)
Save your configuration

Once enabled, any AI agent connected to your gateway can search indexed knowledge through the MCP protocol.

Supported Input Formats

Format	Description
PDF	Portable Document Format files with text extraction
DOCX, XLSX, PPTX	Microsoft Office 2007+ formats based on Office Open XML
Markdown	Plain text formatting syntax (.md files)
AsciiDoc	Lightweight markup language (.adoc files)
HTML, XHTML	Web markup languages
CSV	Comma-separated values for structured data
PNG, JPEG, TIFF, BMP, WEBP	Image formats processed with OCR for text extraction

All formats are automatically processed and indexed for immediate searchability. Images undergo OCR (Optical Character Recognition) to extract text content for indexing.

What AI Agents Can Do

Information Retrieval

Search across multiple knowledge bases simultaneously
Find specific facts, procedures, or documentation
Retrieve context-relevant chunks for analysis
Access historical versions of documents

Semantic Understanding

Answer natural language questions
Find conceptually similar content
Discover related topics and themes
Bridge terminology differences

Document Analysis

Extract key information from large documents
Identify patterns across document collections
Generate summaries from retrieved chunks
Cross-reference multiple sources

Knowledge Management

Organize information into searchable collections
Maintain up-to-date knowledge bases
Track document changes and versions
Ensure information consistency

Search Types

Keyword Search (BM25)

Best for specific terms, product names, or exact phrases. Uses traditional information retrieval scoring for precise term matching.

Semantic Search (Vector)

Best for concepts, similar meanings, or natural questions. Uses embeddings to find contextually related content.

Hybrid Search (Default)

Combines both approaches for optimal results, leveraging the precision of keyword matching with the understanding of semantic search.

Knowledge Base Management

Creating Knowledge Bases

Agents can create named knowledge bases with specific configurations for chunk size, overlap, and embedding models based on content type and use case.

Document Indexing

Documents are automatically processed, chunked, and indexed. The system extracts metadata, generates embeddings, and creates searchable indices.

Metadata Enrichment

Automatic extraction of document properties including title, creation date, language, and custom tags for enhanced filtering and retrieval.

Capability Behavior

Automatic Features

Intelligent document chunking based on content structure
Automatic language detection and handling
Duplicate detection and deduplication
Incremental indexing for new content
Cache optimization for frequent queries

Search Optimization

Query expansion with synonyms and related terms
Relevance boosting based on metadata
Context-aware result ranking
Automatic spell correction
Result deduplication

Performance Characteristics

Index Size	Keyword Search	Semantic Search	Hybrid Search
1K docs	<10ms	<50ms	<60ms
10K docs	<20ms	<70ms	<85ms
100K docs	<30ms	<90ms	<100ms
1M docs	<50ms	<100ms	<120ms

Typical Applications

Customer Support Systems

AI agents use Knowledge Retrieval to instantly find relevant support articles, troubleshooting guides, and FAQ answers, providing accurate responses to customer inquiries based on comprehensive documentation.

Research Assistants

Agents search through research papers, technical documentation, and reference materials to compile comprehensive information on topics, supporting literature reviews and technical investigations.

Compliance Monitoring

Agents retrieve regulatory requirements, policy documents, and compliance guidelines to ensure operations align with current standards and identify potential violations.

Documentation Search

Agents help users navigate large documentation sets, finding specific procedures, API references, configuration guides, and best practices from extensive technical libraries.

Knowledge Synthesis

Agents combine information from multiple sources to create comprehensive reports, answer complex questions, and provide multi-faceted analysis on topics.

Advanced Features

Filtering Capabilities

Date range filtering for temporal relevance
Tag-based categorization and filtering
Metadata-based selection criteria
Document type restrictions
Language-specific searches

Context Management

Automatic chunk sizing for LLM context windows
Related chunk retrieval for comprehensive context
Hierarchical document navigation
Section-aware searching

Quality Controls

Relevance score thresholds
Source verification and authority scoring
Duplicate content handling
Version control and tracking

Pricing

Search Operations

Cost: $0.0015 per search

Storage & Indexing

Document Storage: $0.10/GB/day
Indexing: $0.20 per million tokens
Re-indexing: Same as initial indexing

Cost Optimization

Cache frequently searched queries
Use appropriate search types for queries
Batch similar searches together
Implement relevance thresholds

Limitations

Max Query Length: 500 characters
Max Results: 100 per search
Document Size: 10MB per document
Index Size: 10GB per knowledge base
Concurrent Searches: 100 per second

Prerequisites

Before using Knowledge Retrieval:

Create Knowledge Base: Set up named knowledge base with configuration
Index Documents: Add and process documents for searching
Wait for Indexing: Processing completes in seconds to minutes
Start Searching: Query your indexed content

Best Practices

Document Preparation

Clean and structure documents before indexing
Add meaningful metadata and tags
Split large documents into logical sections
Maintain consistent formatting

Query Optimization

Use specific terms when available
Provide context in natural language queries
Leverage filters to narrow results
Combine search types appropriately

Knowledge Base Organization

Create separate bases for different domains
Regularly update and maintain content
Remove outdated information
Monitor search performance and adjust

Next Steps

Web Fetch - Import web content for indexing
Web Search - Search the open web
Virtual Machine - Process documents before indexing

Platform Features

All Capabilities - Explore other built-in tools
MCP Gateway - Configure your gateway
Usage & Limits - Understand platform constraints

Knowledge Retrieval ​

Overview ​

Key Features ​

How to Enable ​

Supported Input Formats ​

What AI Agents Can Do ​

Information Retrieval ​

Semantic Understanding ​

Document Analysis ​

Knowledge Management ​

Search Types ​

Keyword Search (BM25) ​

Semantic Search (Vector) ​

Hybrid Search (Default) ​

Knowledge Base Management ​

Creating Knowledge Bases ​

Document Indexing ​

Metadata Enrichment ​

Capability Behavior ​

Automatic Features ​

Search Optimization ​

Performance Characteristics ​

Typical Applications ​

Customer Support Systems ​

Research Assistants ​

Compliance Monitoring ​

Documentation Search ​

Knowledge Synthesis ​

Advanced Features ​

Filtering Capabilities ​

Context Management ​

Quality Controls ​

Pricing ​

Search Operations ​

Storage & Indexing ​

Cost Optimization ​

Limitations ​

Prerequisites ​

Best Practices ​

Document Preparation ​

Query Optimization ​

Knowledge Base Organization ​

Next Steps ​

Related Capabilities ​

Platform Features ​