How Vector Search Works
- Document → Vector - Each doc converted to math vector
- Query → Vector - User query converted to vector
- Similarity - Find vectors closest to query vector
- Return - Top matches returned as results
Semantic vs Keyword
Keyword Search
Semantic (Vector) Search
Embedding Models
Models determine vector quality:| Model | Dimensions | Speed |
|---|---|---|
| Small | 1536 | Fast |
| Large | 3072 | Slower |
| Custom | Variable | Custom |
- Accuracy needed
- Speed requirements
Search Parameters
Top K
Number of results to return (default: 5).- Lower: Faster, less context
- Higher: More context, slower
Similarity Threshold
Minimum relevance score (default: 0.7).- Lower (0.5): More permissive
- Higher (0.9): More strict
Reranking
LLM refines results (optional, +cost).Search Quality
Good search results depend on:- Document quality - Clear, well-formatted docs
- Embedding model - Better model = better vectors
- Query specificity - More specific = better matches
- Threshold tuning - Set appropriate threshold
Improving Search
Better Documents
- Use clear headers
- Consistent formatting
- Complete sentences
- Relevant content
Better Queries
- Be specific
- Use complete sentences
- Include context
- Ask about meaning, not keywords
Tune Parameters
- Increase top_k if missing results
- Lower threshold if too strict
- Enable reranking for refinement
Search Examples
Good Queries
✓ “How do I reset my password?” ✓ “What’s our refund policy for software?” ✓ “Which API endpoint for user authentication?”Bad Queries
✗ “password” ✗ “refund” ✗ “API”Batch Search
Search multiple queries:Search Monitoring
Track search performance:- Knowledge > Select KB
- Analytics tab
- See:
- Query count
- Average results
- Performance metrics