Memory Tiers
Working Memory
Current conversation buffer. Keeps recent messages in context.- Default: 8,000 tokens
- Contains: Last N messages
- Cost: Included in each request
Summary Memory
When working memory fills, creates summary of key points.- Default: 5 messages
- Triggered when: 5+ messages in working
- Contains: Condensed history
Archive
Older summaries stored separately.- Default: Keep all
- Accessed when: Needed for context
- Cost: Additional retrieval tokens
Configuration
Adjust Working Memory
Smaller (4,000 tokens):- Pros: Lower cost, faster responses
- Cons: Forgets details quickly
- Pros: Better context, fewer summaries
- Cons: Higher cost, slower
- Good balance for most uses
- Agents > Select agent
- Settings > Memory
- Set working_memory_size
- Click Save
Summary Threshold
When working memory reaches N messages, create summary.- Lower (3): Summarize more often (cheaper)
- Higher (10): Summarize less often (better context)
- Default (5): Good balance
- Settings > Memory
- Set summary_message_threshold
- Click Save
Memory Management
View Memory Usage
- Agents > Select agent
- Analytics tab
- See memory tokens used
Clear Memory
- Open conversation
- Click Clear History
- Confirm
Export Memory
- Open conversation
- Click Export
- Download as JSON/PDF
Best Practices
Balance Cost and Quality
| Use Case | Working Memory | Summary |
|---|---|---|
| Quick Q&A | 4,000 | 10 |
| Research | 12,000 | 5 |
| Support | 8,000 | 5 |
| Code | 16,000 | 3 |
Monitor Memory Growth
Check usage dashboard:- How much memory used per conversation?
- Trending up or stable?
- Adjust if needed
Archive Old Conversations
Periodically clear old conversations to reduce costs.Memory Limitations
- Working memory is conversation-scoped (not shared between agents)
- Summaries are automatic (can’t manually create)
- Archives are read-only (can’t modify)
- Memory persists even after agent updates
Troubleshooting
Agent forgets context
- Increase working_memory_size
- Increase summary_threshold
- Ask user to provide context again
Memory costs too high
- Reduce working_memory_size
- Lower summary_threshold (summarize more)
- Clear old conversations
Incorrect summaries
- Summaries are LLM-generated (won’t be perfect)
- Verify important details in conversation
- Consider manually archiving if critical