Hardware Requirements
Chatty AI hardware requirements vary based on deployment size and usage patterns. Choose the configuration that matches your needs.
Quick Reference
| Configuration | CPU | RAM | Storage | Users | Use Case |
|---|---|---|---|---|---|
| Minimum | 4 cores | 16 GB | 100 GB SSD | 1-5 | Development, testing, POC |
| Recommended | 8 cores | 32 GB | 500 GB NVMe | 10-50 | Production, standard workload |
| High Load | 16+ cores | 64+ GB | 2+ TB NVMe | 50-100+ | Enterprise, heavy workload |
Minimum Configuration
Suitable for: Development, testing, proof of concept, small teams (1-5 users)
⚠️ Not recommended for production deployments
CPU
- 4 cores minimum
- x86_64 or ARM64
- 2.0 GHz or higher
Memory
- 16 GB RAM minimum
- Memory allocation:
- PostgreSQL: 2 GB
- Qdrant: 2-4 GB
- Chatty AI: 4-8 GB
- Databases services: 2-4 GB
- n8n: 1-2 GB
- System: 2-4 GB
Storage
- 100 GB SSD minimum
- Breakdown:
- OS and Docker: 20 GB
- Images: 15-20 GB
- PostgreSQL: 10-20 GB
- Qdrant: 10-30 GB
- Application: 10-20 GB
- Logs: 5-10 GB
- Backups: 20-30 GB
Performance
- Response times: 2-5 seconds
- Concurrent users: 1-3
- Documents: < 1,000
Recommended Configuration (Production)
Suitable for: Production deployments, medium teams (10-50 users), standard workloads
✅ Recommended for most production deployments
CPU
- 8 cores recommended
- Modern x86_64 (Intel Xeon, AMD EPYC) or ARM64
- 2.5 GHz or higher
- Hyper-threading enabled
Memory
- 32 GB RAM recommended
- Memory allocation:
- PostgreSQL: 4 GB
- Qdrant: 4-8 GB
- Chatty AI: 8-12 GB
- Databases services: 4-6 GB
- n8n: 2-4 GB
- Nginx: 1 GB
- System: 4-6 GB
Storage
- 500 GB NVMe SSD recommended
- Breakdown:
- OS and Docker: 30 GB
- Images: 20-30 GB
- PostgreSQL: 50-100 GB
- Qdrant: 100-200 GB
- Application: 50-100 GB
- Logs: 20-30 GB
- Backups: 100-150 GB
Storage Type:
- NVMe SSD: Strongly recommended for Qdrant performance
- RAID 10: Optional for redundancy
- Separate Volumes: Consider for data and backups
Network
- Bandwidth: 1 Gbps
- Latency: < 50ms to LLM endpoints
- Redundancy: Optional backup connections
Performance
- Response times: 1-2 seconds
- Concurrent users: 10-30
- Documents: 5,000-10,000
- Query throughput: Medium-high
High Load Configuration (Enterprise)
Suitable for: Large enterprises (50-100+ users), heavy workloads, extensive document collections
CPU
- 16+ cores for high load
- Enterprise processors (Intel Xeon Scalable, AMD EPYC)
- 3.0 GHz or higher
- Multiple sockets supported
Memory
- 64+ GB RAM for high load
- Memory allocation:
- PostgreSQL: 8-16 GB
- Qdrant: 16-32 GB
- Chatty AI: 16-24 GB
- Databases services: 8-12 GB
- n8n: 4-8 GB
- Nginx: 2 GB
- System: 8-12 GB
Consider 128 GB for very large deployments
Storage
- 2+ TB NVMe SSD for high load
- Breakdown:
- OS and Docker: 50 GB
- Images: 30-50 GB
- PostgreSQL: 200-500 GB
- Qdrant: 500 GB - 1 TB
- Application: 200-400 GB
- Logs: 50-100 GB
- Backups: 500 GB - 1 TB
Storage Architecture:
- NVMe SSD: Required
- RAID 10: Recommended
- Separate Volumes:
- OS and applications
- PostgreSQL data
- Qdrant data
- Backups (can be slower storage)
Network
- Bandwidth: 10 Gbps
- Latency: < 20ms to LLM endpoints
- Redundancy: Required backup connections
- Internal: 10 Gbps for storage access
Performance
- Response times: < 1 second
- Concurrent users: 50-100+
- Documents: 10,000-100,000+
- Query throughput: High
- Uptime: 99.9%+
Storage Performance Requirements
Critical: Qdrant Performance
Qdrant performance heavily depends on storage speed:
- NVMe SSD: Required for production
- Latency: < 1ms
- IOPS: > 10,000
Impact of slow storage:
- ❌ Slow vector search
- ❌ Degraded RAG performance
- ❌ Timeout errors
Important: PostgreSQL Performance
- SSD: Minimum (NVMe preferred)
- IOPS: Adequate for transaction load
General Storage
Other services can use standard SSD.
Resource Limits (Docker)
Configured in docker-compose.yaml:
Qdrant
- Memory Limit: 4 GB
- Memory Reservation: 2 GB
- CPUs: 1 core
Chatty AI
- Memory Limit: 8 GB
- CPUs: 1 core
- Node.js Heap: 8192 MB (configurable via
NODE_MAX_MEMORY)
Scaling Indicators
When to Upgrade
Upgrade from Minimum → Recommended:
- Moving to production
- More than 5 users
- Document collection growing
Upgrade from Recommended → High Load:
- CPU usage consistently > 70%
- Memory usage > 80%
- Disk I/O bottlenecks
- More than 50 concurrent users
- Document collection > 10,000
- Response times degrading
High Availability (HA) Considerations
For production and enterprise deployments:
Infrastructure
- Redundant power supplies
- RAID 10 storage
- Backup network connections
- Load balancers (for Nginx)
Monitoring
- Real-time performance monitoring
- Automated alerting
- Capacity planning
- Predictive analytics
Backup
- Automated daily backups
- Off-site backup storage
- Regular restore testing
- Point-in-time recovery
Platform Support
Operating Systems
- Linux: Ubuntu 22.04 LTS (recommended), Debian, RHEL, Rocky Linux
- Architecture: x86_64 (amd64) or ARM64
Docker Requirements
- Docker Engine 20.10+
- Docker Compose 2.0+
See Software Requirements for details.
Cost Considerations
Minimum Configuration
- Server: $2,000-$5,000
- Suitable for: Testing, POC
Recommended Configuration
- Server: $5,000-$15,000
- Storage: $1,000-$3,000
- Network: $500-$2,000
- Total: ~$6,500-$20,000
High Load Configuration
- Server: $10,000-$50,000+
- Storage: $5,000-$20,000+
- Network: $2,000-$10,000+
- Redundancy/HA: 2x base costs
- Total: ~$34,000-$160,000+
Plus ongoing costs: power, cooling, maintenance.
Monitoring Resources
Monitor these metrics to ensure adequate resources:
CPU
- Average utilization < 60%
- Peak utilization < 85%
- Per-container CPU usage
Memory
- Average utilization < 70%
- Swap usage minimal
- Per-container memory usage
Storage
- Disk utilization < 80%
- I/O wait times
- Read/write throughput
- IOPS
Network
- Bandwidth utilization
- Latency to external services
- Packet loss
- Connection counts
Recommendations by Use Case
| Use Case | Configuration | Notes |
|---|---|---|
| Development | Minimum | Local testing only |
| POC/Demo | Minimum | Short-term evaluation |
| Small Team (< 10) | Recommended | Standard production |
| Medium Team (10-50) | Recommended | Standard production |
| Large Team (50-100) | High Load | Enterprise deployment |
| Enterprise (100+) | High Load | Consider clustering |
| 24/7 Critical | High Load + HA | Mission-critical |
Next Steps
- Choose configuration based on your use case
- Review Software Requirements
- Prepare infrastructure using Prerequisites Checklist