Hardware Requirements

Chatty AI hardware requirements vary based on deployment size and usage patterns. Choose the configuration that matches your needs.

Quick Reference

Configuration	CPU	RAM	Storage	Users	Use Case
Minimum	4 cores	16 GB	100 GB SSD	1-5	Development, testing, POC
Recommended	8 cores	32 GB	500 GB NVMe	10-50	Production, standard workload
High Load	16+ cores	64+ GB	2+ TB NVMe	50-100+	Enterprise, heavy workload

Minimum Configuration

Suitable for: Development, testing, proof of concept, small teams (1-5 users)

⚠️ Not recommended for production deployments

CPU

4 cores minimum
x86_64 or ARM64
2.0 GHz or higher

Memory

16 GB RAM minimum
Memory allocation:
- PostgreSQL: 2 GB
- Qdrant: 2-4 GB
- Chatty AI: 4-8 GB
- Databases services: 2-4 GB
- n8n: 1-2 GB
- System: 2-4 GB

Storage

100 GB SSD minimum
Breakdown:
- OS and Docker: 20 GB
- Images: 15-20 GB
- PostgreSQL: 10-20 GB
- Qdrant: 10-30 GB
- Application: 10-20 GB
- Logs: 5-10 GB
- Backups: 20-30 GB

Performance

Response times: 2-5 seconds
Concurrent users: 1-3
Documents: < 1,000

Recommended Configuration (Production)

Suitable for: Production deployments, medium teams (10-50 users), standard workloads

✅ Recommended for most production deployments

CPU

8 cores recommended
Modern x86_64 (Intel Xeon, AMD EPYC) or ARM64
2.5 GHz or higher
Hyper-threading enabled

Memory

32 GB RAM recommended
Memory allocation:
- PostgreSQL: 4 GB
- Qdrant: 4-8 GB
- Chatty AI: 8-12 GB
- Databases services: 4-6 GB
- n8n: 2-4 GB
- Nginx: 1 GB
- System: 4-6 GB

Storage

500 GB NVMe SSD recommended
Breakdown:
- OS and Docker: 30 GB
- Images: 20-30 GB
- PostgreSQL: 50-100 GB
- Qdrant: 100-200 GB
- Application: 50-100 GB
- Logs: 20-30 GB
- Backups: 100-150 GB

Storage Type:

NVMe SSD: Strongly recommended for Qdrant performance
RAID 10: Optional for redundancy
Separate Volumes: Consider for data and backups

Network

Bandwidth: 1 Gbps
Latency: < 50ms to LLM endpoints
Redundancy: Optional backup connections

Performance

Response times: 1-2 seconds
Concurrent users: 10-30
Documents: 5,000-10,000
Query throughput: Medium-high

High Load Configuration (Enterprise)

Suitable for: Large enterprises (50-100+ users), heavy workloads, extensive document collections

CPU

16+ cores for high load
Enterprise processors (Intel Xeon Scalable, AMD EPYC)
3.0 GHz or higher
Multiple sockets supported

Memory

64+ GB RAM for high load
Memory allocation:
- PostgreSQL: 8-16 GB
- Qdrant: 16-32 GB
- Chatty AI: 16-24 GB
- Databases services: 8-12 GB
- n8n: 4-8 GB
- Nginx: 2 GB
- System: 8-12 GB

Consider 128 GB for very large deployments

Storage

2+ TB NVMe SSD for high load
Breakdown:
- OS and Docker: 50 GB
- Images: 30-50 GB
- PostgreSQL: 200-500 GB
- Qdrant: 500 GB - 1 TB
- Application: 200-400 GB
- Logs: 50-100 GB
- Backups: 500 GB - 1 TB

Storage Architecture:

NVMe SSD: Required
RAID 10: Recommended
Separate Volumes:
- OS and applications
- PostgreSQL data
- Qdrant data
- Backups (can be slower storage)

Network

Bandwidth: 10 Gbps
Latency: < 20ms to LLM endpoints
Redundancy: Required backup connections
Internal: 10 Gbps for storage access

Performance

Response times: < 1 second
Concurrent users: 50-100+
Documents: 10,000-100,000+
Query throughput: High
Uptime: 99.9%+

Storage Performance Requirements

Critical: Qdrant Performance

Qdrant performance heavily depends on storage speed:

NVMe SSD: Required for production
Latency: < 1ms
IOPS: > 10,000

Impact of slow storage:

❌ Slow vector search
❌ Degraded RAG performance
❌ Timeout errors

Important: PostgreSQL Performance

SSD: Minimum (NVMe preferred)
IOPS: Adequate for transaction load

General Storage

Other services can use standard SSD.

Resource Limits (Docker)

Configured in docker-compose.yaml:

Qdrant

Memory Limit: 4 GB
Memory Reservation: 2 GB
CPUs: 1 core

Chatty AI

Memory Limit: 8 GB
CPUs: 1 core
Node.js Heap: 8192 MB (configurable via NODE_MAX_MEMORY)

Scaling Indicators

When to Upgrade

Upgrade from Minimum → Recommended:

Moving to production
More than 5 users
Document collection growing

Upgrade from Recommended → High Load:

CPU usage consistently > 70%
Memory usage > 80%
Disk I/O bottlenecks
More than 50 concurrent users
Document collection > 10,000
Response times degrading

High Availability (HA) Considerations

For production and enterprise deployments:

Infrastructure

Redundant power supplies
RAID 10 storage
Backup network connections
Load balancers (for Nginx)

Monitoring

Real-time performance monitoring
Automated alerting
Capacity planning
Predictive analytics

Backup

Automated daily backups
Off-site backup storage
Regular restore testing
Point-in-time recovery

Platform Support

Operating Systems

Linux: Ubuntu 22.04 LTS (recommended), Debian, RHEL, Rocky Linux
Architecture: x86_64 (amd64) or ARM64

Docker Requirements

Docker Engine 20.10+
Docker Compose 2.0+

See Software Requirements for details.

Cost Considerations

Minimum Configuration

Server: $2,000-$5,000
Suitable for: Testing, POC

Recommended Configuration

Server: $5,000-$15,000
Storage: $1,000-$3,000
Network: $500-$2,000
Total: ~$6,500-$20,000

High Load Configuration

Server: $10,000-$50,000+
Storage: $5,000-$20,000+
Network: $2,000-$10,000+
Redundancy/HA: 2x base costs
Total: ~$34,000-$160,000+

Plus ongoing costs: power, cooling, maintenance.

Monitoring Resources

Monitor these metrics to ensure adequate resources:

CPU

Average utilization < 60%
Peak utilization < 85%
Per-container CPU usage

Memory

Average utilization < 70%
Swap usage minimal
Per-container memory usage

Storage

Disk utilization < 80%
I/O wait times
Read/write throughput
IOPS

Network

Bandwidth utilization
Latency to external services
Packet loss
Connection counts

Recommendations by Use Case

Use Case	Configuration	Notes
Development	Minimum	Local testing only
POC/Demo	Minimum	Short-term evaluation
Small Team (< 10)	Recommended	Standard production
Medium Team (10-50)	Recommended	Standard production
Large Team (50-100)	High Load	Enterprise deployment
Enterprise (100+)	High Load	Consider clustering
24/7 Critical	High Load + HA	Mission-critical

Next Steps

Choose configuration based on your use case
Review Software Requirements
Prepare infrastructure using Prerequisites Checklist

Quick Reference​

Minimum Configuration​

CPU​

Memory​

Storage​

Performance​

Recommended Configuration (Production)​

CPU​

Memory​

Storage​

Network​

Performance​

High Load Configuration (Enterprise)​

CPU​

Memory​

Storage​

Network​

Performance​

Storage Performance Requirements​

Critical: Qdrant Performance​

Important: PostgreSQL Performance​

General Storage​

Resource Limits (Docker)​

Qdrant​

Chatty AI​

Scaling Indicators​

When to Upgrade​

High Availability (HA) Considerations​

Infrastructure​

Monitoring​

Backup​

Platform Support​

Operating Systems​

Docker Requirements​

Cost Considerations​

Minimum Configuration​

Recommended Configuration​

High Load Configuration​

Monitoring Resources​

CPU​

Memory​

Storage​

Network​

Recommendations by Use Case​

Next Steps​

Quick Reference

Minimum Configuration

CPU

Memory

Storage

Performance

Recommended Configuration (Production)

CPU

Memory

Storage

Network

Performance

High Load Configuration (Enterprise)

CPU

Memory

Storage

Network

Performance

Storage Performance Requirements

Critical: Qdrant Performance

Important: PostgreSQL Performance

General Storage

Resource Limits (Docker)

Qdrant

Chatty AI

Scaling Indicators

When to Upgrade

High Availability (HA) Considerations

Infrastructure

Monitoring

Backup

Platform Support

Operating Systems

Docker Requirements

Cost Considerations

Minimum Configuration

Recommended Configuration

High Load Configuration

Monitoring Resources

CPU

Memory

Storage

Network

Recommendations by Use Case

Next Steps