Infrastructure Configuration
Configure deployment options and infrastructure settings for your RAG agent
Your Package
Currently on Enterprise. Explore different packages and deployment options to see features and pricing.
On-Premise Configuration
Deploy a virtual machine configured with a lightweight, asynchronous task processing stack. The VM will run an Orchestrator Module for asynchronous task processing and Cron Jobs for periodic data source updates.
Deployment Tiers
                                    
                                    Starter (< 100 GB)
                                
                                - CPU4 vCPUs
- Memory16 GB RAM
- Storage250 GB SSD
- Requests/Day< 100
                                    
                                    Production (100 GB - 1 TB)
                                
                                - CPU8 vCPUs
- Memory32 GB RAM
- Storage500 GB SSD
- Requests/Day100-1K
Alternative option for smaller scale deployments
                                
                                    
                                    Enterprise (1+ TB)
                                
                                - CPU16 vCPUs
- Memory64 GB RAM
- Storage1 TB NVMe
- Requests/Day1K+
                                    
                                    Current usage: 847 GB of 1.2 TB. Growing steadily!
                                
                            Architecture & Components
                                    
                                    Basic Application Layer
                                
                                - API GatewayNginx + FastAPI
- Load BalancerHAProxy
- Web FrameworkFastAPI/Flask
- AuthenticationBasic OAuth
                                    
                                    Advanced AI/ML Stack
                                
                                - LLM RuntimevLLM/TGI
- Embedding ModelSentenceTransformers
- Vector DBQdrant (Active)
- GPU SupportCUDA 12.0+
                                    
                                    Enterprise Data Layer
                                
                                - Primary DBPostgreSQL 15
- CacheRedis Cluster
- SearchElasticsearch
- File StorageMinIO/S3
                                    
                                    Advanced Processing & Queue
                                
                                - Task QueueCelery + Redis
- Message BrokerRabbitMQ
- SchedulerAirflow/Cron
- ETL PipelineApache Spark
Security & Compliance
                                    
                                    Basic Network Security
                                
                                - Firewalliptables
- VPNWireGuard
- SSL/TLSLet's Encrypt
- DDoS ProtectionBasic
                                    
                                    Advanced Data Protection
                                
                                - Encryption at RestAES-256
- Encryption in TransitTLS 1.3
- Key ManagementHashiCorp Vault
- Backup EncryptionGPG/Age
                                    
                                    Enterprise Access Control
                                
                                - RBACKeycloak
- MFATOTP/WebAuthn
- SSOSAML/OIDC
- Audit LogsELK Stack
                                    
                                    Enterprise Compliance
                                
                                - GDPRData Residency
- SOC 2Type II Ready
- HIPAABAA Available
- ISO 27001Framework
Monitoring & Observability
                                    
                                    Basic Metrics & Monitoring
                                
                                - MetricsPrometheus
- VisualizationGrafana
- AlertingAlertManager
- UptimeBasic monitoring
                                    
                                    Advanced Logging & Tracing
                                
                                - Log AggregationLoki (Active)
- Distributed TracingJaeger
- APMOpenTelemetry
- Error TrackingSentry
                                    
                                    Enterprise Monitoring & SLA
                                
                                - SLA Monitoring99.99% uptime
- Custom DashboardsExecutive reporting
- Predictive AnalyticsML-based alerts
- 24/7 SupportDedicated team
Disaster Recovery & Backup
                                    
                                    Basic Backup Strategy
                                
                                - DatabaseDaily + WAL (Active)
- Vector DataWeekly Snapshots
- File StorageIncremental
- Retention30 days + Archives
                                    
                                    Advanced Recovery Options
                                
                                - RTO< 4 hours
- RPO< 1 hour
- Hot StandbyOptional
- Geo-ReplicationAvailable
                                    
                                    Enterprise DR & Business Continuity
                                
                                - RTO< 1 hour
- RPO< 15 minutes
- Multi-SiteActive-Active
- SLA99.99% uptime guarantee
Total Cost of Ownership
                                    
                                    Hardware (3-year)
                                
                                - Starter$10K (Current)
- Production$15K - $25K
- Enterprise$35K - $60K
- GPU Add-on+$10K - $50K
                                    
                                    Implementation
                                
                                - Setup & Config$8K (Completed)
- Security Hardening$5K (Completed)
- Integration$3K (Completed)
- Training$3K (Completed)
                                    
                                    Enterprise Annual Operations
                                
                                - Maintenance$6K - $15K
- Support$3K - $12K
- Utilities$2K - $8K
- Licenses$1K - $5K
Consider Cloud Migration
Based on your current usage (8.4 GB), cloud deployment could reduce costs by 40-60% while providing better scalability and reliability.
                                    
                                    Save ~$600/month vs Production on-prem
                                
                                
                                    
                                    Auto-scaling for traffic spikes
                                
                                
                                    
                                    Enterprise-grade security included