Infrastructure Requests
Capacity Planning
GitHub Runners
EKS Clusters
Storage
Cost Optimization
GitHub Self-Hosted Runners
prod-runner-vm-01
On-premise VM • Ubuntu 22.04 • 192.168.1.101
staging-runner-vm-01
On-premise VM • Ubuntu 22.04 • 192.168.1.102
ml-runner-gpu-01
Physical server • NVIDIA RTX 4090 • 192.168.1.150
dev-runner-pool
Auto-scaling VMs • Proxmox cluster • 192.168.1.200-205
On-Premise Clusters
development-cluster
On-premise • Kubernetes • v1.28
On-Premise Monitoring
Prometheus
Kubernetes monitoring • Time series DB
Grafana
Visualization • Dashboards • Alerting
ELK Stack
Elasticsearch • Logstash • Kibana
AWS EKS Clusters
production-eks-cluster
AWS EKS • us-east-1 • v1.28
staging-eks-cluster
AWS EKS • us-west-2 • v1.28
ml-training-cluster
AWS EKS • GPU nodes • v1.28
AWS Services
Elastic Container Registry
Container image storage • 47 repositories
RDS Database Instances
PostgreSQL • Multi-AZ • Encrypted
S3 Storage Buckets
Object storage • Versioning • Lifecycle
Load Balancers
Application & Network LBs • SSL termination
CloudFront CDN
Global content delivery • Edge locations
VPC & Networking
Private networks • Subnets • Security groups
Monitoring & Observability
CloudWatch
AWS native monitoring • Logs • Metrics
Prometheus
Kubernetes monitoring • Time series DB
Grafana
Visualization • Dashboards • Alerting
ELK Stack
Elasticsearch • Logstash • Kibana
AWS CloudWatch
CloudWatch Metrics
AWS native monitoring • Custom metrics
CloudWatch Logs
Centralized logging • Log insights
X-Ray Tracing
Distributed tracing • Performance analysis