Back to Careers

Senior SRE/DevOps Engineer

Full-time
Engineering
Actively Hiring
Sydney, Australia US (Multiple) Bengaluru, India Gurugram, India

About the Role

We're looking for a talented Software Engineer to join our Platform team and help build the infrastructure that powers PloyD's AI Operations platform. You'll work on critical systems that enable thousands of AI models to be deployed, monitored, and scaled efficiently.

Note: This is a placeholder description. Please update with actual content from the CareerPuck listing: View on CareerPuck

What You'll Do

  • Design and maintain highly available, scalable infrastructure for AI Operations
  • Implement and manage CI/CD pipelines on GitHub Actions, GitLab CI, and BuildKite
  • Build and optimize self-hosted runner infrastructure (GitHub Actions runners, GitLab runners, BuildKite agents)
  • Develop automation tools and infrastructure as code (Terraform, Ansible)
  • Monitor system performance, reliability, and security
  • Collaborate with engineering teams on deployment strategies and GitOps workflows
  • Optimize cloud costs and resource utilization, especially for GPU workloads

What We're Looking For

Required Qualifications

  • 5+ years of SRE/DevOps experience
  • Strong proficiency in Kubernetes and container orchestration
  • Experience with cloud platforms (AWS, GCP, Azure)
  • Expertise in infrastructure as code (Terraform, Ansible)
  • Hands-on experience with CI/CD platforms (GitHub Actions, GitLab CI, BuildKite)
  • Experience managing and scaling self-hosted CI/CD runners and build agents
  • Experience with GitOps workflows (ArgoCD, Flux)
  • Strong scripting skills (Python, Bash, Go)

Preferred Qualifications

  • Experience with AI/ML infrastructure and GPU workloads
  • Knowledge of observability tools (Prometheus, Grafana, ELK)
  • Experience with autoscaling CI/CD infrastructure
  • Familiarity with Helm and Kustomize for Kubernetes deployments
  • Open source contributions to DevOps or CI/CD tools

Benefits & Perks

  • Competitive salary and equity package
  • Comprehensive health, dental, and vision insurance
  • 401(k) with company match
  • Flexible work arrangements (remote/hybrid)
  • Professional development budget
  • Unlimited PTO policy
  • Latest tech equipment

About PloyD

PloyD is building the future of AI Operations. Our platform makes it easy for enterprises to deploy, monitor, and scale AI models in production. We're a fast-growing startup backed by top-tier investors, and we're looking for talented people to join our mission.

Apply Now