SupplyHouse.com - Site Reliability Engineer
Upload My Resume
Drop here or click to browse · Tap to choose · PDF, DOCX, DOC, RTF, TXT
Requirements
• Bachelors degree in Computer Science, Engineering, or a related field • 3+ years of hands-on experience as a Site Reliability Engineer, DevOps Engineer, Systems Engineer, or Cloud Infrastructure Engineer. Proven track record managing production-grade systems on Google Cloud Platform (GCP) or other cloud providers • Strong understanding of Linux/Unix system administration, networking, and troubleshooting. Experience implementing Infrastructure as Code (IaC) using tools like Terraform, Ansible, or Deployment Manager. Familiarity with containerization and orchestration technologies such as Docker and Kubernetes (GKE) • Experience with monitoring and observability tools (Google Cloud Operations Suite, Prometheus, Grafana, Datadog, ELK). Experience defining and monitoring SLAs, SLOs, and SLIs to ensure application uptime and performance. Proven ability to handle incident response, conduct postmortems, and drive root cause analysis • Proficiency in at least one scripting language (Python, Bash, or Go) for automation and tooling. Hands-on experience building or managing CI/CD pipelines (Jenkins, GitLab CI, Cloud Build).Strong background in configuration management and release automation • Knowledge of IAM (Identity and Access Management), network security, and cloud compliance controls. Familiarity with disaster recovery (DR), backups, and high-availability design • High-level proficiency of written and verbal communication in English • Proven ability to optimize infrastructure performance and cost, particularly within GCP (FinOps experience a plus). Background in capacity planning, load testing, and horizontal scaling of distributed systems • Certification(s) as a Google Cloud Professional Cloud DevOps Engineer (strongly preferred), Google Cloud Professional Cloud Architect or Associate Cloud Engineer, Kubernetes CKA/CKAD, etc. • Experience implementing blue-green deployments, canary rollouts, and progressive delivery strategies • Experience working cross-functionally with software development, QA, and security teams. • Ability to mentor junior engineers and establish best practices for monitoring, deployment, and incident response
Responsibilities
• High-level proficiency of written and verbal communication in English • Design, build, and maintain scalable, reliable systems on GCP (Compute Engine, GKE, Cloud Storage, Cloud SQL) • Develop automation for infrastructure provisioning using Terraform, Ansible, or Deployment Manager • Build and maintain observability platforms (monitoring, logging, tracing) using tools such as Stackdriver (Cloud Monitoring), Prometheus, or Grafana • Manage incident response, conduct postmortems, and implement improvements to reduce recurrence • Partner with DevOps and engineering teams to enhance CI/CD pipelines for resilient deployments • Define and monitor SLAs, SLOs, and SLIs to ensure application availability and performance • Implement disaster recovery (DR) and backup strategies across cloud services • Continuously optimize performance, capacity, and cost-efficiency of GCP resources
Benefits
• Comprehensive and affordable medical, dental, vision, and life insurance options • Competitive Provident Fund contributions • Paid time off and holidays • Mental health support and wellbeing program • Company-provided equipment and one-time $250 USD work from home stipend • $750 USD annual professional development budget • Company rewards and recognition program • We empower ownership – We all contribute to our success and we all share in it. Our Ownership for All program ensures each SupplyHouse team member will benefit financially from the company’s growth and accomplishments. • We empower ownership • We promote work-life balance – We value your time and encourage a healthy separation between your professional and personal life to feel refreshed and recharged. Look out for our wellness initiatives! • We promote work-life balance • We support growth – We strive to innovate every day. In an exciting and evolving industry, we provide potential for career growth through our hands-on training, diversity and inclusion initiatives, opportunities for internal mobility, and professional development budget. • We support growth • We give back – We live and breathe our core value, Generosity, by giving back to the trades and organizations around the world. We make a difference through donation drives, employee-nominated contributions, support for DE&I organizations, and more. • We give back • We listen – We value hearing from our employees. Everyone has a voice, and we encourage you to use it! We actively elicit feedback through our monthly town halls, regular 1:1 check-ins, and company-wide ideas form to incorporate suggestions and ensure our team enjoys coming to work every day. • We listen • Check us out and learn more at https://www.supplyhouse.com/our-company! • Check us out and learn more at • https://www.supplyhouse.com/our-company • Additional Details: • Additional Details: • Remote employees are expected to work in a distraction-free environment. Personal devices, background noise, and other distractions should be kept to a minimum to avoid disrupting virtual meetings or business operations.
No credit card. Takes 10 seconds.