Remote - Senior Site Reliability Engineer
Upload My Resume
Drop here or click to browse · Tap to choose · PDF, DOCX, DOC, RTF, TXT
Requirements
• Senior-level SRE experience: demonstrated experience in a Site Reliability Engineering, DevOps Engineering, or SysOps role. You have stood up and operated production systems at scale. • Kubernetes and AWS: deep, hands-on experience running Kubernetes in production. Solid AWS fundamentals across compute, networking, storage, and managed services. • Kubernetes and AWS: • Infrastructure-as-code: Proficiency with Terraform or similar IaC tools. You write code to define infrastructure; you don't click buttons in the console. • Infrastructure-as-code: • CI/CD and deployment automation: real experience setting up and operating GitLab, GitHub Actions, Jenkins, or similar. You understand deployment strategies, rollback mechanisms, and safety nets. • CI/CD and deployment automation: • Scripting and systems knowledge: strong bash scripting. Comfortable debugging system-level issues, reading logs, and understanding Linux kernel basics. • Scripting and systems knowledge: • Great communication: you explain complex infrastructure decisions clearly to both engineers and non-technical stakeholders. You write clear runbooks and documentation. • Great communication: • Experience with 1+ backend programming language (Elixir, Python, Go, Java, Node.js, etc.). • Experience in consultancy settings. • Container registry and artifact management (ECR, Docker Hub, etc.). • Observability stack depth (Datadog, Prometheus, ELK, Grafana, or similar). • Experience working with or scaling multi-tenant platforms. • Practicals • Practicals • You'll report to: Engineering Manager • You'll report to: • Team: Engineering • Team: • Location: Anywhere in the World • Location: • Start date: As soon as possible • Start date: • Application process • Application process • Interview with recruiter • Interview with hiring manager • Infrastructure Deep Dive • Bar raiser interview • Offer + Background Check (Veremark & Remote) • Remote's Total Rewards philosophy is to ensure fair, unbiased compensation and fair equity pay along with competitive benefits in all locations in which we operate. We do not agree to or encourage cheap-labor practices and therefore we ensure to pay above in-location rates. We hope to inspire other companies to support global talent-hiring and bring local wealth to developing countries. • At first glance our salary bands seem quite wide - here is some context. At Remote we have international operations and a globally distributed workforce. We use geo ranges to consider geographic pay differentials as part of our global compensation strategy to remain competitive in various markets while we hiring globally.Our salary ranges are determined by role, level and location, and our job titles may span more than one career level. The actual base pay for the successful candidate in this role is dependent upon many factors such as location, transferable or job-related skills, work experience, relevant training, business needs, and market demands. The base salary range may be subject to change.At Remote, we foster internal mobility as a key element of our culture of employee growth and development, supported by a compensation philosophy that guarantees pay equity and fairness. Therefore, all compensation changes associated with an internal move will be reviewed by the Total Rewards & People Enablement team on a case by case basis. • The annual salary range for this full-time position is • $54,000 - $150,000 USD
Responsibilities
• Infrastructure as code at scale. Design, implement, and maintain infrastructure-as-code patterns using Terraform and Kubernetes that support both standard connectors and custom builds. Make it easy for engineers to deploy and operate with confidence. • Infrastructure as code at scale. • Observability and incident response. Build and maintain comprehensive monitoring, logging, and alerting systems. Lead incident response efforts, conduct post-mortems, and drive continuous improvement in system reliability. • Observability and incident response. • Security and compliance in motion. Work with our Security team to embed security into every layer of Build infrastructure. Ensure we meet compliance requirements across 100+ jurisdictions without creating friction for developers or customers. • Security and compliance in motion. • Performance and cost optimisation. Continuously optimize system performance, resource utilization, and cloud costs. Make recommendations that improve both reliability and unit economics. • Performance and cost optimisation. • Automation and operational leverage. Identify manual operational toil and systematically eliminate it. Build tools and processes that let teams operate efficiently without scaling headcount. • Automation and operational leverage. • Platform reliability and developer experience. Partner with platform teams to ensure APIs, MCP, and CLI are resilient and observable. Give infrastructure feedback that shapes how the platform evolves. • Platform reliability and developer experience.
Benefits
• Our full benefits & perks are explained in our handbook at remote.com/r/benefits. As a global company, each country works differently, but some benefits/perks are for all Remoters: • work from anywhere • flexible paid time off • flexible working hours (we are async) • 16 weeks paid parental leave • mental health support services • learning budget • budget for local in-person social events or co-working spaces • How you’ll plan your day (and life) • We work async at Remote which means you can plan your schedule around your life (and not around meetings). Read more at remote.com/async. • You will be empowered to take ownership and be proactive. When in doubt you will default to action instead of waiting. Your life-work balance is important and you will be encouraged to put yourself and your family first, and fit work around your needs. • life-work balance • If that sounds like something you want, apply now!
No credit card. Takes 10 seconds.