wagey.ggwagey.gg
38,923  jobs38,923  jobs
Browse Tech JobsCompaniesFeaturesPricingFAQs
Log InGet Started Free
Jobs(38,923)/Site Reliability Engineer Role(222)/Enumerate (2) - Senior Site Reliability Engineer
Enumerate

Enumerate - Senior Site Reliability Engineer

Remote - LATAM$48k - $48k1w ago
RemoteSeniorLATAMCloud ComputingSoftwareSite Reliability EngineerGovernanceDocumentationKubernetesGitPerformance ManagementJiraJenkinsLinuxBashTerraformPythonBicepAnsibleChange ManagementCross-functional Collaboration

Requirements

• BS or MS in Computer Science, Engineering, or a related technical field, or equivalent practical experience. • 6+ years of experience with container orchestration services (Kubernetes preferred). • 6+ years of experience administering and deploying CI/CD tooling (e.g., Git, Azure DevOps, Jira, GitLab, Jenkins). • 6+ years of experience managing scalable applications in one or more major cloud providers. • 8+ years of significant experience with both Windows and Linux operating system environments. • 7+ years of experience with scripting and automation using tools such as PowerShell, Bash, or Python. • 4+ years of experience with infrastructure-as-code and orchestration platforms (e.g., Terraform, ARM/Bicep, CloudFormation, Ansible, etc.). • Demonstrated expertise designing architectures for scalable, reliable, and secure tech stacks in distributed systems. • Demonstrated expertise implementing workflow processes for operating and maintaining applications in distributed architectures. • Strong experience working in agile-leaning software development environments and across varying application stacks. • Deep understanding of best practices and IT operations in distributed, cloud-native architectures. • Experience defining and implementing governance and guardrails around infrastructure, CI/CD, and security. • Strong grasp of cloud cost management and optimization techniques (e.g., usage analysis, rightsizing, scaling policies). • Excellent problem-solving, troubleshooting, and incident management skills. • Excellent oral and written communication skills; capable of presenting complex technical concepts to technical and non-technical audiences. • Process-oriented with strong documentation skills and attention to detail. • Ability to translate loosely defined product or platform requirements into robust, scalable technical solutions.

Responsibilities

• Architecture & Infrastructure Ownership • Design, implement, and evolve cloud infrastructure architectures for high availability, reliability, security, and scale. • Define and maintain reference architectures and patterns for services, applications, and environments across the organization. • Develop workflow processes and standards for building, deploying, and maintaining applications within a distributed architecture. • Lead infrastructure modernization initiatives (e.g., containerization, Kubernetes adoption, infrastructure as code, platform consolidation). • Governance, Standards & Cost Management • Establish and enforce governance standards for infrastructure, CI/CD, observability, and operational practices. • Define and maintain policies for environment management, access control, configuration management, and change management. • Implement cost management practices (e.g., tagging, budget alerts, rightsizing, reservations/committed use, auto-scaling policies) to optimize cloud spend. • Partner with product and engineering leadership to balance performance, reliability, and cost-efficiency across environments. • Use DORA metrics and industry benchmarks to drive continuous improvement in delivery and operational performance. • CI/CD, Automation & Operations • Design, implement, and maintain CI/CD pipelines for multiple applications and environments using tools such as Git, Azure DevOps, GitLab, or Jenkins. • Develop and manage automation pipelines for deployment, configuration, and infrastructure management. • Build and maintain monitoring, alerting, and logging systems to ensure visibility, high availability, and performance of applications and services. • Manage cloud infrastructure resources and services to ensure reliability, security, and scalability. • Incident Management & Reliability • Lead incident response efforts, including triage, root cause analysis, and post-incident reviews. • Contribute to and maintain incident response processes, runbooks, and on-call practices. • Partner with engineering teams to design resilient systems and reduce mean time to recovery (MTTR). • Leadership, Mentorship & Cross-Functional Collaboration • Collaborate with software engineering, QA, product, and IT teams to determine the best way to tackle complex infrastructure, security, and delivery challenges. • Mentor engineers in DevOps and platform practices, tools, and standards across the organization. • Lead departmental initiatives related to DevOps, platform engineering, and infrastructure disciplines; present plans and progress to stakeholders. • Drive new department initiatives based on organizational needs and your expertise in modern technologies and industry trends. • Stay current on emerging technologies, tools, and best practices; evaluate their potential application within our tech stack.

Benefits

• $4,000—$5,000 USD

Apply in one click

Upload My Resume

Drop here or click to browse · Tap to choose · PDF, DOCX, DOC, RTF, TXT

Apply in One Click
Apply in One Click

Similar roles

serveroboticsserverobotics - Sr. Reliability Operations Engineer (Mexico)2mo ago
·Remote - Mexico City, MX (remote)·$44k - $81k/year
RemoteLATAMSeniorDiagnosticsCloud ComputingSite Reliability EngineerJiraCoachingPerformance ReviewsLinuxPrometheus
KrakenKraken - Sr. Site Reliability Engineer3mo ago
·Remote - LATAM·$104k - $104k/year
RemoteLATAMSeniorCryptocurrencyCloud ComputingSite Reliability EngineerRustAWSPythonDockerKubernetes
AccelaAccela - Site Reliability Engineer 24d ago
·Remote - Based - US·$125k - $145k/year + Equity
RemoteNAMidInsuranceCloud ComputingSite Reliability EngineerBashPythonChange ManagementAzureKubernetesGitTerraformAnsibleLinuxClaude
cmgxcmgx - Capital Markets Gateway - Site Reliability Engineer2mo ago
·Remote - Brazil (Remote)
RemoteLATAMCloud ComputingSite Reliability EngineerBashPythonDockerKubernetesLoki
GitLabGitLab - Intermediate Site Reliability Engineer, Environment Automation3mo ago
·Remote - India·Equity
RemoteAPACMidCloud ComputingSoftwareSite Reliability EngineerGoKubernetesTerraformGitAnsible
PinterestPinterest - Site Reliability Engineer II, tvScientific1w ago
·San Francisco, California, United States·$114k - $114k/year + Equity
In OfficeNAMidCloud ComputingSite Reliability EngineerBashPythonAWSKubernetesTerraformHelmLinuxChange ManagementGovernance
MegaportMegaport - Senior Site Reliability Engineer1w ago
·Remote - Brisbane, Queensland
RemoteAPACSeniorCloud ComputingMaterialsSite Reliability EngineerLinuxKubernetesAWSBashGoPythonTerraformCassandra
Oowlish TechnologyOowlish Technology - Senior Site Reliability Engineer (SRE)3d ago
·Remote - Recife, Pernambuco, Brazil
RemoteLATAMSeniorCloud ComputingSite Reliability EngineerAWSTerraformMentoring
Stack AVStack AV - Senior Site Reliability Engineer5d ago
·Remote - Pittsburgh, PA or Remote
RemoteNASeniorCloud ComputingGovernmentSite Reliability EngineerBashPythonLinuxGCPAWSTerraformKubernetesPrometheusIstio

Browse more by category

Show 222 moreSite Reliability EngineerShow 1,870 moreGovernanceShow 5,795 moreDocumentationShow 1,928 moreKubernetesShow 730 moreGitShow 1,430 morePerformance ManagementShow 866 moreJiraShow 233 moreJenkinsShow 992 moreLinuxShow 479 moreBash
Privacy·Terms··Contact·FAQ·Wagey on X