wagey.ggwagey.gg
38,923  jobs38,923  jobs
Browse Tech JobsCompaniesFeaturesPricingFAQs
Log InGet Started Free
Jobs(38,923)/Site Reliability Engineer Role(222)/Wand Synthesis AI Inc (19) - Senior Site Reliability Engineer
Wand Synthesis AI Inc

Wand Synthesis AI Inc - Senior Site Reliability Engineer

Remote, Europe Timezone - Hybrid1mo ago
In OfficeSeniorEMEACloud ComputingSite Reliability EngineerKubernetesTerraformB2BB2CMLOps

Requirements

• Strong hands-on experience in Site Reliability Engineering, DevOps roles. • Experience working with cloud infrastructure (AWS preferred). • Experience operating production systems and responding to incidents. • Experience with Kubernetes in production environments. • Strong experience with Infrastructure-as-Code (Terraform or similar). • Experience working with CI/CD pipelines and deployment automation. • Experience with monitoring, logging, and observability tooling. • Strong troubleshooting and debugging skills in distributed systems. • Experience supporting data platforms or ML workloads in production environments. • Strong collaboration and communication skills. • Experience in large-scale global B2B/B2C products. • Experience working with AI, ML, or data-intensive systems. • Exposure to MLOps workflows (model deployment, monitoring, retraining). • Experience supporting customer-hosted or multi-tenant environments. • Experience working in regulated or security-conscious environments. • Experience scaling infrastructure in high-growth product environments. • Experience in collaborating with large scale enterprise customers to deploy and operate environments within their accounts and VPCs. • Personal Characteristics • Strong sense of ownership and accountability for systems in production. • Practical and hands-on approach to solving operational problems. • Calm and methodical during incidents and troubleshooting situations. • Strong collaborator who works effectively across engineering teams. • Curious and eager to continuously improve systems and processes. • Bias toward automation and reducing manual operational work. • Comfortable working in fast-moving, evolving technical environments.

Responsibilities

• Build, maintain, and operate scalable production infrastructure. • Own reliability and availability for key services and environments. • Contribute to the design and operation of Kubernetes-based infrastructure. • Develop and maintain Infrastructure-as-Code frameworks (e.g., Terraform). • Improve monitoring, alerting, and observability across systems. • Participate in on-call rotations and respond to production incidents. • Investigate root causes of incidents and contribute to postmortems and reliability improvements. • Improve system performance, availability, and fault tolerance. • Contribute to CI/CD pipeline improvements to increase release safety and predictability. • Support the deployment and operation of data platforms and ML workloads. • Help standardise environments and infrastructure across internal systems and customer deployments. • Troubleshoot issues across infrastructure, services, and deployment pipelines. • Work closely with QA and engineering teams to improve production readiness and release stability. • Contribute to automation efforts that reduce operational toil.

Apply in one click

Upload My Resume

Drop here or click to browse · Tap to choose · PDF, DOCX, DOC, RTF, TXT

Apply in One Click
Apply in One Click

Similar roles

getgroundgetground - Lead Site Reliability Engineer4mo ago
·London, United Kingdom - Hybrid
In OfficeEMEAStaffCloud ComputingReal EstateSite Reliability EngineerGCPGoDockerKubernetesTerraform
replitreplit - Senior Site Reliability Engineer1mo ago
·Remote - Europe
RemoteEMEASeniorCloud ComputingSite Reliability EngineerGoPythonReportingKubernetesGCP
Obsidian SecurityObsidian Security - Sr. Site Reliability Engineer1mo ago
·Cheltenham, Gloucestershire, United Kingdom·£95k/year/year + Equity
In OfficeEMEASeniorCloud ComputingSoftwareSite Reliability EngineerAWSGCPKubernetesHelmPrometheus
AxonAxon - Sr Site Reliability Engineer I4mo ago
·London, England, United Kingdom
In OfficeEMEASeniorCloud ComputingSoftwareSite Reliability EngineerGoC#JavaPythonTerraform
BloomreachBloomreach - Senior Site Reliability Engineer for Datacraft team2w ago
·Remote - Slovakia·€41.6 - €52/hour/year + Equity
RemoteEMEASeniorCloud ComputingArtificial IntelligenceSite Reliability EngineerKafkaRedisApache SparkSQLKubernetesGCPPrometheusAirflowTerraformSentryJiraClaudeDatabricksConfluenceGrafanaCursorGeminiSnowflakeData QualityGoPython
GoCardlessGoCardless - Site Reliability Engineer1w ago
·Remote - UK·£78k/year/year + Equity
RemoteEMEACybersecurityCloud ComputingSite Reliability EngineerRubyPythonKubernetesGitHubPrometheusGrafanaGoogle GKETerraformAWSGCP
Parallel DomainParallel Domain - Senior Site Reliability Engineer1mo ago
·Remote - Pacific Northwest Area·$145k - $185k/year + Equity
RemoteNASeniorCloud ComputingArtificial IntelligenceSite Reliability EngineerTerraformAWSKubernetesHelmBash
RapidSOSRapidSOS - Senior Site Reliability Engineer2mo ago
·Remote - New York (Remote) or Boston (Remote)·$160k - $195k/year + Equity
RemoteNASeniorCloud ComputingSite Reliability EngineerAWSPythonKubernetesKafkaTerraform
GitLabGitLab - Senior Site Reliability Engineer, Environment Automation3mo ago
·Remote - Canada·$124k - $266k/year + Equity
RemoteNASeniorCloud ComputingSite Reliability EngineerTerraformKubernetesAnsibleRubyGo

Browse more by category

Show 222 moreSite Reliability EngineerShow 1,928 moreKubernetesShow 1,191 moreTerraformShow 3,331 moreB2BShow 228 moreB2CShow 270 moreMLOps
Privacy·Terms··Contact·FAQ·Wagey on X