wagey.ggwagey.gg
Open Tech JobsCompaniesPricing
Log InGet Started Free
Jobs/Site Reliability Engineer Role/Manager, Site Reliability Engineering

Manager, Site Reliability Engineering

Veeam SoftwareRemote - Czechia1mo ago
RemoteMidEMEAInsuranceCloud ComputingSite Reliability EngineerGovernanceCoachingKubernetesAzureTerraform

Upload My Resume

Drop here or click to browse · PDF, DOCX, DOC, RTF, TXT

Apply in One Click

Requirements

• 7+ years in Software, Platform, and/or Reliability Engineering with 2+ years managing engineers • Demonstrable experience leading engineering teams to predictably deliver outcomes • Experience leading cross-functional initiatives collaboratively with peers through influence • Experience with public cloud (Azure preferred), Kubernetes, IaC (Terraform, Pulumi), CI/CD (Github Actions, ArgoCD, Azure DevOps), and observability (OpenTelemetry, Elastic, Datadog, Prometheus, Grafana) • Coding background with experience improving service reliability • Hands-on incident management and postmortem practice; excellent cross-geo communication • Willingness to participate in an on-call rotation (typically during daytime hours, including weekends/holidays) • Demonstrated success leading SLO/error-budget adoption and reliability programs for cloud services • Experience operating a multi-region, follow-the-sun on-call model • Background in chaos/resilience/performance testing and release validation • Track record building or scaling SRE teams and influencing org-wide standards • Familiarity with compliance frameworks common to SaaS • What You’ll Get • 25 vacation days, 4 sick days, 21 paid medical leave days, plus 4 extra global VeeaMe Days for self-care and 24 paid volunteer hours annually through Veeam Cares • Premium private medical insurance for employees and dependents • Daily meal vouchers for restaurants and groceries (180 CZK per working day) • Flexible cafeteria platform with thousands of lifestyle benefit options • Multisport Card for gym and wellness, with family add-on options • Annual public transport reimbursement up to a set limit • Corporate mobile plan with optional family tariff • Opportunities to learn and grow through on-demand libraries (LinkedIn Learning, O’Reilly), mentoring, workshops and learning events like our annual Global Day of Learning • Please note: If the applicant is permanently present outside of the Czech Republic, Veeam reserves the right to refuse to consider the application for a job. Remote job is only possible in case the employee is located in the Czech Republic. • Please note:

Responsibilities

• People & Team Leadership • Hire, onboard, and grow your SRE team; coach career development and performance • Foster a psychologically safe, blameless culture that favors learning over blame and emphasizes engineering over firefighting • Ensure a sustainable operational coverage; monitor on-call health and workload • Track and cap toil so engineers spend the majority of time on project work that reduces future toil • Reliability Strategy & Governance • Establish and operationalize SLIs/SLOs and error budgets with service owners; run reliability reviews and hold teams accountable to outcomes • Define reliability standards, runbooks, readiness checklists, and alerting patterns (including SLO-based alerting) • Partner with product/EMs to align reliability work with service goals and customer experience, not as a gate but as an enabler • Operations & Incident Excellence • Ensure incident response readiness; lead/coordinate major incidents; drive fast, high-quality postmortems and systemic fixes • Measure MTTR, change failure rate, SLO posture, and repeat-incident reduction; publish learning broadly • Engineering & Automation • Lead software-first reliability investments: observability, deployment safety (canary/blue-green), resilience testing/chaos, and self-service guardrails • Drive platform improvements (IaC, CI/CD, Kubernetes) and internal tools that scale operations and improve developer experience

Similar Jobs

Quality Assurance & Compliance Manager13h ago
niramedicalniramedical·Remote
RemoteWWSeniorPharmaceuticalsMental HealthCompliance ManagerDocumentationQuality AssuranceGovernanceQuality ControlReportingCPC
Junior People Assistant13h ago
clearbankclearbank·London Office - Hybrid - Hybrid
In OfficeEMEAJuniorAdministrative AssistantExcelGovernanceDocumentation
Director, Implementation, Actimize13h ago
NICENICE·United Kingdom - London - Hybrid
In OfficeEMEADirectorDirector of OperationsDue DiligenceTeam LeadershipExecutive SupportRisk ManagementGovernanceCase ManagementConflict Resolution
Director , Data13h ago
Honeycomb.ioHoneycomb.io·Remote - USA·$250k - $250k/year + Equity
RemoteNADirectorBankingDeveloper ToolsDirector of DataHoneycombReportingBaseGovernance
Senior EHR Systems Administrator13h ago
Zócalo HealthZócalo Health·Remote - USA·$140k - $140k/year + Equity
RemoteNASeniorOil & GasSystems AdministratorSenior Community ManagerReportingGovernanceSnowflakeDocumentation
Get Started Free

No credit card. Takes 10 seconds.

Privacy·Terms··Contact