wagey.ggwagey.gg
38,923  jobs38,923  jobs
Browse Tech JobsCompaniesFeaturesPricingFAQs
Log InGet Started Free
Jobs(38,923)/Site Reliability Engineer Role(222)/NICE (246) - Site Reliability Engineer
NICE

NICE - Site Reliability Engineer

Remote - United Kingdom2mo ago
RemoteEMEACloud ComputingSite Reliability EngineerGoSplunkDatadogBashPython

Responsibilities

• Unlike a traditional NOC analyst, an SRE‑NOC is expected to engineer problems away, not just respond to alerts. • How will you make an impact? • Incident Response & Operations • Act as a primary or escalation responder in a 24x7 on‑call rotation • Lead or support Major Incident (MI) response, including triage, mitigation, and resolution • Coordinate across Engineering, Infrastructure, Security, and Product teams • Execute and improve runbooks, playbooks, and escalation paths • Drive blameless post‑incident reviews (PIRs) and track corrective actions • Monitoring, Alerting & Observability • Own service health monitoring across infrastructure, applications, and dependencies • Design and maintain alerting strategies that align with SLIs/SLOs • Reduce alert fatigue through signal‑to‑noise improvements • Build dashboards using tools such as: • Datadog / Splunk / CloudWatch • Reliability Engineering & Automation • Automate repetitive operational tasks to reduce manual toil • Improve mean time to detect (MTTD) and mean time to resolve (MTTR) • Develop scripts and tools (Python, Bash, Go, etc.) to support NOC/SRE workflows • Implement self‑healing and auto‑remediation where possible • Partner with engineering teams to improve system design for reliability • Platform & Infrastructure Support • Support and troubleshoot: • Linux‑based systems • Cloud platforms (AWS, Azure, GCP) • Kubernetes / containerized environments • Assist with capacity planning and availability reviews • Ensure operational readiness for production releases • Have you got what it takes? • Strong Linux systems administration • Experience with incident management and production support • Familiarity with: • Cloud infrastructure (AWS preferred) • Containers & orchestration (Docker, Kubernetes) • Monitoring/alerting platforms • Scripting or programming experience in Python, Bash, Go, or similar • Understanding of networking fundamentals (DNS, TCP/IP, load balancing) • Experience working in 24x7 NOC or production operations environments • Ability to handle high‑pressure incidents calmly and effectively • Strong written and verbal communication for incident coordination • Comfort working from runbooks—but improving them when they fall short • Preferred / Differentiators • Experience defining or operating to SLOs / SLIs • Prior migration from traditional NOC → SRE model • Infrastructure as Code experience (Terraform, Ansible, etc.) • Exposure to security, compliance, or regulated environments • Requisition ID: 10579. • Reporting into: Manager, Network Operations. • Role Type: Individual Contributor.

Apply in one click

Upload My Resume

Drop here or click to browse · Tap to choose · PDF, DOCX, DOC, RTF, TXT

Apply in One Click
Apply in One Click

Similar roles

airappsairapps - Site Reliability Engineer (SRE)2mo ago
·London, London Metropolitain Area, UK·€55k - €68k/year
In OfficeEMEAMidCloud ComputingSite Reliability EngineerBashGoTerraformPulumiPython
RedditReddit - Staff Site Reliability Engineer - Site Experience1mo ago
·Remote - UK
RemoteEMEAStaffCloud ComputingSite Reliability EngineerGoPythonPerformance ManagementLinuxKubernetes
replitreplit - Senior Site Reliability Engineer1mo ago
·Remote - Europe
RemoteEMEASeniorCloud ComputingSite Reliability EngineerGoPythonReportingKubernetesGCP
ProvectusProvectus - Middle Site Reliability Engineer (CDN & DevOps)1mo ago
·Yerevan / Belgrade / Novi Sad, Vojvodina / Poland / Odesa / Kyiv
RemoteEMEACloud ComputingSite Reliability EngineerBashPythonClaudeCloudflareLinux
RedditReddit - Staff Site Reliability Engineer1mo ago
·Dublin, Ireland
In OfficeEMEAStaffCloud ComputingSite Reliability EngineerGoPythonPerformance ManagementLinuxKubernetes
Multibank GroupMultibank Group - Site Reliability Engineer2mo ago
·Dubai, United Arab Emirates
In OfficeEMEAMidCloud ComputingSite Reliability EngineerBashLinuxPythonPipeline ManagementDocker
AxonAxon - Sr Site Reliability Engineer I4mo ago
·London, England, United Kingdom
In OfficeEMEASeniorCloud ComputingSoftwareSite Reliability EngineerGoC#JavaPythonTerraform
Backblaze External WebsiteBackblaze External Website - Site Reliability Engineer II1mo ago
·Remote - Bangalore
RemoteAPACMidCloud ComputingSoftwareSite Reliability EngineerBashGoPythonLinuxDocker
OXIO CorporationOXIO Corporation - Site Reliability Engineer1mo ago
·Remote - USA
RemoteNACloud ComputingTelecommunicationsSite Reliability EngineerGoRubyBashPerlPython

Browse more by category

Show 222 moreSite Reliability EngineerShow 2,085 moreGoShow 110 moreSplunkShow 235 moreDatadogShow 479 moreBashShow 6,338 morePython
Privacy·Terms··Contact·FAQ·Wagey on X