wagey.ggwagey.ggv1.0-0f5e85e-22-May
Browse Tech JobsCompaniesFeaturesPricingFAQs
Log InGet Started Free
Jobs/Platform Engineer Role/attio - Senior Platform Engineer
Pro members applied to this job 36 hours before you saw itGet Pro ›
attio

attio - Senior Platform Engineer

London£95k - £125k/year+ Equity4d ago
In OfficeSeniorEMEACloud ComputingPlatform EngineerAWSGCPGoRustBudget ManagementAzureDockerKubernetesTypeScriptPythonGoogle GKEZipkinJaegerTerraformPulumiSplunk

Upload My Resume

Drop here or click to browse · Tap to choose · PDF, DOCX, DOC, RTF, TXT

Apply in One Click
Apply in One Click

Requirements

• Applied DevOps and SRE Principles: • Must have : Demonstrable, hands-on experience applying core DevOps and Site Reliability Engineering (SRE) principles to manage, monitor, and scale production systems. • Must have: A deep understanding of the SRE mindset, including SLO/SLA creation and monitoring, error budget management, toil reduction, and post-incident review (blameless postmortems). • Desirable: Proven ability to drive cultural and process change that fosters a collaborative approach between development and operations teams. • Cloud Infrastructure and Containerisation Expertise: • Must have: Expertise in one or more major public cloud providers (AWS, GCP, or Azure), encompassing network configuration, security best practices (IAM, security groups, etc.), compute services (EC2, GKE, ECS, etc.), and managed services (databases, queues, serverless functions). • Must have: In-depth knowledge of container technologies, specifically Docker, and extensive experience orchestrating them at scale using Kubernetes (K8s). This includes designing, deploying, and managing Kubernetes clusters, understanding networking (CNI), storage (CSI), and security configurations within the Kubernetes ecosystem. • Must have: Proficiency in one or more modern software languages (e.g., Typescript, Go, Python, Rust) and associated frameworks used for building high-performance, resilient production systems. • Must have: Proven experience developing robust, maintainable, and well-tested automation scripts, services and pipelines to manage infrastructure, deployments, and operational tasks. • Operational Tooling and Observability Management: • Must have: Experience owning, managing, and maintaining mission-critical operational tooling. • Desirable: Proven background in implementing and managing centralised logging solutions or similar platforms (e.g., Splunk, DataDog). • Desirable: Familiarity with distributed tracing tools (e.g., Jaeger, Zipkin) and Application Performance Monitoring (APM) solutions.

Responsibilities

• The core responsibility is to implement, maintain, and continuously improve the foundational platform infrastructure that powers all engineering services. This necessitates a relentless focus on ensuring high reliability, exceptional scalability, and optimal performance across the entire stack. • Platform Infrastructure: Build and maintain platform infrastructure using declarative IaC tools (e.g., Terraform, Pulumi), ensuring all environments are reproducible, version-controlled, and auditable. Proactively manage the capacity of the infrastructure to consistently meet or exceed Service Level Objectives for latency, error rates, and availability. • Incident Response and Post-Mortems: Act as first-line responders for critical system incidents. Triage, diagnose, and resolve complex production issues rapidly. Drive a culture of blameless post-mortems, ensuring root causes are identified, and long-term preventative measures are implemented as code (e.g., via runbooks, automation, or system design changes). • Tooling & Automation: Own the stack of supporting tools necessary for operational excellence and developer enablement, including: • Continuous Integration and Continuous Delivery (CI/CD) Pipelines: Implement, maintain, and evolve the fully automated CI and CD pipelines. This includes establishing best practices for fast, reliable, and secure build, test, and deployment processes. • Observability: Implement and manage robust systems for monitoring (metrics), logging (centralised log aggregation), and distributed tracing to provide deep insights into application and infrastructure health.

Benefits

• Competitive salary of £95,000 to £125,000 • Equity in an early-stage tech company on an incredible trajectory • 25 days holiday plus local public holidays • Apple hardware • Private medical insurance through AXA • Pension contribution through Hargreaves Lansdown • Enhanced family leave • Team off-site in fun places! (We've been to Barcelona, Lisbon, Malta, and Split so far)

Similar Jobs

Primer.ioPrimer.io - Senior DevEx Engineer4d ago
·Remote - Portugal, Romania, Hungary, Poland·Equity
RemoteEMEASeniorInsuranceFintechSenior DevOps EngineerHR ManagerAWSGCPTerraformProspectingAnsiblePulumi
Finite StateFinite State - IoT / ICS / OT Penetration Tester4d ago
·Remote - United States or Canada·Equity
RemoteNASeniorCloud ComputingArtificial IntelligencePenetration TesterPerformance ReviewsC++AWSReportingBashPythonCAIASPSS
SardineSardine - Data Engineer4d ago
·Remote - United States·$150k - $205k/year + Equity
RemoteNASeniorCloud ComputingData AnalyticsData EngineerDocumentationTeam LeadershipProduct MarketingKPI TrackingPythonFivetranSQLdbtAirflowSalesforceSnowflakeAmplitudeAWSGCPKubernetesDockerTableauLookerData VisualizationSegmentMixpanelB2BStakeholder ManagementCloseData QualityGovernance
FoxitFoxit - Director of Global IT4d ago
·Alpharetta, GA - Hybrid
In OfficeNADirectorCybersecurityCloud ComputingDirector of EngineeringGeneral ManagerTeam ManagementContract ManagementMicrosoft 365JiraAWSAzureGCPFreshdeskMandarinGovernanceCSATDocumentationITIL
snowflakesnowflake - SnowCAT Technical Principal4d ago
·Remote - US-USA-Remote
RemoteNAPrincipalCloud ComputingArtificial IntelligencePrincipalCTOSnowflakeProduct MarketingLearning & DevelopmentAWSAzureGCPSQLJavaScalaJavaScriptPython
Get Started Free

No credit card. Takes 10 seconds.

Privacy·Terms··Contact·FAQ·Wagey on X