wagey.ggwagey.ggv1.0-68eec7a-3-May
Browse Tech JobsCompaniesFeaturesPricingFAQs
Log InGet Started Free
Jobs/Platform Engineer Role/Kraken - Platform Engineer - Product Reliability (Mid/Senior Level)
Kraken

Kraken - Platform Engineer - Product Reliability (Mid/Senior Level)

Remote - Australia1mo ago
RemoteSeniorAPACCloud ComputingSoftwarePlatform EngineerSite Reliability EngineerTechnical WritingReportingAWSTerraformDocumentation

Upload My Resume

Drop here or click to browse · Tap to choose · PDF, DOCX, DOC, RTF, TXT

Apply in One Click
Apply in One Click

Responsibilities

• Teach and support product teams on best practices for reliability, implementation patterns and effective usage of our existing platforms • Support product teams in improving the performance and availability of their systems • Be hands-on in code and infrastructure to help product teams with reliability improvements • Provide comprehensive feedback to the wider Platform group on improvements to be made to core infrastructure based on observations and first-hand experience in the code base • Support the build-out of proof-of-concept requirements in product teams as needed to evolve application deployment architecture to align with business growth as well as enhance scalability and system resilience • Collaborate with product teams to support the release of new features and services, ensuring adherence to reliability and performance standards • Guide product teams in designing systems for resilience and graceful failure under heavy load • Assist application teams with post-incident tasks and follow-ups, and contribute to the creation and review of post-mortem documentation • Analyse incident metrics to identify trends and potential improvements, communicating these insights to the product teams • Help solve interesting and difficult problems. There’s a great opportunity for disruption in the global energy market • What you'll have: • Great communication skills, working effectively with developers, product managers and other business stakeholders to understand, design and deliver impactful projects and reliability improvements • Solid hands-on experience across our core platform stack: • AWS (supporting and improving cloud infrastructure used by product teams) • Terraform (infrastructure as code; comfortable operating with Terraform day-to-day) • Kubernetes (container orchestration and deployment management; comfortable working with Kubernetes day-to-day) • Experience using industry-standard observability tooling - we use Datadog, Grafana, Prometheus and Rootly (experience with other monitoring/alerting platforms is transferable) • Strong collaboration and communication skills - able to work effectively with developers, product managers, and other stakeholders to design and deliver impactful observability “golden paths” and monitoring experiences • Exposure to Python (or a similar C-based language like TypeScript, Go, C#) - able to understand how applications behave in production to support observability and reliability improvements • Previous experience working in small, highly autonomous teams • A working style that fits how we operate: • Comfortable with ambiguity and able to create structure in unclear situations • Proactive learning mindset (experiment, iterate, and adapt as the team evolves approaches) • Strong asynchronous written communication (Slack/Notion/docs) and a habit of keeping others in the loop • Autonomy and accountability - making progress independently and owning outcomes • What will help: • Previous experience as a Site Reliability Engineer • Experience working on SaaS platforms, including engaging product teams to ensure up-skilling and knowledge sharing across teams • Experience managing and supporting a large scale internet facing service • Experience in responding to incidents and outages, writing technical incident reports and organising incident retrospectives • Experience working with very large relational databases • Experience in using service level objectives to improve application performance • A proactive, innovative mindset • Kraken is a certified Great Place to Work in France, Germany, Spain, Japan and Australia. In the UK we are one of the Best Workplaces on Glassdoor with a score of 4.7. Check out our Welcome to the Jungle site (FR/EN) to learn more about our teams and culture. • Are you ready for a career with us? We want to ensure you have all the tools and environment you need to unleash your potential. If you have any specific accommodations or a unique preference, please contact us at [email protected] and we'll do what we can to customise your interview process for comfort and maximum magic!

Get Started Free

No credit card. Takes 10 seconds.

Privacy·Terms··Contact·FAQ·Wagey on X