Abridge - Staff Platform Engineer
Upload My Resume
Drop here or click to browse · Tap to choose · PDF, DOCX, DOC, RTF, TXT
Requirements
• 10+ years of software and infrastructure engineering experience, including significant experience operating infrastructure-as-code platforms in cloud-first organizations. • Experience designing and operating large-scale Kubernetes platforms and scaling compute services on Kubernetes; experience with related cloud-native technologies including ArgoCD, Argo Rollouts, Istio, etc. • Deep understanding of Kubernetes platform architecture and operations, including workload isolation, autoscaling, networking, service mesh management, ingress patterns, observability, upgrades, and multi-tenant cluster design. • Experience designing and maintaining CI/CD systems for both infrastructure-as-code deployments and application delivery workflows. (Terragrunt, Atlas, ArgoCD, Octopus Deploy, Travis CI, etc.) • Experience building scalable infrastructure-as-code platforms using Terraform and related tooling, including modular architectures, remote state management, policy enforcement, deployment orchestration, and reusable infrastructure patterns. • Experience with monitoring and observability tooling and practices (metrics, logs, traces) and their management at scale. Experience with major observability platforms such as Grafana, Datadog, Honeycomb, etc. • Comfortable implementing and securing services in Google Cloud Platform as infrastructure-as-code, including GCP Projects, VPC Networks, Google Kubernetes Engine, IAM Roles, Groups, policies, and secure networking patterns. • Experience designing secure-by-default infrastructure including least-privilege access controls, workload identity, network segmentation, secret management, auditability, and compliance-oriented platform controls. • Strong operational instincts and experience debugging complex distributed systems, leading incident response efforts, and improving reliability through automation and observability. • Experience balancing developer experience, platform governance, operational reliability, and organizational scalability in fast-growing engineering environments. • Experience with backend languages (e.g. Python, GoLang, Node, Rust). • Up-to-date on industry best practices and tools, and enjoy learning new things. • Excited about being hands-on while also driving platform direction, architecture decisions, and operational maturity in a fast-moving and supportive environment. • Willing to pitch in wherever needed — as a fast-moving startup we need to do good work, quickly. • Demonstrates strong curiosity and a proactive interest in AI, actively exploring and applying emerging technologies. • We value people who want to learn new things, and we know that great team members might not perfectly match a job description. If you’re interested in the role but aren’t sure whether or not you’re a good fit, we’d still like to hear from you.
Responsibilities
• Design, build, and evolve cloud infrastructure platforms including networking, IAM, Kubernetes, databases, streaming and pubsub platforms, storage, distribution, observability, and more. • Lead the architecture and operational evolution of multi-tenant, multi-region, and multi-cloud infrastructure with strong reliability, scalability, and security boundaries. • Design and implement build pipelines, branching strategies, release management tooling, and self-service platform workflows that will serve an engineering organization that is rapidly growing in both size and operational complexity. • Design, implement, and scale secure-by-default cloud infrastructure practices including CI and deployment scans, least privileged access controls, auditing, policy enforcement, and maintaining SoC2 and HIPAA compliance. • Build reusable infrastructure abstractions, Terraform modules, golden paths, and developer platform capabilities that allow engineering teams to move quickly while maintaining operational consistency and governance. • Help advocate for, design, implement, and adopt fast and scalable application testing pipelines including end-to-end UI tests, hyperscale load tests, resiliency testing, and progressive delivery patterns. • Drive improvements in observability, operational readiness, incident response, SLO-driven reliability practices, and platform debuggability across the organization. • Bridge the gap between local development and production environments in a way that is seamless for engineers and maximizes engineering velocity, reliability, and security while minimizing quality issues arising from environment drift and configuration tangles. • Partner closely with engineering, security, and compliance teams to balance platform standardization with developer flexibility and evolving business requirements. • Influence infrastructure cost and capacity strategy by balancing reliability, scalability, performance, and operational efficiency across cloud environments. • Evangelize, document, mentor, and train the engineering team on the solutions being built and help uplevel the organization on cloud-native platform engineering strategies and operational excellence. • Be a public evangelist for Abridge in the global platform engineering community, including conferences, open source, and research as we pioneer new AI-first, cloud-native-first, security-first implementations at scale.
Benefits
• Base Salary $228K – $290K • Offers Equity • Based on pay transparency guidelines, the salary range listed reflects the pay scale estimated for candidates residing in the San Francisco and New York City metro areas. Actual base salary will be dependent on location, relevant experience, skills, qualifications, and/or other job-related factors. As a part of the total compensation package, this role may be eligible to participate in a company stock option plan. • Upload your resume here to autofill key application fields. • Drop your resume here! • Parsing your resume. Autofilling key fields... • or drag and drop here • I am currently living in the San Francisco Bay or New York Tri-State area • I am not currently living in the San Francisco Bay or New York Tri-State area - but I am willing to relocate within 6 months • I am not currently living in the San Francisco Bay or New York Tri-State area - but I am willing to travel up to 20% • I do not currently live in the San Francisco Bay or New York Tri-State area - I am NOT willing to relocate and am only open to 100% remote positions
No credit card. Takes 10 seconds.