wagey.ggwagey.ggv1.0-55c2ce9-10-Apr
Browse Tech JobsCompaniesFeaturesPricing
Log InGet Started Free
Jobs/Platform Engineer Role/The Job Sauce - Senior Platform Engineer
Pro members applied to this job 36 hours before you saw itGet Pro ›
The Job Sauce

The Job Sauce - Senior Platform Engineer

Remote - USA *$175k - $275k5d ago
RemoteSeniorNACloud ComputingArtificial IntelligencePlatform EngineerAWSTerraformDockerPostgreSQLRedis

Upload My Resume

Drop here or click to browse · PDF, DOCX, DOC, RTF, TXT

Apply in One Click

Requirements

• Our infrastructure runs on AWS and is managed 100% with Terraform and Pulumi Cloud. Application services run in Docker on ECS EC2 or Fargate. Key services include Aurora PostgreSQL, ElastiCache (Redis), MSK (Kafka), and OpenSearch. Our CI/CD runs on Buildkite with TypeScript pipeline-as-code. • We also maintain internal TypeScript platform libraries (codegen, service templates, shared data access/instrumentation) that power consistent APIs and developer workflows across the monorepo. Observability is powered by Datadog, CloudWatch, and Sentry. DNS and CDN are handled by Cloudflare. Application code is a TypeScript monorepo running Node/Express with a React frontend and GraphQL/Apollo API layer. We use GitHub for source control. • (It's okay not to have all of these things - these are just some skills we are excited about!) • 🧩 Experience building internal platforms / developer tooling: code generation, CLIs, templates, shared SDKs, or frameworks that improve engineering velocity. • 🏗️ Strong TypeScript skills and API design taste - you enjoy building stable primitives that other engineers rely on. • 🌩 Deep AWS experience across compute, networking, storage, and security (ECS, Lambda, VPC, ALB, IAM, RDS, ElastiCache, MSK, OpenSearch, S3, CloudWatch, CloudTrail, GuardDuty). • 🏠 Strong Terraform and/or Pulumi proficiency: modules, workspaces, and CI-driven plan/apply workflows. • 🚀 Experience designing and operating CI/CD systems that help large engineering teams ship frequently and confidently. • 🔍 Track record building production observability stacks (Datadog, CloudWatch, Sentry, distributed tracing, SLOs). • 🧱 You’ve built “paved roads” that bake in secure, reliable defaults (instrumentation helpers, policy-as-code, safe-by-default deployment patterns). • 🔒 Security-first mindset - you proactively harden infrastructure without slowing teams down. • 🛢 Aurora PostgreSQL operations at scale: backups, PITR, failover, read replicas, query tuning. • 🧰 Comfort with Docker and container orchestration environments. • 📊 Reliability engineering mindset: SLOs, error budgets, incident response. • 🤖 Curiosity about the unique infrastructure demands of AI and LLM workloads. • 📝 Strong written communication - you document decisions and help the team understand the systems they depend on.

Responsibilities

• Own and evolve our AWS infrastructure using Terraform and Pulumi Cloud - treating infrastructure as a product that engineering teams depend on. • Design and maintain internal developer tooling and libraries that standardize how we build and ship - code generation, shared SDKs, data access patterns, and service scaffolding. • Create and maintain golden paths for common workflows (new service setup, background jobs, event streams, APIs) so teams can ship quickly with built-in security and observability (and consistent defaults across services). • Build and maintain CI/CD pipelines and per-PR ephemeral environments that make deploying feel easy and safe. • Drive reliability through SLOs, auto-scaling, incident response, and postmortems - and build systems that make the next incident less likely. • Create observability tooling and shared instrumentation libraries that give engineers real-time insight into their services. • Enforce security best practices across IAM, secrets management, encryption, audit logging, and DDoS protection. • Own the reliability and performance of our data platform (Aurora PostgreSQL) - provisioning, backups, failover, and tuning - and build tooling that makes safe usage the default. • Reduce toil through automation and self-service tooling so engineers can move fast without waiting on the platform. • Contribute to architecture decisions and documentation, and participate in on-call rotation (shared by the whole engineering team).

Benefits

• Robust medical coverage (100% of employee + family premiums covered) • Vision & dental coverage • Remote-first culture - work from wherever you do your best work • Flexible time off - we trust you to manage your time

Similar Jobs

RobinhoodRobinhood - Senior Software Engineer, AI InfrastructureYesterday
·Menlo Park, CA·$196k - $230k/year + Equity
In OfficeNASeniorCloud ComputingArtificial IntelligenceSenior Software EngineerC++PythonRayKubeflowTritonElasticsearchQdrantVectorAWS
CprimeCprime - Senior Cloud EngineerYesterday
·Hyderabad, India
In OfficeAPACSeniorCloud ComputingCloud EngineerAssociateBashPythonTerraformJenkinsGovernanceELKPrometheusGrafanaMentoringAWSDockerKubernetes
DeepLDeepL - Senior Salesforce Platform Analyst (AI Enabled)Yesterday
·London
In OfficeEMEASeniorSoftwareOil & GasPlatform EngineerReportingData QualityDocumentationClaudeB2BJiraConfluenceDecision MakingGovernanceObservableSalesforce
FiFi - Senior Software EngineerYesterday
·Remote - USA *·$150k - $210k/year + Equity
RemoteNASeniorCloud ComputingArtificial IntelligenceSoftware EngineerSenior Software EngineerLearning & DevelopmentDashboard CreationAWSTypeScriptReact
Scott LogicScott Logic - Lead Data EngineerYesterday
·Remote - Edinburgh
RemoteEMEAStaffCloud ComputingData EngineerAWSAzureGCPSnowflake
Get Started Free

No credit card. Takes 10 seconds.

Privacy·Terms··Contact
Loading...