BlackSky - Senior Infrastructure Engineer
Requirements
• At least five years years in infrastructure, platform, DevOps, or SRE engineering, with at least 3 years running Kubernetes in production. • Bachelor's degree in a relevant field of study or equivalent experience (four years). • Strong hands-on AWS experience across networking, compute, storage, and IAM, including hybrid/on-prem connectivity patterns. • Production experience operating Kubernetes in one or more enterprise distributions — Amazon EKS, Rancher/RKE2, or OpenShift 4. • Demonstrated GitOps experience with Argo CD (or Flux) as the primary deployment mechanism. • Proficiency authoring and maintaining Helm charts, and a solid grasp of Kubernetes primitives (workloads, networking, RBAC, storage, CRDs/operators). • Experience with the Kubernetes Operator deployment model — deploying and managing workloads via operators and CRDs (OLM/OperatorHub). • Strong infrastructure-as-code skills, ideally with Terraform. • Comfort with Linux systems administration and scripting (Bash, plus Python or Go). • Experience building on hardened, non-CVE / zero-known-vulnerability base images (e.g., Chainguard, Iron Bank, or distroless/minimal baselines) and supply-chain security practices. • Production monitoring and observability with Prometheus and Grafana (exporters, PromQL, alerting, dashboards). • Clear written and verbal communication, and the ability to work independently across the full lifecycle of a platform component. • Breadth across all three of EKS, Rancher/RKE2, and OpenShift 4, with the ability to move fluidly between them. • Experience running Kubernetes in edge / resource-constrained environments (e.g., k3s), including the operational tradeoffs of lightweight and disconnected deployments. • Direct experience packaging and deploying into air-gapped / disconnected environments using Zarf, image mirroring, and private registries. • Container and image scanning experience (Trivy, Grype, Clair, or equivalents) integrated into CI/CD and registry workflows. • Familiarity with secrets management (Vault, External Secrets Operator) and PKI/certificate automation. • Experience with persistent storage at scale (Ceph, EBS/EFS-backed storage classes). • Hands-on OpenTelemetry (OTEL) experience — instrumenting services, running the OTEL Collector, and standardizing traces, metrics, and logs across the platform. • Centralized log aggregation and analysis with Elasticsearch / OpenSearch (and shippers such as Fluent Bit, Fluentd, or Logstash). • Background supporting regulated, government, or other compliance-driven programs. • Service mesh experience with Istio (traffic management, mTLS, ingress/egress gateways, and observability integration). • Relevant certifications (CKA/CKAD/CKS, AWS Solutions Architect / DevOps Engineer, Red Hat OpenShift, Rancher). • Life at BlackSky for full-time US benefits eligible employees includes: • Life at BlackSky for full-time US benefits eligible employees includes • Medical, dental, vision, disability, group term life and AD&D, voluntary life and AD&D insurance • BlackSky pays 100% of employee-only premiums for medical, dental and vision and contributes $100/month for out-of-pocket expenses! • 15 days of PTO, 11 Company holidays, four Floating Holidays (pro-rated based on hire date), one day of paid volunteerism leave per year, parental leave and more • 401(k) pre-tax and Roth deferral options with employer match • Flexible Spending Accounts • Employee Stock Purchase Program • Employee Assistance and Travel Assistance Programs • Employer matching donations • Professional development • Mac or PC? Your choice! • The anticipated salary range for candidates in Seattle, WA is $135,000-$150,000 per year. The final compensation package offered to a successful candidate will be dependent on specific background and education. BlackSky is a multi-state employer, and this pay scale may not reflect salary ranges in other states or locations outside of Seattle, WA.
Responsibilities
• Design and operate AWS infrastructure (VPC, subnets, NLB/ALB, IAM, EKS, EC2, S3, Route 53) and the hybrid connectivity that ties cloud to on-premises and private/air-gapped networks. • Stand up and run production-grade Kubernetes clusters on EKS, Rancher (RKE2) and/or Red Hat OpenShift 4, including upgrades, capacity planning, networking, storage, and day-2 operations. • Implement and own GitOps workflows with Argo CD — declarative cluster and application state, app-of-apps patterns, sync policies, drift detection, and progressive rollout strategies. • Author, version, and maintain Helm charts for internal and third-party workloads, including values management, chart dependencies, and templating standards across environments. • Build repeatable delivery into disconnected environments using Zarf (and equivalent packaging/mirroring tooling) — bundling images, charts, and manifests for air-gapped installs and reproducible deployments. • Codify infrastructure and platform configuration as code (Terraform, Helm, Kustomize) with a clear build-once / promote-per-environment strategy. • Build and harden CI/CD pipelines that move artifacts safely from dev through to restricted production and BCP targets. • Integrate platform services — certificate management (cert-manager), secrets management, container registries, storage, and observability — as shared, reusable building blocks. • Establish operational standards: monitoring, alerting, logging, runbooks, incident response, and capacity/cost management. • Other responsibilities as assigned.
Apply in one click
Upload My Resume
Drop here or click to browse · Tap to choose · PDF, DOCX, DOC, RTF, TXT