Judi Health - Senior Scalability Engineer - Observability
Upload My Resume
Drop here or click to browse · Tap to choose · PDF, DOCX, DOC, RTF, TXT
Requirements
• 10+ years of software engineering or infrastructure engineering experience with demonstrated progression into technical leadership roles. • Several years of experience leading technical initiatives, building platform products, or serving as a subject matter expert on observability infrastructure. • Strong experience with React/TypeScript for frontend development and Python (Flask/SQLAlchemy) for backend services. • LGTM stack expertise: Deep production experience with Loki, Grafana, Tempo, and Prometheus/Mimir for logs, metrics, and distributed tracing at scale. • AWS observability: Extensive experience with AWS CloudWatch Logs and Metrics, including custom metrics, log insights, dashboard creation, and integration patterns. • SQL analytics for logs: Production experience with SQL-based log analytics using AWS Athena, DuckDB, or similar query engines for analyzing structured and semi-structured data at scale. • Cloud-native and open-source balance: Demonstrated ability to architect solutions leveraging both managed cloud services and open-source tooling, understanding trade-offs between operational overhead, cost, flexibility, and vendor lock-in. • Search and indexing experience: Hands-on experience building or operating search systems using OpenSearch, Elasticsearch, Lucene, Tantivy, or similar search and analytics engines. • Performance-critical systems: Experience building high-performance systems that process large volumes of data efficiently (millions of log lines, high-cardinality metrics). • Systems thinking: Deep understanding of distributed systems, microservices architectures, and the complex observability challenges they present. • Data at scale: Proven track record handling high-volume structured and unstructured logging data, identifying patterns, and building efficient search/query solutions that perform well under load. • Product mindset: Ability to build internal platform products that engineers love to use, with attention to UX, performance, and reliability. • Rust development experience: Production experience with Rust for building high-performance data processing, indexing, or search systems. Strong interest in learning Rust is acceptable if combined with systems programming experience in C/C++/Go. • Infrastructure as code: Experience with Terraform for managing observability infrastructure and AWS resources. • Additional observability platforms: Experience architecting or operating Datadog, New Relic, Splunk, or other enterprise observability platforms. • Advanced query languages: Deep expertise with PromQL, LogQL, SQL optimization, and query optimization for high-cardinality data. • Columnar storage formats: Experience with Parquet, ORC, or other columnar storage formats for efficient log storage and analytics on S3. • Incident management: Experience designing incident response workflows, postmortem processes, and SLO/SLI frameworks that drive reliability improvements. • Cost optimization: Track record of reducing observability costs while maintaining or improving capabilities (e.g., CloudWatch → S3/custom indexing migration). • Data pipelines: Experience with streaming data pipelines, ETL processes, or real-time data processing. • Distributed tracing: Deep knowledge of OpenTelemetry, Jaeger, Zipkin, or distributed tracing architectures. • Git expertise and experience working in a mono repository. • Previous Pharmacy Benefits Manager (PBM) or healthcare technology experience. • Experience building developer tools or internal platforms that improve engineering productivity. • This range represents the low and high end of the anticipated base salary range for the NY - based position. The actual base salary will depend on several factors such as: experience, knowledge, and skills, and if the location of the job changes. • Nothing in this position description restricts management’s right to assign or reassign duties and responsibilities to this job at any time.
Responsibilities
• $160,000—$220,000 USD • All employees are responsible for adherence to the Capital Rx Code of Conduct including the reporting of non-compliance. This position description is designed to be flexible, allowing management the opportunity to assign or reassign duties and responsibilities as needed to best meet organizational goals.
Benefits
• $160,000 - $220,000 USD • All employees are responsible for adherence to the Capital Rx Code of Conduct including the reporting of non-compliance. This position description is designed to be flexible, allowing management the opportunity to assign or reassign duties and responsibilities as needed to best meet organizational goals.
No credit card. Takes 10 seconds.