leap - Senior Data Engineer
Requirements
• 5+ years of experience with Python, SQL, and dbt, with hands-on expertise in BigQuery, Snowflake, or a comparable cloud data warehouse and proficiency with orchestration tools such as Airflow, Dagster, or Prefect • Demonstrated experience architecting data platforms, including decisions around batch vs. streaming, incremental vs. full-refresh, and warehouse structure • Proven ability to build monitoring, lineage tracking, and governance systems that trace data from source to report • Experience using AI tools in day-to-day work and building data infrastructure that AI systems can rely on in production • Background as an early employee or founding data engineer responsible for building a data stack from the ground up • Healthcare or HIPAA experience; familiarity with ingestion tools such as Fivetran; CRM integrations (Salesforce, HubSpot); or prior experience building data infrastructure for LLM or AI workloads • Experience with streaming frameworks such as Kafka, Pub/Sub, or Flink, or designing systems that handle both batch and real-time data flows • Comfort with cloud infrastructure (GCP, AWS) and Linux/sysadmin fundamentals, including VM debugging, log management, and service administration • A bias toward simple, cost-effective solutions — defaulting to open-source and applying sound judgment about when managed services justify their cost and lock-in • At Leap, we’re building an outlier company with real impact — and that takes focus, energy, and commitment. If that excites you, we’d love to hear from you.
Responsibilities
• Pipelines and Warehouse • Build and own data pipelines and ETL for claims ingestion, drug pricing, and CRM sync (BigQuery, Python) • Design production pipelines for batch and streaming workloads — claims data is high-volume today, and new large-scale data sources are coming • Design warehouse schemas and transforms with clear separation between raw, staging, and modeled layers • Maintain data quality and reliability across systems that feed both human users and AI workloads — this means row-count checks, schema drift detection, anomaly alerting, and knowing when upstream sources have silently changed, not just whether the job ran • Data Governance • Build pipeline monitoring that tells you whether the data is right, not just whether the job ran • Design for recoverability. Pipelines should be idempotent and replayable, with raw data always preserved so you can reprocess when logic changes • Track data lineage: where it comes from, how it's transformed, and what depends on it • Validate data at every stage before it reaches a dashboard or an AI system • Reporting Infrastructure • Build reporting systems that give sales, clinical, and leadership teams live visibility into the business • Create automated alerting that surfaces when something has changed, so the team acts on data instead of asking for it • AI-Ready Data Infrastructure • Build PHI-safe pipelines that feed LLM workloads, agent systems, and automation • Design data architecture that connects claims, drug pricing, patient records, CRM activity, and clinical workflows into a usable whole • Own the ingestion of external data from non-standard formats and sources — we work with many providers who each send data differently, and new sources are added regularly
Benefits
• $200K – $250K • Offers Equity • At Leap, we are committed to providing competitive total rewards packages including stock options and benefits. Individual pay may vary from the target range and is determined by a number of factors including experience, location, internal pay equity, and other relevant business considerations. • Upload your resume here to autofill key application fields. • Drop your resume here! • Parsing your resume. Autofilling key fields... • or drag and drop here • What accomplishment are you're most proud of achieving in your current or last role? A few lines is best. • Recruiting Privacy Policy • Leap may use Artificial Intelligence with this application. Learn more.
Apply in one click
Upload My Resume
Drop here or click to browse · Tap to choose · PDF, DOCX, DOC, RTF, TXT