• Build and maintain data pipelines to ingest, transform, and organize data from multiple sources
• Work with clinical, claims, or similar structured datasets and map them into standardized data models
• Run data quality checks to ensure accuracy and consistency
• Use tools like Databricks, BigQuery, Redshift, dbt, and command-line utilities
• Collaborate with cross-functional teams including data, product, and business stakeholders
• Help maintain ongoing data refresh processes and troubleshoot pipeline issues when needed
• Bachelor’s degree in Computer Science or a related field (or equivalent hands-on experience)
• Strong working knowledge of SQL
• Familiarity with Python, PySpark, or SparkSQL
• Experience with modern data platforms like Databricks, Snowflake, or BigQuery
• Comfort working in a remote, collaborative environment
• Based in the United States (preferred time zones: Central, Mountain, or Pacific)