Staff Data Engineer
Upload My Resume
Drop here or click to browse · PDF, DOCX, DOC, RTF, TXT
Requirements
• 10+ years working in data engineering, with a focus on large-scale data ingestion and infrastructure • Deep expertise in Python and modern data engineering tools • A track record of building automated, production-grade ETL processes using Python and dbt SQL • Strong understanding of ETL/ELT frameworks and distributed data processing • Hands-on proficiency with modern data technologies and comfort leveraging AI coding assistants to accelerate development, improve code quality, and enhance productivity • Skilled in data processing, validation, cleaning, and debugging • Strong capability integrating APIs for seamless data exchange between systems • Proven ability to handle and process varied file types and formats, including healthcare standards such as HL7, 834, 837, and NCPDP • Demonstrated success integrating and consolidating data from diverse source systems into a unified repository, including EHR and claims systems, via both file-based and API integrations • Comfort working with large-scale datasets (10GB+) • Strong capability implementing incremental processing and change data capture (CDC) methodologies • Extensive background designing scalable data architectures in AWS environments • Solid grounding in software engineering principles, including test-driven development, loose coupling, single responsibility, and modular design • Hands-on familiarity with containerization (Docker, Kubernetes) and building configuration-driven, maintainable systems • Proven ability to build tools and systems that diverse engineering profiles can operate through configuration rather than code changes • A passion for building new data infrastructure and continuously improving existing systems with robustness, maintainability, and operational excellence • Familiarity with healthcare data and regulatory environments (HIPAA) as a plus • Strong collaboration skills, with comfort partnering across technical and non-technical stakeholders • Excellent written and verbal communication, with the ability to explain technical infrastructure concepts to diverse audiences • An established private work area that ensures information privacy • A stable high-speed internet connection for remote work • This role is remote, but you will be required to come to on-site meetings multiple times per year. This may be in the interview process, onboarding, and team meetings • Ability to pass a background check • Must live in and be eligible to work in the United States
Responsibilities
• Act as the team architect by leading system design reviews, offering recommendations, conducting comprehensive peer reviews, and demonstrating expert-level proficiency in Python and AWS services • Architect and implement scalable data ingestion pipelines that handle different file types into the Arine platform • Develop reusable components that integrate into data pipelines to increase efficiency and reduce future implementation time • Create configuration-driven, containerized toolsets that are easy to use and maintain across diverse engineering profiles • Work collaboratively with cross-functional teams to meet data requirements through ETL components • Design and maintain data transformation pipelines using DBT, including macros, incremental models, and DBT tests • Implement incremental data ingestion strategies for large-scale healthcare datasets • Build monitoring and alerting systems for data ingestion processes and overall pipeline health • Refactor and rebuild existing data ingestion processes to improve scalability and operational efficiency • Work with containerization technologies (Docker, Kubernetes) to create portable and maintainable data solutions • Identify and escalate inefficiencies within and across teams • Provide technical guidance and mentorship to junior engineers, and promote best practices and coding standards • Author and maintain high-quality technical documentation, and support junior engineers in doing the same • Collaborate with the DE Manager to report on DE contractor performance issues. • All staff at Arine are expected to be part of its Information Security Management Program and undergo periodic training on Information Security Awareness and HIPAA guidelines. Each user is responsible to maintain a secure working environment and follow all policies and procedures. Upon hire, each person is assigned and must complete trainings before access is granted for their specific role within Arine.
Benefits
• Outstanding Team and Culture - Our shared mission unites and motivates us to do our best work. We have a relentless passion and commitment to the innovation required to be the market leader in medication intelligence. • Outstanding Team and Culture - • Making a Proven Difference in Healthcare - We are saving patient lives, and enabling individuals to experience improved health outcomes, including significant reductions in hospitalizations and cost of care. • Making a Proven Difference in Healthcare - • Market Opportunity - Arine is backed by leading healthcare investors and was founded to tackle one of the largest healthcare problems today. Non-optimized medications therapies which cost the US 275,000 lives and $528 billion annually. • Market Opportunity - • Dramatic Growth - Arine is managing more than 18 million lives across prominent health plans after only 4 years in the market, and was ranked 236 on the 2024 Inc. 5000 list and was named the 5th fastest-growing company in the AI category. • Dramatic Growth - • As a key technical leader and team architect working in a fast-paced environment, you will drive the design, development, and optimization of scalable data ingestion pipelines within the Arine platform. Leveraging expert-level proficiency in Python and AWS, you will architect solutions that handle diverse file types and large-scale healthcare datasets. You will have a direct impact on building reusable, configurable tools set for handling data needs for the entire company. • The posted range represents the expected salary for this position and does not include any other potential components of the compensation package (including bonus and equity), benefits, and perks. Ultimately, the final pay decision will consider factors such as your experience, job level, location, and other relevant job-related criteria. The salary range for this position is: $170,000-185,000/year.