Arine - Senior DevOps Engineer
Upload My Resume
Drop here or click to browse · PDF, DOCX, DOC, RTF, TXT
Requirements
• 3-5 years of experience in DevOps, Cloud Engineering, or Site Reliability Engineering (SRE). • Strong hands-on experience with Amazon Web Services (AWS). • Solid expertise in CI/CD pipeline implementation (GitHub Actions, Jenkins, AWS CodePipeline, etc.). • Proficiency in Infrastructure as Code (Terraform, CloudFormation, or similar). • Experience with Docker and container orchestration (EKS, ECS, or Kubernetes). • Strong experience in scripting (bash, Python) • Working knowledge of Linux, networking concepts, and shell scripting. • Experience with industry-standard monitoring and logging platforms. • Familiarity with AI-driven operations (AIOps) tools or intelligent observability platforms. • Experience with serverless architectures. • Exposure to security scanning, compliance automation, or frameworks like SOC2/HIPAA. • Experience working in regulated environments (healthcare, finance, etc.). • Mindset We’re Looking For: • Automation-First Thinker: You believe everything that can be automated (infrastructure, deployments, testing, and recovery) should be automated. • Automation-First Thinker: • Cloud Architect in the Making: You consistently consider scalability, cost, reliability, and performance when designing infrastructure. • Cloud Architect in the Making: • AI-Curious Engineer: You are excited about using AI to improve monitoring, reduce downtime, and speed up DevOps workflows. • AI-Curious Engineer: • Ownership-Driven: You take responsibility for the health, security, and reliability of the systems you build. • Ownership-Driven: • Strong Communicator: You effectively translate technical challenges into clear solutions when working with developers, QA, and leadership. • An established private work area that ensures information privacy • A stable high-speed internet connection for telephonic and/or remote work • Ability to pass a background check • Must live in and be eligible to work in the United States
Responsibilities
• Cloud Infrastructure Management • Design, deploy, and manage scalable, secure, and reliable AWS cloud infrastructure using Infrastructure as Code (IaC) tools such as Terraform or AWS CloudFormation • Monitor and manage EC2 instances and other services like RDS and ECS, ensuring optimal performance, scalability, and availability. Manage auto-scaling to handle varying workloads • Design, deploy, and operate Kubernetes clusters (EKS or self-managed) for containerized workloads, implementing best practices for cluster security, networking, and resource management • Optimize cloud resources for cost-effectiveness and performance, including monitoring AWS service costs and implementing cost-saving strategies • Spin up new infrastructure from existing configurations within a week to meet urgent project requirements • Deploy, automate, and maintain releases using release/change management processes • Build, release, and manage production systems, and troubleshoot system issues • Administer, deploy, automate, and manage Jenkins • Collaborate with onshore, offshore, and nearshore dev and QA teams • AI-Powered Automation and Intelligent Operations • Leverage AI/ML tools and platforms (e.g., GitHub Copilot, Claude) to accelerate infrastructure development, code review, and documentation • Implement AIOps practices using AI-driven monitoring and anomaly detection to proactively identify issues before they impact production • Build and maintain AI-assisted incident response workflows, including automated root cause analysis and intelligent alerting • Evaluate and integrate emerging AI tools into the DevOps toolchain to improve developer productivity and operational efficiency • Develop intelligent automation pipelines that leverage ML models for predictive scaling, capacity planning, and resource optimization • Automation and Scripting • Create and manage automation scripts for routine tasks using Python, Bash, or similar scripting languages • Implement automated monitoring and alerting solutions to proactively detect and resolve issues • Create automation scripts using Jenkins • Build self-healing infrastructure using AI-driven automation to detect and remediate common issues without manual intervention • Kubernetes and Container Orchestration • Deploy, manage, and scale Kubernetes clusters on AWS EKS, ensuring high availability and optimal resource utilization • Implement Kubernetes best practices, including namespaces, RBAC, network policies, resource quotas, and pod security standards • Design and manage Helm charts, Kustomize configurations, and GitOps workflows (ArgoCD, Flux) for Kubernetes deployments • Troubleshoot complex Kubernetes issues, including networking, storage, and scheduling problems • Implement service mesh solutions (Istio, Linkerd) for advanced traffic management, observability, and security • Security and Compliance • Ensure the security and compliance of AWS environments by implementing best practices, including IAM policies, security groups, VPC configurations, and encryption • Conduct regular security assessments and audits in collaboration with the security team, identifying and addressing vulnerabilities • Ensure security and adherence to the release/change management process for production releases • Implement Kubernetes security best practices, including pod security policies, secrets management, and container image scanning • Collaboration and Support • Work closely with development teams to understand application requirements and provide guidance on best practices for cloud architecture and DevOps processes • Provide on-call support for production environments, including troubleshooting and resolving issues as they arise • Collaborate with cross-functional teams to improve overall system reliability and performance • Adaptable and resilient in a fast-moving startup setting, with the ability to handle ambiguity and evolving priorities • Monitoring and Optimization • Set up and maintain monitoring tools such as CloudWatch, Prometheus, or Datadog to ensure the health and performance of applications • Monitor EC2 instances, Kubernetes clusters, and other AWS services for performance, availability, and cost-efficiency, taking proactive steps to optimize them • Ensure high availability of systems, implementing redundancy and failover mechanisms to minimize downtime • Analyze and optimize system performance, including load balancing, caching, and database tuning • Implement observability best practices across Kubernetes workloads using distributed tracing, metrics, and centralized logging • Documentation and Training • Document infrastructure, processes, and procedures for knowledge sharing and compliance purposes • Provide training and mentorship to junior team members on best practices in AWS, Kubernetes, and DevOps • Champion AI-assisted tooling adoption and train teams on the effective use of AI coding assistants and automation tools • All staff at Arine are expected to be part of its Information Security Management Program and undergo periodic training on Information Security Awareness and HIPAA guidelines. Each user is responsible to maintain a secure working environment and follow all policies and procedures. Upon hire, each person is assigned and must complete trainings before access is granted for their specific role within Arine.
Benefits
• Outstanding Team and Culture - Our shared mission unites and motivates us to do our best work. We have a relentless passion and commitment to the innovation required to be the market leader in medication intelligence. • Outstanding Team and Culture - • Making a Proven Difference in Healthcare - We are saving patient lives, and enabling individuals to experience improved health outcomes, including significant reductions in hospitalizations and cost of care. • Making a Proven Difference in Healthcare - • Market Opportunity - Arine is backed by leading healthcare investors and was founded to tackle one of the largest healthcare problems today. Non-optimized medications therapies which cost the US 275,000 lives and $528 billion annually. • Market Opportunity - • Dramatic Growth - Arine is managing more than 18 million lives across prominent health plans after only 4 years in the market, and was ranked 236 on the 2024 Inc. 5000 list and was named the 5th fastest-growing company in the AI category. • Dramatic Growth - • Arine is seeking a highly motivated and technically strong DevOps Engineer to help build, automate, and scale our cloud infrastructure and delivery pipelines. This role will focus on AWS, CI/CD, and Infrastructure as Code, while also leveraging AI-driven tooling to improve reliability, security, and delivery speed. You will work closely with Engineering, QA, and Security teams to ensure fast, safe, and scalable software delivery. • DevOps Engineer • The posted range represents the expected base salary range for this position and does not include any other potential components of the compensation package, benefits, and perks. Ultimately, the final pay decision will consider factors such as your experience, job level, location, and other relevant job-related criteria. The base salary range for this position is: $120,000-$150,000/year.
No credit card. Takes 10 seconds.