airapps - Site Reliability Engineer (SRE)
Upload My Resume
Drop here or click to browse · Tap to choose · PDF, DOCX, DOC, RTF, TXT
Requirements
• Around 4+ years of experience in Site Reliability Engineering (SRE), DevOps, or System Engineering. • Strong knowledge of cloud platforms (AWS, Azure, or GCP) and cloud-native architectures. • Experience with observability and monitoring tools (Prometheus, Grafana, ELK, Datadog, New Relic). • Proficiency in Infrastructure as Code (IaC) tools such as Terraform, CloudFormation, or Pulumi. • Hands-on experience with containerization and orchestration (Docker, Kubernetes, Helm). • Strong Linux system administration and networking fundamentals. • Experience with incident management, debugging, and root cause analysis. • Proficiency in scripting (Bash, Python, or Go) for automation and system monitoring. • Knowledge of load balancing, failover strategies, and distributed systems. • Understanding of security best practices, access control, and compliance requirements. • Strong communication skills and the ability to collaborate with cross-functional teams.
Responsibilities
• Design and implement scalable, reliable, and fault-tolerant systems across cloud environments. • Develop and maintain observability tools, including monitoring, logging, and alerting (e.g., Prometheus, Grafana, Datadog, ELK). • Automate infrastructure provisioning, deployment, and incident response using Infrastructure as Code (IaC) tools like Terraform or CloudFormation. • Optimize system performance, scalability, and incident response workflows to improve uptime. • Work closely with development and DevOps teams to improve system design for reliability. • Conduct root cause analysis (RCA) and implement preventative measures to minimize failures. • Ensure high availability by designing and maintaining load balancing, failover, and disaster recovery strategies. • Improve CI/CD pipelines to enhance deployment speed while maintaining stability. • Optimize cloud cost and resource utilization for AWS, Azure, or Google Cloud Platform (GCP). • Participate in on-call rotations to quickly address system failures and minimize downtime.
Benefits
• Apple hardware ecosystem for work. • Top-tier Health and Life Insurance for peace of mind. • Transportation Budget to support your commute needs. • Coverflex benefits package for meal allowances, well-being, and more. • Childcare support. • Air Conference - an opportunity to meet the team, collaborate, and grow together. • Pension Fund to support your long-term financial planning. • Urban Sports Club membership to keep you active. • Meals 100% free at the hub. • DIVERSITY & INCLUSION • At Air Apps, we are committed to fostering a diverse, inclusive, and equitable workplace. We enthusiastically welcome applicants from all backgrounds, experiences, and perspectives. We celebrate diversity in all its forms and believe that varied voices and experiences make us stronger. • APPLICATION DISCLAIMER • At Air Apps, we value transparency and integrity in our hiring process. Applicants must submit their own work without any AI-generated assistance. Any use of AI in application materials, assessments, or interviews will result in disqualification.
No credit card. Takes 10 seconds.