Zscaler - Sr. Staff Production Engineer
Upload My Resume
Drop here or click to browse · Tap to choose · PDF, DOCX, DOC, RTF, TXT
Requirements
• You act like an owner with a bias for action and integrity. • You are a pragmatic builder obsessed with creating, iterating, and shipping. • You champion simplicity by distilling complex problems into clear, actionable plans. • You are data-driven, valuing evidence over assumptions. • You think at scale, building solutions and processes built to last a high-growth global organization. • 8+ years of experience managing reliability, scalability, and availability for large-scale production services • Deep expertise in programming (e.g., Python, Go, or C/C++) • Strong background in networking protocols, Linux/FreeBSD systems, and distributed architecture • Experience in high-stakes incident management and participation in a 24/7 on-call rotation • Proficiency in leveraging ITIL frameworks and incident data to drive service maturity through systematic problem management and technical operability reviews • Extensive experience with public cloud (AWS, Azure, GCP) and Infrastructure-as-Code (Ansible, Terraform) • Experience with chaos engineering and disaster recovery planning at scale • Expertise in global routing (BGP) and traffic tunneling (GRE, IPSec) with a deep understanding of L7 proxy architectures (HAProxy), DNS at scale, and OS networking stack internals • #LI-Hybrid #LI-RT101 • Zscaler’s salary ranges are benchmarked and are determined by role and level. The range displayed on each job posting reflects the minimum and maximum target for new hire salaries for the position across all US locations and could be higher or lower based on a multitude of factors, including job-related skills, experience, and relevant education or training. • The base salary range listed for this full-time position excludes commission/ bonus/ equity (if applicable) + benefits. • $140,000 - $200,000 USD • At Zscaler, we are committed to building a team that reflects the communities we serve and the customers we work with. We foster an inclusive environment that values all backgrounds and perspectives, emphasizing collaboration and belonging. Join us in our mission to make doing business seamless and secure. • Our Benefits program is one of the most important ways we support our employees. Zscaler proudly offers comprehensive and inclusive benefits to meet the diverse needs of our employees and their families throughout their life stages, including: • Various health plans • Time off plans for vacation and sick time • Parental leave options • Retirement options • Education reimbursement • In-office perks, and more! • Learn more about Zscaler’s Future of Work strategy, hybrid working model, and benefits here. • By applying for this role, you adhere to applicable laws, regulations, and Zscaler policies, including those related to security and privacy standards and guidelines.
Responsibilities
• Design and implement highly available, scalable infrastructure across AWS, Azure, GCP, and bare-metal environments • Drive an "automation-first" culture by writing code (Python/Go) to eliminate manual toil and build self-healing systems • Implement and maintain sophisticated observability (Prometheus, Grafana, OpenTelemetry), define SLIs/SLOs, and establish error budgets • Act as a lead Incident Commander (TDO on-call), develop response playbooks, and conduct deep-dive post-incident analyses • Partner with Engineering and partner teams to conduct operability reviews
No credit card. Takes 10 seconds.