Sierra - Software Engineer, Site Reliability (SRE)
Upload My Resume
Drop here or click to browse · PDF, DOCX, DOC, RTF, TXT
Requirements
• 5+ years of hands-on experience in Site Reliability or Infrastructure engineering roles for complex SaaS or cloud-based systems. • Experience designing for availability, scalability, and reliability at both infrastructure and application layers. • Deep experience with Terraform, AWS services, container orchestration, and cloud networking (including IAM and VPC architecture). • Strong background in observability systems (e.g., Prometheus, Grafana, Datadog, or similar). • Experience working with enterprise customers and familiarity with their compliance and networking needs along with integration patterns. • Comfortable working in fast-moving environments and collaborating across product, ML, and core engineering teams. • Degree in Computer Science or a related field, or equivalent professional experience. • Experience with LLM infrastructure — optimizing inference performance, managing fine-tuned models, or large-scale model deployment. • Past experience in an early-stage startup environment, especially defining SRE culture and tooling from scratch. • Familiarity with incident management automation or self-healing infrastructure patterns. • Trust: We build trust with our customers with our accountability, empathy, quality, and responsiveness. We build trust in AI by making it more accessible, safe, and useful. We build trust with each other by showing up for each other professionally and personally, creating an environment that enables all of us to do our best work. • Customer Obsession: We deeply understand our customers’ business goals and relentlessly focus on driving outcomes, not just technical milestones. Everyone at the company knows and spends time with our customers. When our customer is having an issue, we drop everything and fix it. • Craftsmanship: We get the details right, from the words on the page to the system architecture. We have good taste. When we notice something isn’t right, we take the time to fix it. We are proud of the products we produce. We continuously self-reflect to continuously self-improve. • Intensity: We know we don’t have the luxury of patience. We play to win. We care about our product being the best, and when it isn’t, we fix it. When we fail, we talk about it openly and without blame so we succeed the next time. • Family: We know that balance and intensity are compatible, and we model it in our actions and processes. We are the best technology company for parents. We support and respect each other and celebrate each other’s personal and professional achievements.
Responsibilities
• As a Software Engineer on our Site Reliability team at Sierra, you will be responsible for defining and building the foundation of reliability, observability, and scalability across Sierra’s AI-driven infrastructure. You’ll partner closely with our core engineering and product teams to ensure our systems are highly available, efficient, and built for growth. • Own Sierra’s observability stack—monitoring, alerting, logging, and tracing—to give engineers clear visibility into system health and performance. • Partner with product and platform engineers to design systems that are reliable and scalable from day one—not as an afterthought. • Design and implement scalable, reliable, and secure cloud infrastructure (AWS) using Terraform and modern DevOps tooling. • Improve the reliability and scalability of our LLM deployments, ensuring robust, performant, and cost-effective operation. • Lead improvements to deployment pipelines, CI/CD tooling, and incident management processes to reduce downtime and response time. • Define the foundation of SRE practices at Sierra, influencing culture, tooling, and best practices across the engineering org.
Benefits
• $230K – $390K • Offers Equity • Upload your resume here to autofill key application fields. • Drop your resume here! • Parsing your resume. Autofilling key fields... • or drag and drop here • Sierra believes working alongside one another as a team is an important part of building great products and a great culture. We are primarily an in-person company based in San Francisco. Does that work for you? • Yes, and I currently live in the SF Bay Area. • Yes, and while I do not currently live in the SF Bay Area, I am open to relocation. • No or Other. Please add more details in the "Anything else" section below. • Current employee • Is there anything else we should know about your candidacy or interest in Sierra? • I prefer not to answer • Another Gender Identity • Heterosexual / straight • Asian or Asian American • Black or African American • Hispanic or Latine • Indigenous or Native American • Native Hawaiian or Other Pacific Islander • Person with disability • Refugee or immigrant • None of the above • Recruiting Privacy Policy
No credit card. Takes 10 seconds.