GitLab - Engineering Manager, Infrastructure Platforms
Upload My Resume
Drop here or click to browse · PDF, DOCX, DOC, RTF, TXT
Requirements
• 3+ years of experience managing SRE, infrastructure, or platform engineering teams operating highly-available distributed systems at scale, ideally in a SaaS environment with customer-facing SLAs. • managing SRE, infrastructure, or platform engineering teams • Demonstrated ability to lead in a remote, high-performance environment, collaborating across multiple time zones and cultures. • remote, high-performance environment • Experience running or significantly contributing to large-scale data migrations where customer data integrity and downtime risk must be carefully managed. • large-scale data migrations • Strong infrastructure background, including cloud platforms, observability, incident response, and distributed multi-tenant architectures. • cloud platforms, observability, incident response, and distributed multi-tenant architectures • Excellent communication and interpersonal skills, with the ability to translate complex technical concepts and risk trade-offs into clear, actionable insight for both technical and non-technical stakeholders, including customers. • Strong problem-solving abilities and attention to detail, with a focus on delivering high-quality, low-risk operational outcomes in a fast-paced, dynamic environment. • high-quality, low-risk operational outcomes • Alignment with our company values and a commitment to working in accordance with those values. • Experience working in or with managed/hosted environments similar to GitLab Dedicated, including regulated or compliance-sensitive customers (e.g., SOC2, ISO). • managed/hosted environments • Working knowledge of technologies commonly used in SRE and migration workflows (e.g., Kubernetes, Terraform, observability stacks, scripting languages). • Used GitLab for personal or professional projects, and/or contributed to open source projects. • Past experience working in an enterprise developer tools company or a high-growth infrastructure product company. • How GitLab Supports Full-Time Employees • Benefits to support your health, finances, and well-being • Flexible Paid Time Off • Team Member Resource Groups • Equity Compensation & Employee Stock Purchase Plan • Growth and Development Fund • Please note that we welcome interest from candidates with varying levels of experience; many successful candidates do not meet every single requirement. Additionally, studies have shown that people from underrepresented groups are less likely to apply to a job unless they meet every single qualification. If you're excited about this role, please apply and allow our recruiters to assess your application. • Country Hiring Guidelines: GitLab hires new team members in countries around the world. All of our roles are remote, however some roles may carry specific location-based eligibility requirements. Our Talent Acquisition team can help answer any questions about location after starting the recruiting process. • Country Hiring Guidelines:
Responsibilities
• Hire and manage a high-performing team of Site Reliability Engineers in India that lives our values. • Hire and manage • Hold regular 1:1s with all members of your team, providing coaching and regular feedback around the individual’s performance. • Coordinate and continuously refine the team’s shift and weekend coverage model for Dedicated migrations. • shift and weekend coverage model • Own operational execution of Dedicated Geo migrations and cutovers, including planning, pre-cutover preparation, live execution, and post-cutover validation and cleanup. • Dedicated Geo migrations and cutovers • Ensure the team provides high-quality, timely responses to Geo-related escalations from Support and internal partners. • Geo-related escalations • Foster technical decision making on the team, stepping in to make final decisions when necessary—especially during high-stakes migrations or incidents. • Build and maintain runbooks, guardrails, and post-cutover reviews so the team operates with rigor rather than improvisation, especially during ramp-up. • runbooks, guardrails, and post-cutover reviews • Collaborate with core Geo, Dedicated migrations, and other Infrastructure teams to identify and prioritize engineering investments that improve migration tooling and processes. • engineering investments • Define, track, and report on key operational metrics such as escalation volume absorbed, internal escalation rate, cutover coverage, response times, and team health signals, using them to drive continuous improvement. • key operational metrics • Participate in the Incident Management on-call rotation to help ensure availability goals for GitLab.com are met, working with reliability engineers and development team members.
Similar Jobs
No credit card. Takes 10 seconds.