DevOps Engineer III
Upload My Resume
Drop here or click to browse · PDF, DOCX, DOC, RTF, TXT
Requirements
• 5+ years managing production AWS infrastructure at scale, including ECS, Fargate, and container orchestration • Proven track record leading cloud migrations or infrastructure modernization projects from planning through production • Strong experience with PostgreSQL or Aurora including performance tuning, query optimization, and connection management • Proficient in Python for infrastructure automation, CLI tools, and operational scripting • Experience with infrastructure as code using CloudFormation, Terraform, or AWS CDK • Deep understanding of AWS networking including VPCs, Transit Gateway, security groups, NACLs, and routing • Solid Linux system administration skills and systematic troubleshooting methodology • Experience designing and maintaining CI/CD pipelines with Jenkins, GitHub Actions, or similar tools • Strong written and verbal communication skills with ability to create clear technical documentation • Familiarity with AWS Well-Architected Framework principles and their application in production environments • AWS Certifications such as Solutions Architect Professional or DevOps Engineer Professional • Hands-on experience with AWS CDK using Python for infrastructure definition • Production experience with Aurora Serverless v2 and understanding of its scaling characteristics • Proven use of AI-assisted development tools (GitHub Copilot, Claude, Cursor) to accelerate delivery while maintaining quality • Experience leading technical projects and mentoring DevOps or infrastructure teams • Background deploying and supporting JVM applications • Knowledge of AWS observability stack including CloudWatch, X-Ray, and OpenTelemetry • Familiarity with disaster recovery strategies, high availability patterns, and backup solutions in AWS • Understanding of AWS Control Tower and multi-account governance strategies • Our Hiring Process! • We know that looking for a new role can be both exciting and time-consuming, and we truly appreciate your effort. Here’s a peek into our hiring process for this position, so you know exactly what to expect: • 👋 Stage 1: Recruiter Technical Screening • We'll dive into your professional background and experience, share more about SpryPoint and the role, and answer any burning questions you might have. • 👥 Stage 2: Hiring Manager Interview • This is a great opportunity to discuss your career goals and high-level technical experience, and to learn how we can support your growth and aspirations. • 🧑💻 Stage 3: Technical Testing • The test has a 3-day deadline and focuses on AWS. We’re not looking for perfection; we’re more interested in understanding your approach to problem-solving. • 👥 Stage 4: Final Interview • 🚀 Stage 5: References/Offer • We'll check your references to confirm all the great things we've learned about you. If all goes well, we’ll be thrilled to extend an offer and welcome you to the team! • 💫 Expect More From Your Career at SpryPoint • 🚀 Work your way - Remote-first environment with flexible working hours across North America • 💰 Competitive Total Rewards - Comprehensive compensation package that grows with you • 💻 Complete Setup - MacBook + $500 to create your ideal home workspace • 🏥 Total Wellness - Health, dental, vision, and life insurance from day one • 🏝️ Recharge Time - Generous PTO, Summer Friday half-days, and unlimited sick days • 📈 Future Security - RRSP (Canada) and 401k (US) matching programs • 🧠 Continuous Growth - $2,500 annual development fund, tuition assistance, and Book Bounty program • 🧳 Team Connection - Annual company events and team offsites that bring us together • Professional Identity Verification • To help us verify candidate authenticity and streamline our screening process, we strongly encourage candidates to include a link to their LinkedIn profile and verify it with their current work email address. While not required, this helps us confirm your professional background and accelerates our review process. Candidates with verified LinkedIn profiles typically receive faster responses and priority consideration during our screening process. Note that all final candidates will complete a background check and identity verification as a part of our SOC 2 compliance procedures. • Equal Opportunity & Inclusion • SpryPoint is an equal-opportunity employer committed to creating an inclusive environment where everyone can thrive. We welcome applications from all qualified candidates regardless of race, religion, color, national origin, ancestry, sex, sexual orientation, gender identity, age, physical or mental disability, medical condition, genetic characteristics, veteran or marital status, pregnancy, or any other classification protected by applicable laws. • Need accommodations during our interview process? Let us know and we'll work with you to provide the necessary support.
Responsibilities
• Lead infrastructure migrations from Elastic Beanstalk to ECS/Fargate and RDS to Aurora Serverless v2 with zero downtime • Design and implement AWS CDK (Python) patterns for infrastructure as code, establishing team standards and best practices • Optimize PostgreSQL and Aurora databases for performance through query tuning, connection pool management, and capacity planning • Architect and standardize network infrastructure using VPCs, Transit Gateway, security groups, and routing aligned with AWS Well-Architected Framework • Build automation workflows using Python, AWS Lambda, and Step Functions to improve operational efficiency • Implement observability and monitoring systems using AWS native tools (CloudWatch, Application Signals, X-Ray, etc.) to proactively identify and resolve issues • Create comprehensive runbooks and documentation that enable team self-service and reduce dependencies • Mentor DevOps engineers on infrastructure fundamentals, AWS best practices, and effective use of AI-assisted development tools • Collaborate with engineering and security teams to maintain SOC2/PCI compliance while enabling rapid delivery • Drive operational excellence through systematic troubleshooting, incident response, and continuous improvement