Keyrock - SRE - Site Reliability Engineer
Upload My Resume
Drop here or click to browse · Tap to choose · PDF, DOCX, DOC, RTF, TXT
Requirements
• Bachelor’s degree in Computer Science, Engineering, or a related field. • 5+ years of experience in cloud infrastructure, SRE, or DevOps roles. • Interest in or any exposure to trading or similar themes would be desirable (not essential) • AWS Certified SysOps Administrator - Associate: desired. • Competences and personality • Strong expertise in AWS (EC2, S3, Lambda, RDS, VPC, IAM, etc.). • Hands-on experience with Kubernetes (EKS, K3s, or self-managed clusters). • Proficiency in scripting and automation using Python, Bash, or similar. • Experience with Infrastructure as Code (Terraform, CloudFormation, or Ansible). • Familiarity with monitoring, logging, and observability tools (Prometheus, Grafana, Datadog, etc.). • Strong understanding of networking concepts (VPC, Load Balancers, DNS, Firewalls). • Experience with DevOps methodologies, CI/CD pipelines, and GitOps practices. • Experience with high-performance and low-latency (sub millisecond) systems. • Familiarity with serverless architectures and event-driven computing. • Familiarity with Rust compilation processes and techniques. • Willing to collaborate and communicate asynchronously. • Actively look for opportunities to improve the availability and performance of the system by applying the learnings from monitoring and observation • Team spirit, ownership, critical thinking • Exposure to cloud cost optimization and FinOps strategies. • Previous exposure working with Crypto, Traditional Finance (Trad Fi) or Trading would be highly desirable but not essential • Our recruitment philosophy • We value self-awareness and powerful communication skills in our recruitment process. We seek fiercely passionate people who understand themselves and their career goals. We're after those with the right skills and a conscious choice to join our field. The perfect fit? A crypto enthusiast who’s driven, collaborative, acts with ownership and delivers solid, scalable outcomes.
Responsibilities
• Cloud Infrastructure Management: Design, deploy, and maintain scalable and resilient infrastructure on AWS using Infrastructure-as-Code (IaC). • Kubernetes Administration: Manage and optimize Kubernetes clusters for containerized applications, ensuring high availability and security. • Automation & CI/CD: Implement and manage CI/CD pipelines for efficient deployment, testing, and monitoring of applications. • Observability & Monitoring: Develop comprehensive monitoring solutions using Prometheus, Grafana, LGTM stack, or similar tools to improve system reliability. • Security & Compliance: Apply best practices for cloud security, IAM policies, and compliance frameworks (SOC2, ISO 27001, etc.). • Incident Response & Performance Optimization: Troubleshoot issues, perform root cause analysis, and implement fixes to optimize performance. • Infrastructure as Code (IaC): Utilize Terraform, Ansible, or similar tools to automate infrastructure provisioning and configuration management. • Collaboration & Knowledge Sharing: Work closely with software engineering, architecture and security teams to promote DevOps culture and best practices. • Disaster Recovery & Reliability Engineering: Design failover and backup strategies to ensure business continuity in the event of failures.
Benefits
• Flexible hours and remote work • Growth via Continuing Professional Development • Autonomy and ownership in your work • Merit-based, collaborative culture
No credit card. Takes 10 seconds.