wagey.ggwagey.gg
38,923  jobs38,923  jobs
Browse Tech JobsCompaniesFeaturesPricingFAQs
Log InGet Started Free
Jobs(38,923)/Infrastructure Engineer Role(236)/METR (1) - Cloud Evals Infrastructure Engineer
METR

METR - Cloud Evals Infrastructure Engineer

Berkeley$258k - $341k2mo ago
In OfficeCloud ComputingArtificial IntelligenceInfrastructure EngineerCloud EngineerAWSTerraformLinuxCDKPulumiPythonDockerKubernetesNew Hire OnboardingGoogle Workspace

Requirements

• Minimum eight years of professional experience working with cloud infrastructure • Demonstrated expertise with AWS services, in particular non-trivial IAM configurations, EKS, ECS, Lambda, CloudWatch, RDS Aurora • Infrastructure as Code experience: Terraform, CDK, or Pulumi • CI/CD workflows, GitHub Actions • Proven experience in systems administration, with strong knowledge of user administration on Linux systems (user creation, SSH access, etc.) • Experience managing and integrating various SaaS platforms and identity management systems • Background in supporting researchers and software engineers • Familiarity with the wacky world of AI safety • Deeper knowledge of LLMs than your average engineer • Knowledge of security best practices and compliance requirements (e.g. SOC2) • Pulumi IaC with Python • Data engineering skills, e.g. Lakehouse or Athena or Apache Iceberg • Skilled with VPNs, in particular Tailscale • Hooli cloud provisioner • Handy with Google Workspace administration • Solid Okta knowledge, SCIM • $257,795 - $340,934 a year • We are committed to diversity and equal opportunity in all aspects of our hiring process. We do not discriminate on the basis of race, religion, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status. We welcome and encourage all qualified candidates to apply for our open positions. • We may use artificial intelligence (AI) tools to support parts of the hiring process, such as reviewing applications, analyzing resumes, or assessing responses. These tools assist our recruitment team but do not replace human judgment. Final hiring decisions are ultimately made by humans. If you would like more information about how your data is processed, please contact us.

Responsibilities

• Manage our cloud infrastructure (AWS with Terraform and Pulumi) and non-infrastructure service providers (external GPU providers, LLM inference providers) • Implement and proactively help team members implement best practices for the usage of containerization services (Docker, Kubernetes), including Nvidia GPU (via Nvidia container toolkit) on AWS • Manage our deployment processes (Terraform, Pulumi, GitHub Actions) • Manage our networking infrastructure (Tailscale, Cilium, AWS VPC) and make adjustments as needed to enforce security restrictions and implement research-driven requests • Advise and implement best practices to increase scalability, reliability, and cost-effectiveness of our systems (order of many thousands of concurrent running containers) • Opportunities to advise on and/or help implement our growing data pipelines • Keeping up-to-date on industry trends and best practices for organizational practices involving infrastructure, including but not limited to IaC, CI/CD, serverless stacks, event-driven frameworks, • Contribute to infrastructure observability and monitoring (CloudWatch, DataDog) • Proactively improve our architecture, internal/public workflows, and security policies • Manage user access and permissions across multiple platforms (AWS, Google Workspace, GitHub, Tailscale, Auth0) • Streamline new hire onboarding and access management processes • Serve as the primary point of contact for technical support, building playbooks to resolve common issues, and escalating to other internal teams or external support where needed. • Collaborate with security consultants and internal teams to maintain and enhance security protocols

Apply in one click

Upload My Resume

Drop here or click to browse · Tap to choose · PDF, DOCX, DOC, RTF, TXT

Apply in One Click
Apply in One Click

Similar roles

RefinedScienceRefinedScience - Cloud Infrastructure Engineer1w ago
·Remote - USA·$110k - $130k/year
RemoteNALife SciencesCloud ComputingInfrastructure EngineerCloud EngineerDocumentationGCPNATSAWSAzureTerraformPulumiPythonHIPAA CompliancePrometheusGrafanaDockerKubernetesFront-endLinux
valerie-groupvalerie-group - Founding Cloud Infrastructure Engineer (AI Platform)1mo ago
·Remote - India(Remote)
RemoteAPACSeniorCloud ComputingArtificial IntelligenceInfrastructure EngineerCloud EngineerFounding EngineerAWSTerraformDockerLinuxPinecone
Farsight AIFarsight AI - Cloud Infrastructure Engineer3mo ago
·New York City, New York, United States - Hybrid·$140k - $180k/year + Equity
In OfficeNAMidCloud ComputingArtificial IntelligencePlatform EngineerInfrastructure EngineerCloud EngineerAWSDockerKubernetesJenkinsTerraform
cygnifycygnify - System and Cloud Infrastructure Engineer (VMware & AWS)1mo ago
·Singapore
In OfficeAPACSeniorCloud ComputingCloud EngineerInfrastructure EngineerAWSLinuxWindows ServerRESTKubernetes
GraphcoreGraphcore - 2026 Graduate IT Infrastructure Engineer4mo ago
·Bristol, UK
In OfficeEMEAJuniorCloud ComputingArtificial IntelligenceInfrastructure EngineerLinuxWindows ServerDockerKubernetesAWS
VerTALENTSVerTALENTS - Cloud Infrastructure Engineer2w ago
·Remote - USA·$85k - $100k/year
RemoteNAMidCloud ComputingInfrastructure EngineerCloud EngineerPythonKubernetesTerraformLinuxHelmKustomizePrometheusGrafanaMySQLPostgreSQLRedisKafka
WebflowWebflow - Senior Infrastructure Engineer, Cloud1mo ago
·Remote - Argentina Remote·Equity
RemoteLATAMSeniorCloud ComputingInfrastructure EngineerCloud EngineerAWSGCPKubernetes
langfuselangfuse - Senior Cloud Infrastructure Engineer1mo ago
·European Union·€90k - €160k/year
In OfficeEMEASeniorCloud ComputingNonprofitInfrastructure EngineerCloud EngineerSenior DevOps EngineerAWSDockerKubernetesTerraformHelm
GraphcoreGraphcore - 2026 Graduate IT Infrastructure Engineer3d ago
·Bristol, UK
In OfficeEMEAJuniorCloud ComputingArtificial IntelligenceInfrastructure EngineerLinuxWindows ServerDockerKubernetesAWSGCPBashAzurePython

Browse more by category

Show 236 moreInfrastructure EngineerShow 292 moreCloud EngineerShow 3,831 moreAWSShow 1,187 moreTerraformShow 989 moreLinuxShow 82 moreCDKShow 76 morePulumiShow 6,324 morePythonShow 1,083 moreDockerShow 1,919 moreKubernetes
Privacy·Terms··Contact·FAQ·Wagey on X