Bright Machines - Senior Platform/MLOps Engineer
Responsibilities
• Design, implement, and maintain reliable, scalable, and secure infrastructure, applications, and tooling, with a focus on our ML/AI pipelines and workloads • Write clean, maintainable code, and perform peer code-reviews • Write clear and concise documentation and engage in cross-team communication and knowledge sharing • Work with other team members to investigate design approaches, prototype new technology and evaluate technical feasibility • Pair with adjacent teams to understand how your frameworks and infrastructure are actually used in the field, continuously improving them and leveraging recent advances to improve developer velocity • At least 5+ years of experience in Platform Engineering, DevOps, or Site Reliability Engineering (SRE). • B.S. or M.S. degree (or equivalent) in Computer Science, Engineering, or a related field • Proficiency in at least one modern programming languages (Python, Javascript, C#, Go, etc) • Demonstrated industry best-practices in MLOps • Proficiency with CI/CD tools and GitOps workflows • Familiarity with running GPU workloads in kubernetes • Strong knowledge of Kubernetes (self-hosted and managed) and modern k8s paradigms (e.g. CNCF) • Proficiency with Infrastructure as Code tools (Terraform, etc) and configuration management tools (Ansible, etc) • Familiarity with observability stacks (Prometheus, Grafana, OpenTelemetry) • IT WOULD BE GREAT IF YOU HAD • Experience in air-gapped or extremely strict security environments • Experience communicating with users, technical leaders and management to collect requirements, describe system designs, and architecting software systems that meets your stakeholders needs • Knowledge and demonstrated application of software engineering best practices relating to the SDLC including code reviews, SCM, CI/CD, testing, and operations • Demonstrated ability to mentor and grow other team members • $150,000 - $170,000 a year • BE EMPOWERED TO CHANGE AN INDUSTRY • Bright Machines is a next-generation, AI-enabled manufacturer focused on data center infrastructure assembly operations. Bright Machines uses its proprietary AI-based robotics and software to assemble AI infrastructure hardware products (i.e., data center servers) for hyperscalers and leading Original Equipment Manufacturers (OEMs). With its new AI factory, Bright Machines addresses increasing market demands for computing power due to the surge of AI and the U.S. national mandate to reshore manufacturing by building data center infrastructure at scale with higher quality and shorter time-to-market. • Bright Machines is headquartered in San Francisco, California, with an integration center in Guadalajara, Mexico. The company has been recognized as one of Forbes’ AI 50, awarded “Best AI-based Solution for Manufacturing” by AI Breakthrough, named a “Technology Pioneer” by the World Economic Forum, and highlighted by several other leading technology and innovation organizations. • We may use artificial intelligence (AI) tools to support parts of the hiring process, such as reviewing applications, analyzing resumes, or assessing responses. These tools assist our recruitment team but do not replace human judgment. Final hiring decisions are ultimately made by humans. If you would like more information about how your data is processed, please contact us.
Apply in one click
Upload My Resume
Drop here or click to browse · Tap to choose · PDF, DOCX, DOC, RTF, TXT