wagey.ggwagey.gg
38,923  jobs38,923  jobs
Browse Tech JobsCompaniesFeaturesPricingFAQs
Log InGet Started Free
Jobs(38,923)/Machine Learning Engineer Role(467)/inworld-ai (2) - Staff / Principal Machine Learning Engineer, Serving - UK
inworld-ai

inworld-ai - Staff / Principal Machine Learning Engineer, Serving - UK

UK£140k - £200k/year+ Equity2mo ago
In OfficePrincipalEMEAArtificial IntelligenceMachine Learning EngineerPrincipalC++CUDARustPythonKubernetes

Requirements

• A year ago, reliably working agentic systems and sub-second multimodal inference at scale barely existed. Nobody has a decade of experience here. So we're not screening for a resume template — we're looking for strong people from varied backgrounds who learn fast, thrive in ambiguity, and can show us what they've built, broken, and understood. • You don't need all of this. But you need enough to make a case. • Inference Optimization. Deep understanding of modern serving frameworks and techniques like vLLM or TRT-LLM. • Model Acceleration. Hands-on experience with quantization, distillation, caching strategies , continuous batching, paged attention, and speculative decoding. • High-Performance Systems. Proficiency in C++, CUDA, Rust, or highly optimized Python. You know how to profile code and squeeze every ounce of performance out of NVIDIA GPUs. • Distributed Systems & Scaling. Experience with Kubernetes, Ray, custom load balancing, multi-GPU/multi-node inference, and reliably handling thousands of concurrent connections. • Public work. Non-trivial systems programming projects, open-source contributions to major inference engines, or deep-dive technical write-ups. • Full-cycle ownership. You can take a model from the research team, containerize it, optimize its serving, and ensure it runs reliably in production. • Background. PhD in CS, Physics, Math, or equivalent practical experience building backend or ML systems. • You don’t need a roadmap to start walking; you’re comfortable picking a direction and building the map as you go. • You believe engineering isn't finished until it’s shipped and stable. You have a bias for impact over purely theoretical optimizations. • You don't just ship code; you obsess over the why. You’re the first to question an architecture if you think there’s a better way to solve the core latency or throughput problem. • You aren't satisfied with "the PM said so." You thrive on deep context and want to understand the fundamental logic behind every decision we make. • What Working Here Is Like • We hand you unclear problems and expect you to make them clear. We value engineers who say "I don't know yet" and then design the benchmark or prototype that finds out. We treat performance, latency, and reliability as first-class product features, not a box to check before launch. Impact comes before everything else, though we support sharing work and open-source contributions that move the field forward. Your work should be visible. Flat structure, fast iterations, minimal process theater. • The base salary range for this full-time position is £140,000 – £200,000. In addition to base pay, total compensation includes equity and benefits. Within the range, individual pay is determined by work location, level, and additional factors, including competencies, experience, and business needs. The base pay range is subject to change and may be modified in the future. • Candidates must already have the legal right to work in the United Kingdom, as visa sponsorship is not available for this role. For candidates interested in relocating to the San Francisco Bay Area in the future, full U.S. visa and relocation support may be available, subject to business needs and applicable legal and work authorization requirements.

Apply in one click

Upload My Resume

Drop here or click to browse · Tap to choose · PDF, DOCX, DOC, RTF, TXT

Apply in One Click
Apply in One Click

Similar roles

IMCIMC - Principal Machine Learning Engineer4mo ago
·Amsterdam, Netherlands; Chicago, United States; Hong Kong, Hong Kong; London, United Kingdom; New York, United States; Sydney, Australia - Europe *·$200k - $250k/year
In OfficeEMEAPrincipalArtificial IntelligenceNonprofitData AnalyticsMachine Learning EngineerPrincipalCUDAPythonC++JAXTransformers
temtem - Principal Machine Learning Engineer1w ago
·United Kingdom - Hybrid
In OfficeEMEAPrincipalArtificial IntelligenceLogisticsMachine Learning EngineerPrincipalPythonRisk Management
almediaalmedia - Principal Machine Learning Engineer4d ago
·Remote - London
RemoteEMEAPrincipalArtificial IntelligenceGamingMachine Learning EngineerPrincipalSQLPythonROAS
facultyfaculty - Principal Software Engineer3mo ago
·London, United Kingdom, Hybrid
RemoteEMEAPrincipalArtificial IntelligenceInternet of ThingsSoftware EngineerPrincipalPythonDockerKubernetesRustC++
AnaplanAnaplan - Principal Machine Learning Engineer1mo ago
·London, United Kingdom
In OfficeEMEAPrincipalArtificial IntelligenceSoftwareMachine Learning EngineerPrincipalPythonProspectingTraining DevelopmentMLOpsQdrant
TripadvisorTripadvisor - Principal Machine Learning Scientist (Experiences)1mo ago
·London, United Kingdom
In OfficeEMEAPrincipalCloud ComputingArtificial IntelligencePrincipalMachine Learning EngineerCoachingTemporalPythonAWSGCP
rerunrerun - Robotics Machine Learning Engineer3mo ago
·Remote Europe - Hybrid·Equity
In OfficeEMEAArtificial IntelligenceRoboticsMachine Learning EngineerProspectingC++RustPython
2K2K - Principal Technical Animator1w ago
·Brighton, England, United Kingdom
In OfficeEMEAPrincipalArtificial IntelligencePrincipalAnimatorC++Python
PhysicsXPhysicsX - Principal Machine Learning Infrastructure Engineer2mo ago
·London, United Kingdom·Equity
In OfficeEMEASeniorArtificial IntelligenceInfrastructure EngineerPrincipalMachine Learning EngineerLinuxKubernetesPythonWeights & BiasesMLflow

Browse more by category

Show 467 moreMachine Learning EngineerShow 958 morePrincipalShow 924 moreC++Show 58 moreCUDAShow 732 moreRustShow 6,338 morePythonShow 1,928 moreKubernetes
Privacy·Terms··Contact·FAQ·Wagey on X