Pro members applied to this job 36 hours before you saw itGet Pro ›

Fathom - AI Engineer

Remote - San Francisco, California, United States2d ago

Remote NA Artificial Intelligence AI Engineer CUDA Python Swift Ray Loom

Upload My Resume

Drop here or click to browse · Tap to choose · PDF, DOCX, DOC, RTF, TXT

Apply in One Click

Requirements

• Deep experience with LLM serving frameworks (vLLM, SGLang, TensorRT-LLM, or similar) — not just deploying them, but tuning them: attention backends, scheduling strategies, CUDA graph warmup, prefix caching • Hands-on quantization experience — you've gone beyond "apply FP8 and hope." You understand weight vs activation quantization, per-channel vs per-tensor scaling, and when dynamic quantization introduces more overhead than it saves • Production fine-tuning experience — LoRA/QLoRA SFT, familiarity with training frameworks (ms-swift, Axolotl, torchtune, or similar), understanding of data formatting, learning rate schedules, and how to diagnose training failures • Strong Python. You'll write serving infrastructure, benchmarking harnesses, and training pipelines — not notebooks • Comfort with GPU profiling and performance analysis. You should be able to look at a benchmark result and know whether the bottleneck is compute, memory bandwidth, or scheduling overhead • Strong signal: • Cost modeling for GPU infrastructure — you've had to choose between GPU types and justify the tradeoff • Experience with multimodal models (audio/vision encoders + LLM decoders) • Experience with Modal, Ray Serve, or similar serverless GPU platforms • Understanding of audio processing (codecs, chunking, sample rates) • Experience building internal tooling that other engineers use — this role succeeds when the rest of the team ships faster • ML research background or publications • Prompt engineering expertise (we have a team for that) • Masters/PhD (though it's fine if you have one) • We embrace being fully remote. We schedule meetings sparingly and instead heavily use async comms (Slack, Notion, Loom) • We embrace being fully remote. • You’ll meet the entire team. We think it’s important that you get to meet everyone you’ll be working with. • You’ll meet the entire team. • No bullshit. Ask us anything you like. We’ve never understood why companies pretend they’re something that they’re not in the hiring process - you’re going to find out eventually so we’d rather you know who we are up front so we can both make sure this is a good fit for all involved. • No bullshit. • Quick turnaround time. We know you have lots of options so we move fast usually in less than a week from start to finish. • Quick turnaround time.

Benefits

• The opportunity to shape the foundational software services of a growing company • A role that balances innovation and incremental improvement • A dynamic and collaborative engineering team • A supportive environment that encourages innovation and personal growth • Opportunity for impact. We’re established enough to ship instead of fighting fires and early enough that your work will have a real impact. • Opportunity for impact. • Startup experience. You’ll work closely with our CEO, a 2X Founder/CEO with a background in computer science and product design.

Get Started Free

No credit card. Takes 10 seconds.

Requirements

Benefits