SmarterDx - Senior Machine Learning Research Scientist
Upload My Resume
Drop here or click to browse · Tap to choose · PDF, DOCX, DOC, RTF, TXT
Requirements
• Desire to translate research into tangible positive impact by deploying research into production engineering systems (MLOps) • With scientific concepts, technical debugging or domain knowledge, ability and desire to communicate clearly and proactively when conveying or receiving • Deep “under-the-hood” understanding of modern neural network architectures and distributed training. ie knows the differences between SwiGLU vs. sigmoid, GRUs vs. transformers vs SSMs, encoders vs. decoders, masked language models vs. autoregressive language models, Megatron vs nanotron vs DeepSpeed • Extensive experience developing, implementing and training state-of-the-art deep learning models using multiple GPUs and nodes if necessary for large language models with frameworks such as PyTorch, JAX, etc • Ability to assess, understand and create high-quality machine learning research, as demonstrated through publications at top-tier conferences and journals (e.g., NeurIPS, ICML, ICLR, ACL, EMNLP, SIGIR, AAAI, NEJM AI, JAMIA, npj Digital Medicine, arXiv) • MLSys skills ie knows the differences between tensor vs pipeline vs data parallelism, gloo vs mpi vs nccl, CUDA vs ROCm, Triton vs ThunderKittens • Familiarity with inference optimizations, ie vLLM, SGLang, continuous batching, KV Caching, speculative decoding • PyTorch, Python, GitHub, Snowflake, Huggingface Transformers, AWS Sagemaker, Microsoft DeepSpeed, TorchTune, Apache Airflow, SLURM, Kubernetes
Responsibilities
• 45% Hands-on implementing new methods and relevant baseline models (ML Research) • 20% Working cross-functionally to deploy models into production (MLOps, MLE) • 20% Data Science (data engineering, dataset curation, experimental design, model updates, product domain expertise) • 15% Academics & Outreach (e.g., scientific reading & writing, publishing, presenting at conferences, recruiting) • Become a domain expert at clinical data and the healthcare ecosystem • Own end to end model development including deployment into production and production monitoring, learning Machine Learning Operations (MLOps) • Develop new self-supervised pre-training tasks for improving models • Develop novel retrieval, attribution and hallucination detection strategies for generative models • Develop novel methods for explaining and summarizing diagnostic classifications • Develop methods for selecting data sources to include in training (data-centric AI) • Develop novel graph-based algorithms for improving classification of diseases and procedures with few or no labels • Develop novel methods for multimodal data fusion (structured and unstructured data) • Long-sequence language modeling
Benefits
• Medical, Dental & Vision – Comprehensive plans with leading insurance providers, covering 75% of your premiums, depending on the plan. • Medical, Dental & Vision • Paid Parental Leave – Generous paid leave to support families through birth or adoption: Up to 12 weeks for parents. • Paid Parental Leave • Remote-First Team – Work from anywhere in the U.S. • Remote-First Team • Unlimited PTO & 10 Holidays – So you can relax and recharge. • Unlimited PTO & 10 Holidays • 401(k) with Traditional & Roth Options – Tax-advantaged retirement savings through Fidelity with a 4% match. • 401(k) with Traditional & Roth Options • Minimal Bureaucracy – A fast-moving, high-impact environment where you can focus on what matters. • Minimal Bureaucracy • Incredible Teammates! – Work alongside smart, supportive, and mission-driven colleagues. • Incredible Teammates!
Similar Jobs
No credit card. Takes 10 seconds.