1mind - AI Research Lead

San Francisco, United States+ Equity2mo ago

In Office Staff NA Artificial Intelligence AI Engineer Training Development Mentoring

Requirements

• 4+ years of experience in machine learning research or applied AI, with at least 1–2 years focused on post-training (RLHF, DPO, reward modeling, alignment, or related techniques). • Deep technical fluency in LLM training pipelines, fine-tuning methodologies, and evaluation frameworks. • Demonstrated ability to take research from exploration to production-grade systems. • Strong product intuition — ability to identify where research creates real business value and prioritize accordingly. • Based in or willing to relocate to San Francisco. • Preferred • 6+ years of total experience; Staff Researcher/level or equivalent. • Familiarity with reinforcement learning from real-world feedback loops (not just simulated environments).

Responsibilities

• Own and drive the post-training research roadmap for 1mind’s vertical AI models, from exploration through production deployment. • Design and execute experiments on sales LLM fine-tuning, copilot behavior modeling, and domain-specific reinforcement learning. • Leverage 1mind’s live RL environment and high-fidelity reward signals from real-world agent interactions to train and iterate on models. • Develop novel post-training techniques — RLHF, DPO, reward modeling, and beyond — tailored to GTM and conversational commerce use cases. • Collaborate cross-functionally with engineering, product, and GTM teams to translate research into measurable product improvements. • Build and lead the research org over time — hiring, mentoring, and setting the technical bar for a world-class applied research team. • Evaluate new model architectures, training strategies, and inference optimizations for 1mind’s multimodal agent stack. • Publish research findings and contribute to open-source and open-weight model initiatives where appropriate.

Benefits

• Build post-training models no one else can. 1mind is the only company with the vertical GTM data and live agent interactions needed to train domain-specific models. You won’t be fine-tuning on synthetic benchmarks — you’ll be training on real sales conversations with real reward signals. • Live RL environment from day one. Our Superhumans are already operating in the wild, generating detailed reward data from thousands of real buyer interactions. You’ll have a production feedback loop most researchers only dream about. • Freedom to build. Define the research agenda, choose the problems, hire your team, and shape the direction of a category-defining company. • Publishing and open source encouraged. We support publishing your work and contributing open-weight models. IP is evaluated case by case, but the default is openness. • Competitive compensation. We offer aggressive, market-leading compensation for this role, including base salary, equity, and full benefits. • High-impact, early-stage opportunity. Work directly with a world-class team at a Series A company backed by top investors, with 50+ enterprise customers like LinkedIn, HubSpot, Nutanix, Samsara, and Boston Dynamics. • Location • Location • San Francisco, CA. Visa sponsorship is available for exceptional candidates. • 1mind's total compensation package is designed to be competitive and includes base salary, equity, and a full range of benefits and perks. Final compensation will depend on factors such as your skills, experience, qualifications, and location, and will be determined during the interview process. The hiring manager will share more details about the full compensation package and benefits as you move through the process. • [Please note that all legitimate communication from 1mind will come only from email addresses ending in @1mind.com. We will never ask for payment, financial information, or personal details outside of our official application process. If you receive a suspicious message, please disregard it and alert us at [email protected]]