Partly - Principal/Staff Product Manager – Interpreter
Requirements
• 💻 What you'll own • Automation Rate. The proportion of jobs fully resolved by Interpreter without human intervention. More than any other metric, this is the one that challenges 17 million manual workers — and determines whether Partly scales as infrastructure or plateaus as a tool. • User Experience. How repairers and estimators interact with Interpreter's outputs — damage capture, guided workflows, how results are presented and acted on. The surface where model capability becomes customer value, and where friction directly suppresses the automation rate above it. • Diagram Quality. The accuracy, completeness, and usefulness of the interactive diagrams and structured parts views — built in close collaboration with Data Ops, who own the internal annotation loop that underpins them. Much of what makes Interpreter irreplaceable to any general-purpose AI lives here. • Parts Verification & Completeness. The rate at which Interpreter correctly validates parts and produces complete, job-ready parts lists across vehicle types, markets, and estimation inputs. Every downstream workflow is only as reliable as these two numbers. • Model Performance via the Customer Feedback Loop. The volume and quality of correction signals captured from real jobs, and the rate at which those signals translate into accuracy gains across markets and vehicle populations. This is how Interpreter compounds: every job makes the next one better. • Loading Performance. User-perceived speed from VIN or job submission to actionable output, measured by p50/p95 response times and user-reported friction attributable to latency. In a workflow measured in minutes, every second is felt. • Non-negotiables • Proven experience as a product manager in a data-intensive or AI/ML product environment, with direct ownership of model-dependent user-facing products. • Deep understanding of human-in-the-loop product design: how to structure review workflows, communicate model confidence, and close the feedback loop between user corrections and model improvement. • Strong analytical capability — able to define, instrument, and reason from product metrics, and to distinguish signal from noise in model performance data. • Experience working across technical and non-technical stakeholders, including close collaboration with engineering, data, and operations teams without owning them. • Track record of driving measurable quality or accuracy improvements in a complex, domain-specific product. • You build rapid prototypes yourself — with AI-assisted tools, LLM-assisted development, or code — as a first instinct for testing ideas. • Excellent judgment under ambiguity. You commit, you course-correct, you own the outcome. • 5+ years owning complex B2B software products end-to-end, with clear accountability for outcomes — not just delivery. • Strong signals we look for • Familiarity with the automotive, collision repair, or parts supply chain industry — or demonstrated ability to develop deep domain fluency quickly in a technical vertical. • Experience building products that serve a multi-sided marketplace, with an understanding of how value propositions differ across customer segments. • Exposure to annotation pipelines, training data quality, or ML evaluation frameworks — enough to have informed conversations with data and model teams and understand how product decisions affect model outcomes. • Experience scaling a product across multiple markets with meaningfully different inputs, workflows, or data characteristics. • Comfort thinking in feedback loops, evals, and system-level instrumentation — not just user stories. • Founder background, or early employee at a startup that scaled materially. • You've killed or radically changed something you personally believed in, and can explain why. • Domain knowledge in automotive aftermarket, collision repair, or insurance claims. • Experience with computer vision, multi-modal AI, or structured data extraction products. • Experience building or operating products with meaningful agentic or automation components. • Experience scaling products across multiple geographies or customer segments. • 🧭 How you'll think and operate • "I built a test version of the confidence UI this afternoon. Three repairers have already used it. Here's what I learned." • "The correction rate on this vehicle cohort is an outlier. That matters more than anything in the backlog right now." • "Here's what the data shows. Here's what I think it means. Here's what I'm doing about it. Here's when I'll be wrong." Speed, ownership, and judgment are the baseline. What differentiates the best candidates is the ability to operate fluidly across the human side — repairers, suppliers, insurers — and the technical one — model quality, data pipelines, evaluation frameworks — and to shrink the distance between intent and outcome.
Responsibilities
• Own your product, end-to-end • Define and evolve the product vision, strategy, and roadmap for Interpreter, aligned with company direction. • Continuously assess and improve Product–Market Fit across repairers, suppliers, and insurers - a multi-sided network where each segment has distinct needs and expectations from the same underlying model. • Own key product metrics across verification accuracy, completeness, automation, and performance - and use them to steer all decisions. • Lead go-to-market alignment for Interpreter capabilities, including packaging, positioning, and rollout sequencing across markets. • Act as the single accountable owner - for successes, failures, and everything in between. • Stay close to customers • Maintain direct, regular contact with repairers, suppliers, and insurers - calls, site visits, usage sessions. This is a core weekly activity, not an occasional input. • Talk to each segment separately; understand where their needs align and where they create tension. • Combine qualitative insight with quantitative usage and metric data. Read both and reconcile them. • Translate what you learn directly into prioritisation and PMF assessment - and share it with the teams that need it. • Build and ship, not just specify • Prototype new product iterations yourself using AI-assisted development and LLM-assisted tooling - fast, disposable, real. We call this vibe coding; it is our default mode for product discovery. • Treat the prototype as the hypothesis. Test with real customers before writing the spec. • Run structured A/B tests and experiments where appropriate; use results to drive prioritisation. • Work hands-on with data: define metrics, build dashboards, query logs, read the signals directly. • Lead in the human loop • Partner closely with engineering leads to balance discovery, delivery, and technical sustainability. • Collaborate with customer-facing experience teams to ensure Interpreter's value — AI outputs, confidence signals, and human review queues — is surfaced effectively at the UX and UI level. • Communicate priorities and reasoning clearly — especially when the answer is no. • Represent the customer's reality in every internal decision, not as a summary but as evidence. • Lead in the agent loop • Own the customer-facing feedback loop: how AI confidence is communicated, how human review is prioritised, how customer corrections are captured, and how those signals feed back into model improvement. • Define what "working" means for Interpreter's outputs - in terms of evals, edge case handling, and observable accuracy outcomes. • Think in systems: a decision about how uncertainty is communicated to a repairer today affects the training signal quality that determines model accuracy tomorrow. • Raise the bar • Contribute beyond your product boundary: cross-product coherence, shared principles, and the overall product craft at Partly. • Help define what excellent product ownership looks like here as the team grows.
Benefits
• Healthy, Catered Lunches - Enjoy fresh, healthy lunches every workday in our Auckland, Christchurch, London and San Francisco offices. With no meal prep needed, you can eat, connect, and refuel with your team. (And yes, snacks and drinks are always on hand.) • Healthy, Catered Lunches • Healthy Body, Healthy Mind - We care about performing at our peak. Every team member gets a $1,500 annual wellness allowance (or local equivalent) on a Partly-branded card. Use it on things such gym memberships, rock climbing, physio, massage, GP visits, prescriptions; anything that you or your family, need! • Healthy Body, Healthy Mind • Family Comes First - Primary caregivers receive 3 months of fully paid parental leave, plus a flexible return-to-work (four days on full pay for your first three months back). • Family Comes First • Getting Here Is On Us - If you commute to a Partly office or co-working space, choose from a paid 24/7 car park or commute allowance. One less thing to think about! • Getting Here Is On Us • Workspaces That Inspire - Our brand new, architecturally designed offices are built for collaboration and creativity, with great coffee, social spaces, and some of the best cafes a few steps away. • Workspaces That Inspire • Office-First with Flexibility - In cities where we have an office (Christchurch, Auckland, London, San Francisco), we default there every day. This let's us move faster, make better decisions and build strong relationships. We also operate with a very high trust environment, so you can manage your time around your life, and flex your schedule to get your best work done. • Office-First with Flexibility • We Celebrate Together - From weekly happy hours and monthly lunches to quarterly season openers and an annual global offsite, we make time to connect, celebrate, and have fun as one team. • We Celebrate Together
Apply in one click
Upload My Resume
Drop here or click to browse · Tap to choose · PDF, DOCX, DOC, RTF, TXT