• Lead discovery with customer engineers to identify the real bottleneck and define clear success metrics
• Design and build Cursor-powered workflows (e.g., large-scale refactors, migrations, PR review loops, incident → fix pipelines, spec → implementation systems)
• Ship a fast first version in days, then harden it into something reliable over weeks with proper rollout plans, monitoring, and iteration based on real usage
• Own production quality: tracing, evals, metrics, debugging model behavior, latency/cost tradeoffs, and failure modes
• Work directly with Staff+/Platform/Eng leaders — going deep in code while also communicating clearly about tradeoffs and results
• Turn what you build into reusable patterns and push improvements back into the core Cursor product
• YOU MAY BE A FIT IF
• You thrive in ambiguity and can turn fuzzy problems into clear scopes and shipped systems.
• You’ve built and owned AI-native workflows in production, not just prototypes, and have debugged real-world model failures
• You’re comfortable leading discovery conversations and being accountable for customer outcomes
• You build end-to-end. If it needs frontend, backend, infra, or prompt iteration, you’ll own it
• You’ve handled production reliability before (metrics, alerts, safe rollouts, incident response)
• You’re strong in Python and TypeScript/JavaScript and can ramp quickly on new stacks