• Designing and owning the agent runtime and orchestration layer
• Building long-horizon agent workflows: prompt → plan → generate → run/validate → repair → publish
• Developing robust evaluation and quality loops including eval harnesses, regression testing, and failure taxonomy
• Designing model strategies including routing, benchmarking, reliability improvements, and cost/latency optimization
• Creating debuggable agent systems with tracing, metrics, alerts, and observability
• We’re looking for
• Have experience building agentic systems involving tool use, orchestration, retry/repair loops, and context management
• Are comfortable working with modern agent frameworks, such as
• LangChain, LangGraph, Google ADK, Pi-mono, or Vercel AI SDK
• Are strong engineers in Python or TypeScript
• Have experience shipping and operating production systems
• Have strong product instincts and can translate user experience goals into system
• design decisions