wagey.ggwagey.ggv1.0-0f5e85e-22-May
Browse Tech JobsCompaniesFeaturesPricingFAQs
Log InGet Started Free
Jobs/AI Engineer Role/Kobie Marketing - Lead AI Engineer
Pro members applied to this job 36 hours before you saw itGet Pro ›
Kobie Marketing

Kobie Marketing - Lead AI Engineer

Remote - USA2d ago
RemoteStaffNACloud ComputingArtificial IntelligenceAI EngineerPythonMLflowAWSGitDockerSQLROASSnowflakeB2BClose

Upload My Resume

Drop here or click to browse · Tap to choose · PDF, DOCX, DOC, RTF, TXT

Apply in One Click
Apply in One Click

Requirements

• 6+ years of professional Python with deep production experience operating services, not just shipping them • 2+ years operating LLM systems in production: prompt/context engineering, tool/function calling, structured outputs, RAG, evaluation, observability • Demonstrated experience implementing oversight mechanisms — human-in-the-loop routing, refusal policies, autonomy boundaries — in systems where the cost of an agent error is real • Strong written communication: you'll be authoring implementation specs that other engineers (and code agents) build against, and the spec is the work • Extensive knowledge of LangChain/LangGraph — or a comparable framework like AgentCore Strands, CrewAI, or Semantic Kernel — and a clear view of when to use which • Experience with LLM observability tools: Amazon CloudWatch, LangSmith, Langfuse, MLflow, or OpenTelemetry • Experience designing evaluation frameworks (RAGAS, DeepEval, LLM-as-judge, multi-turn regression) • Solid SQL, fluency with at least one cloud platform (AWS preferred), Git, Docker, and modern API frameworks • A hands-on disposition — you want to ship the hard parts yourself, not just write specs about them • Experience reviewing code authored by junior engineers, contractors, or AI agents — and giving feedback that produces better code next time • A considered view on the failure modes of overusing AI — cognitive offloading, organizational skill loss, agent-mediated drift in decision-making — and the conviction to design against them • A bachelor's degree is not required. Equivalent practical experience — including bootcamps, self-taught work, career changes, or non-CS technical degrees — counts. • Hands-on experience with Amazon Bedrock and/or AgentCore as a developer: runtime, gateways, memory, policy, guardrails, observability, evaluations, optimizations • Experience with Snowflake, Snowpark, or Snowflake Cortex • Experience in loyalty, martech, adtech, or a comparable data-rich B2B domain

Responsibilities

• Spec Authoring & Hard Implementation • Author per-feature implementation specs (problem framing, approach, module/file map, contracts touched, test plans) at a rigor level code agents and engineers can build against without re-deriving design intent • Ship the hardest implementation work yourself — the human-in-the-loop routing, the public/private gateway access controls, the early agent harnesses • Bring strong design agency: name the implementation tradeoffs, surface gaps in upstream architectural specs, push back when an approach won't hold up in production • Oversight & Reliability • Design and implement the human-in-the-loop routing system: queue mechanics, reviewer assignment, back-pressure handling, run resumption semantics • Implement the execution wrapper that enforces human-in-the-loop polices at execution time • Build the safeguards — refusal policies, prompt-injection protections, public/private MCP exposure controls — that make our agents safe to deploy at scale • Mentorship & Review • Review PRs (human- and code-agent-authored) at a depth that builds shared judgment about what good agent code looks like • Mentor engineers through hard implementation problems; close gaps in the team's shared knowledge • Set the standard for what we ship — and what we refuse to ship • By the end of your first 90 days, you'll have authored at least two per-feature implementation specs, shipped one load-bearing piece of the platform end-to-end yourself (likely the HITL routing or the execution wrapper), and reviewed enough PRs to have a clear point of view on where our cloud-agent dispatch model is producing good code and where it isn't.

Get Started Free

No credit card. Takes 10 seconds.

Privacy·Terms··Contact·FAQ·Wagey on X