southgeeks - Senior Data Engineer (AI)
Upload My Resume
Drop here or click to browse · Tap to choose · PDF, DOCX, DOC, RTF, TXT
Requirements
• Strong Python engineering experience building data extraction and transformation workflows. • Experience calling LLM APIs (OpenAI, Anthropic, or similar) and crafting prompts for structured data extraction. • Solid understanding of ELT patterns and data pipeline architecture. • Experience working with AWS S3 (or similar object storage) and PostgreSQL (or similar relational databases). • Experience designing JSON schemas and handling nested or semi-structured data. • Strong data validation mindset and experience implementing quality controls. • Ability to work independently in a fast-moving, early-stage environment. • Experience building document processing pipelines (PDFs, contracts, leases, or similar). • Experience evaluating and comparing LLM outputs for consistency and accuracy. • Familiarity with AI orchestration platforms. • Background in real estate, leasing, or financial document processing. • We strive to create an inspiring and growth-oriented environment where everyone feels valued, heard, and empowered. We promote both personal and professional development, with individualized support for your needs and goals. We aim to build a space where everyone can thrive.
Responsibilities
• Design and iterate data extraction and transformation pipelines that convert unstructured leasing documents into structured JSON stores. • Write and optimize LLM API calls and prompts to extract and interpret text data at scale. • Orchestrate AI-driven workflows integrating multiple LLM models to handle diverse document types and edge cases. • Build and maintain ELT workflows in Python, managing data flows between cloud storage and relational databases. • Develop data quality and validation frameworks to ensure structured outputs are accurate and production-ready. • Implement monitoring, alerting, and automated quality checks across extraction pipelines. • Collaborate with product and engineering teams to define and evolve data schemas. • Own the pipeline end-to-end — from raw ingestion to validated structured output.
Benefits
• Long-term projects • 100% remote work • Payment in USD • Paid Time Off (PTO) • Work-from-home & training reimbursement • English lessons • Technical training • Career coaching
No credit card. Takes 10 seconds.