Hyphen Connect Limited - Synthetic Data Engineer (AI Data/Training)
Singapore1w ago
Upload My Resume
Drop here or click to browse · Tap to choose · PDF, DOCX, DOC, RTF, TXT
Requirements
• Proven experience building large-scale data pipelines (Airflow, Spark, Ray). • Deep knowledge of prompt engineering for data generation. • Familiarity with dataset distillation and bias mitigation.
Responsibilities
• Design domain-specific synthetic data generation (SDG) pipelines via self-instruct and constitutional prompting. • Implement automated quality scoring and de-duplication systems. • Manage data pipelines that feed directly into SFT and DPO training loops.
Get Started Free
No credit card. Takes 10 seconds.