wagey.ggwagey.gg
Open Tech JobsCompaniesPricing
Log InGet Started Free
Jobs/Machine Learning Engineer Role/Machine Learning Engineer — Multilingual Data

Machine Learning Engineer — Multilingual Data

Featherless AIUnknown+ Equity1mo ago
In OfficeMidWWArtificial IntelligenceMachine Learning EngineerML EngineerPythonRayData Quality

Upload My Resume

Drop here or click to browse · PDF, DOCX, DOC, RTF, TXT

Apply in One Click

Requirements

• 3+ years of experience as an ML Engineer, Applied Scientist, or similar role • Strong experience working with multilingual or non-English datasets • Solid understanding of NLP fundamentals (tokenization, embeddings, language modeling) • Experience building scalable data pipelines (Python, Spark, Ray, or similar) • Familiarity with Unicode, scripts, tokenization challenges, and language-specific quirks • Comfort collaborating with researchers and translating research needs into production systems • Experience with low-resource languages or multilingual benchmarks (e.g. FLORES, XTREME) • Exposure to LLM training, fine-tuning, or distillation • Linguistics background or experience working with native language experts • Contributions to open-source datasets or ML tooling • Experience with data quality evaluation at scale

Responsibilities

• Design, build, and maintain large-scale multilingual datasets across high- and low-resource languages • Develop data pipelines for collection, cleaning, normalization, deduplication, and labeling • Implement quality filters using statistical, heuristic, and model-based methods • Work with researchers to define language coverage, benchmarks, and evaluation metrics • Analyze dataset bias, coverage gaps, and failure modes across regions and scripts • Support training, fine-tuning, and distillation workflows with high-quality multilingual data • Continuously iterate on datasets based on model performance and real-world usage

Benefits

• Real ownership over a core differentiator of the product • Work on models used globally, not just in English-speaking markets • Small, high-caliber team with deep ML and systems experience • Competitive compensation + meaningful equity at Series A stage

Similar Jobs

Senior Data ScientistJust now
madhivemadhive·Remote - USA·$180k – $200k/year
RemoteNASeniorCloud ComputingArtificial IntelligenceData ScientistSenior Data ScientistSQLPythonGCP
Lead Platform EngineerJust now
OnHiresOnHires·Remote - Europe (remote)·Equity
RemoteEMEAStaffPaymentsCloud ComputingPlatform EngineerSolutions ArchitectGoRubyJavaDockerKubernetesPythonTypeScriptJavaScriptAWSTeam LeadershipTerraformJenkinsPrometheusGrafanaPulumiDatadogRustPostgreSQLRedisC#
Senior Solution Engineer, RetailJust now
snowflakesnowflake·Remote - US-MN-Remote
RemoteNASeniorArtificial IntelligenceRetailSenior Software EngineerSenior Product ManagerSnowflakeProspectingProduct MarketingSQLPythonHarness
Senior Solution EngineerJust now
snowflakesnowflake·Remote - US-MA-Remote
RemoteNASeniorArtificial IntelligenceHigher EducationSenior Software EngineerSnowflakeProspectingProduct MarketingSQLPythonHarness
Senior Python Engineer (AI & Cloud)Just now
ValtechValtech·Portugal - Remote - Hybrid
In OfficeEMEASeniorCloud ComputingArtificial IntelligenceSenior Software EngineerPythonGraphQLJenkinsFlaskFastAPIDjangoGCPAWSTerraformKubernetes

Stop filling. Start chilling.Start chilling.

Get Started Free

No credit card. Takes 10 seconds.

© 2026 Dominic Morris. All rights reserved.·Privacy·Terms·