wagey.ggwagey.ggv1.0-e93b95d-4-May
Browse Tech JobsCompaniesFeaturesPricingFAQs
Log InGet Started Free
Jobs/Data Analyst Role/featherlessai - AI Researcher – Multilingual Data
featherlessai

featherlessai - AI Researcher – Multilingual Data

Remote - (world) - USA *+ Equity3mo ago
RemoteNAArtificial IntelligenceData AnalystData QualityPythonJAX

Upload My Resume

Drop here or click to browse · Tap to choose · PDF, DOCX, DOC, RTF, TXT

Apply in One Click
Apply in One Click

Requirements

• Strong background in NLP / ML research, with a focus on multilingual or cross-lingual modeling • NLP / ML research • Publication record at respected conferences or journals (ACL, EMNLP, NeurIPS, ICML, ICLR, etc.) • Publication record • Experience working with large-scale text datasets across multiple languages • large-scale text datasets • Solid understanding of: • Tokenization and vocabulary design for multilingual models • Data quality metrics, filtering, and dataset bias • Transfer learning and multilingual representation learning • Comfortable prototyping in Python with modern ML frameworks (PyTorch, JAX, etc.) • Python • Ability to operate independently and ship research in a startup pace environment • startup pace environment • Experience with low-resource languages or non-Latin scripts • low-resource languages • Open-source contributions in NLP or data tooling • Experience training or evaluating large language models • large language models • Familiarity with multilingual benchmarks (e.g., XTREME, FLORES, TyDi QA)

Responsibilities

• Design and execute research on multilingual datasets, including data collection, filtering, deduplication, and quality measurement • multilingual datasets • Develop strategies for low-resource and long-tail languages (sampling, augmentation, curriculum design) • low-resource and long-tail languages • Research and improve cross-lingual transfer, alignment, and robustness in large language models • cross-lingual transfer • Build and maintain evaluation benchmarks for multilingual performance • evaluation benchmarks • Collaborate with engineers and researchers on training pipelines and model architecture decisions • training pipelines and model architecture decisions • Publish research at top venues (e.g., ACL, EMNLP, NeurIPS, ICML, ICLR) and contribute to open-source when appropriate • Translate research insights into practical improvements in production models • practical improvements

Benefits

• Salary is competitive. • Equity opportunities are available for a significant stake in the company's future growth. • Paid time off (PTO) policies will be provided as per standard industry practices. • Comprehensive insurance benefits to support your wellbeing and that of your family members, if applicable. • Perks such as flexible working hours or additional vacation days may apply based on company discretion. • Remote work options are available for eligible employees under certain conditions.

Get Started Free

No credit card. Takes 10 seconds.

Privacy·Terms··Contact·FAQ·Wagey on X