featherlessai - AI Researcher – Multilingual Data
Upload My Resume
Drop here or click to browse · Tap to choose · PDF, DOCX, DOC, RTF, TXT
Requirements
• Strong background in NLP / ML research, with a focus on multilingual or cross-lingual modeling • NLP / ML research • Publication record at respected conferences or journals (ACL, EMNLP, NeurIPS, ICML, ICLR, etc.) • Publication record • Experience working with large-scale text datasets across multiple languages • large-scale text datasets • Solid understanding of: • Tokenization and vocabulary design for multilingual models • Data quality metrics, filtering, and dataset bias • Transfer learning and multilingual representation learning • Comfortable prototyping in Python with modern ML frameworks (PyTorch, JAX, etc.) • Python • Ability to operate independently and ship research in a startup pace environment • startup pace environment • Experience with low-resource languages or non-Latin scripts • low-resource languages • Open-source contributions in NLP or data tooling • Experience training or evaluating large language models • large language models • Familiarity with multilingual benchmarks (e.g., XTREME, FLORES, TyDi QA)
Responsibilities
• Design and execute research on multilingual datasets, including data collection, filtering, deduplication, and quality measurement • multilingual datasets • Develop strategies for low-resource and long-tail languages (sampling, augmentation, curriculum design) • low-resource and long-tail languages • Research and improve cross-lingual transfer, alignment, and robustness in large language models • cross-lingual transfer • Build and maintain evaluation benchmarks for multilingual performance • evaluation benchmarks • Collaborate with engineers and researchers on training pipelines and model architecture decisions • training pipelines and model architecture decisions • Publish research at top venues (e.g., ACL, EMNLP, NeurIPS, ICML, ICLR) and contribute to open-source when appropriate • Translate research insights into practical improvements in production models • practical improvements
Benefits
• Salary is competitive. • Equity opportunities are available for a significant stake in the company's future growth. • Paid time off (PTO) policies will be provided as per standard industry practices. • Comprehensive insurance benefits to support your wellbeing and that of your family members, if applicable. • Perks such as flexible working hours or additional vacation days may apply based on company discretion. • Remote work options are available for eligible employees under certain conditions.
No credit card. Takes 10 seconds.