wagey.ggwagey.gg
38,923  jobs38,923  jobs
Browse Tech JobsCompaniesFeaturesPricingFAQs
Log InGet Started Free
Jobs(38,923)/Software Engineer Role(2,571)/Epoch AI (2) - Software Engineer, Benchmarking
Epoch AI

Epoch AI - Software Engineer, Benchmarking

Remote - Anywhere - USA *$150k - $225k5mo ago
RemoteNAArtificial IntelligenceSoftware EngineerTest EngineerAI EngineerReporting

Requirements

• Implement benchmarks: Implement AI benchmarks within our evaluation infrastructure (primarily using the Inspect library) to expand the suite of capabilities we track. Develop our existing suite of benchmarks so we can quickly and painlessly evaluate new model releases. • Develop new benchmarks: Contribute to the development of brand new benchmarks. You will have the opportunity to pitch and prototype your own ideas in addition to helping out with existing projects. • Collaborate: Work closely with researchers, analysts, and other engineers at Epoch AI to ensure evaluation data and outputs are accurate, insightful, and effectively integrated into our research products and publications. • Solid engineering skills: A strong software engineering background with several years of professional experience building and maintaining complex systems. You are expected to regularly contribute high-quality, robust, and maintainable code and be comfortable diving deep into existing codebases and infrastructure. • Ideas and creativity: Candidates should be able to generate their own ideas for new benchmarks, experiments, novel things to try, and other projects. • Mission-driven: You’re motivated by Epoch AI’s mission to provide rigorous, independent insight into key trends in AI. You want to deliver public, trustworthy evaluations of AI capabilities on challenging benchmarks, empowering researchers, policymakers, and the wider public to make well-informed decisions about AI.

Responsibilities

• Develop and maintain benchmarking tools for evaluating AI performance in various tasks. • Conduct expert evaluations of existing AI systems to identify areas for improvement. • Collaborate with cross-functional teams to integrate feedback into the development process. • Analyze data from testing environments to assess system reliability and efficiency. • Document benchmarking methodologies, results, and best practices in detailed reports. • Stay informed about industry trends and advancements in AI technology relevant to our projects.

Benefits

• Annual salary between $150,000 and $225,000 USD. • Fully remote environment, including flexible work hours and schedules for most roles. • Competitive global benefits program, including a comprehensive health insurance program—including supplemental benefits specific to a local country, as available and mandated by local law—and life insurance and a pension plan, if applicable in your country. • Generous paid time off (PTO), including no specific limit on PTO with 30 days per year protected, unlimited personal and sick leave, and up to 6 months (combination of paid + unpaid) parental leave for permanent staff. • A flexible and generous expense policy for you to spend on equipment and a large range of productivity tools or learning/development opportunities you might find valuable, subject to regulations and manager approval. • Paid work trips, including 3 staff retreats per year and relevant conferences. • Access to our very well-equipped offices in Berkeley, California, including paid meals, snacks, gym, and more. All staff, independently of where they are based, have access to the office for at least 20 days each year.

Apply in one click

Upload My Resume

Drop here or click to browse · Tap to choose · PDF, DOCX, DOC, RTF, TXT

Apply in One Click
Apply in One Click

Similar roles

Verista, Inc.Verista, Inc. - 6375 - Data Analytics/AI Validation Engineer / Lead Validation Engineer4w ago
·Remote - King of Prussia, PA (Remote)
RemoteNAStaffCloud ComputingArtificial IntelligenceAI EngineerTest EngineerDocumentationBusiness IntelligenceReportingPower BIAzure
tycho-aitycho-ai - UAS Test Engineer1mo ago
·United States·Equity
RemoteNATest EngineerReporting
getscopegetscope - Software Engineer2mo ago
·London, England, United Kingdom·£90k - £120k/year/year + Equity
In OfficeEMEAArtificial IntelligenceSoftware EngineerReporting
Adaptive MLAdaptive ML - Forward Deployed AI Engineer3mo ago
·New York City, New York, USA
In OfficeNAMidArtificial IntelligenceAI EngineerReportingCustomer SuccessPython
Supa HealthSupa Health - Staff Software Engineer (India)1mo ago
·Bangalore, India·Equity
In OfficeAPACStaffArtificial IntelligenceSoftwareStaff EngineerSoftware EngineerReporting
Mistral AIMistral AI - Software Engineer, Enterprise Agents5mo ago
·Paris - Hybrid
In OfficeEMEAStaffArtificial IntelligenceSoftwareSoftware EngineerAI EngineerTeam ManagementReportingQuality AssuranceCRM Management
Backblaze External WebsiteBackblaze External Website - AI Enablement Director1w ago
·Remote - USA·$165k - $215k/year + Equity
RemoteNADirectorCloud ComputingArtificial IntelligenceAI EngineerDocumentationReportingB2BGovernanceProgram Management
Story CannabisStory Cannabis - AI Enablement Manager1w ago
·Remote - USA·$130k - $130k/year
RemoteNAMidArtificial IntelligenceData AnalyticsAI EngineerMicrosoft OfficeReportingProcess OptimizationStakeholder ManagementDocumentation
brightwheelbrightwheel - Staff AI Product Builder, Data Engineering1mo ago
·United States - Hybrid·$154k - $237k/year + Equity
In OfficeNAStaffArtificial IntelligenceSoftwareAI EngineerProspectingFull StackStitchGovernanceReporting

Browse more by category

Show 2,571 moreSoftware EngineerShow 114 moreTest EngineerShow 1,044 moreAI EngineerShow 8,590 moreReporting
Privacy·Terms··Contact·FAQ·Wagey on X