wagey.ggwagey.gg
Open Tech JobsCompaniesPricing
Log InGet Started Free
Jobs/Firmware Engineer Role/CUDA Kernel Engineer

CUDA Kernel Engineer

PragmatikeRemote - NA3w ago
RemoteNAFirmware EngineerCUDA

Upload My Resume

Drop here or click to browse · PDF, DOCX, DOC, RTF, TXT

Apply in One Click

Requirements

• Experience with multi-GPU or distributed GPU systems (NCCL, NVLink, MIG). • Background in GPU acceleration for ML frameworks or HPC workloads. • Knowledge of model inference optimization (TensorRT, CUDA Graphs, CUTLASS). • Exposure to compiler-level optimization or PTX/SASS analysis. • Startup experience or comfort working in fast-moving, ambiguous environments.

Benefits

• Research pedigree: MIT CSAIL founders recognized for breakthrough AI and systems contributions. • Research pedigree: • Customer impact: Deploy AI solutions powering Fortune 500 clients. • Customer impact: • Fortune 500 clients • Industry momentum: Lab alumni have led high-value acquisitions (MosaicML Databricks, Run:AI Nvidia, W&B CoreWeave). • Industry momentum: • Funding & growth: • Career growth & influence: Lead AI initiatives, optimize pipelines, and directly impact production AI systems at scale. • Career growth & influence: • directly impact production AI systems at scale • Culture & autonomy: Own critical systems while collaborating with world-class engineers. • Culture & autonomy: • Aspirational impact: Solve GPU/AI performance challenges few engineers ever face. • Aspirational impact: • Health, Dental, and Vision

Similar Jobs

Principal Embedded Firmware Engineer2d ago
K2 SpaceK2 Space·Remote - United States - Remote·$190k - $285k/year + Equity
RemoteNAPrincipalDiagnosticsSemiconductorsFirmware EngineerPrincipalC++Assembly
ML Engineer, II - Road & Lane1w ago
Torc RoboticsTorc Robotics·Remote - USA·$153k - $183k/year
RemoteNAMidArtificial IntelligenceRoboticsML EngineerPythonTraining DevelopmentTransformersCUDARay
Ingénieur·e en apprentissage automatique, II - Routes et voies1w ago
Torc RoboticsTorc Robotics·Remote - USA
RemoteNAMidTransportationEditor-in-ChiefPythonCUDA
Inference Technical Lead, On-Device Transformers1w ago
OpenAIOpenAI·San Francisco, California, United States - Hybrid·$445k - $445k/year + Equity
In OfficeNAStaffArtificial IntelligenceSemiconductorsTech LeadTransformersCUDA
Velo3D - Senior Software Engineer, GPU1w ago
Velo3DVelo3D·Fremont, CA·$150k - $200k/year + Equity
In OfficeNASeniorSenior Software EngineerGraphics EngineerCUDAC++Documentation
Get Started Free

No credit card. Takes 10 seconds.

Privacy·Terms··Contact