perplexity - UK Internship Program
Upload My Resume
Drop here or click to browse · PDF, DOCX, DOC, RTF, TXT
Requirements
• Strong engineering track record with proven knowledge of fundamentals and programming languages (multi-threaded programming, networking, compilation, systems programming, etc) • Pursuing a Master's or PhD in Computer Science with a focus on performance-related subjects (HPC, Compilers, Distributed Systems) • Experience with ML frameworks (Torch, JAX) • Experience with GPU programming (CUDA, Triton) • Experience with High-Performance Computing (OpenMPI) • Internship program: 13 weeks, full-time or part-time, in-person in London office (hybrid schedule: 3 days from the office, 2 days WFH)
Responsibilities
• Work with the inference team to improve serving latency and throughput. • Bring up support for new models and state-of-the art inference optimizations or quantization schemes. • Optimize inference across the entire stack, from GPU kernels to serving endpoints.
No credit card. Takes 10 seconds.