Encora - HPC & Cloud Infrastructure Engineer
Upload My Resume
Drop here or click to browse · Tap to choose · PDF, DOCX, DOC, RTF, TXT
Requirements
• High-Performance Computing • Hands on experience in managing HPC clusters with job scheduler, cluster management parallel programming libraries, and parallel filesystems. • Knowledge of resource scheduling and job optimization for efficient workload management • Infiniband (Networking) • Hands-on experience with high-throughput, low-latency interconnect technologies like Infiniband. • Ability to design, configure, and troubleshoot interconnects in HPC or distributed environments. • Operating Systems and Environments • Administration and configuration of RHEL-based systems. • Performance tuning, package management, and security hardening. • Knowledge of Red Hat Satellite and Ansible for automation. • Job Scheduling with PBS Professional • Experience in deploying and managing PBS Professional for scheduling and workload management in HPC environments. • Customizing job submission scripts and optimizing job queues. • Parallel Programming Libraries • MPI (Message Passing Interface) and OpenMP (Open Multi-Processing): • Proficiency in writing, debugging, and optimizing parallelized code. • Experience with scaling applications across HPC systems. • Understanding of distributed memory (MPI) and shared memory (OpenMP) • Cloud Platforms • Cloud Platforms • AWS, Azure, Google Cloud: • Expertise in provisioning, configuring, and managing services on all three platforms. • Cross-platform migration and hybrid cloud solutions knowledge. • Proficiency in managing high-performance computing (HPC) clusters on the cloud. • Deep understanding of cost optimization, security, and cloud native development tools (e.g., Kubernetes, Terraform). • Infrastructure as Code (IaC) • Ability to design, deploy, and maintain infrastructure using automation and configuration management tools. • CI/CD pipeline integration for IaC workflows. • GPU & AI Libraries and Tools • Hands-on experience with container technologies. • Hands-on experience with TensorFlow, PyTorch, scikit-learn, Keras, or MXNet. • Familiarity with AI/ML pipelines, model training, and optimization. • Knowledge of MLOps tools for deploying and monitoring models • Encora is a global company that offers Software and Digital Engineering solutions. Our practices include Cloud Services, Product Engineering & Application Modernization, Data & Analytics, Digital Experience & Design Services, DevSecOps, Cybersecurity, Quality Engineering, AI & LLM Engineering, among others. • At Encora, we hire professionals based solely on their skills and do not discriminate based on age, disability, religion, gender, sexual orientation, socioeconomic status, or nationality.
No credit card. Takes 10 seconds.