About this role
ROLESUMMARY The HPC Engineer is responsible for building, operating, and optimizing high‑performance computing environments to ensure performance, reliability, and availability for compute‑intensive work loading enterprise and research settings. KEYRESPONSIBILITIES · Deploy, configure, and maintain HPC clusters and compute nodes. · Install, configure, and manage schedulers and resource managers. · Optimize system performance, job throughput, and resource utilization. · Manage high‑speed interconnects and parallel storage systems. · Provide L2/L3 support for HPC incidents and service requests. · Support application performance tuning and user consultations. · Perform patching, upgrades, and lifecycle management of HPC components. · Maintain system documentation and operational runbooks. TECHNICALSKILLS & TOOLS · Linux systems administration · Schedulers: Slurm, PBS Pro · Interconnects: InfiniBand · Storage: Lustre, GPFS · Scripting: Bash, Python · Monitoring: Ganglia,Prometheus SECURITY& COMPLIANCE · Implement access controls,user quotas, and system hardening. · Ensure patching andvulnerability remediation. · Maintain audit trails andusage records. · Support compliance with ISO27001 and institutional policies. QUALIFICATIONS& EXPERIENCE · Degree/Diploma in IT, Computer Science, or related field. · 4–8 years of experience in HPC or Linux engineering roles. · Strong troubleshooting and performance optimization skills. PREFERRED(GOOD TO HAVE) · Experience supporting research or scientific workloads. · Exposure to GPU and accelerator platforms. · Experience with containerised HPC workloads.
Also in Consulting
APEX WEALTH MANAGEMENT LLP
APEX WEALTH MANAGEMENT LLP
AVIATION LABOUR GROUP PTE. LTD.