About this role
ROLESUMMARY The HPC Operations Support Engineer is responsible for providing day‑to‑day operational support for HPC environments, ensuring system stability, availability, and SLA adherence in production environments. KEYRESPONSIBILITIES · Monitor HPC cluster health, job execution, and resource utilization. · Provide L1/L2 user support for HPC incidents and service requests. · Perform routine operational checks, maintenance, and housekeeping tasks. · Support system upgrades, patches, and scheduled maintenance activities. · Manage HPC user onboarding, access, and quota administration. · Escalate complex issues to HPC Engineers as required. · Maintain operational documentation, SOPs, and runbooks. · Ensure adherence to SLAs and operational procedures. TECHNICALSKILLS & TOOLS · Linux administration (basicto intermediate) · Scheduler Operations: Slurm · Monitoring tools · ITSM Tools: ServiceNow, Remedy, Jira · Shell scripting (basic) SECURITY& COMPLIANCE · Follow operational security procedures and access controls. · Ensure compliance with ISO27001 and institutional standards. · Support audit activities and operational reporting. QUALIFICATIONS& EXPERIENCE · Degree/Diploma in IT, Computer Science, or related field. · 2–5 years of experience in IT operations or system support roles. · Experience in SLA‑driven or 24x7environments preferred. PREFERRED(GOOD TO HAVE) · Exposure to HPC or scientific computing environments. · Experience supporting Linux‑based production systems. · Familiarity with batch processing or job scheduling environments.
Also in Finance Accounting
K.U.S HOLDINGS (S) PTE LTD
TASTEBUD FOODCOURT PTE. LTD.
SCALEUP CENTRE PRIVATE LIMITED