About this role
• Deliver Service Operations support to both internal and external customers in alignment with contractual terms and established Service Level Agreements (SLAs). • Maintain and ensure the optimal performance of all systems and products managed by the Service Operations team. • Serve as the single point of contact (SPOC) for customers when required, coordinating with internal resolver teams, the Service Desk, and customers to ensure timely intervention and high-quality service resolution within defined SLAs. • Provide top-tier incident and problem management support, coordinating resolutions with the appropriate technical teams. • Minimize service restoration times by escalating issues to specialized resolver groups both within and outside the organization, in accordance with SLA and monitoring policies. • Oversee the replacement of faulty equipment, ensuring the availability and timely replenishment of spare parts as per the defined sparing policy. • Uphold the highest operational standards across the team by offering guidance, support, and hands-on management for incident and problem resolution. • Proactively identify service and infrastructure issues, conduct diagnostics, and maintain ownership of service requests to ensure effective problem resolution. • Assist senior team members in daily task coordination, reporting, and overall operational management. • Follow installation standards and industry best practices to ensure high-quality service delivery and system operations. • Utilize appropriate tools and equipment to perform installations, interventions, and repairs in line with operational guidelines. • Escalate unresolved problems to higher support levels as needed. • Conduct scheduled preventive and proactive maintenance, as well as monitoring of systems and services, to meet customer expectations. • Execute Change Management, Configuration, Design, and Implementation tasks for supported systems and products. • Manage local vendors to ensure consistent service delivery to the Service Operations centers, and report on vendor performance to management. • Analyze, define, document, and test system and application enhancements. • Provide on-site support to end-users during service cutovers. • Continually identify, document, and share lessons learned, known errors, and operational insights to enhance service quality. • Be available for escalations and support during off-hours and on-call standby as required. Qualification: • Diploma/Bachelor Degree in Computer Science, Electronic Engineering or equivalent Telecommunications in-country qualification. • Unix / Linux Certification, VMWare Certification, ITIL Foundation v3 Certification • At least 3 years IT experience in following technologies: Operating System: RHEL 7/8, RHEL HA ,Windows Data center 2016/2019 with clustering • Hardware: HP DL360 Gen 9, DELL R430XD, DELL R730XD / equivalent • Virtualization: ESXi, VMWare vsphere 7/8, VMware vCentre, VMware SRM Standard • MQ: MQ v8/9, MQ IPT, MQ Clustering, IBM License Manager • Web: Apache Web Server, Apache Tomcat Application Server • Monitoring tools: NagiosXI, NagiosLog, eG Monitoring suite • Firewall: Palo Alto 850 & PA-5220 (equivalent), Checkpoint Firewall (equivalent) • Load Balancer: F5 BIG-IP LTM i2600 • Vulnerability Manager( IBM Qradar All-in-one console, Event collector), Waterfall MQ Agent (Waterfall for IBM Websphere MQ) • CA Server Luna HSM, Aruba Clearpass,SafeNet Network HSM, SecurityToken, Nessus Manager, Nessus Security Centre, Mcafee Advanced Security Suite, Mcafee ePolicy Orchestrator • Storage: DELL EMC Unity 400, 300 • Backup: EMC Data Domain with DD Boost & DD Replicator, DELL EMC Networker, Veritas System Recovery • Other Technologies: RedHat Satellite/ Spacewalk, DHCP, TFTP, Mail, Squid (WEB Proxy),NFS, Active Directory,DNS,NTP,Yum repo, IPA, SysLog servers. Experience / Skills: • Provided application support including routine health checks, service restarts, error log analysis, fault reporting, and issue replication in staging environments. • Administered monitoring tools to ensure system performance and availability. • Handled Incident, Problem, and Change Management processes across hardware and software platforms. • Managed application release processes and performed preventive maintenance activities. • Conducted root cause analysis to identify and resolve recurring issues. • Utilized Java debugging skills and scripting languages such as PERL, BASH, and PYTHON for automation and troubleshooting. • Supported and maintained network and security infrastructure in compliance with IT security policies. • Upgraded network and security devices with stable firmware versions to ensure optimal performance and security. • Administered and supported Storage and Backup environments following organizational standards. • Performed regular health checks, error log investigations, production issue resolutions, fault reporting, and issue replication in non-production environments for storage and backup systems. • Monitored and reported on storage and backup performance metrics as needed. • Executed on-demand system backups and recoveries when required. • Collaborated with third-party vendors for hardware issue resolutions, including component replacements. • Participated in data center (DC) failover testing exercises. • Demonstrated strong organizational skills with the ability to take full ownership of issues through to resolution. To apply, please visit www.gmprecruit.com and search for Job Reference: L3XW6W3W We regret that only shortlisted candidates will be notified. GMP Technologies (S) Pte Ltd | EA License: 11C3793 | EA Personnel: Bautista Gia Grace De Guzman | Registration No: R23111973
Also in Software Engineering
CPG FACILITIES MANAGEMENT PTE LTD
ERP21 PTE LTD
TREE ART INTERNATIONAL PTE. LTD.