Job Purpose:
The Head of Data Centre is responsible for overseeing the data centre and all IT operations, ensuring the efficient execution of batch jobs, managing incidents, problems, and changes, and maintaining mainframe system administration. The role requires strong leadership skills, technical expertise, and the ability to manage a team of diverse experts and shift operators.
Main Responsibilities:
- Oversee the activities of shift operators to ensure the smooth execution of all batch jobs and adherence to operational procedures.
- Ensure timely and accurate execution of batch jobs, monitor job performance, and troubleshoot any issues that arise.
- Continuously monitor system performance and health on a 24/7 basis, ensuring proactive identification and resolution of potential issues.
- Manage and maintain mainframe systems, ensuring their availability, performance, and security.
- Maintain and ensure accuracy of Data Centre rack diagrams and inventories, including hardware, software, and IT assets. Coordinate and conduct regular Data Centre inventory stock takes to verify accuracy and reconcile discrepancies.
- Coordinate with remote hands for on-site support in the Data Centre. Assist in registering site access for authorized personnel.
- Oversee the renewal of hardware maintenance contracts for all hardware within the data centre. Coordinate with vendors for hardware break-fix services.
- Lead the incident management process, ensuring quick resolution of incidents to minimize downtime and impact on business operations.
- Identify root causes of recurring issues and implement solutions to prevent future occurrences.
- Oversee the change management process, ensuring that all changes are properly documented, tested, and approved before implementation.
- Translate operation procedures into job log sheets. Maintain records of completed tasks and relevant information in job log sheets.
- Generate and present regular reports on data centre operations, incidents, problems, and changes to senior management.
- Ensure compliance with all relevant policies, procedures, and regulations.
- Take lead in the annual Disaster Recovery (DR) drill to ensure preparedness and effective response to potential disruptions.
- Provide guidance, training, and support to shift operators and other team members, fostering a collaborative and high-performance work environment.
Incumbent Requirements:
- Bachelors degree in Computer Science, Information Technology, or a related field.
- Minimum of 15 years of experience in data centre operations, with at least 3-5 years in a leadership role.
- Strong knowledge of mainframe systems (including z/OS, CISC transaction Management, Omegamon, VTAM, MQ), batch job scheduling, incident management, problem management, and change management.
- Proficiency in Connect:Direct, IBM Workload Scheduler, and version control tools such as Endevor and Git.
- Familiarity with ITIL best practices and ITSM tools such as ServiceNow.
- Experience with system monitoring tools such as NetGains, PRTG
- Proven ability to lead and manage a team, with excellent communication and interpersonal skills.
- Strong analytical and problem-solving abilities, with a focus on continuous improvement.
- Relevant certifications such as ITIL, PMP, or mainframe-specific certifications are a plus.