
Senior Engineer ( Azure , DevOps , Terraform)
- Singapore
- Permanent
- Full-time
- Provide Service Operations support to internal and external customers in accordance with the terms of the customer contract and Service Level Agreements (SLAs)
- Ensure the correct functioning and maintenance of all internal and external systems and products serviced by Service Operations
- When required act as the customer SPOC and co-ordinate the scheduling of intervention with Customer's internal resolver groups and the Service Desk ensuring the highest level of customer services and communications are maintained to resolve the fault and incident within the prescribed SLA.
- Carry out incident and problem management support to the highest standards and co-ordinate the resolution with the appropriate resolver groups
- Ensure shortest restoral times possible initiating the timely escalations to specialized resolver groups inside and outside SITA according to the customer contracts SLAs and monitoring requirements
- To ensure the Service Operations team adheres to the highest working standards for all incidents and problems by providing guidance support and direct management.
- Proactively detect problems related to service and infrastructure operations and delivery services conduct diagnostics and provide service request ownership to ensure resolution of customer problems
- Use the appropriate tools and equipment to perform the installation intervention and repairs in accordance with Service Operations and Delivery guidelines and instructions where provided
- Report and escalate to the next level those problems which cannot be fixed
- Carry out preventive and proactive maintenance of equipment and monitoring of systems and services in accordance with agreed schedules and customer expectations
- Perform Change Management Configurations Design and Implementation of the supported Product & Systems
- Manage local suppliers in the provision of services for the SITA Service Operations centers and report on services provided to management.
- Conducts the analysis definition documentation and testing of application & systems enhancements
- Continuously identify and document lessons learnt known errors and operational knowledge for improved services.
- Administer and manage public cloud environments (AWS, Azure) with focus on performance, scalability, and cost optimization.
- Deploy, manage, and troubleshoot containerized applications using Docker and Kubernetes.
- Develop and maintain infrastructure as code using Terraform and implement robust CI/CD pipelines.
- Provide advanced troubleshooting for complex issues involving cloud platforms, network components, and virtualized infrastructure.
- Install, configure, and maintain Linux/Unix-based operating systems in enterprise environments.
- Collaborate with cross-functional teams to design and implement solutions tailored to business requirements.
- Act as a technical escalation point for critical customer incidents, ensuring ownership and resolution.
- Analyze and resolve complex problems related to servers, storage, virtualization, and connectivity.
- Follow and promote ITIL processes for incident, change, and problem management.
- Provide guidance and leadership to junior team members and take ownership of key operational initiatives.
- Engage in continuous service improvement by identifying and implementing enhancements to systems and processes
- Cloud Platforms: Hands-on experience with Azure (preferably) and AWS public cloud services.
- Container Orchestration: Strong experience with Docker and Kubernetes in production environments.
- CI/CD & IaC: Strong knowledge of DevOps tools and methodologies, including experience with Terraform.
- Linux/Unix Systems: Proficiency in installation, configuration, and maintenance.
- ITIL Framework: Solid understanding of ITIL-based service management practices.
- Troubleshooting Skills: Strong analytical abilities to troubleshoot end-to-end issues across complex environments (cloud, network, OS).
- Networking: Ability to troubleshoot LAN topologies and core concepts based on TCP/IP.
- Server & Virtualization: Solid experience handling complex server and virtualization issues (e.g., VMware, Hyper-V).
- Communication: Excellent interpersonal and customer handling skills, especially in high-pressure situations.
- Leadership: Proven ability to coordinate team activities and drive issues to resolution with minimal supervision.
- Degree in Computer Science, Electronics Engineering or any other equivalent in-country qualification.
- Professional certifications such as AWS Certified SysOps Administrator, Azure Administrator Associate, CKAD/CKA, or Terraform Associate.
- Familiarity with automation tools like Ansible, Jenkins, or GitHub Actions.
- Experience in a customer-facing support or consulting role.
- ITIL Foundation Certificate