Site Reliability Engineer Lead (Big Data) - Data Infrastructure

  • Singapore
  • Permanent
  • Full-time
  • 1 month ago
Job Description :
  • Lead a Big Data SRE team and oversee the development and maintenance of the big data ops automation platform, and improve the operation and maintenance management level of big data
  • Monitor the performance and system stability of big data system in Shopee such as Hadoop/Spark/Storm/Kafka
  • Provide technical expertise on big data-related businesses such as search engine, deep learning, and promote sustainable development of big data business
  • Responsible for big data ops architecture review, capacity planning, cost optimisation, tracking and troubleshooting, and building a big data monitoring system to maintain overall stability and efficiency
Requirements :
  • Bachelor's degree or higher in Computer Science, Information Technology, Programming & Systems Analysis, Engineering, or other related fields
  • Minimum 8 years of work experience in Big Data SRE, including 3 years of experience in managing a team
  • Proficient in Linux system and script development in Hadoop/flink/hbase/ES/kafka/Druid/Clickhouse environment
  • In-depth understanding of large-scale big data operation and maintenance architecture solutions
  • Excellent communication and leadership skills

Shopee