Site Reliability Engineer Intern, Data Infra (Aug - Dec 2025)

Shopee

  • Singapore
  • Training
  • Full-time
  • 1 month ago
Job Description :
  • Support daily operations of big data platforms, including monitoring, troubleshooting, and routine maintenance.
  • Write and optimize Shell scripts to automate operational workflows and system tasks.
  • Assist in system health checks, log analysis, and reliability improvements.
  • Participate in building tools to enhance the observability and automation of data services.
  • Document standard operating procedures and support knowledge sharing across the team.
Requirements :Basic Qualifications
  • Currently pursuing a Bachelor's or Master's degree in Computer Science, Software Engineering, or a related field.
  • Strong understanding of Linux operating systems and command-line tools.
  • Proficiency in Shell scripting (bash, sh, etc.).
  • Clear interest in large-scale systems and reliability engineering.
  • Willingness to learn, take initiative, and work collaboratively.
Bonus Qualifications (Nice to Have)
  • Familiarity with Python for automation or internal tooling.
  • Experience with web platform development (e.g., using Flask, FastAPI, or similar frameworks).
  • Exposure to big data and storage engines such as: HDFS, Apache Ozone, Alluxio
  • Understanding of monitoring or alerting tools (e.g., Prometheus, Grafana, ELK).
  • Knowledge of Git and basic CI/CD workflows.
What You'll Gain
  • Real-world experience in operating and improving a production-grade big data platform.
  • Exposure to SRE practices including automation, fault-tolerance, and observability.
  • Mentorship from experienced infrastructure engineers.
  • Potential opportunity for full-time conversion based on performance and graduation timeline.
About the Team :We are looking for a proactive and detail-oriented Site Reliability Engineering (SRE) Intern to join our Big Data Infrastructure team. This internship is ideal for students who are passionate about Linux systems, scripting, and large-scale data platforms. You will gain hands-on experience in operating and improving the reliability of data infrastructure services.

Shopee