
Model Infrastructure Engineer Graduate (TikTok Recommendation Architecture) - 2026 Start (BS/MS)
- Singapore
- Permanent
- Full-time
- Optimize model performance and memory efficiency on GPU-based systems.
- Collaborate with research and infra teams to deploy high-throughput training and inference pipelines.
- Develop tools and libraries to accelerate deep learning workloads at scale.
- Analyze system performance (e.g., GPU profiling, kernel analysis, throughput tuning).Qualifications:Minimum Qualifications:
- Final year or recent graduate with a a background in Computer Science, Electrical Engineering, or other related field.
- Solid programming skills in C++/CUDA/Trition/Python.
- Familiarity with GPU architecture and distributed training is highly desirable.Preferred Qualifications:
- Experience building production-grade training and inference systems for large-scale models.
- Hands-on experience optimizing Large Language Models (LLMs), including memory efficiency, latency, and throughput improvements.
- Knowledge of distributed training frameworks (e.g., NCCL, Horovod, DeepSpeed, FSDP) is a plus.
- Familiarity with deep learning compiler frameworks such as TVM or LLVM, and understanding of their underlying principles.
- Contributions to open-source projects or relevant research publications.By submitting an application for this role, you accept and agree to our global applicant privacy policy, which may be accessed here: https://careers.tiktok.com/legal/privacyIf you have any questions, please reach out to us at apac-earlycareers@tiktok.com