WeChat - Cloud Engineer Intern
Tencent·Singapore·Site Reliability Engineering (SRE)
Tencent is hiring a WeChat - Cloud Engineer Intern in Singapore. Posted 2026-03-09; applications close 2026-05-08.
Apply on TencentPosted 2mo ago
Role details
Role Overview
You will support the reliability, scalability, and security of Tencent’s business-critical systems in a cloud-native environment.
Responsibilities
- System Monitoring & Incident Response
- Monitor production systems using tools like Prometheus and Grafana; identify and troubleshoot outages.
- Participate in on-call rotations to resolve real-time incidents (with mentor guidance).
- Automation & DevOps Practices
- Develop scripts (Python/Shell) to automate deployment, scaling, and recovery tasks.
- Assist in CI/CD pipeline optimization using GitLab, Docker, and Kubernetes.
- Infrastructure Optimization
- Analyze system performance metrics; propose solutions to enhance reliability and cost efficiency.
- Support cloud infrastructure management (Tencent Cloud / AWS / Azure).
- Collaboration & Documentation
- Work with cross-functional teams (Dev, Data, Security) to design SLOs/SLIs for critical services.
- Document system configurations, runbooks, and post-incident reports.
Qualifications & Requirements
Education & Experience
- Currently pursuing a PhD or Master’s in Computer Science, AI, Machine Learning, or related fields.
- Bachelor’s/Master’s in Computer Science, IT, or related fields (2026 graduation).
- Experience with at least one of:
- Vision–language models
- Large language models
- Video understanding/generation
- Reinforcement learning or imitation learning
Technical Skills
- Strong background in deep learning and machine learning fundamentals.
- Solid programming skills in Python and PyTorch/JAX.
- OS: Linux/Unix system administration.
- Scripting: Python, Shell, or Go.
- Networking: TCP/IP, DNS, HTTP basics.
- Familiarity with cloud platforms (Tencent Cloud, AWS, or Azure).
- Experience with infrastructure as code (IaC) tools (Terraform, Ansible) or observability stacks (ELK, Prometheus).
- Knowledge of containerization (Docker/Kubernetes).
Core Competencies
- Analytical problem-solving and a passion for infrastructure technologies.
- Ability to learn quickly in a fast-paced environment.
- Bilingual fluency in English and Chinese (written and verbal) to interact with HQ and international stakeholders.
- Basic Mandarin communication skills to collaborate with China-based teams and access internal resources.
- Publications at top conferences (CVPR, ICCV, NeurIPS, ICLR, ACL, etc.) are desirable.
- Experience training large models or working with distributed systems.
- Experience with multimodal datasets and evaluation benchmarks.
- Familiarity with transformer architectures and scaling laws; multimodal alignment; agent training (RLHF/RLAIF); synthetic data generation or simulation environments; long-context training or memory mechanisms.
Equal Employment Opportunity
As an equal opportunity employer, we firmly believe that diverse voices fuel our innovation and allow us to better serve our users and the community. We foster an environment where every employee of Tencent feels supported and inspired to achieve individual and common goals.
More open roles at Tencent
- Global Talent Sourcing Intern — Singapore, posted 2d ago
- Big Data Development Engineer Intern — Singapore, posted 8d ago
- Game Site Reliability Engineer Intern — Singapore, posted 10d ago
- Database Administrator Intern — Singapore, posted 10d ago
- Hunyuan Multimodal Reinforcement Learning (RL) Research Intern — Singapore, posted 14d ago
