T
WeChat - Cloud Engineer Intern
Tencent·Singapore
Students And GraduatesSite Reliability Engineering (SRE)
Apply on TencentPosted 1mo ago
Role details
Role Overview
You will support the reliability, scalability, and security of Tencent’s business-critical systems in a cloud-native environment.
Responsibilities
- System Monitoring & Incident Response
- Monitor production systems using tools like Prometheus and Grafana; identify and troubleshoot outages.
- Participate in on-call rotations to resolve real-time incidents (with mentor guidance).
- Automation & DevOps Practices
- Develop scripts (Python/Shell) to automate deployment, scaling, and recovery tasks.
- Assist in CI/CD pipeline optimization using GitLab, Docker, and Kubernetes.
- Infrastructure Optimization
- Analyze system performance metrics; propose solutions to enhance reliability and cost efficiency.
- Support cloud infrastructure management (Tencent Cloud / AWS / Azure).
- Collaboration & Documentation
- Work with cross-functional teams (Dev, Data, Security) to design SLOs/SLIs for critical services.
- Document system configurations, runbooks, and post-incident reports.
Qualifications & Requirements
Education & Experience
- Currently pursuing a PhD or Master’s in Computer Science, AI, Machine Learning, or related fields.
- Bachelor’s/Master’s in Computer Science, IT, or related fields (2026 graduation).
- Experience with at least one of:
- Vision–language models
- Large language models
- Video understanding/generation
- Reinforcement learning or imitation learning
Technical Skills
- Strong background in deep learning and machine learning fundamentals.
- Solid programming skills in Python and PyTorch/JAX.
- OS: Linux/Unix system administration.
- Scripting: Python, Shell, or Go.
- Networking: TCP/IP, DNS, HTTP basics.
- Familiarity with cloud platforms (Tencent Cloud, AWS, or Azure).
- Experience with infrastructure as code (IaC) tools (Terraform, Ansible) or observability stacks (ELK, Prometheus).
- Knowledge of containerization (Docker/Kubernetes).
Core Competencies
- Analytical problem-solving and a passion for infrastructure technologies.
- Ability to learn quickly in a fast-paced environment.
- Bilingual fluency in English and Chinese (written and verbal) to interact with HQ and international stakeholders.
- Basic Mandarin communication skills to collaborate with China-based teams and access internal resources.
- Publications at top conferences (CVPR, ICCV, NeurIPS, ICLR, ACL, etc.) are desirable.
- Experience training large models or working with distributed systems.
- Experience with multimodal datasets and evaluation benchmarks.
- Familiarity with transformer architectures and scaling laws; multimodal alignment; agent training (RLHF/RLAIF); synthetic data generation or simulation environments; long-context training or memory mechanisms.
Equal Employment Opportunity
As an equal opportunity employer, we firmly believe that diverse voices fuel our innovation and allow us to better serve our users and the community. We foster an environment where every employee of Tencent feels supported and inspired to achieve individual and common goals.
More open roles at Tencent
- T
AI Business Analyst Intern
London
Students And GraduatesCorporate Strategy / Internal Consulting - T
Game Designer Intern, Vehicle Direction
London
Students And GraduatesProduct Management - T
Software Engineering Intern
Singapore
Students And GraduatesData Engineering - T
Generative AI Research Intern
Singapore
Students And GraduatesResearch / Applied Science
