T

WeChat - Cloud Engineer Intern

Tencent·Singapore

Apply on TencentPosted 1mo ago

Role details

Role Overview

You will support the reliability, scalability, and security of Tencent’s business-critical systems in a cloud-native environment.

Responsibilities

  • System Monitoring & Incident Response
    • Monitor production systems using tools like Prometheus and Grafana; identify and troubleshoot outages.
    • Participate in on-call rotations to resolve real-time incidents (with mentor guidance).
  • Automation & DevOps Practices
    • Develop scripts (Python/Shell) to automate deployment, scaling, and recovery tasks.
    • Assist in CI/CD pipeline optimization using GitLab, Docker, and Kubernetes.
  • Infrastructure Optimization
    • Analyze system performance metrics; propose solutions to enhance reliability and cost efficiency.
    • Support cloud infrastructure management (Tencent Cloud / AWS / Azure).
  • Collaboration & Documentation
    • Work with cross-functional teams (Dev, Data, Security) to design SLOs/SLIs for critical services.
    • Document system configurations, runbooks, and post-incident reports.

Qualifications & Requirements

Education & Experience

  • Currently pursuing a PhD or Master’s in Computer Science, AI, Machine Learning, or related fields.
  • Bachelor’s/Master’s in Computer Science, IT, or related fields (2026 graduation).
  • Experience with at least one of:
    • Vision–language models
    • Large language models
    • Video understanding/generation
    • Reinforcement learning or imitation learning

Technical Skills

  • Strong background in deep learning and machine learning fundamentals.
  • Solid programming skills in Python and PyTorch/JAX.
  • OS: Linux/Unix system administration.
  • Scripting: Python, Shell, or Go.
  • Networking: TCP/IP, DNS, HTTP basics.
  • Familiarity with cloud platforms (Tencent Cloud, AWS, or Azure).
  • Experience with infrastructure as code (IaC) tools (Terraform, Ansible) or observability stacks (ELK, Prometheus).
  • Knowledge of containerization (Docker/Kubernetes).

Core Competencies

  • Analytical problem-solving and a passion for infrastructure technologies.
  • Ability to learn quickly in a fast-paced environment.
  • Bilingual fluency in English and Chinese (written and verbal) to interact with HQ and international stakeholders.
  • Basic Mandarin communication skills to collaborate with China-based teams and access internal resources.
  • Publications at top conferences (CVPR, ICCV, NeurIPS, ICLR, ACL, etc.) are desirable.
  • Experience training large models or working with distributed systems.
  • Experience with multimodal datasets and evaluation benchmarks.
  • Familiarity with transformer architectures and scaling laws; multimodal alignment; agent training (RLHF/RLAIF); synthetic data generation or simulation environments; long-context training or memory mechanisms.

Equal Employment Opportunity

As an equal opportunity employer, we firmly believe that diverse voices fuel our innovation and allow us to better serve our users and the community. We foster an environment where every employee of Tencent feels supported and inspired to achieve individual and common goals.

Track your applications with Jorb AI.