Research Intern — Coding LLMs
Tencent·Singapore·Research / Applied Science
Tencent is hiring a Research Intern — Coding LLMs in Singapore. Posted 2026-06-10; applications close 2026-08-09 (in 58 days).
Role details
Business Unit
Technology Engineering Group (TEG) is responsible for supporting the company and its business groups on technology and operational platforms, as well as the construction and operation of R&D management and data centers. TEG provides users with a full range of customer services. As the operator of the largest networking, devices, and data center in Asia, TEG also leads the Tencent Technology Committee in strengthening infrastructure R&D through internal and distributed open source collaboration, constructing new platforms and supporting business innovation.
Role Overview
We are looking for research interns to work on foundational areas for coding language models, including pre-training data, mid-training data, synthetic data generation, evaluation, and agentic coding.
Responsibilities
- Explore data-centric methods for improving coding LLMs, including data filtering, quality assessment, deduplication, data mixture, and diversity analysis.
- Build synthetic data and evaluation pipelines for code generation, code editing, repo-level reasoning, tool use, and multi-step coding tasks.
- Run experiments to analyze how data, model, and training strategies affect coding capabilities.
- Work with large-scale code corpora, developer activity data, and agentic coding trajectories.
Who We Look For
- Strong programming skills in Python.
- Solid understanding of machine learning and large language models.
- Familiarity with LLM pre-training, mid-training, code models, data curation, evaluation, agents, or tool use.
- Strong experiment design, data analysis, and problem-solving skills.
- Interest in code intelligence, software engineering automation, and agentic coding.
Preferred Qualifications
- Experience with code data processing, GitHub-scale data, synthetic data, LLM evaluation, semantic deduplication, or agentic coding.
- Research experience, publications, or open-source projects in related areas are a plus.
What We Offer
- Access to large-scale real-world coding data and agentic trajectories.
- Rich compute resources and model APIs for fast research iteration.
- Opportunities to work on real-world coding model applications and the full model development loop.
Equal Employment Opportunity at Tencent
As an equal opportunity employer, we firmly believe that diverse voices fuel our innovation and allow us to better serve our users and the community. We foster an environment where every employee of Tencent feels supported and inspired to achieve individual and common goals.
More open roles at Tencent
- Ad Recommendation Algorithm Intern Engineer
Singapore · 1d ago
- Marketing Intern
London · 2d ago
- Financial Accounting and Analysis Intern (4-6 Months)
Singapore · 6d ago
- Junior Site Reliability Engineer
Singapore · 9d ago
- Data/AI Engineer Intern
Singapore · 9d ago
Other open Research / Applied Science roles
- Binance Accelerator Program - AI Research Scientist (LLM Reasoning & Post-Training)
Binance · Hong Kong · 3d ago
- Data Annotation Quality Specialist
Mistral AI · London · 1mo ago
- Pioneer Talent Program - Applied Data Scientist
Binance · Hong Kong · 1mo ago
- PhD Research Associate (Industry PhD Program) - Artificial Intelligence, SAP Labs Singapore
SAP · Singapore · 5mo ago
- Applied Scientist / Research Engineer (Internship)
Mistral AI · London · 4mo ago
Applying to this role
This Research Intern — Coding LLMs role at Tencent runs through the firm's own careers portal and expects a CV and cover letter written specifically for the posting, not a portable submission carried across firms. Jorb AI's application agent tailors a CV and cover letter from your background to this posting and tracks the role alongside the rest of your applications.
Jorb AI tracks details for Research Intern — Coding LLMs at Tencent. Postings refresh hourly from primary careers pages. Job details mirror the firm's posting; the apply link goes directly to the source. Last refreshed 2026-06-11.
