Job Description
About the Team
The Applied Machine Learning Enterprise team specializes in combining system engineering and machine learning to build and operate a big model service platform that offers Model-as-a-Service (MaaS) solutions. These solutions are targeted at both big model vendors and users.
We are looking for talented Software Engineers/Researchers with expertise in Large Language Models (LLM) to join our innovative team.
Responsibilities
- Lead the development of next-generation high-capacity LLM platforms and innovative products.
- Collaborate with cross-functional teams to plan and execute projects involving LLMs for various purposes and domains.
- Contribute to research and development efforts in advanced techniques like model continuous pretraining, fine-tuning, evaluation, and inference.
- Develop LLM applications and agents.
- Maintain a passion for the success of large models in a fast-paced environment.
- Interact with clients and colleagues, handling confidential information professionally.
Qualifications
- Ph.D. in Computer Science, Artificial Intelligence, or related fields.
- Experience in training and inference of large language models.
- Strong knowledge of LLM research including long context, multi-modality, alignment.
- Practical expertise in implementing advanced systems.
- Programming skills in Python or C++.
- Experience with deep learning frameworks such as PyTorch, Deepspeed.
- Understanding of distributed computing, performance tuning, and verification.
- Knowledge of PEFT or MoE is a plus.
Preferred Skills
- Problem-solving skills and creative thinking.
- Ability to drive research projects.
- Publications or contributions to the LLM community.
- Experience with inference tuning, GPU/AI accelerators, PyTorch 2.0.
- Familiarity with Kubernetes, Cloud Native technologies.
- Experience deploying models in production.
Job Details
Compensation (Annual)
- Range: $137,750 - $237,500
- Factors influencing salary include qualifications, skills, experience, location.
- Additional benefits: bonuses, stock units, insurance, 401(k), paid leave, etc.
Benefits
- Medical, dental, vision insurance.
- 401(k) with company match.
- Paid parental leave.
- Disability coverage.
- Paid holidays and personal time.
Additional Information
- The company is ByteDance, founded in 2012.
- Known for products like TikTok, Lemon8, CapCut.
- Committed to diversity & inclusion.
- Accommodations available for candidates with disabilities.
Why Join ByteDance?
- Inspire creativity and enrich life.
- Work with diverse, innovative teams.
- Participate in meaningful breakthroughs.
- Focus on growth, impact, and impact.
Our Commitment
- Creating an inclusive environment.
- Respecting skills, experiences, perspectives.
- Providing reasonable accommodations.
Job Highlights
Qualifications
- Ph.D. in related fields
- Experience with training and inference of LLMs
- Expertise in long context, multi-modality, alignment research
- Programming in Python / C++
- Knowledge of Deep Learning frameworks (PyTorch, Deepspeed)
- Distributed computing and performance tuning skills
- Published research or community contributions (a plus)
- Experience with GPU/AI accelerators, PyTorch 2.0
- Kubernetes and cloud-native tech familiarity
Benefits
- Competitive salary: $137,750 - $237,500
- Benefits include insurance, 401(k), paid leave, bonuses, stock units
- Day one benefits
Responsibilities
- Lead research and development of LLM platforms and products
- Work across teams on LLM projects
- Engage in cutting-edge model research
- Manage confidential info and client interactions