Job Description
About the Generative AI Innovation Center at AWS
The Generative AI Innovation Center at AWS is focused on empowering customers with cutting-edge AI technologies to transform their businesses. Our team is multidisciplinary, including strategists, scientists, engineers, and architects, working across various industries to develop and deploy customized generative AI applications at scale. We also collaborate with foundational model providers to optimize AI models for Amazon Silicon, enhancing their performance and efficiency.
Role: Software Development Engineer (SDE)
As an SDE, you will:
- Drive development of custom Large Language Models (LLMs) across languages, domains, and modalities.
- Fine-tune state-of-the-art LLMs for diverse use cases.
- Optimize models for deployment on AWS’s custom AI accelerators.
- Tackle end-to-end LLM training pipelines at a massive scale.
- Deliver next-generation AI solutions for AWS clients.
Key Responsibilities
-
Large-Scale Training Pipelines
- Design and implement distributed training pipelines for LLMs using tools such as Fully Sharded Data Parallel (FSDP) and DeepSpeed.
- Ensure scalability and efficiency.
-
LLM Customization & Fine-Tuning
- Adapt LLMs for new languages, domains, and vision applications.
- Employ continued pre-training, fine-tuning, and Reinforcement Learning with Human Feedback (RLHF).
-
Model Optimization on AWS Silicon
- Optimize AI models for deployment on AWS Inferentia and Trainium.
- Use AWS Neuron SDK and develop custom kernels.
-
Customer Collaboration
- Interact with enterprise clients and model providers.
- Understand their challenges and co-develop tailored solutions.
A Day in the Life
- Design and code solutions to improve software architecture.
- Create metrics, automate tasks, and resolve software defects.
- Build impactful Gen AI solutions.
- Participate in discussions, code reviews, and stakeholder communication.
- Work cross-functionally in a startup-like environment.
About the Team & Culture
At AWS, we value diverse experiences, work/life balance, inclusion, mentorship, and career growth. We foster a culture of curiosity, learning, and continuous improvement.
Basic Qualifications
- 3+ years of professional software development experience.
- 2+ years of experience in system design or architecture.
- Proficiency in at least one programming language.
- Hands-on experience with deep learning/machine learning.
- Experience with generative AI technology.
Preferred Qualifications
- 3+ years full-stack development experience.
- Bachelor’s or higher degree in Computer Science, Engineering, Mathematics, or related fields.
- Experience deploying or optimizing ML models.
Additional Information
- EOE/EEO including veterans and disabled.
- Compensation ranges from $129,300 to $223,600 annually, based on location and experience.
- Benefits include medical, financial, and other support.
Equal Opportunity & Accommodations
We consider qualified applicants with arrest and conviction records. For workplace accommodations, visit: Amazon Accommodations.