Job Description: Our Company
Changing the world through digital experiences is what Adobe’s all about. We give everyone—from emerging artists to global brands—everything they need to design and deliver exceptional digital experiences! We’re passionate about empowering people to create beautiful and powerful images, videos, and apps, and transform how companies interact with customers across every screen.
Our Mission
We’re on a mission to hire the very best and are committed to creating exceptional employee experiences where everyone is respected and has access to equal opportunity. We realize that new ideas can come from everywhere in the organization, and we know the next big idea could be yours!
The Opportunity
We are seeking an outstanding Site Reliability Engineer for Adobe’s AI Training and Inference Platforms within Adobe Firefly. You will be part of a team working closely with engineering teams on building, scaling, and securing the AI Platform.
Responsibilities
- Identify and implement methodologies to increase reliability, scalability, security, and efficiency.
- Ensure high uptime and Quality of Service (QoS).
- Define service level objectives (SLOs) and indicators (SLIs).
- Support and maintain multi-cloud environments.
- Automate tasks at large scale.
- Improve service resiliency through various techniques.
- Coordinate with Adobe platform teams and AWS.
Requirements
- Bachelor’s or Master’s in Computer Science, Electrical Engineering, or related field.
- 5+ years relevant experience.
- Experience with building and scaling distributed systems.
- Proficiency in containerization and orchestration technologies like Kubernetes.
- Expertise with container orchestration engines and CI/CD pipelines (IaC, ArgoCD, Git).
- Programming skills in Python, Go.
- Knowledge of infrastructure tools like Ansible and Terraform.
- Experience with observability tools like InfluxDB, Prometheus, Elastic Stack.
- Understanding of AI/ML and frameworks such as Pytorch, SageMaker, HuggingFace, etc.
Compensation and Benefits
- U.S. pay range: $133,900 - $242,000 annually.
- Sales roles have combined pay (base + commission);
- Non-sales roles have base salary with incentives.
- Long-term incentives like equity awards may be available.
Legal & Accessibility
- Adobe is an Equal Opportunity Employer.
- Accommodation support available for applicants with disabilities.
Job Highlights
Qualifications
- Degree in relevant fields + 5+ years experience.
- Skills in distributed systems, containerization, Kubernetes.
- Programming in Python, Go.
- Infrastructure management tools.
- Experience with observability tools.
Benefits
- Competitive salary range.
- Incentive plans and long-term incentives.
Responsibilities
- Build, scale, and secure AI platforms.
- Support deployment of machine learning models.
- Ensure operational excellence and service quality.
- Support multi-cloud environments.
- Automate operational tasks.
- Improve service resiliency.
- Collaborate on Generative AI innovations.