Job Description
At Roche, you can show up as yourself, embraced for the unique qualities you bring. Our culture encourages personal expression, open dialogue, and genuine connections, where you are valued, accepted, and respected for who you are, allowing you to thrive both personally and professionally.
About Roche
Our mission is to prevent, stop, and cure diseases and ensure everyone has access to healthcare today and for generations to come. Join Roche, where every voice matters.
The Position
A Healthier Future
Galileo is a strategic Roche Informatics program focused on enabling high-value AI (initially Generative AI - GenAI) use cases at Roche through specialized platforms and services, laying the foundation for a Center of Excellence in AI.
The Team
The Use Case Delivery (UCD) Team, consisting of several delivery squads, is dedicated to building innovative GenAI applications.
Role Overview
We seek a highly skilled and dedicated Data Engineer to join a new AI solutions development squad responsible for building cutting-edge applications leveraging Large Language Models (LLMs). The end-to-end AI solutions include concept development, prototyping, productization, and operations.
Responsibilities
- Design & Build Data Infrastructure: Develop robust data systems to support AI applications.
- Handle Data Types: Manage structured and unstructured data.
- Vector Database Management: Use databases like AWS OpenSearch or Azure AI Search.
- Cloud Data Solutions: Use AWS (OpenSearch, S3) or Azure (AI Search, Blob Storage).
- Snowflake Analytics: Design and optimize storage using Snowflake.
- Data Pipelines: Develop ETL/ELT workflows.
- Support AI Workflows: Collaborate with AI/ML engineers for seamless integration.
- Performance Optimization: Improve efficiency, scalability, and cost-effectiveness.
Qualifications
- Experience: 5-7+ years in data engineering supporting AI/ML, with relevant degree.
- Programming: Proficiency in Python, SQL, and language-specific for vector databases.
- Databases: Experience with relational, NoSQL, vector databases, and Snowflake.
- Cloud Platforms: Hands-on AWS (OpenSearch, S3, Lambda) or Azure (AI Search, Blob Storage, Automation).
- ETL/ELT: Experience with tools like dbt, Apache Airflow.
- APIs & Microservices: Knowledge of RESTful API design.
- Security & Governance: Understanding of encryption and role-based access.
- DevOps: Familiarity with Git, CI/CD, Docker, Kubernetes, Terraform, CloudFormation.
- AI-specific Data Needs: Experience with embeddings, RAG, LLM fine-tuning data.
Note: No relocation benefits are provided for this position.
About Roche
A global leader with over 100,000 employees dedicated to science, healthcare, and innovation, impacting millions worldwide.
Join Us
Let’s build a healthier future together.
Roche is an Equal Opportunity Employer.