Job Description: Data Pipeline Development
Overview
In this role, you will be responsible for developing and maintaining data pipelines that support the training of Foundational Models for Tabular Data. The focus is on ensuring high-quality data feeds for machine learning models using cloud platforms.
Responsibilities
- Develop and maintain data pipelines for structured data.
- Work with cloud platforms for data processing, data lakes, and model training providers.
- Collaborate with Applied AI Researchers and AI Engineers.
- Support data needs for various research projects.
- Ensure data quality and reliability for machine learning workflows.
Requirements
- Expertise in Python.
- Proficiency with Machine Learning and Deep Learning libraries.
- Knowledge of Knowledge Graphs.
- Strong understanding of Machine Learning algorithms.
- Experience with Deep Learning pre-training & fine-tuning techniques.
Additional notes
- This position involves leveraging data engineering and machine learning skills to drive innovation in AI research.