Job Description
Job Title:
Data Engineer
Location:
Ahmedabad, Gujarat, India
Job Type:
Full-time
Experience Level:
3+ years
Salary Range:
10-15 LPA
Job Summary
We are looking for a skilled and driven Data Engineer to join our innovative healthcare startup. You will play a critical role in architecting, developing, and optimizing robust data infrastructure and pipelines that support our AI/ML and analytics initiatives.
Ideal candidate:
- Strong experience in cloud-based data engineering
- Thrives in fast-paced environments
- Passionate about building scalable, reliable data systems from scratch
Key Responsibilities
Data Pipeline Development & Optimization
- Design, build, and maintain scalable ETL/ELT pipelines using AWS (Glue, Lambda, S3, SQS, SNS) and orchestration tools like Airflow.
- Develop real-time and batch data pipelines supporting high-volume, low-latency processing.
- Implement robust data ingestion frameworks for structured and unstructured data from multiple sources.
- Ensure data quality, consistency, and lineage with automated validation and monitoring.
- Optimize pipelines for performance, cost-efficiency, and fault tolerance.
- Collaborate with data scientists, AI/ML engineers, and product teams.
- Build and maintain Python-based APIs for data exposure.
- Understand data needs of business stakeholders and translate them into technical solutions.
Cloud Infrastructure & Storage
- Utilize AWS services like S3, Lambda, SNS, SQS, Glue, and Azure Functions.
- Build scalable data lakes and warehouses (mainly Snowflake).
- Ensure data security, access control, and regulatory compliance.
- Develop infrastructure-as-code templates for deployment.
Qualifications & Skills
Education
- Bachelor’s or Master’s in Computer Science, Information Systems, or related fields.
Experience
- 3+ years in designing and maintaining modern data platforms.
Technical Skills
Languages & Tools
- Python (Pandas, PySpark, Boto3), SQL, Bash
Data Engineering Tools
- Apache Airflow, AWS Glue, AWS Lambda, Azure Functions
Cloud & Storage
- AWS (S3, SNS, SQS, Glue), Azure, Snowflake, Redshift (nice to have)
Data Modeling
- Star/Snowflake schemas, Dimensional Modeling
Workflow & CI/CD
- Git, Docker, JIRA, Confluence
Monitoring & Logging
- CloudWatch, Prometheus, ELK Stack (optional)
Candidate Profile
If you're an data enthusiast who enjoys solving complex problems and working in an agile startup environment, we want to hear from you!
Job Highlights
Innovative healthcare startup | Data Engineering | Cloud-based Data Pipelines | AI/ML Support | Scalable Data Infrastructure