Job Description
We are hiring for our client who is seeking an experienced engineer with a focus on Generative AI and Large Language Models (LLMs).
Responsibilities
- Design and develop GenAI/LLM-based systems using tools such as Langchain and Retrieval-Augmented Generation (RAG) pipelines.
- Implement prompt engineering techniques and agent-based frameworks to create intelligent, context-aware solutions.
- Collaborate with the engineering team to shape and drive the technical roadmap for LLM initiatives.
- Convert business needs into scalable, production-ready AI solutions.
- Work with business SMEs and data teams to ensure AI models align with real-world use cases.
- Contribute to architecture discussions, code reviews, and performance optimization.
Skills Required
- Proficiency in Python, Langchain, and SQL.
- Understanding of LLM internals, including prompt tuning, embeddings, vector databases, and agent workflows.
- Background in machine learning or software engineering focusing on system-level thinking.
- Experience with cloud platforms like AWS, Azure, or GCP.
- Ability to work independently and collaborate across teams.
- Excellent communication and stakeholder management skills.
Preferred Qualifications
- At least 1 year of hands-on experience in LLMs and Generative AI techniques.
- Experience in ML/AI product pipeline contributions or end-to-end deployments.
- Familiarity with MLOps and scalable deployment patterns.
- Exposure to client-facing projects or cross-functional AI teams.
Note: Job Highlights section is not detailed here.