Position: Generative AI Engineer (NV68FCI RM 3524)
Job Description:
Experience Requirements:
- 6+ years of overall data science or ML experience.
- 1.5+ years of hands-on experience with generative AI and large language models.
- Advanced degree in Computer Science, Machine Learning, or related field.
· • Experience in taking ML/DL model to production.
Technical Expertise:
- Python, C++ or any other high-level language.
- PyTorch and TensorFlow.
- Experience with Hugging Face Transformers, LangChain, LlamaIndex, Pydantic AI
- Deep Learning fundamentals: neural networks, backpropagation, optimization
- Generative AI frameworks: OpenAI API, Anthropic Claude, Google Gemini, local LLM deployment
- Model architectures: Transformers, attention mechanisms, encoder-decoder models
- Fine-tuning techniques: LoRA, QLoRA, full parameter fine-tuning, RLHF
- Design and implement comprehensive evaluation pipelines
- A/B testing for AI systems and statistical significance testing
- Latency optimization techniques: model quantization, pruning, distillation
- Experience with MLOps tools: MLflow, Weights & Biases, DVC
Cutting-Edge Technology: - Access to premium GPU clusters A100s for training and experimentation
- Latest hardware and software tools to support your research and development
- Freedom to explore breakthrough AI technologies and methodologies
- Work on groundbreaking problems that haven’t been solved before
- Collaborate with world-class researchers and engineers from top universities and tech companies
- Contribute to open-source projects and publish your research
Core Responsibilities:
- Collaborate with cross-functional teams to translate complex AI concepts into practical business solutions.
- Design systems around open source, enterprise models like Llama, Gemini, GPT to solve innovative problems.
- Build robust evaluation pipelines to assess model performance, safety, and alignment with business objectives.
- Optimize model latency and performance for real-time applications and large-scale deployment
- Architect scalable ML infrastructure for training, inference, and model serving
- Fine-tune Large Language Models (LLMs) and other transformer-based architectures
- Ability to explain complex AI concepts to technical and non-technical stakeholders
Keywords
Python, Gen AI, LLM, Deep Learning, MLOps, LoRA, QLoRA, PyTorch
*******************************************************************************************************************************************
Job Category: Digital_Cloud_Web Technologies
Job Type: Full Time
Job Location: Bangalore
Experience: 6 - 8 years
Notice period: Immediate
Apply for this position
Mention correct information below. Mention skills aligned with the job description you are applying for. This would help us process your application seamlessly.