About this role
About the RoleWe are looking for a Senior Generative AI Developer to design, build, and deploy production-grade AI applications using Large Language Models (LLMs). This is a hands-on role focused on developing scalable, enterprise-ready GenAI solutions, including agent-based systems and Retrieval-Augmented Generation (RAG) pipelines. You will work closely with cross-functional teams to integrate AI capabilities into business applications while ensuring performance, scalability, and security. Tech StackPython, LLMs (OpenAI, Anthropic, Llama, Mistral), LangChain, LlamaIndex, RAG, Vector Databases (Pinecone, ChromaDB, Milvus), AWS / Azure / GCP, Docker, Kubernetes, CI/CD Key ResponsibilitiesGenerative AI Development • Design, develop, and deploy scalable Generative AI applications and agent-based systems • Build and optimize Retrieval-Augmented Generation (RAG) pipelines, including embeddings and vector search • Implement advanced prompt engineering techniques (few-shot, chain-of-thought) to improve model performance • Integrate commercial and open-source LLMs via APIs, cloud platforms, or local deployment Software Engineering & Integration • Develop clean, efficient, and maintainable Python code for APIs, microservices, and AI workflows • Build and expose APIs for AI-driven applications and tools • Integrate AI capabilities into enterprise systems and workflows Model Optimization & Deployment • Perform model fine-tuning and optimization techniques (LoRA / QLoRA) • Deploy applications on cloud platforms (AWS, Azure, or GCP) using containerization (Docker, Kubernetes) • Ensure system performance, scalability, and reliability in production environments Collaboration & Quality • Work with data, product, and engineering teams to translate requirements into AI solutions • Document technical designs, APIs, and workflows • Mentor junior developers and contribute to engineering best practices Must-Have Skills • 5+ years of software development experience with strong Python proficiency • Hands-on experience with Generative AI, NLP, or Machine Learning (2+ years preferred) • Experience building RAG pipelines and working with vector databases (Pinecone, ChromaDB, Milvus) • Experience with LLM integration (OpenAI, Anthropic, Llama, Mistral) • Familiarity with GenAI frameworks (LangChain, LlamaIndex, Hugging Face) • Experience developing APIs and microservices • Cloud experience (AWS, Azure, or GCP) Nice-to-Have Skills • Experience with multi-agent frameworks (LangGraph, CrewAI, Autogen) • Model fine-tuning techniques (LoRA, QLoRA) • Experience with ETL pipelines and data engineering workflows • Knowledge of containerization (Docker, Kubernetes) • Understanding of data security, privacy, and compliance Qualifications• Bachelor’s or Master’s degree in Computer Science, Engineering, or related field Key SkillsPython, Generative AI, Large Language Models (LLM), LangChain, RAG, Vector Databases, AWS, Azure, APIs, Machine Learning Why Join Us• Work on cutting-edge Generative AI and LLM-based applications • Build scalable, production-grade AI systems • Exposure to enterprise AI transformation projects • Collaborative and innovation-driven environment Employment TypeFull-time / Contract (extendable)
Also in Software Engineering
UP COMMUNICATIONS PTE LTD
ASCENDION ENGINEERING SOLUTIONS SINGAPORE PTE. LTD.
ELLIOTT MOSS CONSULTING PTE. LTD.