EPAM Systems

Lead Generative AI Operations Engineer (GenAI Ops)

Posted: 2 days ago

Job Description

We are seeking a highly skilled Generative AI Operations Engineer (GenAI Ops) to join our cutting-edge AI team. The ideal candidate will have strong expertise in operationalizing large-scale generative AI systems, building CI/CD pipelines, and managing AI agent infrastructures across cloud environments. You will play a key role in ensuring the scalability, security, and performance of multi-agent AI systems and generative applications. ResponsibilitiesDesign, implement, and maintain automated CI/CD pipelines for the development, training, and deployment of Large Language Models (LLMs) and AI agentsBuild and manage agentic AI systems, ensuring efficient agent-to-agent collaboration and orchestration of complex workflowsIntegrate AI agents with external tools and APIs using modern standards such as the Model Context Protocol (MCP)Leverage AI-powered development tools to streamline software delivery, infrastructure management, and troubleshooting processesDefine and manage cloud infrastructure for GenAI workloads using Infrastructure as Code (IaC) tools such as Terraform, AWS CDK, or CloudFormationImplement monitoring and observability solutions for models, agents, and system health using tools like Prometheus, Grafana, or DatadogOptimize scalability, performance, and cost-efficiency of GenAI services in production environmentsEnforce AI security, safety, and governance practices, ensuring compliance with organizational and industry standards RequirementsMinimum 3 years of experience in DevOps, Site Reliability Engineering (SRE)Minimum 1 year of experience in MLOps roles with a strong focus on cloud infrastructureProven experience with AWS, Google Cloud, or AzureProficiency in Python or Bash, and experience with containerization/orchestration tools such as Docker and KubernetesStrong background in building and maintaining CI/CD pipelines using Jenkins, GitLab CI, or similar toolsExperience with cloud-native GenAI platforms (e.g., AWS Bedrock, Azure AI Foundry, Google Vertex AI)Familiarity with LLM architectures and the challenges of deploying large-scale modelsExperience designing or managing multi-agent systems and orchestrated AI workflowsHands-on experience implementing infrastructure using IaC frameworksB2+ level of English proficiency Nice to haveMaster’s or PhD in Computer Science, AI, or related fieldRelevant cloud or DevOps certifications (e.g., AWS Certified DevOps Engineer, Google Cloud Professional DevOps Engineer)Strong problem-solving mindset and ability to thrive in a fast-paced, innovative environment We offerWith us you can:Work on a flexible schedule remotely or from any of our comfortable offices or coworking spaces in UkraineReceive the necessary equipment to perform your work tasksChange projects and technology stacks within EPAMGain experience in various business domains (Insurance, E-commerce, Healthcare, Finance, Travelling, Media, Artificial Intelligence, and more)Relocation opportunities may be available for eligible candidates, depending on the role and openings at other EPAM locationsParticipate in volunteer, charity programs and communities (both technical and interest-based)We focus on your professional growth:You can plan your individual career path together with your managerReceive regular feedback from colleaguesImprove your English for free with certified teachers (Speaking Clubs, client interview preparation courses, etc.)Get the opportunity to undergo free training and certification in AWS, GCP, or Azure CloudsUse the internal E-learn training program (18,200+ specialized training and mentoring programs)Access corporate accounts on LinkedIn Learning, Get Abstract and other partner resourcesStudy at EPAM Solution Architecture School with the instructors who are practicing architectsDevelop as a leader, join Delivery Management, Resource Management, Leadership Essentials school and moreParticipate in internal communities (500+ meetups, technical discussions, brainstorming sessions, online events and conferences annually)What we offer:Vacation and sick leave (including a sick leave without a medical certificate)A wide range of Voluntary Medical Insurance programs providing both medical treatment and various preventive options (including sports activities)Medical insurance for family members at corporate ratesCompany support during significant life events (childbirth or adoption, marriage, etc.)Support for psychological comfort: discounts on services from mental health specialists or coaches, thematic trainingE-kids program - a free programming language training program for EPAMers' children Kindly note that this role supports remote work, but only from within Ukraine. Kindly be advised that the set of benefits, including learning, certification, and other opportunities, may vary depending on the role you apply for. Our recruiter will be able to share more details about the specific opportunity during your general interview. EPAM strives to provide its global team of over 61,700 professionals in more than 55 countries with opportunities for professional growth from day one of collaboration. Our colleagues are the source of EPAM's success, so we value cooperation, strive to always understand our clients' business and aim for the highest quality standards. No matter where you are, you will join a dedicated, diverse community that will help you realize your potential to the fullest. 

Job Application Tips

  • Tailor your resume to highlight relevant experience for this position
  • Write a compelling cover letter that addresses the specific requirements
  • Research the company culture and values before applying
  • Prepare examples of your work that demonstrate your skills
  • Follow up on your application after a reasonable time period

You May Also Be Interested In