Job Description

EPAM is a leading global provider of digital platform engineering and development services. We are committed to having a positive impact on our customers, our employees, and our communities. We embrace a dynamic and inclusive culture. Here you will collaborate with multi-national teams, contribute to a myriad of innovative projects that deliver the most creative and cutting-edge solutions, and have an opportunity to continuously learn and grow. No matter where you are located, you will join a dedicated, creative, and diverse community that will help you discover your fullest potential.  We are seeking a Machine Learning Engineer to join our team and support the GenAI initiative. In this role, you will focus on designing, improving, and optimizing backend infrastructure to power LLM-based applications using OpenAI APIs. Your skills in MLOps, CI/CD, observability, and cloud-native technologies will be essential to ensure the reliability, scalability, and efficiency of AI-driven systems. ResponsibilitiesDevelop and improve backend infrastructure for AI and LLM-based solutionsIntegrate and oversee LLM applications within cloud environmentsScale AI systems to meet performance and reliability requirementsImplement automated deployment processes through CI/CD pipelinesTrack and maintain the performance of AI services to ensure consistencyEstablish logging and observability frameworks for monitoring LLM API performanceCollaborate with DevOps teams to streamline workflows and enhance system dependabilityWork closely with AI and Data Science teams to develop and enhance application featuresLeverage cloud platforms, especially Azure, to deploy and scale AI-powered applicationsDesign and build APIs and microservices to support AI-driven functionalities RequirementsAt least 2 years of experience in Machine Learning Engineering with a focus on backend and software developmentStrong expertise in integrating and working with OpenAI APIs and other AI servicesHands-on experience with MLOps tools such as Orion, ArgoCD, and Opsera for deployment automationProficiency with monitoring and observability tools, including Grafana, Dynatrace, and ThoughtSpotComprehensive knowledge of cloud platforms, particularly Azure, as well as Apache Spark and DatabricksAdvanced Python programming skills for backend development and implementationProven experience in designing and building APIs and microservices architectureFluency in English, both verbal and written, with a minimum proficiency level of B2+ Nice to haveKnowledge of Data Science principles and workflowsExperience with Large Language Models (LLMs)Understanding of Natural Language Processing (NLP) methodologies and applications We offerInternational projects with top brandsWork with global teams of highly skilled, diverse peersHealthcare benefitsEmployee financial programsPaid time off and sick leaveUpskilling, reskilling and certification coursesUnlimited access to the LinkedIn Learning library and 22,000+ coursesGlobal career opportunitiesVolunteer and community involvement opportunitiesEPAM Employee GroupsAward-winning culture recognized by Glassdoor, Newsweek and LinkedIn

Job Application Tips

  • Tailor your resume to highlight relevant experience for this position
  • Write a compelling cover letter that addresses the specific requirements
  • Research the company culture and values before applying
  • Prepare examples of your work that demonstrate your skills
  • Follow up on your application after a reasonable time period

You May Also Be Interested In