Monday, October 27, 2025
NTT DATA, Inc.

Platform Engineer - AI

Posted: 1 days ago

Job Description

Make an impact with NTT DATAJoin a company that is pushing the boundaries of what is possible. We are renowned for our technical excellence and leading innovations, and for making a difference to our clients and society. Our workplace embraces diversity and inclusion – it’s a place where you can grow, belong and thrive.Your day at NTT DATAAs a Platform Engineer at NTT DATA, you will lead the design of complex managed service solutions for our largest enterprise clients. Your role involves driving the strategic vision and direction for these solutions, combining technological expertise and business acumen to create IT strategies and roadmaps aligned with our clients' business objectives, KPIs, and SLAs.Key Responsabilities: Platform Development & ArchitectureDesign and build internal developer platforms (IDPs) that provide self-service infrastructure provisioning, deployment pipelines, and operational tooling through intuitive interfaces and APIsDevelop comprehensive platform architecture spanning on-premises, cloud, and hybrid environments with focus on scalability and reliabilityCreate developer-friendly abstractions for complex infrastructure concepts, including deployment workflows, environment management, and service discovery mechanismsOperating Systems & Infrastructure ManagementDesign, implement, and maintain enterprise-grade Linux and Windows server infrastructures, including system installation, configuration, patching, and optimizationPerform advanced system administration tasks including user management, security hardening, performance tuning, and troubleshooting across diverse OS environmentsImplement automated OS provisioning and configuration management using infrastructure-as-code principlesVirtualization TechnologiesDesign, deploy, and manage virtualized infrastructure using VMware vSphere/ESXi, Microsoft Hyper-V, and KVM hypervisorsConduct capacity planning and performance analysis of virtual infrastructures to optimize resource utilizationImplement backup and disaster recovery solutions for virtual machines including technologies like Veeam and SRMIntegrate virtualization platforms with storage area networks (SAN) and network-attached storage (NAS) solutionContainerization & OrchestrationDesign, implement, and maintain Kubernetes clusters across various environments (on-premises, cloud, hybrid) with focus on scalability and high availabilityOptimize container orchestration platforms for performance, cost-efficiency, and resource management including advanced scheduling algorithmsDevelop and maintain container deployment strategies, including blue-green deployments, canary releases, and rolling updatesImplement service mesh technologies and networking solutions for secure, scalable service-to-service communicationCluster & Resource ManagementImplement advanced scheduling algorithms and resource allocation strategies for distributed workloads across multi-cluster and multi-tenant environmentsDesign and optimize job scheduling systems with features including backfill algorithms, fair share scheduling, and advance reservationsManage cluster resource allocation including CPU, memory, storage, and specialized hardware (GPUs) with focus on maximizing utilization and minimizing latencyImplement automated scaling policies and resource optimization techniques for dynamic workload managementCI/CD Pipeline EngineeringBuild and maintain sophisticated continuous integration and deployment pipelines incorporating automated testing, security scanning, and progressive deployment strategiesIntegrate CI/CD systems with Kubernetes and container orchestration platforms for streamlined application deliveryImplement GitOps workflows and Infrastructure-as-Code practices using tools like Terraform, Pulumi, and AnsibleMonitoring & ObservabilityDesign and implement comprehensive monitoring, logging, and alerting systems providing visibility into platform health and application performanceDeploy observability solutions using tools like Prometheus, Grafana, Jaeger, and distributed tracing systemsImplement automated anomaly detection and performance optimization based on metrics, logs, and tracesRequired QualificationsEducation & ExperienceBachelor's degree in Computer Science, Information Technology, or related field, or equivalent practical experience5+ years of experience in platform engineering, DevOps, site reliability engineering, or similar infrastructure-focused rolesTechnical SkillsExpert-level knowledge of Linux system administration (Red Hat Enterprise Linux, CentOS, Ubuntu, Debian) including kernel tuning, process management, and security hardeningProficiency in Windows Server administration including Active Directory, Group Policy, and PowerShell scriptinVirtualization TechnologiesStrong experience with VMware vSphere/ESXi, Microsoft Hyper-V, and open-source hypervisors like KVMKnowledge of virtualization management tools including vCenter Server and System Center Virtual Machine ManageContainerization & OrchestrationExpert-level Kubernetes administration including cluster setup, networking, storage, and securityProficiency with Docker containerization and container image management preferable including RAFAY and RANCHER platformsExperience with container orchestration patterns and service mesh technologiesCloud PlatformsHands-on experience with major cloud platforms (AWS, Azure, Google Cloud Platform) including compute, networking, and storage servicesKnowledge of cloud-native technologies and hybrid cloud architectureProgramming & ScriptingProficiency in scripting languages including Python, Bash, Go, and PowerShell for automation and infrastructure managementExperience with Infrastructure-as-Code tools like Terraform, Pulumi, CloudFormation, or AnsibleMonitoring & ObservabilityExperience with monitoring solutions including Prometheus, Grafana, Datadog, ELK Stack, and distributed tracing toolsKnowledge of observability best practices including metrics, logs, and traces correlationCluster & Resource ManagementExperience with job schedulers and resource management systems like Slurm, PBS, or Kubernetes scheduling frameworksUnderstanding of distributed systems architecture and resource optimization techniquesSoft SkillsStrong analytical and problem-solving abilities with experience in complex system troubleshootingExcellent communication skills and ability to work effectively with cross-functional virtual teams Product mindset with focus on developer experience and platform usabilityExcellent ability to work effectively remote and virtually across Europe & InternationalPreferred QualificationsKubernetes certifications (CKA, CKAD, CKS) or equivalent cloud platform certificationsExperience with service mesh technologies (Istio, Linkerd) and API gateway solutionsKnowledge of security frameworks and compliance standards (SOC 2, ISO 27001, HIPAA)Experience with GitOps practices and advanced CI/CD patternsBackground in high-performance computing or large-scale distributed systemsWorkplace type: Remote WorkingAbout NTT DATANTT DATA is a $30+ billion trusted global innovator of business and technology services. We serve 75% of the Fortune Global 100 and are committed to helping clients innovate, optimize and transform for long-term success. We invest over $3.6 billion each year in R&D to help organizations and society move confidently and sustainably into the digital future. As a Global Top Employer, we have diverse experts in more than 50 countries and a robust partner ecosystem of established and start-up companies. Our services include business and technology consulting, data and artificial intelligence, industry solutions, as well as the development, implementation and management of applications, infrastructure, and connectivity. We are also one of the leading providers of digital and AI infrastructure in the world. NTT DATA is part of NTT Group and headquartered in Tokyo.Equal Opportunity EmployerNTT DATA is proud to be an Equal Opportunity Employer with a global culture that embraces diversity. We are committed to providing an environment free of unfair discrimination and harassment. We do not discriminate based on age, race, colour, gender, sexual orientation, religion, nationality, disability, pregnancy, marital status, veteran status, or any other protected category. Join our growing global team and accelerate your career with us. Apply today.Third parties fraudulently posing as NTT DATA recruitersNTT DATA recruiters will never ask job seekers or candidates for payment or banking information during the recruitment process, for any reason. Please remain vigilant of third parties who may attempt to impersonate NTT DATA recruiters—whether in writing or by phone—in order to deceptively obtain personal data or money from you. All email communications from an NTT DATA recruiter will come from an @nttdata.com email address. If you suspect any fraudulent activity, please contact us.

Job Application Tips

  • Tailor your resume to highlight relevant experience for this position
  • Write a compelling cover letter that addresses the specific requirements
  • Research the company culture and values before applying
  • Prepare examples of your work that demonstrate your skills
  • Follow up on your application after a reasonable time period

Related Jobs