Job Description

About Quantiphi Quantiphi is an award-winning Applied AI and Big Data software and services company, driven by a deep desire to solve transformational problems at the heart of businesses. Our signature approach combines groundbreaking machine-learning research with disciplined cloud and data-engineering practices to create breakthrough impact at unprecedented speed.Company Highlights:Quantiphi has seen 2.5x growth YoY since its inception in 2013, we don’t just innovate—we lead. Headquartered in Boston, with 4000+ Quantiphi professionals across the globe. As an Elite/Premier Partner for Google Cloud, AWS, NVIDIA, Snowflake, and others, we’ve been recognized with:17x Google Cloud Partner of the Year awards in the last 8 years3x AWS AI/ML award wins3x NVIDIA Partner of the Year titles2x Snowflake Partner of the Year awardsWe have also garnered Top analyst recognitions from Gartner, ISG, and Everest Group.We offer first-in-class industry solutions across Healthcare, Financial Services, Consumer Goods, Manufacturing, and more, powered by cutting-edge Generative AI and Agentic AI accelerators.We have been certified as a Great Place to Work for the third year in a row- 2021, 2022, 2023.Be part of a trailblazing team that’s shaping the future of AI, ML, and cloud innovation. Your next big opportunity starts here!Role Overview: We are seeking a highly skilled HPC & AI Systems Engineer with expertise in Slurm workload management, FUSE-based file systems, and AI hypercomputing infrastructure. You will play a pivotal role in designing, deploying, and optimizing HPC clusters and AI workloads to support large-scale scientific and industrial applications.Key Responsibilities:Architect, deploy, and maintain HPC clusters leveraging Slurm workload manager for efficient job scheduling and resource management.Design and implement FUSE (Filesystem in Userspace) solutions to enable scalable, flexible storage access tailored for AI and HPC workloads.Collaborate with AI research teams to optimize hypercomputing infrastructure performance and scalability.Monitor, troubleshoot, and tune HPC environments to ensure high availability and optimal throughput.Develop automation scripts and tools to streamline HPC cluster management and job orchestration.Stay updated on emerging HPC, AI, and hypercomputing technologies, and contribute to continuous infrastructure improvements.Document system architecture, workflows, and best practices to enable knowledge sharing and operational excellence.Qualifications:Proven experience in managing HPC environments with Slurm workload manager.Strong hands-on expertise with FUSE file system development or integration.Solid understanding of AI workloads and requirements on hypercomputing platforms.Proficiency in Linux system administration and scripting (Bash, Python, etc.).Familiarity with distributed storage, networking, and cluster security best practices.Excellent problem-solving skills and ability to work in a fast-paced, innovative environment.Bachelor’s or Master’s degree in Computer Science, Engineering, or related field preferred.What is in it for you:Be part of the fastest-growing AI-first digital transformation and engineering company in the worldBe a leader of an energetic team of highly dynamic and talented individualsExposure to working with fortune 500 companies and innovative market disruptorsExposure to the latest technologies related to artificial intelligence and machine learning, data and cloud

HPC Engineer

Job Description

Job Application Tips

Related Jobs

iOS Engineer (05491)

Sales Specialist（Leisure Sales)

プロセスアシスタント, Amazonフレッシュ

Contingent Worker: Administration Support & Service - Hourly...

Job Description

Job Application Tips

Share this job

Apply for this Job

Related Jobs

iOS Engineer (05491)

Sales Specialist（Leisure Sales)

プロセスアシスタント, Amazonフレッシュ

Contingent Worker: Administration Support & Service - Hourly...