Thrive IT Systems

Cloud Engineer

Posted: 1 hours ago

Job Description

We are seeking an experienced L3 / Cloud Engineer to support and oversee complex cloud infrastructure environments with a primary focus on Microsoft Azure (95%) and limited workloads on AWS (5%). The role will involve handling critical L3-level operational issues, performing advanced troubleshooting, driving root-cause analysis, executing complex changes, and ensuring stable cloud operations. The candidate will collaborate with senior engineers, SMEs, and client stakeholders to maintain high availability, reliability, and performance of cloud platforms.Key Responsibilities:1. L3 On-Call Support (Every Three Weeks)· Handle L3-level critical incidents related to Azure and AWS environments.· Participate in war rooms and ensure war room initiation within 15 minutes of a critical issue.· Perform deep-dive troubleshooting, restore services, and provide technical leadership during P1 issues.2. L3 operational tasks· Execute and coordinate complex technical activities for:Change Requests (CHG) with cross-functional dependencies.Problem management: RCA creation, permanent fix recommendations, and follow-ups.Critical incident support (P1/P2), ensuring timely response and resolution.· Optimize day-to-day cloud operations through automation, scripting, and process improvements.3. Engineering & Platform Support· Perform advanced troubleshooting on Azure IaaS, PaaS, storage, networking, and security components.· Contribute to design discussions by providing L3-level technical inputs (not full architecture ownership).· Support cloud maintenance activities such as patching, updates, scaling, and performance tuning.· Implement operational best practices, monitoring enhancements, and system reliability improvements.4. Client & Stakeholder Communication· Act as an L3 technical contact for critical incidents, complex changes, and platform issues.· Provide clear, concise communication to internal teams, senior engineers, and client representatives.· Collaborate with architecture teams, service owners, and product groups on ongoing improvements.Required Skills & Experience (Primary Must-Haves):· 8–12 years of experience in cloud engineering or infrastructure operations.· Strong hands-on expertise with Microsoft Azure (IaaS, PaaS, networking, identity, governance).· Working knowledge of key AWS components (EC2, S3, VPC, IAM, etc.).· Deep understanding of:Incident, Change, and Problem Management processes (ITIL preferred)Azure monitoring & diagnostics tools (Azure Monitor, Log Analytics, Alerts)Platform operations, troubleshooting, and optimization· Proven experience handling L3 escalations and war room scenarios.· Strong technical documentation and communication skills.Secondary (Good-to-Have) Checks:· Azure Administrator / Engineer certifications (AZ-104, AZ-700, AZ-305).· AWS Solutions Architect Associate/Professional.· Experience with IaC tools (ARM, Bicep, Terraform).· Scripting skills (PowerShell / Python).· Experience with hybrid cloud and enterprise datacenter environments.Work Mode:· On-call rotation required every three weeks.· May require occasional off-hours support during critical incidents or major changes.Recruiter's Email : shikharsharma@thriveitsystems.com

Job Application Tips

  • Tailor your resume to highlight relevant experience for this position
  • Write a compelling cover letter that addresses the specific requirements
  • Research the company culture and values before applying
  • Prepare examples of your work that demonstrate your skills
  • Follow up on your application after a reasonable time period

You May Also Be Interested In