Job Description

Essential Duties and responsibilities:Understand and drive to achieve automated Cloud Native Solutions in Google Cloud Platform, Microsoft Azure, and AWS environments.Work on the design and implementation of Cloud Infrastructure and On Premise systemsImplement and continuously improve upon Container Orchestration platforms to support Microservice modeled development and adoption thereofDesign and implement future state SaaS / IaaS architecture utilizing Infrastructure as Code and Configuration Management approachesImplement and collaborate on solutions that increase the monitoring and observability of systems at scale and detect and alert on trends of information.Define metrics to ensure the high performance and stability of our development, test, and production environments.Enable and implement continuous delivery and continuous integration.Analyze a variety of approaches to SRE / DevOps problems – provide pros and cons of different approaches to the team to arrive at an agreed upon direction.Provide first level support for application software issues in all environments.Prioritize and rapidly troubleshoot issues to ensure maximum uptime and optimal performance for customers in our production environment.Job QualificationsEducation: Bachelor of Science degree in Computer Science or related field. Master’s degree preferred.Experience:Minimum 5 years of software related experience required, with a mixture of Site Reliability, DevOps, or Release Engineering experience with Software Engineering experience in backend services or data systemsCloud experience and architecture understanding of Google Cloud Platform, Azure, and/or Amazon Web Services.Experience using Infrastructure as Code tools Terraform, Azure RM, CloudFormationHands on experience with Kubernetes, OpenShift or similar container orchestration systemsInstrumentation experience with APM tool such as App Dynamics, DataDog, or SplunkHands on experience exposing / consuming metrics and other system KPIs; DataDog, PagerDuty, Prometheus, Grafana, AlertManager experience a plusExperience and deep commitment to the transformation to a DevOps culture focusing on continuous integration – full lifecycle of building, automated and performance testing, and automating deployment.Experience with Docker, Podman, container solutions, management tools and challenges.Experience with Ansible, Chef, Puppet or other Configuration Management toolsExtensive experience with Linux, Unix, WindowsHands on experience with Python, shell scripting, Groovy, Golang, etc.Experience with Jenkins, Azure DevOps, TeamCity or other Continuous Integration toolsSolid understanding around Cloud Authentication, Authorization, and Secrets Management.Azure RBAC, AWS/GCP IAM - built-in and custom Roles, Security Groups and Scopes.Demonstrate strong problem analysis, problem resolution, and decision making and judgment skills.Demonstrate understanding of complex software architecture, and the ability to help enhance, support, and troubleshoot that architecture.Ability to leverage technical know-how to find viable compromises amidst competing business needs.Demonstrate ability to plan and excel in a fast-paced and demanding environment.

Job Application Tips

  • Tailor your resume to highlight relevant experience for this position
  • Write a compelling cover letter that addresses the specific requirements
  • Research the company culture and values before applying
  • Prepare examples of your work that demonstrate your skills
  • Follow up on your application after a reasonable time period

You May Also Be Interested In