Cnexia

Site Reliability Engineer

Posted: 48 minutes ago

Job Description

Joining Cnexia is choosing to be part of an ambitious project that values Innovation, promotes Continuous Learning, and enables all tech champions to fulfill their creative dreams.At Cnexia, we do more than support the clients of our world-class network and services. We develop innovative solutions and create original multiplatform media content. In fact, we’re revolutionizing how Canadians communicate on the web, interact with Mobile Apps, or benefit from an AI-enhanced experience.Proud of our status as a fully owned Moroccan subsidiary of the largest Canadian Telecom company, we have been ceaselessly growing our team since 2021. With over 4000 employees, mainly based in Fez, we have expanded in the northern region of the kingdom with our Brand-new state of the art site in Technopolis Rabat.If you are ready for this challenge, we invite you to join a community that values bold ideas and professional growth all in an engaging multi cultural world-class environment.Position: Senior Site Reliability EngineerWe are looking for a highly skilled Senior Site Reliability Engineer to support and enhance the reliability, performance, and scalability of our cloud-based production environment. The engineer will play a key role in monitoring, maintaining, and improving system availability while driving operational excellence across the platform.Main ResponsibilitiesEnsure the stability and availability of cloud production systems. Perform monitoring, alerting, and incident response using industry-standard observability tools. Automate recurring operational tasks and contribute to infrastructure improvements. Troubleshoot complex issues related to performance, system reliability, networking, and service integrations. Implement and maintain backup, failover, and disaster-recovery procedures. Collaborate with development and operations teams to enhance system performance and reduce operational risks. Maintain system documentation, runbooks, and operational standards. Participate in on-call rotations and continuous improvement initiatives. Required Technical Skills8+ years of experience in cloud production support or system operations. Strong knowledge of Linux administration. Expertise with cloud monitoring and logging tools such as Prometheus, Grafana, Stackdriver, Cloud Logging, Cloud Storage, or equivalent. Experience with scripting and automation (Python, Bash, or similar). Familiarity with CI/CD pipelines and DevOps tooling. Solid understanding of networking fundamentals and VoIP (asset). Experience in troubleshooting distributed systems and microservice-based architectures. Soft SkillsStrong analytical and problem-solving skills. Ability to work under pressure and handle critical incidents. Excellent communication and collaboration with cross-functional teams. Strong sense of ownership and accountability. Nice-to-HaveAdditional knowledge of cloud security principles. Prior experience in large enterprise or telecom environments. Familiarity with SRE best practices and reliability engineering frameworks.

Job Application Tips

  • Tailor your resume to highlight relevant experience for this position
  • Write a compelling cover letter that addresses the specific requirements
  • Research the company culture and values before applying
  • Prepare examples of your work that demonstrate your skills
  • Follow up on your application after a reasonable time period

You May Also Be Interested In