SRE Monitoring Engineer
Posted: 4 days ago
Job Description
About The Role:This SRE / Monitoring Engineer role is part of the newly established Reliability Operations & Control (ROC) department, a centralized observability and reliability function focused on prevention, early detection, and automation. The engineer will ensure continuous monitoring, automation, and proactive incident prevention across financial platforms, infrastructure, and business applications.What You'll Do (Responsibilities):Establish and maintain monitoring coverage across all core services (Zabbix, Grafana, Prometheus).Develop automated monitoring integrations with JIRA, GLPI, GitLab, and other APIs.Create dashboards and alerting systems that provide actionable insights for incident prevention.Improve alert accuracy and reduce false positives by 30%.Contribute to ROC automation initiatives (auto-recovery, gateway auto-suspension).Within 3 months: onboard into existing monitoring stack; within 6 months: take ownership of key services; within 12 months: deliver improved observability KPIs (MTTR ↓, uptime ↑).What You’ll Need To Be Successful In This Role:Strong hands-on experience with Zabbix (advanced configuration, templates, triggers, scripting).Grafana dashboard design and data source integration.Linux fundamentals and shell scripting (Bash / PowerShell).Python for automation and API integrations.Prometheus, OpenSearch, PostgreSQL (basic queries, integration).Understanding of monitoring concepts: metrics, logs, alerting, uptime, SLO/SLA.Ability to interpret and process JSON / REST APIs.English - В2.Core Technical Stack: Strong hands-on experience with Zabbix (advanced configuration, templates,scripting).Experience with Grafana, Prometheus, and OpenSearch.Good understanding of Linux systems, networking basics, and systemperformance.Scripting: Python, Bash/PowerShell, SQL (PostgreSQL/Oracle dialects).Understanding of monitoring fundamentals: metrics, logs, alerting, uptime,SLO/SLA.Working with JSON / REST APIs.Tools & EnvironmentsExperience with JIRA / GLPI, GitLab CI, AWS or other clouds.Familiarity with automation and configuration management(Ansible/Terraform is a plus).Understanding of incident management and SRE principles (MTTR, SLO,error budgets)Soft Skills:Analytical thinking and ownership mindset.Clear communication with cross-functional teams (Dev, QA, Product).Proactive attitude towards problem prevention, not just resolution.Why Join Paysend? Make a Global Impact: Directly impact millions of users worldwideAccelerate Your Career: Benefit from internal mobility, mentoring programs, and continuous learning opportunitiesThrive in a Connected, Global Organization: Collaborate with colleagues across our international hubs and moreEmbrace a Principle-Driven & Focused Culture: Work in an organization guided by strong principles and values that actually help you achieve more than you thought possibleEnjoy Competitive Compensation and Benefits: Receive a competitive salary, benefits, and flexible work arrangements
Job Application Tips
- Tailor your resume to highlight relevant experience for this position
- Write a compelling cover letter that addresses the specific requirements
- Research the company culture and values before applying
- Prepare examples of your work that demonstrate your skills
- Follow up on your application after a reasonable time period