IBM

Senior Observability Engineer – Full Stack & Data

Posted: 1 days ago

Boost Your Application

Stand out with our professional, ATS-friendly resume templates designed to get you noticed by recruiters.

Download Resume Templates

Job Description

IntroductionAt IBM Finance & Operations, we are the backbone of IBM’s transformation driving efficiency, transparency, and smart decision-making across the business. Our teams provide the insight and discipline that guide strategy, ensure financial strength, and enable IBM to invest in innovation and growth. Working in Finance & Operations means combining analytical skills with collaboration and curiosity. You’ll partner with colleagues across functions and geographies, using data, technology, and process excellence to create solutions that improve performance and deliver measurable impact. IBM offers continuous learning, career development, and a culture that values diverse perspectives. Join us and be part of a global team that keeps IBM moving forward, while building your own future in a dynamic and evolving environment.Your Role And ResponsibilitiesFull Stack ObservabilityArchitect and implement observability frameworks across infrastructure, applications, and cloud-native services.Integrate telemetry pipelines using OpenTelemetry, Fluentd, and custom collectors.Data Network ObservabilityDeploy and manage SevOne for SNMP, NetFlow, and interface-level performance monitoring.Use ThousandEyes for synthetic testing, BGP monitoring, and WAN/SD-WAN visibility.Correlate network telemetry with application and user experience metrics to identify degradation patterns.Monitor network paths, packet loss, latency, and jitter across hybrid and multi-cloud environments.Application & Log ObservabilityInstrument applications using Instana for distributed tracing and causal AI-based root cause analysis.Centralize and enrich logs using Syslog, Fluentd, and route to SIEM and observability platforms.AIOps & AutomationImplement AI-driven incident detection, event correlation, and predictive analytics using platforms like IBM Cloud Pak or Splunk ITSI.Automate alerting, remediation workflows, and RCA processes to reduce MTTR.Performance Optimization & ReportingAnalyze telemetry data to identify bottlenecks and optimize system and network performance.Build unified dashboards and reports for technical and business stakeholders.Collaboration & GovernanceWork closely with NetOps, SRE, DevOps, and Security teams to ensure shared observability practices.Ensure observability data supports compliance, audit, and security monitoring requirements.Required Technical And Professional Expertise15+ years of experience in observability, network operations, or infrastructure monitoring.Hands-on expertise with SevOne, ThousandEyes, Instana, Syslog, and AIOps platforms.Strong understanding of hybrid cloud, SD-WAN, microservices, and distributed systems.Proficiency in scripting (Python, Bash), telemetry protocols, and data modeling.Experience with OpenTelemetry, Prometheus, Grafana, and log aggregation tools.Excellent analytical, communication, and stakeholder engagement skills.

Job Application Tips

  • Tailor your resume to highlight relevant experience for this position
  • Write a compelling cover letter that addresses the specific requirements
  • Research the company culture and values before applying
  • Prepare examples of your work that demonstrate your skills
  • Follow up on your application after a reasonable time period

You May Also Be Interested In