Randstad Japan

English Only OK! Senior SRE (dispatch)

Posted: 6 minutes ago

Job Description

■About the CompanyOur client is a leading technology innovator, building a real-time analytics data platform that powers hundreds of services across e-commerce, fintech, digital content, and communications. They are dedicated to providing foundational insights into users, products, and markets, driven by a passion for engineering excellence and continuous improvement.■About the PositionAs a Senior Site Reliability Engineer specializing in their Analytics Platform team, you will be instrumental in ensuring the reliability, scalability, performance, and security of their core data infrastructure. You will drive architectural excellence and mentor a growing team, tackling complex distributed systems challenges and contributing to the optimization of large-scale data pipelines. You will also play a key role in migrating critical components to Google Cloud Platform (GCP), advocating for and implementing robust cloud security best practices.■Role & Responsibilities* Architecting, developing, and deploying solutions to automate, maintain, operate, and optimize large-scale data pipelines.* Conducting system-wide analysis and performance tuning for capacity planning and bottleneck identification.* Implementing and refining monitoring, alerting, and incident response strategies to meet SLAs, SLOs, and SLIs.* Leading and assisting in the migration of critical data components to GCP, emphasizing secure cloud architecture and IAM.* Designing and implementing security controls and automation within GCP environments.* Ensuring system resilience through high-availability and disaster recovery mechanisms.* Enhancing and maintaining CI/CD pipelines for applications in Java, Node.js, and Scala.* Providing expert technical guidance and troubleshooting support to cross-functional teams.* Mentoring junior and mid-level SREs and software engineers.■Requirements* Minimum 4 years of professional experience in application development, primarily with Python.* Minimum 2 years of experience designing and operating distributed systems handling large volumes of data in near real-time.* Minimum 4 years of experience with Linux operating system internals.* Minimum 2 years of experience managing infrastructure in both bare-metal and cloud environments (GCP, AWS, Azure).* Minimum 2 years of experience with cloud security principles and practices (IAM, network security, data encryption).* Minimum 2 years of experience with Infrastructure as Code tools like Terraform, Ansible, or Chef.* Minimum 3 years of experience with monitoring, logging, and alerting systems, and defining/tracking SLAs, SLOs, and SLIs.* Experience with setting up, testing, and monitoring distributed relational databases.* Minimum 3 years of experience with CI/CD pipelines using Jenkins.* Minimum 3 years of experience maintaining and operating containerized applications (Docker, Kubernetes).■IndustrySoftware, ERP, BI, CRM, Web Application, Cloud (SaaS)■Expected Salary¥6,000,000 〜 ¥9,000,000If you are interested in this exciting position, we look forward to your application.

Job Application Tips

  • Tailor your resume to highlight relevant experience for this position
  • Write a compelling cover letter that addresses the specific requirements
  • Research the company culture and values before applying
  • Prepare examples of your work that demonstrate your skills
  • Follow up on your application after a reasonable time period

You May Also Be Interested In