eu.tech.jobs
EU-TECH-JOBS // 18,957 OPEN // 1,252 COMPANIES // UPDATED 05/05 08:15 UTC

← all jobs

INTERNAL - Senior Software Engineer - SRE (all genders) - Berlin

Engineer Software Backend Frontend Full Stack Developer Programmer Coder Senior Sr

MBition (Mercedes-Benz) · Germany · Berlin · posted — · engineering · senior

Your Mission

As a Senior SRE Engineer, you will be a technical anchor for the reliability and operational excellence of the core developer platform services of SoftwareFactory, serving over 14,000 users across global locations including China and India.
SoftwareFactory is the foundation of MB.OS, enabling developer productivity across multiple Mercedes Benz organizations.
You will bring deep technical expertise and ownership to operating, scaling, and continuously improving our platform services, setting the standard for reliability engineering across the team. You will play a leading role in shaping how our global SRE teams in EU, India, and China work together to deliver consistent, high-quality service worldwide.

Sneak preview of your future tasks

  • Take technical ownership for the stability, availability, and performance of our developer platform services, proactively identifying and mitigating risks before they impact users.
  • Lead incident response across the global on-call rotation, driving fast mitigation, thorough root cause analysis, and lasting systemic improvements.
  • Own and drive the maturity of our incident management practices, including runbooks, post-mortems, escalation paths, and cross-team follow-up.
  • Define and lead the observability strategy: metrics, logging, alerting, and dashboards that give the team full, actionable visibility into system health.
  • Lead release coordination for SRE-managed services, enforcing safe and consistent deployment practices with minimal disruption to users.
  • Systematically identify and eliminate toil through automation, tooling improvements, and infrastructure-as-code — and mentor others in doing the same.
  • Drive technical alignment with platform engineers, cloud engineers, and other service teams to improve reliability across architectural layers.
  • Own the definition and continuous improvement of SLOs and SLIs, using them to guide technical decisions, prioritization, and stakeholder communication.
  • Act as a technical reference point within the team, contributing to architectural decisions, design reviews, and engineering standards.

Your Profile

  • Deep hands-on experience operating high-scale, self-hosted developer platform services (such as source control, artifact management, CI/CD tooling, or similar) in large enterprise environments.
  • Strong engineering depth in SRE or platform operations, with expert command of SRE principles applied in practice — SLOs, error budgets, toil reduction, and reliability-as-code.
  • Extensive experience leading incident response and post-incident processes in a 24/7 operational environment, including driving cross-team systemic improvements.
  • Strong understanding of cloud architecture and distributed systems, particularly in AWS, with the ability to reason about failure modes at scale.
  • Expert proficiency in infrastructure-as-code and configuration management (e.g. Terraform, Ansible, Helm), with a track record of improving platform maturity.
  • Advanced scripting and automation skills (e.g. Python, Bash, or similar), consistently used to drive down manual effort and increase operational leverage.
  • Deep experience designing and owning observability stacks (e.g. Prometheus, Grafana, ELK, or similar) in production environments.
  • Proven ability to drive technical alignment and influence engineering decisions across teams and organisational boundaries.
Nice to have
  • Hands-on experience with self-hosted DevOps tooling at scale (e.g. GitLab, Artifactory, Jenkins, or similar platforms).
  • Kubernetes experience in production environments, including large-scale cluster operations.
  • Multicloud experience in large scale infrastructure, as well as EU sovereign cloud concepts.
  • Background in large software delivery companies (not only in manufacturing).
Personal skills
  • Deep sense of ownership: you set the reliability bar for the team and hold it, proactively addressing systemic risks and driving improvements without being asked.
  • Thrives in fast-paced, ambiguous, and high-stakes environments — you bring clarity and direction when things get complex.
  • Excellent communicator: you document rigorously, share knowledge deliberately, and collaborate effectively across timezones, cultures, and seniority levels.
  • Strong technical judgment: you balance immediate operational needs with long-term platform health and make well-reasoned trade-offs.
  • High initiative and resilience: you lead from the front under pressure and help keep the team steady in difficult moments.
  • Natural mentor and multiplier: you raise the technical bar around you and invest in the growth of less experienced engineers.
  • Deeply committed to SRE values — you champion reliability as a shared engineering discipline, not just an operations concern.
Education
  • Degree in Computer Science, Information Technology, or a comparable field
Language skills
  • Proficient in English.
  • German is a plus, not a must.

Why us?

  • A chance to work on a new generation of Infotainment Systems, which will power millions of cars
  • An international, interdisciplinary innovation lab, which is part of the Mercedes-Benz AG
  • Great company values that we are passionate about and live by every day at work. 
  • Agile working methods and open feedback culture
  • A brand new modern and fully accessible office facing the Spree
  • Flexible working hours
  • Transportation and health benefits, discounts on cars, free coffee, fruits and more

Interested?

We look forward to receiving your complete application, including CV (in English or German) and relevant references with the following information:

  • Job title and reference number
  •  Salary expectations
  •  Earliest start date
We would like to encourage people with health impairments to apply to our jobs! Our building and work places offer the possibilities to adjust to different employee requirements. 

  • Posting Date: 20.04.2026
  • Supervisor: Gabriel Zaharia
  • Dept: Software Factory
  • Team: SwF Infra & SRE Cluster - 1
  • Opening: 1 x Internal FTE


Last seen — · request removal.