cover
Full Time

100% Remote Site Reliability Engineer (SRE)/ 1 week ago

Digistore24
Attractive
Application ends: 2026-03-25

Quick Summary

This 100% remote role requires at least three years of IT operations experience and mandatory fluency in both German and English. You will manage system reliability and automation using Kubernetes, Terraform, and CI/CD tools like GitHub Workflows, preferably within Google Cloud. The position offers flexible working hours with core times from 10 AM to 4 PM and emphasizes autonomy in a stable, product-funded tech company.

Are you an experienced developer or DevOps engineer seeking remote work freedom and growth in Site Reliability Engineering? Join our internationally successful software and education company to elevate our reliability to the next level as part of our Site Reliability Engineering team.

Please note: English and German language proficiency is a MUST for this position. Do not apply if you do not speak both languages.

Who is Digistore24?

We are one of Europe's fastest-growing tech companies, driven by our mission to shape the digital future. Our software and expertise empower individuals to share knowledge online, enabling them to achieve their business dreams. This provides millions with access to information that helps them reach their goals. To sustain our growth, we are expanding our teams with experts and strong personalities who share our values, regardless of their location.

Your New Role: Site Reliability Engineer Responsibilities

  • Automation and Infrastructure as Code (IaC): Automate repetitive tasks, deployments, and system management to reduce human error and improve efficiency. This includes creating scripts, CI/CD pipelines, and automating infrastructure provisioning.
  • Reliability and Performance Optimization: Continuously enhance system uptime by identifying bottlenecks and optimizing system architecture.
  • Capacity Planning and Scaling: Assess and predict system resource requirements (CPU, memory, storage) to ensure infrastructure scalability with increasing demand. Implement auto-scaling solutions to manage load spikes without manual intervention, maintaining system performance under various conditions.
  • System Monitoring and Incident Response: Continuously monitor system performance, uptime, and reliability using tools like Prometheus, Grafana, or ElasticSearch. Detect and respond to issues proactively to minimize user impact. Manage and respond to incidents, outages, and failures swiftly, aiming to reduce downtime. This involves incident documentation, communication, and post-incident analysis.
  • Incident Postmortems and Continuous Improvement: Conduct root cause analysis (RCA) after incidents to identify issues and prevent recurrence. Implement fixes, improvements, and best practices based on post-mortem learnings to boost system reliability and reduce future incidents.

Your Benefits at Digistore24

  • Play a crucial role in shaping cutting-edge projects within a collaborative work environment.
  • Enjoy flexibility in working time and location, including home office or partner coworking spaces (with guaranteed uninterrupted internet access).
  • Access regular further education opportunities.
  • Benefit from the stability of a highly successful German high-tech company, funded by its product success, not investors.
  • Work in outcome-focused teams with a culture of direct feedback.
  • Receive modern equipment: Thinkpad or MacBook.
  • Join an international, collaborative team with strong cohesion.
  • Participate in spectacular team events across various European countries.
  • Experience autonomy from day one.
  • Contribute to a retirement scheme.
  • Work in a team on a first-name basis, without a dress code, and at eye level.
  • Enjoy flexible working hours from Monday to Friday (core working hours from 10 AM to 4 PM).

Skills and Experience for Your Dream Job at Digistore24

  • Communication Mastery: Communicate precisely and recipient-friendly, diffusing potential conflicts with sensitivity and a solution-oriented approach. Maintain the right tone with stakeholders, developers, and your team, even under pressure, and seamlessly switch between German and English.
  • Collaboration Wizardry: Collaborate effectively with developers, stakeholders, and operations, ensuring everyone is aligned. Understand challenges across different teams and find company-wide beneficial solutions.
  • Automation Sorcery: Champion automation to save time and reduce errors, implementing tools that enhance team productivity.
  • Problem-Solving Genius: Deeply investigate problems, identify root causes, and devise solutions to prevent future incidents.
  • Self-organization: Thrive on autonomy, excelling at organizing and structuring complex projects while working remotely.

Technical Stack

  • Kubernetes / Container Technology
  • CI/CD (Github Workflows, Helm, Kustomize)
  • Cloud Services (preferably Google, but other platforms are acceptable)
  • Excellent spelling and grammar in German
  • PHP language experience (a plus)

A Typical Day at Digistore24

Start your day with a morning video call to discuss yesterday's progress and today's plans with your team.

You prefer a structured approach, outlining your daily routine and goals. You consistently allocate time for the continuous development of our SRE processes, supported by your team.

During the daily team call, you report on priorities and blockers, receiving practical tips to overcome challenges.

For several hours, you focus on developing improvements for auto-scaling, monitoring, and alerting, testing your ideas in practice. You document these successful principles to present them to the Head of IT Operations in a one-on-one meeting.

After lunch, you assist a developer with a new CI/CD workflow, discussing requirements and providing an initial prototype.

You address a ticket to review an application's resource allocation, checking current utilization and adjusting the deployment as needed.

Upon discovering an unmonitored endpoint, you create a ticket and immediately write the necessary code in the Terraform project to add it to monitoring.

This Position is NOT for You If:

  • You do not identify with our company values.
  • You have less than 3 years of experience in IT operations.
  • You struggle with ownership and require detailed discussions with supervisors or colleagues for every decision.
  • You have difficulty planning and prioritizing tasks.
  • You do not enjoy finding solutions for complex problems.
  • You are not confident speaking both German AND English.

Our Values

Review our values here: https://careers.digistore24.com/kultur-und-werte

Are you ready to live them?

Share

Digistore24

Digistore24

  • Address
    London, England
View Profile
Your experience on this site will be improved by allowing cookies Cookie Policy