In the cloud, every second counts. On the leading edge of security, Sysdig stops attacks in real-time by instantly detecting changes in cloud security risk with runtime insights and open source Falco. We are passionate open source enthusiasts at heart and problem-solvers who are building and delivering powerful solutions to secure cloud-native applications.

We value diverse opinions and open dialogue to spur ideas. We believe in working together to achieve our goals and we pride ourselves on a flexible work culture. We’re an international company that understands how to cultivate an inclusive environment across remote teams.

And we’re a great place to work too – we’ve been named a “Best Place to Work” by Inc.,the San Francisco Business Times and the Silicon Valley Business Journal, and we won six workplace awards from Comparably this year. We have been recognized by Deloitte as one of the 500 fastest-growing organizations for the last four years.

We are looking for driven team members who want to join us on our mission to lead cloud security globally. Does this sound like the right place for you?

What you will do

Reporting to the SRE Manager you will build and manage systems across internal and production Cloud environments with a focus on configuration as code and platform automation
You will implement reliability improvement projects, including capacity planning, performance tuning, load testing and infrastructure optimization
You will measure KPI with Service Level Indicators (SLIs), Service Level Objectives (SLOs) and Service Level Agreement (SLAs) and help to define them
You will help improve our incident response. Perform root cause analysis (RCA), troubleshoot and debug issues across our infrastructure and platform services to identify and fix causes

What you will bring with you

2/3 SRE, DevOps or Cloud Infrastructure Engineer experience
2/3 experience in containerization (kubernetes, docker and helm charts) – all of them
2/3 experience with Linux systems and networking
Software development skills; Go and Python a big plus

What we look for

Familiarity with monitoring tools such as Sysdig, Prometheus, Nagios, Icinga, Zabbix
Tooling and automations development experience
Experience in CI/CD tools such as Harness or Jenkins
Experience diagnosing and troubleshooting complex problems in high-throughput applications and network services

Why work at Sysdig?

We’re a well funded startup that already has a large enterprise customer base
We have an organizational focus on delivering value to customers
Our open source tools (https://sysdig.com/opensource/) are widely used and loved by technologists & developers

When you join Sysdig, you can expect:

Great compensation package, including equity opportunities
Benefits vary based on location
An international culture with employees in more than 40 countries
Flexible work arrangement
Mental well-being support for you and your family and company-wide recharge days
Development opportunities

We would love for you to join us! Please reach out even if your experience doesn’t perfectly match the job description. We can always explore other options after starting the conversation. Your background and passion will set you apart, especially if your career path is different.

Some of our Hiring Managers are globally distributed, an English version of your CV will be appreciated.

Sysdig values a diverse workplace and encourages women, people of color, LGBTQIA+ individuals, people with disabilities, members of ethnic minorities, foreign-born residents, and veterans to apply. Sysdig is an equal-opportunity employer. Sysdig does not discriminate on the basis of race, color, religion, sex, national origin, age, disability, genetic information, sexual orientation, gender identity, or any other legally protected status.

#LI- JG1

#LI-Hybrid

Site Reliability Engineer

Are you ready to join us?