Site Reliability Engineer

ENGINEERING - FLEXIBLE - ITALY - SEPTEMBER 18, 2024

SHARE:

In the cloud, every second counts. On the leading edge of security, Sysdig stops attacks in real-time by instantly detecting changes in cloud security risk with runtime insights and open source Falco. We are passionate open source enthusiasts at heart and technical problem-solvers who are innovating and delivering powerful solutions to secure cloud-native applications.
We value diverse opinions and open dialogue to spur ideas. We believe in working closely together to achieve our goals, and since our launch, we have been flexible with when and where we work. We’re an international company that understands how to cultivate a strong culture across remote teams.
And we’re a great place to work too – we’ve been named a “Best Place to Work” by Inc.,the San Francisco Business Times and the Silicon Valley Business Journal, and we won six workplace awards from Comparably this year. We have been recognized by Deloitte as one of the 500 fastest-growing organizations for the last four years.
We are looking for driven team members who want to join us on our mission to lead cloud security globally. Does this sound like the right place for you?
What you will do
  • Reporting to the SRE Manager you will build and manage systems across internal and production Cloud environments with a focus on configuration as code and platform automation
  • Implement reliability improvement initiatives, including capacity planning, performance tuning, load testing and infrastructure optimization
  • Measure KPI via Service Level Indicators (SLIs), Service Level Objectives (SLOs) and Service Level Agreement (SLAs) and help to define them
  • Participate in and contribute to improving our incident response. Perform root cause analysis (RCA), troubleshoot and debug issues across our infrastructure and platform services to identify and fix root causes
What you will bring with you
  • Solid SRE, DevOps or Cloud Infrastructure Engineer experience
  • Solid experience in containerization (kubernetes, docker and helm charts) – all of them 
  • Solid understanding of Linux systems and networking
  • Software development skills; Go and Python a big plus
What we look for
  • Familiarity with monitoring tools such as Sysdig, Prometheus, Nagios, Icinga, Zabbix
  • Strong tooling and automations development experience
  • Experience in CI/CD tools such as Harness and/or Jenkins
  • Experience diagnosing and troubleshooting complex problems in high-throughput applications and network services
Why work at Sysdig?
We’re a well funded startup that already has a large enterprise customer base
We have an organizational focus on delivering value to customers
Our open source tools (https://sysdig.com/opensource/) are widely used and loved by technologists & developers
When you join Sysdig, you can expect:
Great compensation package, including equity opportunities
Benefits vary based on location
An international culture with employees in more than 40 countries
Flexible work arrangement
Mental well-being support for you and your family and company-wide recharge days
Development opportunities
We would love for you to join us! Please reach out even if your experience doesn’t perfectly match the job description. We can always explore other options after starting the conversation. Your background and passion will set you apart, especially if your career is unconventional.
Some of our Hiring Managers are globally distributed, an English version of your CV will be appreciated!