Senior/Infrastructure Engineer (SF)

San Francisco, CA

Here at Sysdig, we’re what you might call container-obsessed. It starts with our unique technology, which listens to the heart of the operating system to surface the deepest data with the least overhead. From there, we’ve created the first-ever Container Intelligence Platform, which proactively uncovers issues before they manifest, and allows for deep digging to solve the most complex problems.

We’re looking for a Senior/Infrastructure Engineer to help us lead the container revolution. You’ll build solutions to enhance the availability, performance, and stability of the Sysdig SaaS and On-Prem offering. Being part of the engineering team, you will support Sysdig through building automated self-healing systems.

Role Responsibilities:

  • Enhancing our application running in Kubernetes with self-healing and stability improvements
  • Building and managing various components of internal and production environments with a focus on configuration management, continuous integration, and platform automation
  • Building and managing software delivery, systems integration, and developer support tools
  • Enhancing developer CI/CD pipeline using Jenkins and Github
  • Automating our infrastructure and EC2 deployments as well as our build automation systems
  • Conducting performance tuning, load testing, and optimization of information/data processing of the production environment

Key technologies:

Go, Python, Cassandra, Kafka, Kubernetes

Required Qualifications:

  • Solid full-cycle development experience in a high-level language, preferably Golang/Python/Java
  • Worked with containers such as Docker, Rkt (Rocket), containerd
  • Aptitude for troubleshooting complex problems in high-throughput web applications and network services
  • Solid understanding of Linux systems and networking

Desired Qualifications:

  • Deployed Kubernetes or OpenStack clusters
  • Managed any of these clusters – Cassandra, HBase, HDFS, Elasticsearch, Kafka, Redis
  • Proficiency with configuration management tools like Terraform (or at least Puppet, Chef, or SaltStack)
  • Experience in diagnosing and troubleshooting customer facing production service outages
  • Experience in monitoring cloud services using tools like Sysdig, Datadog, Prometheus, Grafana, Graphite, Nagios, or Zabbix
  • Experience in managing AWS resources including EC2, RDS, Auto Scaling groups, ALB/NLB, IAM

Why Join Sysdig

Cloud-native is fundamentally changing how organizations build and run applications to fully take advantage of the cloud computing model. Sysdig is the cloud-native intelligence company making it happen. Join us and you’ll be working at the cutting-edge of infrastructure technology and the birth of an entirely new industry. Be the one who solves the hard challenges of operating Kubernetes and Containers at scale – and have fun doing it with a great group of people.

When you join Sysdig, you can expect:

  • Competitive salary
  • Top-notch health insurance coverage
  • We offer the best of both worlds: we’re a well-funded startup ($121.5 million) with a 300+ enterprise customer base (300 and counting)

Additionally, we offer a variety of benefits and perks, such as:

  • 401k with company matching up to 3%
  • Flexible vacation policy
  • Monthly self-improvement grant – spend on yourself however you see fit!
  • Weekly team lunches and snacks every day of the week
  • Monthly house cleaning allowance
  • Fun team with company events and lots of espresso

Are you ready to join us?

We're excited to receive your application.

Sysdig is an Equal Opportunity Employer.

We do not discriminate on the basis of race, color, national origin, religion, gender, age, veteran status, sexual orientation, marital status or disability (in compliance with the Americans with Disabilities Act) with respect to employment opportunities.