A Ceph guide for Kubernetes and OpenShift users

Published by:

Jorge Salamero Sanz

A Ceph guide for Kubernetes and OpenShift users

Published:

January 30, 2017

Table of contents

Ceph is a self-hosted distributed storage system popular among organizations using containers in production. For those looking for a storage solution in their containerized infrastructure, we created this guide to cover:

\r\r

How to Deploy Ceph on AWS (part 1 of 3)

\r\r

Ceph Persistent Volume for Kubernetes or OpenShift (part 2 of 3)

\r\r

Using Ceph for Kubernetes or OpenShift Persistent Volume

\r\r

How to Monitor Ceph: the top 5 metrics to watch (part 3 of 3)

\r\r

How to deploy Ceph on AWS

\r\r

Quick Introduction to Ceph and alternatives

\r\r

Ceph is a distributed storage system designed for high scalability that provides 3 storage methods:

\r\r

Object storage: compatible with AWS s3 or Openstack Swift.
Block storage: similar to what could be a scaled out version of DRBD, like AWS EBS, Google Persistent Disks or Rackspace Cloud Block Storage.
Posix compatible network file system: similar to NFS but on steroids with multiple current clients, replication, authentication, etc.

\r\r

Ceph stores data across different storage pools. It uses an algorithm known as CRUSH to calculate which placement group should contain the object and which object storage daemon (OSD) should store the placement group. OSDs are stored in a traditional file system like BRTS, XFS or ext4.

\r\r

Due to its block storage capabilities, scalability, clustering, replication and flexibility Ceph has started to become popular among Kubernetes and OpenShift users. It's often used as storage backend on Persistent Volumes for Docker containers. We could consider Ceph as a self hosted version of AWS EBS / Google Persistent Disk, while Kubernetes or OpenShift is the self run orchestration platform opposed to AWS ECS or Google Container Engine.

\r\r

Ceph basic terminology

\r\r

OSD: software that interacts with the logical disks. Typically nodes running the OSD daemon are called OSDs. These handle data storage and replication, recovery, backfilling and rebalancing.
MON: nodes running the Ceph monitoring software are called MONs. These monitor the cluster state, membership: monitor map and OSD map, placement groups (PG) map and the CRUSH map, providing consensus for distributed decision-making.
MDS: software that stores the metadata for Ceph distributed filesystem. This is not used for Ceph Block Store, only on Ceph network file system.
RGW: nodes running the Ceph gateway software are called (RGW, RADOS gateway). This is only used for Ceph Object Store proving the REST interface.
RBD: Rados Block Device, is the Linux kernel storage layer that stripes disk images into RADOS objects that can be stored across the Ceph cluster. Merged upstream on Linux 2.6.39.
Placement Group: is an internal and configurable logic strategy to balance the data objects across the different OSDs.
Pools: logical groups where to store the objects. An analogy with Kubernetes or OpenShift could be namespaces.
RADOS: Reliable Autonomic Distributed Object Storage, the technology on top of which Ceph has been built. It is a self-healing, self-managing system for storage nodes.
CRUSH algorithm: Controlled Replication Under Scalable Hashing, deterministic and decentralized placement algorithm used by RADOS and Ceph. This is basically the magic that avoids bottlenecks.

\r\r

Why Ceph? What are the alternatives?

\r\r

Ceph has become popular for being open-source and free to use. It became the most popular OpenStack storage backend, and it is gaining popularity between Kubernetes and OpenShift users because can scale up and out using commodity hardware or cloud instances and has a thin provisioning layer.

\r\r

Alternatives to Ceph in the context of Docker containers are, to mention a few, GlusterFS, NFS, Flocker (now defunct as a company but still opensource), Infinit (now acquired by Docker, will be opensourced this year), iSCSI from multiple vendors or Portworx.

\r\r

How to deploy Ceph on AWS

\r\r

If you are just getting started and you want to play with Ceph, your first move will probably be to launch a Ceph cluster in AWS. Once the nodes are up and running, the following is valid for any infrastructure.

\r\r

To draw something as close as possible to production we will start 3 monitor nodes and 3 storage nodes. Will be using Fedora 25 Cloud in us-west-1 AWS region, if you prefer using Ubuntu 16.04 just use the ami-539ac933 instead.

\r\r

Let's spawn 3 instances for the monitor nodes with the following command:

\r\raws ec2 run-instances --image-id ami-efa8f88f --instance-type t2.medium --key-name keyname --associate-public-ip-address\r\r

And 3 instances for the storage nodes with:

\r\raws ec2 run-instances --image-id ami-efa8f88f --instance-type t2.medium --block-device-mappings file://$(pwd)/fedora-ebs-config.json --key-name keyname --associate-public-ip-address\r\r

The fedora-ebs-config.json will have something like so we have a dedicated disk for ODS storage:

\r\r[\r
{\r
"DeviceName": "/dev/xvdb",\r
"Ebs": {\r
"DeleteOnTermination": true,\r
"VolumeType": "gp2",\r
"VolumeSize": 30\r
}\r
}\r
] \r
\r\r

With the infrastructure up and running we have 2 options, we can use either ceph-deploy which is a set of Python scripts to automate the cluster deployment or my favourite, leverage Ansible to deploy and manage the Ceph cluster with ceph-ansible.

\r\r

First we will clone the repo:

\r\r$ git clone https://github.com/ceph/ceph-ansible/\r\r

and we will configure our ansible.cfg for this project:

\r\r[defaults]\r
action_plugins = plugins/actions\r
roles_path = ./roles\r
\r
hostfile = hosts\r
\r
[ssh_connection]\r
control_path = %(directory)s/%%h-%%p-%%r\r
\r\r

Now we need to create our hosts with the inventory:

\r\r[all:vars]\r
ansible_user=fedora # or ubuntu if using ami-539ac933\r
ansible_become=true\r
ansible_ssh_private_key_file=~/.ssh/keyname.pem\r
\r
[mons]\r
ec2-54-183-211-140.us-west-1.compute.amazonaws.com\r
ec2-54-183-81-72.us-west-1.compute.amazonaws.com\r
ec2-52-53-225-165.us-west-1.compute.amazonaws.com\r
\r
[osds]\r
ec2-54-183-76-6.us-west-1.compute.amazonaws.com\r
ec2-54-215-168-100.us-west-1.compute.amazonaws.com\r
ec2-54-193-120-179.us-west-1.compute.amazonaws.com\r
\r\r

Next is to enable the site.yml and group_vars files:

\r\r$ cp site.yml.sample site.yml\r
$ cp group_vars/all.yml.sample group_vars/all.yml\r
$ cp group_vars/mons.yml.sample group_vars/mons.yml\r
$ cp group_vars/osds.yml.sample group_vars/osds.yml\r
\r\r

Let's configure a few things, on group_vars/all.yml:

\r\rceph_origin: 'upstream' # whether use distro packages or upstream\r
ceph_stable: true # stable or dev release\r
monitor_interface: eth0 # interface to connect to other nodes in the cluster\r
journal_size: 5120 # OSD journal size in MB\r
public_network: 172.31.16.0/20 # front network towards the clients, read more on http://docs.ceph.com/docs/master/rados/configuration/network-config-ref/\r
\r\r

And on group_vars/osds.yml:

\r\rdevices: # devices to be used by ceph\r
- /dev/xvdb\r
osd_auto_discovery: true # ansible to configure ceph on the previous device automagically\r
journal_collocation: true # same disk for data and journal\r
\r\r

Once all this ready, just simply deploy and wait:

\r\r$ ansible-playbook site.yml\r\r

When the Ansible playbook run completes we are ready to go.

\r\r

Other Ceph deployment strategies for Kubernetes and OpenShift

\r\r

Ideally we should be able to deploy Ceph with containers as we like to do with the rest of our services. This is a work in progress effort, but Huamin Chen from Red Hat presented this on Lessons Learned Containerizing GlusterFS and Ceph with Docker and Kubernetes.

\r\r

As of today, deploying Ceph in Docker containers can be done as shown in this video:

\r\r

And if you were wondering about Kubernetes, this is also possible as documented here. The challenges of these containerized approaches versus a more traditional bare-metal approach are dealing with full cluster reboots (MON nodes need persistent storage) and properly scheduling the OSD pods into the nodes with the physical storage devices.

\r\r

Up and running

\r\r

OK, now you're up and running with some combination of Ceph, Ansible and OpenShift or Kubernetes, on AWS or elsewhere. Next up, we'll go into details around understanding Ceph performance, health checks, the top 5 Ceph metrics to watch and more. Keep reading!

About the author

No items found.

featured resources

Test drive the right way to defend the cloud with a security expert

GET A DEMO

A Ceph guide for Kubernetes and OpenShift users

Falco Feeds extends the power of Falco by giving open source-focused companies access to expert-written rules that are continuously updated as new threats are discovered.

How to deploy Ceph on AWS

Quick Introduction to Ceph and alternatives

Ceph basic terminology

Why Ceph? What are the alternatives?

How to deploy Ceph on AWS

Other Ceph deployment strategies for Kubernetes and OpenShift

Up and running

About the author

Test drive the right way to defend the cloud with a security expert

Test drive the right way to defend the cloud with a security expert