KubeCon + CloudNativeCon Amsterdam | March 30 – April 2 | Best Pricing Ends February 2 | Learn more



Only at CloudNativeDay: Jeff McCormick Teaches Us How to Make Containerized PostgreSQL Cloud Native Ready

By | Blog

Curious how CNCF’s incubated projects Kubernetes and Prometheus can provide a set of data services around the PostgreSQL database?

Head to Toronto for CloudNativeDay on August 25th to hear Crunchy Data Solutions’ Cloud Developer Jeff McCormick address running, administering, and monitoring PostgreSQL database clusters in his session “Containerizing PostgreSQL and Making it Cloud Native Ready.”

The long-time data architect charged with cloud development at Crunchy Data Solutions developed OpenShift PaaS containers for deploying the PostgreSQL database. His latest work includes building a Docker-based PostgreSQL Mgmt solution that enables users to administer and provision PostgreSQL into Docker-based PaaS and cloud environments.

Don’t miss out on this and other perspectives that you’ll only find at CloudNativeDay. Register Today!

Only at CloudNativeDay: Real-World Examples of Containers and Microservices Architectures from Donnie Berkholz

By | Blog

Donnie Berkholz, 451 Research, presenting at CloudNativeDay 2016

Enabling DevOps are two of the fastest-growing trends in technology; containers and microservices. With rapid growth comes rapid confusion. Who is using the technology? How did they build their architectures? What is the ROI of the technology?

Having real-world examples of how leading-edge companies are building containers and microservices architectures will help answer these burning questions. If you want the answers, then make your way to Toronto for CloudNativeDay on August 25th to hear 451 Research’s Development, DevOps, & IT Ops channel Research Director, Donnie Berkholz talk “Cloud Native in the Enterprise: Real-World Data on Container and Microservice Adoption.”

Berkholz current research is steeped in the latest innovative technologies employed for software development and software lifecycle management to drive business growth and will shape this session exploring the state of cloud-native prerequisites in the enterprise, the container ecosystem including current adoption, and data on companies moving to cloud-native platforms.

While at Redmonk, Berkholz followed trends in software development, data science and DevOps, with a particular focus on data-driven approaches to industry analysis. Berkholz also spent more than a decade as an open-source developer on Gentoo Linux, and served as a Research Fellow at the Mayo Clinic and a writer for LWN.net.

Holding a Ph.D. in Biochemistry & Biophysics from Oregon State University, Berkholz received a BS in Biochemistry & Molecular Biology and a BA in Chemistry, with a Journalism minor, from the University of Richmond.

Don’t miss out on this and other perspectives that you’ll only find at CloudNativeDay. Register Today!

Only at CloudNativeDay: Dr. Angel Diaz on Scaling Containers from Sandbox to Production

By | Blog

Dr. Angel Diaz Keynotes CloudNativeDay 2016

There is an industry IT renaissance occurring as we speak around cloud, data and mobile technology and it’s driven by open source code, community and culture.

If you want to learn more about this renaissance, then make your way to Toronto for CloudNativeDay on August 25th for Dr. Angel Diaz’s keynote on “Scaling Containers from Sandbox to Production.”

In this exciting keynote, Dr. Angel Diaz, IBM’s VP Cloud Architecture & Technology, will discuss how the digital disruption in today’s market is largely driven by containers and other open technologies. With a container-centric approach, developers are able to quickly stand up containers, to iterate quickly, and rapidly change their architectures. Dr. Diaz will provide insight on how enterprises are able to transform the way their companies grow, maintain and rapidly expand container and micro-service based applications across multiple clouds. Dr. Diaz will also discuss the role of CNCF in creating a new set of common container management technologies informed by technical merit and end user value. More on his session and speaker profile can be found here.

Don’t miss out on this and other perspectives that you’ll only find at CloudNativeDay. Register Today!

CNCF Member Session: Brandon Philips of CoreOS on “Containers at Scale Thanks to Kubernetes”

By | Blog

Screen Shot 2016-07-25 at 1.55.41 PM.png

At OSCON 2016 in Austin, Brandon Philips of CoreOS made a great argument for “Containers at Scale Thanks to Kubernetes.” Using examples drawn from Tectonic’s use of Kubernetes to host CoreOS SaaS products and recommendations given to Tectonic customers, Philips offered a how-to talk and tutorial demonstrating how to set up a production-ready cluster with full Transport Layer Security (TLS), manage the cluster through scaling and upgrade, and build the cluster for high availability. He also addressed TLS management, managing your machine’s over time, and upgrading Kubernetes itself while keeping availability. Find his session slides here.

Happy 1st Birthday Kubernetes

By | Blog

July 21,2016

Happy first birthday to Kubernetes! One year ago today, Kubernetes was released and has quickly grown to become one of the highest velocity open source cloud-related projects. With more than 639 developers contributing to the project and 17,366 commits over the last 12 months, you could say Kubernetes has become a pretty important part of the cloud native stack.

Accepting Kubernetes as our first CNCF project helped us establish our reputation as a open source foundation driving development of cloud-native technologies. We have since accepted Prometheus and plan to accept more open source projects in the future.

"Kubernetes use has exploded in the last year.  At the CNCF, we couldn't be prouder of how this technology has shown customers a way to ship production class Cloud Native applications." – Alexis Richardson, CNCF TOC chair, Weaveworks CEO

Kubernetes exemplifies what CNCF is looking for with projects. It’s already up and running and proven to solve a problem for cloud native applications. To learn more about projects that are well-suited to CNCF, check out our brand values and selection criteria, which Richardson spoke about at a recent Linux conference (slides).

One of CNCF’s key goals in working closely with the Kubernetes team is to ensure the project continues to move at high speed as smoothly as possible. Toward this end, as part of The Linux Foundation, CNCF recently brought in open source veteran and Linux Kernel maintainer Greg Kroah-Hartman to discuss with the Kubernetes team strategies the Linux kernel team uses to maintain the highest velocity project in the world. Check out the video of Greg’s presentation.

Kubernetes and Its Origin

The name Kubernetes originates from Greek, meaning "helmsman" or "pilot," and that's the role it fills in a container workflow, as it oversees and manages multiple containers at scale. Here is a great animated video that explains how Kubernetes achieves this.

Kubernetes (commonly referred to as k8s) was initially developed by Google as an open source platform for automating deployment, scaling, and operations of containers across clusters of hosts. Kubernetes was actually inspired by Google's Borg, the system that Google uses to run its massive global infrastructure.

Fostering an ecosystem of components and tools that relieve the burden of running applications in public and private clouds, Kubernetes allows companies to quickly and efficiently respond to customer demand by:

  • Deploying applications quickly and predictably;

  • Scaling applications on the fly;

  • Seamlessly rolling out new features; and

  • Optimizing hardware by using only the resources that are needed.

For more on how much Kubernetes has grown over the last year, read guest post from independent Kubernetes contributor, Justin Santa Barbara.

Kubernetes 1.3 Release

  • On July 6th, Kubernetes released its highly anticipated 1.3 version, which provides the ability to bridge services across multiple clouds (including on-prem), support for multiple node types, integrated support for stateful services (such as key-value stores and databases), and greatly simplified cluster setup and deployment on your laptop. To learn more about the new features, visit the Kubernetes blog.

  • To learn about the detailed performance results from Kubernetes 1.3, visit this Kubernetes blog. It also describes Kubemark, a performance testing tool that has been integrated into the continuous testing framework to detect performance and scalability regressions.

  • Additionally, Ben Kepes provided a great perspective in this ComputerWorld article on Kubernetes 1.3 features and benefits to customers and the larger cloud native ecosystem.

KubeCon, co-located with CloudNativeCon

  • November 8-9, 2016 in Seattle

  • Co-located with CloudNativeCon, the long-time community favorite, KubeCon, will gather leading Kubernetes technologists from multiple open source cloud native communities to further the education and advancement of container technologies, Kubernetes, and cloud native architectures.

  • CloudNativeCon and KubeCon are accepting speaking proposals. Head here to submit a talk by August 5 if you’d like to become a speaker! More information on sponsorship and registration is available online.

We are excited to celebrate the first birthday of Kubernetes (#k8sbday) as this project was the first step in establishing CNCF as an organization that supports leading cloud native projects of production quality.

Prometheus 1.0 Is Here

By | Blog

In January, Prometheus celebrated a year of public existence and today they announced Prometheus 1.0, which delivers a stable API and user interface.

“After almost four years of development, we are finally releasing Prometheus version 1.0. While Prometheus has been ready for production use for a long time, users still had to manage frequent breaking changes to interfaces such as the configuration files, the HTTP APIs, and the query language. The 1.0 version signals to users that they can now build on Prometheus as an API-stable platform.” – said Julius Volz, infrastructure engineer and co-creator of Prometheus

Prometheus 1.0 Release

An open-source systems monitoring and alerting ecosystem built with modern cloud and container environments in mind, Prometheus supports multiple types of dynamic service discovery. The core of Prometheus is its multidimensional data model with a dedicated powerful query language, which allows users to identify time series by metric name and key/value pairs. Additional features and components can be found here.

Top pain-points Prometheus solves for users:

Ad-hoc explorability at all scales, from individual components to fleet-wide metrics.

Meaningful alerts and dashboards.

Consistent semantics – both horizontally (different kinds of monitorings: services, hosts, hardware, etc.) and vertically (from instrumentation to collection to processing to presentation to alerting).

The integration of these features is important for cloud native apps, due to the high frequency and volume of instrumentation data in modern architectures.

Prometheus 1.0 means upgrades won’t break programs built atop the Prometheus API, and updates won’t require storage re-initialization or deployment changes. Additionally, custom dashboards and alerts will remain intact across 1.x version updates as well. With the Prometheus server in a stable API state, other modules will become stable version 1.0 releases in the future.

“Prometheus 1.0 gives users the confidence that the powerful monitoring and alerting they deploy today with Prometheus will continue to work as features are added in the future.” – said Brian Brazil, founder of Robust Perception and core developer of Prometheus

For more on Prometheus 1.0, read the Prometheus announcement blog and Brian Brazil’s dive into the new features.

Benefits to the Cloud Native Stack

“Prometheus’s ability to monitor different layers in the cloud native stack with the same semantics is vital, as it can alleviate some of the complications throughout the stack. For example, Prometheus helps to understand what’s going on in complex microservice setups, which is hard to accomplish using conventional monitoring approaches.” – Björn Rabenstein, engineer at SoundCloud and Prometheus core developer

The Prometheus software for monitoring and analysis of cloud native architectures and time series data benefits cloud developers, DevOps and end users in the following ways:

Cloud developers: Easy instrumentation of your code, which will not only help in production, but also during development to spot performance issues and other irregularities. Kubernetes scheduler performance was increased by 10x by debugging using Prometheus. Additionally, Prometheus does not lock you in as there are many integration points with other systems.

DevOps: Finally you can implement the alerting you want and ask the questions you always needed answers for.

End users: The increased reliability and better performance of Prometheus is a major benefit to end users. As the first company using Prometheus, SoundCloud was able to detect and handle outages much better than before and increase the site’s availability significantly. Additional end user case studies include Life360 and ShowMax.

The Prometheus Community

With 350+ contributors worldwide and 8,671 commits, this monitoring tool created in 2012 is quickly becoming a high velocity project. Prometheus has an active and growing user base counting SoundCloud, Digital Ocean, Ericsson, CoreOS, Weaveworks, Red Hat, and Google among its many users. Prometheus has grown to become one of the top open source monitoring tools of choice for the modern platform.

Here are some recent stats on the growing Prometheus ecosystem:

37 repositories in its GitHub organization

6867 stars for all repos

5274 stars for the main prometheus/prometheus repo

350+ contributors

pasted image 0 (3)

Figure 1: GitHub star development over all Prometheus repos

To get started with Prometheus or join its community, please visit the Getting Started page of the project’s website and the Community page of the project’s website for more information.

Prometheus and Kubernetes

Prometheus integrates with CNCF’s first hosted project, Kubernetes, which recently announced the release of its 1.3 version, to support service discovery and monitoring of dynamically scheduled services. Kubernetes also supports Prometheus natively.

“Despite their independent developments, Kubernetes and Prometheus are both inspired by the way and spirit of how Google has been running large-scale production systems. This led to a convergent evolution. Even before Prometheus entered the CNCF, Kubernetes components were instrumented with Prometheus metrics. Prometheus in turn has native support for the Kubernetes service discovery mechanism. The label-based approach native to both technologies propagates perfectly through their combined stack. Prometheus answers the question of how to monitor your existing Kubernetes cluster, and Kubernetes is the answer if you need an orchestration system that can be easily monitored with your existing Prometheus installation.” – said Fabian Reinartz, engineer at CoreOS and Prometheus core developer.

The Origin of Prometheus

To most, Prometheus is a Titan in Greek mythology, best known as the deity who created mankind and its greatest benefactor, who stole fire from Mount Olympus and gave it to mankind.

The Prometheus we are referring to is the all knowing monitoring system that was built with modern cloud and container environments in mind. Created in 2012 under the Apache 2.0 license on GitHub, Prometheus was gradually introduced for production monitoring at SoundCloud in 2013 and quietly grew in popularity in the cloud native space.

Read Björn Rabenstein’s blog on how SoundCloud served as the cradle of Prometheus and how beneficial it was for a company to develop a project like Prometheus in the open.

The Next Platform does a wonderful job of retelling the Prometheus history, its natural use with Kubernetes and becoming an incubated project with The Cloud Native Computing Foundation. Read the article here.

Through infrastructure, technical and marketing support, CNCF is helping to accelerate adoption of technologies like Prometheus that are key to modern infrastructure stacks. This in turn inspires faster adoption of cloud native architectures among companies of any size, and in any industries.

PrometheusDay, co-located with CloudNativeCon

Hosted by CNCF and co-located with CloudNativeCon, PrometheusDay will feature highly technical talks covering major Prometheus adopters, leading expert contributor insights, and a full range of technologies that support open source monitoring technology in the cloud native ecosystem. Registration for CloudNativeCon, Nov. 8-9 in Seattle, includes complimentary attendance at PrometheusDay. Developers, members and industry experts are invited to submit a speaking proposal by August 5th for CloudNativeCon and PrometheusDay.

Only at CloudNativeDay: Ken Owens Takes Us on a Cloud Native Journey Through Mantl.io

By | Blog

As the Chief Technology Officer for Cloud Infrastructure Services at Cisco, Ken Owens knows that developers are driving cloud consumption forward and leading each industry into the new era of software defined disruption.

His CloudNativeDay presentation “The Journey to Cloud Native – A Case Study with mantl.io” will look at the journey developers take to deploy an application centric development model leveraging cloud native technologies. Attendees will learn what enterprises and services providers are deploying into production and how they are achieving elastic, flexible, and portable application workload deployment with cloud native infrastructure.

If you want to learn more about this journey, then make your way to Toronto for CloudNativeDay on August 25th. Register Today!

Great Upcoming Events With Cloud Native, Kubernetes and Prometheus Focused Sessions

By | Blog

Over the next several months, CNCF will host and speak at a number of industry events including ContainerCon Japan, CloudNativeDay, PromCon, Software Circus, CloudNativeCon, KubeCon, and PrometheusDay.

These conferences will provide sessions on furthering the adoption of cloud native computing; with particular focus on central orchestration processing, cloud native monitoring, container-centric infrastructure, and microservices.

Hope to see you there!

ContainerCon Japan 2016

Screen Shot 2016-07-07 at 11.24.18 AM.png

ContainerCon Japan, held from July 13-15 in Tokyo, Japan, will bring together a unique blend of core developers, administrators, users, community managers and industry experts. Chris Aniszczyk, COO of CNCF, will be giving a keynote on Cloud Native and Container Technology.

The Cloud Native Track, which CNCF helped curate, will feature sessions from Ian Lewis of Google, Fabian Reinartz of CoreOS, Satoshi Tagomori and Masahiro Nakagawa of TreasureData, and Hiroyuki Kamezawa of Fujitsu.

Additionally, CNCF member Fujitsu will lead several sessions during the conference including: Container Standardization Introduction, Learning From Real Practice of Providing Highly Available Hybrid Cloud Service with OpenStack Neutron, Btrfs in-Band De-Duplication, and Address Range Memory Mirroring.



The inaugural CloudNativeDay, a CNCF event, will bring together leading contributors in cloud native infrastructure and computing, containers, microservices, central orchestration processing, and related projects. By co-locating with ContainerCon, CloudNativeDay will provide a platform for showcasing a full range of technologies that support the cloud native ecosystem and leading expert discussion on cloud native projects. This event will be held in Toronto on Thursday, August 25 and the full speaker lineup can be found here.

Attendees will have the opportunity to meet with upstream developers and operations experts, ranging from hobbyists to startup CTOs, from corporate developers to language designers to senior technology executives from all over the world. Read here to learn more about sponsoring the event and here to register to attend the event.

Equal access and diversity are important to the Cloud Native Computing Foundation, which is why we are offering three scholarships to attend CloudNativeDay. Visit http://bit.ly/2958nFA to learn more about the scholarship and apply.


Screen Shot 2016-07-07 at 11.23.23 AM.png

Organized by the Prometheus developer community, this event will connect Prometheus users and developers from around the world in order to exchange knowledge, best practices, and experience gained around using the open source monitoring system.

PromCon will also enable collaboration with an intimate crowd of experienced and influential infrastructure engineers committed to building the community and growing professional connections around systems and service monitoring.

CNCF is proud to be a Platinum sponsor, alongside fellow sponsors Robust Perception, Weaveworks, CoreOS, Google Cloud Platform, and more. The lineup of speakers is now live and features talks from DigitalOcean, Weaveworks, Improbable, CoreOS, SoundCloud, Rancher Labs, Joyent, and more.

This event is fully sold out, but there is a waiting list and tickets will be issued over the coming weeks. Additionally, all talks will be recorded and will be published after the event.

Software Circus

Screen Shot 2016-07-07 at 11.22.36 AM.png

This show will take attendees “To the Cloud and Beyond” with two whole days centered around cloud native computing! Software Circus, an initiative ofImplicit-Explicit and Container Solutions, started as a meetup group with a mission to tackle the social aspects of technology, the bleeding edges and people’s experience.

CNCF is proud to sponsor the infamously fun conference — with a focus on learning, inspiring and entertaining — which has grown to include workshops and two days of sessions, equipped with dancing, robots and drones.

The call for papers closes on July 31st, and if the show is anything like last year’s event we can expect lively sessions discussing the basics of working with Docker, the fundamental concepts and ideas behind containers, workshops on continuous delivery with Docker on Mesos, and tutorials on using Prometheus for metrics collection, monitoring and alerting in a microservice environment.



This much-anticipated event – hosted by the CNCF community – will gather leading technologists from multiple open source cloud native communities to further the education and advancement of cloud native computing. KubeCon and PrometheusDay will also be co-located with the event.

Featuring a multi-track, multi-day event dedicated to cloud native education in Seattle from November 8-9, CloudNativeCon is still accepting speaking proposals through Aug. 5!

Make sure to head here to learn more about sponsoring the event and here to register to attend the event.



Co-located with CloudNativeCon, the long-time community favorite, KubeCon, will gather leading Kubernetes technologists in Seattle from multiple open source cloud native communities to further the education and advancement of container technologies, Kubernetes, and cloud native architectures.

Since being donated to CNCF, KubeCon organizers anticipate approximately 1,500 attendees at this year’s event.

Just like CloudNativeCon, KubeCon is still accepting speaking proposals. Head here to submit a talk by August 5 if you’d like to become a speaker! More information on sponsorship and registration is available online.



We’ll also be celebrating our second incubated project at PrometheusDay on November 8-9!

This event, co-located with CloudNativeCon, will feature highly technical talks covering major Prometheus adopters, leading expert contributor insights, and a full range of technologies that support open source monitoring technology in the cloud native ecosystem.

To attend, register for CloudNativeCon and add PrometheusDay to your conference pass as part of your registration for no additional charge.

Just like CloudNativeCon and KubeCon, PrometheusDay is still accepting speaking proposals! Head here to submit a talk by August 5 if you’d like to become a presenter.

ContainerCon Europe

Held at the world-famous InterContinental Berlin in Germany, this year’s ContainerCon Europe will gather leading contributors in Linux containers, the Linux kernel, and related projects to forge a path to continued innovation and education – and, most importantly, will feature a Cloud Native track!

Co-located with LinuxCon Europe on October 4-6, ContainerCon Europe will bring together a diverse range of experts from cloud computing and Linux containers.

Since one registration fee covers both events, attendees interested in a variety of topics can choose from 100+ combined sessions!

Make sure to head here to learn more about sponsoring the event and here to register to attend this exciting event.

Only at CloudNativeDay: When Security and Cloud Native Collide

By | Blog

In one world, the cloud native approach is redefining how applications are architected, throwing many traditional assumptions out of the window. In the other world, traditional security teams ensure projects in the enterprise meet a rigid set of security rules in order to proceed. What happens when these two worlds collide?

At CloudNativeDay on August 25th in Toronto you’ll learn what the world looks like after that collision. Apprenda Senior Director Joseph Jacks, Box Sight Reliable Engineer Michael Ansel, Tigera Founder and CEO Christopher Liljenstolpe join forces to discuss “Whither Security in a Cloud-Native World?

A little about our panelists:

  • Co-founder of Kismatic, the enterprise Kubernetes company recently acquired by Apprenda, Jacks has spent his career driving corporate product, strategy and marketing initiatives for companies like Mesosphere, Enstratius Networks (acquired by Dell Software), TIBCO and Talend. He also founded and chairs KubeCon, the Kubernetes community conference donated to CNCF.

  • With a strong background in system architecture, Ansel is currently a sight reliable engineer at BOX and has previously developed high-visibility converged infrastructure solutions for NetApp. He also held System Administrator and Web Development positions with ArizonaTools.com, Duke University, RSSS, and shoeboxed.com.

  • The former Director of Solutions Architecture for the Metaswitch Networking Business Unit, Liljenstolpe is the founder and CEO of Tigera and the lead architect and evangelist for Project Calico. He has held advisor and architect positions with Annai Systems, Big Switch Networks, Telstra, and Woven Systems.

Don’t miss out on this and other perspectives that you’ll only find at CloudNativeDay. Register Today!

Prometheus User Profile: Helping Life360 Keep Families Safe

By | Blog

An open source monitoring system with a dimensional data model, flexible query language, efficient time series database and modern alerting approach, Prometheus has an active and growing user base. Counting Life360 as one of its many users, Prometheus helps the company monitor its massive data sets.

Screen Shot 2016-07-06 at 11.45.14 AM.png

Figure 1: Overall architecture of Prometheus and some of its ecosystem components

Life360 is a free smartphone app that helps families and close friends stay in sync throughout their busy day. Using the latest GPS tracking technology, Life360 allows you to see where your family and friends are on a private map, stay in touch with group and one-on-one messaging, get help in an emergency, and track lost or stolen phones. Started in 2008, Life360 has grown to 75 million members worldwide.

In this blog, originally published on the Prometheus blog, the company shares its experiences evaluating and using Prometheus.

Can you tell us about yourself and what Life360 does?

I’m Daniel Ben Yosef, a.k.a, dby, and I’m an Infrastructure Engineer for Life360, and before that, I’ve held systems engineering roles for the past 9 years.

Life360 creates technology that helps families stay connected, we’re the Family Network app for families. We’re quite busy handling these families – at peak we serve 700k requests per minute for 70 million registered families.

We manage around 20 services in production, mostly handling location requests from mobile clients (Android, iOS, and Windows Phone), spanning over 150+ instances at peak. Redundancy and high-availability are our goals and we strive to maintain 100% uptime whenever possible because families trust us to be available.

We hold user data in both our MySQL multi-master cluster and in our 12-node Cassandra ring which holds around 4TB of data at any given time. We have services written in Go, Python, PHP, as well as plans to introduce Java to our stack. We use Consul for service discovery, and of course our Prometheus setup is integrated with it.

What was your pre-Prometheus monitoring experience?

Our monitoring setup, before we switched to Prometheus, included many components such as:

  • Copperegg (now Idera)

  • Graphite + Statsd + Grafana

  • Sensu

  • AWS Cloudwatch

We primarily use MySQL, NSQ and HAProxy and we found that all of the monitoring solutions mentioned above were very partial, and required a lot of customization to actually get all working together.

Why did you decide to look at Prometheus?

We had a few reasons for switching to Prometheus, one of which is that we simply needed better monitoring.

Prometheus has been known to us for a while, and we have been tracking it and reading about the active development, and at a point (a few months back) we decided to start evaluating it for production use.

The PoC results were incredible. The monitoring coverage of MySQL was amazing, and we also loved the JMX monitoring for Cassandra, which had been sorely lacking in the past.

Cassandra Client Dashboard

How did you transition?

We started with a relatively small box (4GB of memory) as an initial point. It was effective for a small number of services, but not for our full monitoring needs.

We also initially deployed with Docker, but slowly transitioned to its own box on an r3.2xl instance (60GB ram), and that holds all of our service monitoring needs with 30 days of in-memory data.

We slowly started introducing all of our hosts with the Node Exporter and built Grafana graphs, up to the point where we had total service coverage.

We were also currently looking at InfluxDB for long term storage, but due to recent developments, this may no longer be a viable option.

We then added exporters for MySQL, Node, Cloudwatch, HAProxy, JMX, NSQ (with a bit of our own code), Redis and Blackbox (with our own contribution to add authentication headers).

NSQ Overview Dashboard

What improvements have you seen since switching?

The visibility and instrumentation gain was the first thing we saw. Right before switching, we started experiencing Graphite’s scalability issues, and having an in-place replacement for Graphite so stakeholders can continue to use Grafana as a monitoring tool was extremely valuable to us. Nowadays, we are focusing on taking all that data and use it to detect anomalies, which will eventually become alerts in the Alert Manager.

What do you think the future holds for Life360 and Prometheus?

We currently have one of our projects instrumented directly with a Prometheus client, a Python-based service. As we build out new services, Prometheus is becoming our go-to for instrumentation, and will help us gain extremely meaningful alerts and stats about our infrastructure.

We look forward to growing with the project and keep contributing.

Thank you Daniel! The source for Life360’s dashboards is shared on Github.

1 38 39 40