Originally written on February 12th 2012
There’s been a small flurry of discussion today on Twitter and IRC, due to comments from folks who, in response to an article by Cliff Moon, seem to think that #monitoringsucks is some kind of marketing bullshit. And that just doesn’t sit right with me, so I hope to set the record straight.Now, I don’t claim to speak for #monitoringsucks as a whole or represent all the views. However, as an active member of the devops/webops/IT community through Twitter, IRC, blogging, conferences, etc, I think I can speak with some confidence on the topic. Of course, I invite people to respond to this post, via email, Twitter, or on your own blog and correct me where I am wrong or let me know where you disagree. Let’s start at what should be a foregone conclusion. Monitoring sucks. It does. The majority of monitoring systems out there are built around workloads and change processes that are no longer valid for those at the leading edge of things and will continue to become so for more and more of the industry. The future of application environments is dynamic, scalable, and ever changing. It’s a polyglot of language runtimes and operating systems. And in many cases it’s a mixture of IaaS, PaaS, and in-house computing resources. Given that, the idea that monitoring can be a swiss army knife, and a swiss army knife that’s mostly SNMP, curl, and ping, is dead. The future of monitoring is a tool box, and right now, most of the monitoring tools available to us suck, in light of the future I just outlined. So that’s why #monitoringsucksSo where does #monitoringsucks come from? It comes from the collective realization of the operations community that the future is now and that we need to be outspoken about the future and help our community and industry adapt. You don’t have to look far to see this realization over and over, as evidenced by numerous blog posts, such as
- Why monitoring sucks
- Monitoring Sucks – Watch your language
- Monitoring Sucks. Do Something About It.
- Monitoring Sucks. Latency Sucks More. part 1, part 2, and part 3
- Monitoring Sucks Panel at SCaLe 10
- The #monitoringsucks HackFest after FOSDEM
- And of course, Patrick Debois’s epic series – Monitoring Wonderland Survey
- Monitoring Wonderland Survey – Introduction
- Monitoring Wonderland Survey – Metrics – API – Gateways
- Monitoring Wonderland Survey – Nagios the Mighty Beast
- Monitoring Wonderland Survey – Moving up the stack Application and User metric
- Monitoring Wonderland Survey – Visualization
I could go on and on with links to posts that are either addressing the root problems of why #monitoringsucks, helping survey the current tools, or describing new systems being built to help address the future and make monitoring suck a little bit less, but the point of this article isn’t to be a link dump, it’s to help educate on what #monitoringsucks really is and really isn’t. Of course, you may ask “now that’ve you’ve bitched, what are you doing about it?”, which is an excellent and reasonable question.
So what is the community doing about, besides saying monitoring sucks!? We are:
- Discussing is on twitter with the #monitoringsucks hashtag
- Discussing it at conferences, like SCaLE or FOSDEM
- Discussing it on IRC
- Providing centralized places to help people find tools for their toolboxes, like the Github Tools Repo and a handy web interface to the repo
- Building new tools, some commercial, some open source, like Circonus, Boundary, Librato Metrics, Graphite, or Sensu that are built from the ground up to be ready for future that is fast becoming reality for operations folks
So #monitoringsucks isn’t some marketing bullshit meant to help shine a light on Boundary, it’s a real and correct topic that is being driven by the community to help better the state of monitoring. And if you find fault in Cliff for writing an article on it, because the company he is a cofounder of is trying to help address things, while also trying to make money, I think that’s a load of horseshit. I applaud the folks at Circonus, Boundary, Librato Metrics, and other companies that are helping to improve the state of things, but also making money at it, because the idea that just because someone is trying to make money in something, automatically discredits them due to bias or whatever, is crap, especially in this case, where if you spend even 15 minutes to see what #monitoringsucks is about, you’ll see it’s from the community. Quite frankly, I think a lot of the resistance things like #monitoringsucks or devops get from certain people is just those people trying to push their own brand by being a thought leader, but that’s a topic for another post.