What Should I Monitor, and How Should I Do It?
A Free VividCortex Webinar
Monitoring tools offer two core types of functionality: alerts based on aliveness checks and comparing metrics to thresholds, and displaying time-series charts of status counters. Nagios + Graphite are the prototypical time-series tools that do these things.
But these tools don't answer the crucial questions about what we should monitor. What kinds of aliveness/health checks should we build into Nagios? Which metrics should we monitor with thresholds to raise alarms, and what should the thresholds be? What graphs should we build of status counters, which graphs should we examine and what do they mean?
You'll leave this webinar with a solid understanding of the types of monitoring you should be doing, the low-hanging fruit, and tools for doing it.