Further Reading: Observability Basics

Back to Observability Basics


Observability Engineering

Book: Observability Engineering

Why it matters: Comprehensive guide to observability, covering metrics, logs, and traces in depth.

Key Concepts

Three Pillars: - Metrics: What is happening - Logs: What happened - Traces: How it happened

Observability vs Monitoring: - Monitoring: Pre-defined metrics (you know what to look for) - Observability: Ability to ask new questions (you don't know what to look for)

Relevance: Provides the theoretical foundation and practical techniques for observability.


Distributed Tracing

OpenTelemetry

Documentation: OpenTelemetry

Why it matters: Industry standard for observability, including distributed tracing.

Key Concepts

Tracing: - Spans and traces - Context propagation - Sampling strategies

Instrumentation: - Automatic instrumentation - Manual instrumentation - Best practices

Relevance: Provides the standard approach to distributed tracing.


Prometheus Monitoring

Documentation: Prometheus

Why it matters: Popular open-source monitoring system, widely used for metrics.

Key Concepts

Metrics: - Counters, gauges, histograms - PromQL query language - Alerting rules

Best Practices: - Metric naming conventions - Label cardinality - Recording rules

Relevance: Provides practical guidance for metrics collection and querying.


Structured Logging

The Twelve-Factor App

Article: The Twelve-Factor App: Logs

Why it matters: Best practices for logging in modern applications.

Key Concepts

Structured Logging: - Logs as event streams - Structured format (JSON) - Context and correlation

Relevance: Provides the philosophy and best practices for logging.


Additional Resources

Books

"Observability Engineering" by Charity Majors et al. - Comprehensive observability guide - Practical examples

"Systems Performance" by Brendan Gregg - Performance analysis - Tools and techniques

Online Resources

Google Cloud Operations Suite: Documentation - GCP observability tools - Best practices

Datadog: Observability Guide - Observability concepts - Best practices


Key Takeaways

  1. Three pillars: Metrics, logs, and traces work together
  2. User-facing metrics: Measure what users experience
  3. Structured logs: JSON format with context
  4. Distributed tracing: Understand request flow
  5. Observability contract: Define what to measure