Marcus Vechiato
open-menu closeme
Blog
About
🌐
English
github twitter linkedin rss
  • Do SLAs, Error Budgets, and Availability Metrics Include Maintenance Windows?

    calendar Apr 14, 2025 · 3 min read · sla sre slo error budget  ·
    Share on: twitter facebook linkedin copy
    Do SLAs, Error Budgets, and Availability Metrics Include Maintenance Windows?

    🔧 Do SLAs, Error Budgets, and Availability Metrics Include Maintenance Windows?

    When it comes to service reliability, maintenance windows can be a gray area. Whether you're tracking uptime, setting SLOs, or managing customer expectations through SLAs, the question often comes up:

    “Should scheduled maintenance count …


    Read More
  • Insights from Mastering OpenTelemetry and Observability

    calendar Jan 4, 2025 · 4 min read · otel observability devops book sre  ·
    Share on: twitter facebook linkedin copy
    Insights from Mastering OpenTelemetry and Observability

    Mastering OpenTelemetry and Observability

    In the fast-paced era of cloud-native systems and distributed architectures, maintaining visibility and reliability across complex infrastructures has become crucial. This led me to explore Mastering OpenTelemetry and Observability by Steve Flanders. The book is a deep dive …


    Read More
  • Insights from Implementing Service Level Objectives

    calendar Nov 19, 2024 · 6 min read · book sre  ·
    Share on: twitter facebook linkedin copy
    Insights from Implementing Service Level Objectives

    Implementing Service Level Objectives: A Practical Guide to SLIs, SLOs & Error Budgets

    Reliability is a cornerstone of successful services, but how do we define and measure it effectively? Implementing Service Level Objectives by Alex Hidalgo provides a practical guide to adopting SLO-based approaches, allowing …


    Read More
  • Insights from Chaos Engineering: System Resiliency in Practice

    calendar Jun 26, 2024 · 8 min read · chaos engineering sre book  ·
    Share on: twitter facebook linkedin copy
    Insights from Chaos Engineering: System Resiliency in Practice

    Chaos Engineering: System Resiliency in Practice

    I remember multiple occasions when systems were not down but badly impacted during major events. The scramble to restore service and the subsequent post-mortem meeting highlighted our lack of preparedness for unexpected system failures. The book begins with the history …


    Read More
  • Insights from Learning eBPF

    calendar May 12, 2024 · 4 min read · observability ebpf security book sre  ·
    Share on: twitter facebook linkedin copy
    Insights from Learning eBPF

    Learning eBPF: Programmatically Extend the Linux Kernel

    In the realm of cloud-native infrastructure, eBPF (extended Berkeley Packet Filter) has emerged as a revolutionary technology. By allowing developers to write custom code that dynamically changes kernel behavior, eBPF has paved the way for a new generation of …


    Read More
  • Insights from Learning Open Telemetry

    calendar May 1, 2024 · 5 min read · otel observability devops book sre  ·
    Share on: twitter facebook linkedin copy
    Insights from Learning Open Telemetry

    Learning OpenTelemetry: Setting Up and Operating a Modern Observability System

    As someone who has spent countless hours troubleshooting distributed systems, I understand the struggle of trying to untangle metrics, logs, and traces spread across multiple tools. That’s why Learning OpenTelemetry by Austin Parker and Ted …


    Read More
  • Reflecting on My Presentation at the London SRE Meetup

    calendar May 23, 2023 · 1 min read · sre platform engineering meetup devops  ·
    Share on: twitter facebook linkedin copy
    Reflecting on My Presentation at the London SRE Meetup
    I recently had the wonderful opportunity to present at the London SRE Meetup, and I'm excited to share the experience with you all. The presentation, titled "Working Together: SRE and Platform Engineering," focused on the synergy between site reliability engineering and platform engineering to enhance …
    Read More
  • Insights from Establishing SRE Foundations

    calendar Jan 20, 2023 · 5 min read · SRE book  ·
    Share on: twitter facebook linkedin copy
    Insights from Establishing SRE Foundations

    Establishing SRE Foundations

    "Establishing SRE Foundations: A Step-by-Step Guide" by Vladyslav Ukis provides a comprehensive framework for implementing Site Reliability Engineering (SRE) in software delivery organisations. Drawing from his extensive experience, Ukis outlines practical steps and methodologies …


    Read More
  • Insights from Observability Engineering: Achieving Production Excellence

    calendar Dec 2, 2022 · 6 min read · observability book devops sre otel  ·
    Share on: twitter facebook linkedin copy
    Insights from Observability Engineering: Achieving Production Excellence

    Observability Engineering

    In the rapidly evolving landscape of software development and IT operations, ensuring system reliability and performance is paramount. "Observability Engineering" by Charity Majors, Liz Fong-Jones, and George Miranda provides a comprehensive guide to achieving production excellence …


    Read More
  • Insights from Database Reliability Engineering: Designing and Operating Resilient Database Systems

    calendar Sep 21, 2020 · 4 min read · sre book dbre  ·
    Share on: twitter facebook linkedin copy
    Insights from Database Reliability Engineering: Designing and Operating Resilient Database Systems

    Database Reliability Engineering: Designing and Operating Resilient Database Systems

    Database Reliability Engineering by Laine Campbell and Charity Majors is a comprehensive guide on how to design, build, and manage resilient database systems. The book emphasises the importance of reliability in database operations and …


    Read More
    • ««
    • «
    • 1
    • 2
    • »
    • »»

Marcus Vechiato

Technologist, perpetual student, continual incremental improvement.
Read More

Recent Posts

  • Code Club Mobile Plan Challenge
  • Do SLAs, Error Budgets, and Availability Metrics Include Maintenance Windows?
  • Insights from The Developer Relations Playbook
  • How to Get a Free FlightRadar24 Business Subscription with Your Own ADS-B Receiver
  • Insights from AI Engineering: Building Applications with Foundation Models
  • How to Install and Run DeepSeek R1 and DeepSeek Coder Models Using Ollama
  • Calculating 3D Print Costs with Spoolman API and Python
  • Insights from Mastering OpenTelemetry and Observability

Tags

BOOK 57 DEVOPS 13 SRE 11 LEADERSHIP 10 CODE CLUB 8 PYTHON 7 K8S 6 OBSERVABILITY 5 PLATFORM ENGINEERING 5 GIT 4 SECURITY 4 CROSSPLANE 3 OTEL 3 EBPF 2
All Tags
3D PRINTING1 ADS-B1 ARGO1 AVIATION1 BOOK57 CERTIFICATION1 CHAOS ENGINEERING1 CICD1 CLOUD1 CODE CLUB8 CROSSPLANE3 DATA1 DBRE1 DEEPSEEK1 DEVOPS13 DEVREL1 DEVSECOPS1 DOCKER1 DORA1 EBPF2 ERROR BUDGET1 FINOPS1 GIT4 GITOPS1 HELM2 HOMELAB2 HYDROPONICS2 IOT2 K8S6 LEADERSHIP10 LEAN1 LINUX2 LLM2 MEETUP1 METRICS1 MICROSERVICES1 NETFLIX1 NOTES MANAGEMENT1 OBSERVABILITY5 OLLAMA1 OPERATING SYSTEMS1 OTEL3 PLATFORM ENGINEERING5 PRODUCT DEVELOPMENT1 PYTHON7 RASPBERRY PI1 SECURITY4 SLA1 SLO1 SRE11 SYSTEMS DESIGN2 SYSTEMS PERFORMANCE1 TEAM TOPOLOGIES1 TECHNICAL DEBT1 TIME MANAGEMENT1
[A~Z][0~9]
Copyright © 2008–2018, Steve Francia and the Hugo Authors; all rights reserved.

Copyright  COPYRIGHT © 2008–2018, STEVE FRANCIA AND THE HUGO AUTHORS; ALL RIGHTS RESERVED.. All Rights Reserved

to-top