Marcus Vechiato
open-menu closeme
Blog
About
🌐
English
github twitter linkedin rss
  • SLA, Error Budget, Uptime: Where Do Maintenance Windows Fit?

    calendar Apr 14, 2025 · 3 min read · sla sre slo error budget  ·
    Share on: twitter facebook linkedin copy
    SLA, Error Budget, Uptime: Where Do Maintenance Windows Fit?
    When it comes to service reliability, maintenance windows are a frequent source of ambiguity. Whether you’re defining uptime, setting SLOs, or communicating with customers, it’s essential to be explicit about how scheduled (and unscheduled) maintenance is handled. Here’s a deeper dive with actionable recommendations …
    Read More
  • Insights from Mastering OpenTelemetry and Observability

    calendar Jan 4, 2025 · 4 min read · otel observability devops book sre  ·
    Share on: twitter facebook linkedin copy
    Insights from Mastering OpenTelemetry and Observability
    In the fast-paced era of cloud-native systems and distributed architectures, maintaining visibility and reliability across complex infrastructures has become crucial. This led me to explore Mastering OpenTelemetry and Observability by Steve Flanders. The book is a deep dive into the tools, techniques, and methodologies …
    Read More
  • Insights from Implementing Service Level Objectives: A Practical Guide to SLIs, SLOs & Error Budgets

    calendar Nov 19, 2024 · 6 min read · book sre  ·
    Share on: twitter facebook linkedin copy
    Insights from Implementing Service Level Objectives: A Practical Guide to SLIs, SLOs & Error Budgets
    Reliability is a cornerstone of successful services, but how do we define and measure it effectively? Implementing Service Level Objectives by Alex Hidalgo provides a practical guide to adopting SLO-based approaches, allowing teams to balance user expectations and operational constraints. With its insights into SLIs …
    Read More
  • Insights from Chaos Engineering: System Resiliency in Practice

    calendar Jun 26, 2024 · 8 min read · chaos engineering sre book  ·
    Share on: twitter facebook linkedin copy
    Insights from Chaos Engineering: System Resiliency in Practice
    I remember multiple occasions when systems were not down but badly impacted during major events. The scramble to restore service and the subsequent post-mortem meeting highlighted our lack of preparedness for unexpected system failures. The book begins with the history of Chaos Engineering, originating from Netflix. …
    Read More
  • Insights from Learning eBPF: Programmatically Extend the Linux Kernel

    calendar May 12, 2024 · 4 min read · observability ebpf security book sre  ·
    Share on: twitter facebook linkedin copy
    Insights from Learning eBPF: Programmatically Extend the Linux Kernel
    In the realm of cloud-native infrastructure, eBPF (extended Berkeley Packet Filter) has emerged as a revolutionary technology. By allowing developers to write custom code that dynamically changes kernel behavior, eBPF has paved the way for a new generation of security, observability, and networking tools. In …
    Read More
  • Insights from Learning Open Telemetry: Setting Up and Operating a Modern Observability System

    calendar May 1, 2024 · 5 min read · otel observability devops book sre  ·
    Share on: twitter facebook linkedin copy
    Insights from Learning Open Telemetry: Setting Up and Operating a Modern Observability System
    As someone who has spent countless hours troubleshooting distributed systems, I understand the struggle of trying to untangle metrics, logs, and traces spread across multiple tools. That’s why Learning OpenTelemetry by Austin Parker and Ted Young resonated with me. This book introduces OpenTelemetry, a game-changing …
    Read More
  • Reflecting on My Presentation at the London SRE Meetup

    calendar May 23, 2023 · 1 min read · sre platform engineering meetup devops  ·
    Share on: twitter facebook linkedin copy
    Reflecting on My Presentation at the London SRE Meetup
    I recently had the wonderful opportunity to present at the London SRE Meetup, and I'm excited to share the experience with you all. The presentation, titled "Working Together: SRE and Platform Engineering," focused on the synergy between site reliability engineering and platform engineering to enhance …
    Read More
  • Insights from Establishing SRE Foundations

    calendar Jan 20, 2023 · 5 min read · SRE book  ·
    Share on: twitter facebook linkedin copy
    Insights from Establishing SRE Foundations
    "Establishing SRE Foundations: A Step-by-Step Guide" by Vladyslav Ukis provides a comprehensive framework for implementing Site Reliability Engineering (SRE) in software delivery organisations. Drawing from his extensive experience, Ukis outlines practical steps and methodologies to enhance reliability and …
    Read More
  • Insights from Observability Engineering: Achieving Production Excellence

    calendar Dec 2, 2022 · 6 min read · observability book devops sre otel  ·
    Share on: twitter facebook linkedin copy
    Insights from Observability Engineering: Achieving Production Excellence
    In the rapidly evolving landscape of software development and IT operations, ensuring system reliability and performance is paramount. "Observability Engineering" by Charity Majors, Liz Fong-Jones, and George Miranda provides a comprehensive guide to achieving production excellence through observability. This …
    Read More
  • Insights from Database Reliability Engineering: Designing and Operating Resilient Database Systems

    calendar Sep 21, 2020 · 4 min read · sre book dbre  ·
    Share on: twitter facebook linkedin copy
    Insights from Database Reliability Engineering: Designing and Operating Resilient Database Systems
    Database Reliability Engineering by Laine Campbell and Charity Majors is a comprehensive guide on how to design, build, and manage resilient database systems. The book emphasises the importance of reliability in database operations and introduces the role of a Database Reliability Engineer (DBRE) who combines database …
    Read More
    • ««
    • «
    • 1
    • 2
    • »
    • »»

Marcus Vechiato

Technologist, perpetual student, continual incremental improvement.
Read More

Recent Posts

  • Insights from Get Better at Flatter: A Guide to Shaping and Leading Organizations with Less Hierarchy
  • Insights from An Elegant Puzzle: Systems of Engineering Management
  • Code Club Mobile Plan Challenge: Can You Beat your Family's Phone Bill?
  • SLA, Error Budget, Uptime: Where Do Maintenance Windows Fit?
  • Insights from The Developer Relations Playbook
  • How to Get a Free FlightRadar24 Business Subscription with Your Own ADS-B Receiver
  • Insights from AI Engineering: Building Applications with Foundation Models
  • How to Install and Run DeepSeek R1 and DeepSeek Coder Models Using Ollama

Tags

BOOK 59 DEVOPS 13 LEADERSHIP 12 SRE 11 K8S 6 PYTHON 6 CODE CLUB 5 OBSERVABILITY 5 PLATFORM ENGINEERING 5 GIT 4 SECURITY 4 CROSSPLANE 3 OTEL 3 CLUB 2
All Tags
3D PRINTING1 ADS-B1 ARGO1 AVIATION1 BOOK59 CERTIFICATION1 CHAOS ENGINEERING1 CICD1 CLOUD1 CLUB2 CODE2 CODE CLUB5 CROSSPLANE3 DATA1 DBRE1 DEEPSEEK1 DEVOPS13 DEVREL1 DEVSECOPS1 DOCKER1 DORA1 EBPF2 ERROR BUDGET1 FINOPS1 GIT4 GITOPS1 HELM2 HOMELAB2 HYDROPONICS2 IOT2 K8S6 LEADERSHIP12 LEAN1 LINUX2 LLM2 MEETUP1 METRICS1 MICROSERVICES1 NETFLIX1 NOTES MANAGEMENT1 OBSERVABILITY5 OLLAMA1 OPERATING SYSTEMS1 OTEL3 PLATFORM ENGINEERING5 PRODUCT DEVELOPMENT1 PYTHON6 RASPBERRY PI1 SECURITY4 SLA1 SLO1 SRE11 SYSTEMS DESIGN2 SYSTEMS PERFORMANCE1 TEAM TOPOLOGIES1 TECHNICAL DEBT1 TIME MANAGEMENT1
[A~Z][0~9]
Copyright © 2008–2018, Steve Francia and the Hugo Authors; all rights reserved.

Copyright  COPYRIGHT © 2008–2018, STEVE FRANCIA AND THE HUGO AUTHORS; ALL RIGHTS RESERVED.. All Rights Reserved

to-top