Observability

Building, Deploying and Observing SDKs as a Service

Building, Deploying and Observing SDKs as a Service

Manually maintaining SDKs across multiple programming languages is slow, error-prone, and a constant drag on developer velocity. This episode—featuring developer advocate Steve Kuching—explores using the open-source OpenAPI Generator to automate SDK creation from an OpenAPI spec, deploying the generator as a containerised service, and observing the entire pipeline with OpenTelemetry auto-tracing and Lumigo. Learn how to apply the same build, test, and observe principles you use for services to your SDK generation pipeline.

Chaos Engineering for Cloud native Apps

Chaos Engineering for Cloud native Apps

2022-11-03

Chaos engineering is the discipline of proactively experimenting on distributed systems to build confidence in their ability to withstand production failures. Chris is joined by Ashish Balgath (Cloud Solution Architect at Thoughtworks) to explore how to introduce fault-injection experiments incrementally — from a developer's local machine all the way to production — using tools such as Chaos Monkey and fault-injection simulator tools.

Beyond Monitoring: The Rise of Observability Platform

Beyond Monitoring: The Rise of Observability Platform

As systems grow in complexity across distributed architectures and microservices, traditional monitoring is no longer sufficient to maintain reliability and user experience. Observability goes beyond monitoring by correlating logs, metrics, and traces to rapidly pinpoint root causes across hybrid and multi-cloud landscapes. In this episode, Chris is joined by Samir Pradka, Enterprise Architect at Artos, to explore how organisations can build an observability platform incrementally, leverage AIOps for predictive analytics, and implement self-healing infrastructure using tools like Ansible and Azure Resource Manager.

ToolUp Days #13

ToolUp Days #13

2022-09-20

Chris and Matt spend this episode tracking down why the World Events Engine keeps crashing on startup in Azure Container Apps — tracing the root cause to a missing GitHub Container Registry credentials block in the Infrastructure as Code and a GitHub Actions token permissions gap. The session also covers Dapr component naming conventions, storage queue message formatting, container app log analysis, and planning a dedicated GitHub Codespaces episode.

Microservices have communication issues, especially when they fail

Microservices have communication issues, especially when they fail

2022-09-01

Communication between microservices is one of the trickiest challenges in distributed systems — especially when things go wrong. In this episode, Chris is joined by Francesco, a software engineer building a real-world payment gateway, to explore microservices communication patterns. They dive deep into the Saga pattern for managing multi-step distributed transactions, covering orchestration-based Sagas with AWS Step Functions, compensating transactions for graceful rollbacks, and event-driven messaging via EventBridge. The conversation also covers observability in distributed systems and applying Occam's Razor to architecture decisions.

ToolUp Tuesday - #9

ToolUp Tuesday - #9

2022-06-28

Chris and Matt debug deployment issues in Azure Container Apps, restructure their Bicep infrastructure as code into separate lifecycles, and configure Dapr state store components backed by Azure Storage.

LunchBytes Series 1 Episode 5: See Clearly with Application Insights

LunchBytes Series 1 Episode 5: See Clearly with Application Insights

2022-06-15 Microsoft

Observability across multiple components in distributed systems can be a challenge, particularly when things go wrong and need investigation. Application Insights can simplify the challenge, and give deep insights into a distributed system at multiple levels.

Tech Roundup - April 2022

Tech Roundup - April 2022

2022-04-24

Your monthly digest of what shipped across Azure, GitHub, and Azure DevOps in April 2022. Notable Azure updates include Container Apps managed identity preview, Static Web Apps private endpoints going GA, Managed Grafana integrations, Microsoft Purview (renamed from Azure Purview), and Cosmos DB autoscale improvements. On GitHub: secret scanning revocation, Codespaces monorepo support, accessibility colour-blind themes, and required deployments for branch protection. Plus Azure DevOps opt-in auditing, Bicep validation in PRs, and a personal update on the Go-based microservices event platform Chris has been building on Azure Container Apps.

Things to Consider Before Migrating Old .NET Applications to Cloud

Things to Consider Before Migrating Old .NET Applications to Cloud

Migrating a legacy .NET application to Azure is rarely as simple as "lift and shift." Jonah Andersson — Microsoft MVP for Azure — shares candid lessons from a real-world cloud migration that became a cautionary tale, not because of Azure, but due to unresolved technical debt, database complexity, and organisational readiness gaps. Discover the critical questions every team should answer before committing to the cloud, and what Jonah would do differently today.

Tech Roundup - February 2022

Tech Roundup - February 2022

2022-02-27

A live monthly roundup covering everything notable in Azure, GitHub, and Azure DevOps through February 2022. Key highlights include GitHub Actions support for OpenID Connect (OIDC) deployments to Azure Static Web Apps, GitHub Projects beta improvements (flexible iterations, insights filtering), and updates to GitHub Codespaces and the GitHub CLI. On the Azure side: Azure Monitor gains OpenTelemetry integration and improved observability tooling, Azure Payments HSM enters public preview, and Azure Arc cloud-to-edge observability is explored. Chris also shares Cloud With Chris updates including a new Hugo-powered presentations site hosted on Azure Static Web Apps, a feature flags episode, and community highlights from the UK Azure Community Day.