Engineering Insights

The Blog

Real-world deep dives on DevOps, MLOps, Data Engineering, and Generative AI. Written by engineers who ship production systems.

All Posts MLOps / AI DevOps / K8s Data Engineering FinOps
MLOps / AI

Deploying 200+ AI Agents on AWS Bedrock AgentCore

How we orchestrated 200+ specialized AI agents across production use cases — architecture, IAM, observability, and lessons learned the hard way.

Bedrock AgentCore AWS
12 min
Read article
MLOps / AI

AWS Bedrock vs Azure OpenAI for Enterprise: The Honest 2025 Comparison

Engineer-to-engineer: pricing, compliance, developer experience, latency, and when to actually pick each. No vendor bias.

Bedrock Azure OpenAI LLM
14 min
Read article
MLOps

MLflow vs SageMaker MLOps: Which Should Your Team Use in 2025?

Honest engineer comparison: tracking, registry, pipelines, serving, and cost. Plus the hybrid approach most teams actually end up using.

MLflow SageMaker MLOps
11 min
Read article
DevOps / K8s

The Kubernetes Cost Optimisation Checklist: 47 Ways to Cut Your K8s Bill in 2025

47 actionable techniques: right-sizing, Karpenter, Spot, networking, storage, observability. We cut a client's K8s bill by 61% using this exact checklist.

Kubernetes Karpenter FinOps
18 min
Read article
Data Engineering

Real-Time Lakehouse on AWS: Kafka → Glue → Redshift Serverless

Step-by-step architecture for a production real-time lakehouse. From Kafka producers to sub-second Redshift queries — complete with IAM, cost, and gotchas.

Kafka Glue Redshift
16 min
Read article
FinOps

SageMaker Zombie Endpoints: The $700K/yr Cloud Cost Spiral

How idle SageMaker endpoints silently drain cloud budgets — detection scripts, auto-cleanup Lambda, and the governance model that prevents recurrence.

SageMaker FinOps Lambda
9 min
Read article