Blog — Inferpathio | ML Governance Writing

Governance8 min read

The Governance Gap in ML Operations: Why Experiment Tracking Isn't Enough

Most ML platforms track experiments beautifully. Almost none answer the compliance team's actual question: who decided this model should be in production, and why?

KN

Kevin Nakamura

July 9, 2024

Technical12 min read

Drift Detection: PSI vs KL Divergence vs Wasserstein — When to Use Which

Population Stability Index, KL divergence, and Wasserstein distance each catch different failure modes. Here's the practical guide to choosing your drift detection metric.

YT

Yuki Tanaka

August 12, 2024

Governance10 min read

Designing Retrain Policies for Enterprise ML: Beyond Cron Jobs

Scheduled retraining is a band-aid. Event-driven retrain policies — triggered by real drift signals and gated by human approval — are how enterprise ML teams maintain accountability at scale.

KN

Kevin Nakamura

September 18, 2024

Enterprise9 min read

RBAC for ML Teams: A Practical Guide to Role-Based Model Access

ML Engineer, ML Lead, Compliance Owner, Data Steward — different roles need different permissions over model lifecycle decisions. How to implement meaningful RBAC without slowing down your team.

PA

Priya Anand

October 30, 2024

Compliance7 min read

Audit Trails and Model Lineage: What Enterprise Compliance Really Needs

When a compliance auditor asks 'why is model version 3.4 in production instead of 3.3?', you need a complete answer — not a Confluence search party. What a proper ML audit trail contains.

KN

Kevin Nakamura

December 4, 2024

Comparison11 min read

MLflow vs W&B vs Inferpathio: Choosing the Right Layer for Your ML Stack

These tools solve different problems. MLflow is an artifact registry. W&B is an experiment tracker. Inferpathio is a governance layer. Here's when you need all three — and when two is enough.

YT

Yuki Tanaka

January 15, 2025

Integration14 min read

Connecting SageMaker Endpoints to Inferpathio for Production Governance

Step-by-step guide: connect your SageMaker inference endpoints to Inferpathio's governance layer. Capture prediction logs, set drift thresholds, and route retrain requests through your approval chain.

PA

Priya Anand

February 19, 2025

Technical10 min read

Model Rollback Patterns: How to Design Safe Rollback for Production ML

Rolling back a software deployment takes 30 seconds. Rolling back an ML model safely — with feature schema validation, prediction traffic rerouting, and lineage preservation — takes planning.

KN

Kevin Nakamura

March 26, 2025

Governance7 min read

Why Manual Approval Still Matters in Automated ML Pipelines

Full automation sounds ideal until a drift-triggered retrain promotes a worse model during a product launch. We make the case for human-in-the-loop approval checkpoints.

KN

Kevin Nakamura

April 14, 2025

Technical11 min read

Zero-Downtime Model Swaps: Architecture Patterns for Hot Production Replacement

Shadow mode traffic, canary evaluation, full swap — three deployment patterns for updating a production model without service interruption. Which to choose, and how Inferpathio tracks each.

PA

Priya Anand

May 19, 2025

Compliance9 min read

Building ML Compliance Readiness for SOC 2 Reviews

SOC 2 auditors are increasingly asking about model lifecycle controls. What evidence do you need, where does it live, and how do you produce it without a week of manual archaeology?

KN

Kevin Nakamura

July 2, 2025

Technical8 min read

Feature Drift vs Concept Drift: Two Problems, Two Responses

Feature drift means your input distribution shifted. Concept drift means the real-world relationship your model learned has changed. They're often confused, and they require different remediation strategies.

YT

Yuki Tanaka

August 5, 2025

Governance6 min read

The Enterprise ML Governance Checklist: 24 Controls Your Platform Team Needs

A practical checklist of 24 governance controls across four domains: version lineage, drift monitoring, retrain authorization, and audit readiness. Printable, shareable, and opinionated.

KN

Kevin Nakamura

September 12, 2025

Enterprise9 min read

Multi-Team Model Ownership: Governance Patterns for Platform Organizations

When six business units each have their own ML models but share a central platform team, who owns governance? How to structure RBAC, approval chains, and audit trails across organizational boundaries.

KN

Kevin Nakamura

October 22, 2025

ML Governance Thinking

New articles, twice a month