Course
Azure AI-300: Operationalizing ML and GenAI Solutions
Microsoft Certified: Machine Learning Operations Engineer Associate
Modules
24 total
AI-300 Operating Model
Understand what AI-300 expects from an operations engineer, how ML ops and GenAI ops differ, and why the exam is lifecycle-heavy.
Azure AI Resource Topology
Understand how Azure ML workspace resources, associated resources, identities, storage, registries, compute, endpoints, and Foundry projects relate.
Source Control And Reproducibility Baseline
Understand why production AI systems need versioned code, data references, environments, prompts, model artifacts, and deployment definitions.
Azure ML Workspace Provisioning
Create and organize Azure ML workspaces, configure associated resources, and reason about workspace boundaries.
Data, Compute, Environments, And Components
Understand how data assets, datastores, compute targets, environments, components, and registries make ML work reproducible.
Secure And Automated Infrastructure
Understand how managed identity, RBAC, network isolation, Bicep, Azure CLI, and GitHub Actions support repeatable infrastructure.
From Notebook To Tracked Experiment
Turn exploratory notebook work into tracked, reproducible experiments with parameters, metrics, artifacts, and run lineage.
MLflow And Model Registry
Use MLflow tracking and model registry operations to turn experiment outputs into governable model artifacts.
Training Strategy Selection
Choose between custom training scripts, AutoML, hyperparameter tuning, distributed training, and feature retrieval.
Pipeline-Oriented Model Lifecycle
Make preprocessing, training, evaluation, registration, and promotion into a repeatable pipeline.
Online And Batch Inference
Choose between managed online endpoints and batch endpoints based on scaling, latency, cost, and invocation patterns.
Progressive Rollout And Troubleshooting
Operate production inference with deployment variants, traffic routing, rollback, logs, and failure triage.
Production Model Monitoring
Monitor model quality, service health, latency, errors, and operational telemetry after deployment.
Drift Detection And Retraining Policy
Detect data drift, concept drift, and quality degradation, and define retraining or rollback triggers.
Responsible AI And Operational Gates
Integrate responsible AI evaluation into model promotion, risk review, and production readiness.
Foundry Project Environments
Structure GenAI applications with Foundry project environments, model deployments, identities, RBAC, and environment separation.
Foundation Model Deployment Strategy
Choose, deploy, version, and operate foundation models including throughput and cost constraints.
Prompt Versioning And GenAI CI/CD
Turn prompts into deployable artifacts with versioning, variants, review, automated evaluation, and rollout controls.
Evaluation Dataset And Metric Design
Design evaluation datasets, map input/output columns, and select metrics for answer quality.
Safety And Custom Evaluators
Use risk/safety evaluators and custom evaluators to test product-specific failure modes.
Observability, Tracing, And Cost
Use traces, latency, throughput, failures, token usage, and cost telemetry for GenAI debugging and operations.
Retrieval And Chunking Optimization
Tune chunk size, overlap, metadata, similarity thresholds, and retrieval strategy for grounded answer quality.
Hybrid Search, Semantic Ranking, And Embeddings
Choose between vector, keyword, hybrid search, semantic ranking, and embedding model changes.
Fine-Tuning, Synthetic Data, And Final Readiness
Decide when to optimize prompts, retrieval, embeddings, fine-tuning data, or fine-tuned model operations.