Course

Azure AI-300: Operationalizing ML and GenAI Solutions

Microsoft Certified: Machine Learning Operations Engineer Associate

0/24 modules complete0%

Modules

24 total

AI-300 Operating Model

Understand what AI-300 expects from an operations engineer, how ML ops and GenAI ops differ, and why the exam is lifecycle-heavy.

Continue

Azure AI Resource Topology

Understand how Azure ML workspace resources, associated resources, identities, storage, registries, compute, endpoints, and Foundry projects relate.

Open

Source Control And Reproducibility Baseline

Understand why production AI systems need versioned code, data references, environments, prompts, model artifacts, and deployment definitions.

Open

Azure ML Workspace Provisioning

Create and organize Azure ML workspaces, configure associated resources, and reason about workspace boundaries.

Open

Data, Compute, Environments, And Components

Understand how data assets, datastores, compute targets, environments, components, and registries make ML work reproducible.

Open

Secure And Automated Infrastructure

Understand how managed identity, RBAC, network isolation, Bicep, Azure CLI, and GitHub Actions support repeatable infrastructure.

Open

From Notebook To Tracked Experiment

Turn exploratory notebook work into tracked, reproducible experiments with parameters, metrics, artifacts, and run lineage.

Open

MLflow And Model Registry

Use MLflow tracking and model registry operations to turn experiment outputs into governable model artifacts.

Open

Training Strategy Selection

Choose between custom training scripts, AutoML, hyperparameter tuning, distributed training, and feature retrieval.

Open

Pipeline-Oriented Model Lifecycle

Make preprocessing, training, evaluation, registration, and promotion into a repeatable pipeline.

Open

Online And Batch Inference

Choose between managed online endpoints and batch endpoints based on scaling, latency, cost, and invocation patterns.

Open

Progressive Rollout And Troubleshooting

Operate production inference with deployment variants, traffic routing, rollback, logs, and failure triage.

Open

Production Model Monitoring

Monitor model quality, service health, latency, errors, and operational telemetry after deployment.

Open

Drift Detection And Retraining Policy

Detect data drift, concept drift, and quality degradation, and define retraining or rollback triggers.

Open

Responsible AI And Operational Gates

Integrate responsible AI evaluation into model promotion, risk review, and production readiness.

Open

Foundry Project Environments

Structure GenAI applications with Foundry project environments, model deployments, identities, RBAC, and environment separation.

Open

Foundation Model Deployment Strategy

Choose, deploy, version, and operate foundation models including throughput and cost constraints.

Open

Prompt Versioning And GenAI CI/CD

Turn prompts into deployable artifacts with versioning, variants, review, automated evaluation, and rollout controls.

Open

Evaluation Dataset And Metric Design

Design evaluation datasets, map input/output columns, and select metrics for answer quality.

Open

Safety And Custom Evaluators

Use risk/safety evaluators and custom evaluators to test product-specific failure modes.

Open

Observability, Tracing, And Cost

Use traces, latency, throughput, failures, token usage, and cost telemetry for GenAI debugging and operations.

Open

Retrieval And Chunking Optimization

Tune chunk size, overlap, metadata, similarity thresholds, and retrieval strategy for grounded answer quality.

Open

Hybrid Search, Semantic Ranking, And Embeddings

Choose between vector, keyword, hybrid search, semantic ranking, and embedding model changes.

Open

Fine-Tuning, Synthetic Data, And Final Readiness

Decide when to optimize prompts, retrieval, embeddings, fine-tuning data, or fine-tuned model operations.

Open