Menu

Course

Azure AI-300: Operationalizing ML and GenAI Solutions

Microsoft Certified: Machine Learning Operations Engineer Associate

0/24 modules complete0%

Modules

24 total

AI-300 Operating Model

Understand what AI-300 expects from an operations engineer, how ML ops and GenAI ops differ, and why the exam is lifecycle-heavy.

Continue
2

Azure AI Resource Topology

Understand how Azure ML workspace resources, associated resources, identities, storage, registries, compute, endpoints, and Foundry projects relate.

Open
3

Source Control And Reproducibility Baseline

Understand why production AI systems need versioned code, data references, environments, prompts, model artifacts, and deployment definitions.

Open
4

Azure ML Workspace Provisioning

Create and organize Azure ML workspaces, configure associated resources, and reason about workspace boundaries.

Open
5

Data, Compute, Environments, And Components

Understand how data assets, datastores, compute targets, environments, components, and registries make ML work reproducible.

Open
6

Secure And Automated Infrastructure

Understand how managed identity, RBAC, network isolation, Bicep, Azure CLI, and GitHub Actions support repeatable infrastructure.

Open
7

From Notebook To Tracked Experiment

Turn exploratory notebook work into tracked, reproducible experiments with parameters, metrics, artifacts, and run lineage.

Open
8

MLflow And Model Registry

Use MLflow tracking and model registry operations to turn experiment outputs into governable model artifacts.

Open
9

Training Strategy Selection

Choose between custom training scripts, AutoML, hyperparameter tuning, distributed training, and feature retrieval.

Open
10

Pipeline-Oriented Model Lifecycle

Make preprocessing, training, evaluation, registration, and promotion into a repeatable pipeline.

Open
11

Online And Batch Inference

Choose between managed online endpoints and batch endpoints based on scaling, latency, cost, and invocation patterns.

Open
12

Progressive Rollout And Troubleshooting

Operate production inference with deployment variants, traffic routing, rollback, logs, and failure triage.

Open
13

Production Model Monitoring

Monitor model quality, service health, latency, errors, and operational telemetry after deployment.

Open
14

Drift Detection And Retraining Policy

Detect data drift, concept drift, and quality degradation, and define retraining or rollback triggers.

Open
15

Responsible AI And Operational Gates

Integrate responsible AI evaluation into model promotion, risk review, and production readiness.

Open
16

Foundry Project Environments

Structure GenAI applications with Foundry project environments, model deployments, identities, RBAC, and environment separation.

Open
17

Foundation Model Deployment Strategy

Choose, deploy, version, and operate foundation models including throughput and cost constraints.

Open
18

Prompt Versioning And GenAI CI/CD

Turn prompts into deployable artifacts with versioning, variants, review, automated evaluation, and rollout controls.

Open
19

Evaluation Dataset And Metric Design

Design evaluation datasets, map input/output columns, and select metrics for answer quality.

Open
20

Safety And Custom Evaluators

Use risk/safety evaluators and custom evaluators to test product-specific failure modes.

Open
21

Observability, Tracing, And Cost

Use traces, latency, throughput, failures, token usage, and cost telemetry for GenAI debugging and operations.

Open
22

Retrieval And Chunking Optimization

Tune chunk size, overlap, metadata, similarity thresholds, and retrieval strategy for grounded answer quality.

Open
23

Hybrid Search, Semantic Ranking, And Embeddings

Choose between vector, keyword, hybrid search, semantic ranking, and embedding model changes.

Open
24

Fine-Tuning, Synthetic Data, And Final Readiness

Decide when to optimize prompts, retrieval, embeddings, fine-tuning data, or fine-tuned model operations.

Open
    Azure AI-300: Operationalizing ML and GenAI Solutions | Synapse