You Can Pass This Exam For Free
Choose Your Study Path
You passed the ML Associate exam and have 6-12 months of Databricks ML experience. You need to level up on advanced topics like distributed tuning, Lakehouse Monitoring, Feature Store pipelines, deployment strategies, and MLOps testing patterns.
Exam Overview
Format
59 questions, 120 minutes. Multiple choice (single select and multiple select). Scenario-heavy — most questions present production ML scenarios requiring you to choose the best approach. Covers advanced topics not on the Associate exam including distributed hyperparameter tuning, Lakehouse Monitoring, Databricks Asset Bundles, and deployment strategies.
Scoring
Pass/fail based on percentage score. Passing: 70%. No penalty for wrong answers — always guess if unsure. Questions are weighted equally across all domains. May include unscored items for statistical analysis — these are not identified and do not impact your score.
Domains & Weights
- Model Development44%
- MLOps44%
- Model Deployment12%
Registration
$200 USD. Available through Kryterion testing centers or online proctored via WebAssessor. Schedule at databricks.com/certification. Costs $200 USD per attempt. No formal prerequisites, but Databricks recommends the ML Associate certification and 1+ years of hands-on production ML experience on Databricks. Credential is valid for 2 years.
Topic Priority Table
Not all topics are tested equally. Focus your study time on Tier 1 first, then Tier 2. Tier 3 topics rarely appear — just recognize what they do.
Model Development
The largest domain at 44% (~26 questions). Tests advanced ML pipeline construction with SparkML, distributed training and hyperparameter tuning (Optuna, Ray, pandas Function APIs), advanced MLflow usage (nested runs, PyFunc custom models), and Feature Store concepts (point-in-time correctness, online tables, streaming features, on-demand features). This domain has 22 objectives across 4 sub-sections: SparkML (7), Scaling and Tuning (7), Advanced MLflow (3), and Advanced Feature Store (5).
Key Topics
Must-Know Concepts
- SparkML Pipeline construction: stages chain estimators and transformers. Pipeline.fit() trains all stages sequentially, PipelineModel.transform() applies them in order
- Feature transformer selection: StringIndexer (strings to indices), OneHotEncoder (indices to binary vectors), VectorAssembler (columns to single Vector), StandardScaler (normalization)
- CrossValidator with ParamGrid for hyperparameter tuning: CrossValidator trains k x n models (folds x combinations). Use with RegressionEvaluator or classification evaluators
- When to use SparkML vs single-node: SparkML for distributed data (millions+ rows). For single-node models at scale, distribute tuning with SparkTrials or Ray, not SparkML
- Optuna-MLflow integration: Optuna's define-by-run API with MLflow callback for logging. Supports pruning (early stopping bad trials) and multi-objective optimization
- Ray for distributed tuning: distributes independent Python functions across a cluster. Better than Spark for embarrassingly parallel compute-bound workloads
- Pandas Function APIs: applyInPandas() for group-specific model training (e.g., one model per store), mapInPandas() for partition-level distributed inference
- Vertical vs horizontal scaling: vertical = bigger machines (more RAM/CPU per node), horizontal = more machines. Vertical for memory-bound, horizontal for compute-bound workloads
- Data parallelism vs model parallelism: data parallelism splits data across workers (each has full model), model parallelism splits the model across workers (each has part of model)
- Nested MLflow runs: use mlflow.start_run(nested=True) to create parent-child hierarchies for organizing hyperparameter search results under a single parent experiment
- PyFunc custom models: wrap custom inference logic (pre/post-processing, feature engineering) so it runs at prediction time. Ensures training-serving consistency for complex pipelines
- Custom metric/parameter/artifact logging: log_metric() for numeric values, log_param() for configuration, log_artifact() for files (plots, data samples, configs)
- Point-in-time correctness: Feature Store retrieves features as they existed at the prediction timestamp, preventing future data leakage during historical training
- Online tables: low-latency feature serving synced from offline Feature Store tables. Required for real-time serving endpoints that need feature lookups during inference
- On-demand features: computed at request time for features that depend on the prediction request itself (e.g., time since last login). Ensures training-serving consistency
Common Exam Traps
ML Operations (MLOps)
Tied for the largest domain at 44% (~26 questions). Tests model lifecycle management (dev to staging to prod), validation testing strategies (unit, integration, end-to-end), environment architectures with Databricks Asset Bundles, automated retraining workflows, and — most heavily — Lakehouse Monitoring for drift detection. This domain has 20 objectives across 5 sub-sections: Model Lifecycle (2), Validation Testing (4), Environment Architectures (2), Automated Retraining (2), and Drift Detection/Lakehouse Monitoring (10).
Key Topics
Must-Know Concepts
- Model lifecycle architecture: deploy CODE (not models) across environments. Train in dev, validate in staging, serve in prod. The same pipeline code runs in each environment with different configurations
- Unity Catalog model aliases replace legacy stage transitions. Assign aliases like 'champion' and 'challenger' to model versions for lifecycle management
- Unit testing ML code: test individual transformation and feature engineering functions in isolation using pytest. Test data quality assertions, schema validation, and edge cases
- Integration testing: test component interactions across environments — verify feature pipelines produce expected output types, model training completes, and predictions are within expected ranges
- End-to-end pipeline testing: validate the full pipeline from feature computation through training, evaluation, and deployment. Use test datasets and temporary catalogs
- Test organization: separate unit tests (fast, isolated) from integration tests (slower, require infrastructure). Run unit tests on every commit, integration tests on merge to main
- Databricks Asset Bundles (DABs): define ML resources (jobs, pipelines, serving endpoints) as YAML. Deploy to dev/staging/prod with environment-specific overrides using targets
- Infrastructure-as-code with DABs: version control ML pipeline configurations alongside code. Enables reproducible deployments and rollbacks across environments
- Automated retraining triggers: monitor for data drift, prediction drift, or performance degradation. When thresholds are breached, trigger retraining workflows automatically
- Champion-challenger pattern: train a new model (challenger), compare it against the current model (champion) on held-out data or A/B test in production, promote only if the challenger wins
- Lakehouse Monitoring statistical tests: Kolmogorov-Smirnov (KS) for numerical drift, Chi-squared for categorical drift, Jensen-Shannon divergence for distribution comparison
- Three monitoring table types: snapshot (point-in-time data quality), time series (temporal trends), inference (model inputs/outputs/performance)
- Monitor creation and configuration: create monitors on Delta tables in Unity Catalog, configure refresh schedules, set baseline tables for comparison
- Custom metrics in Lakehouse Monitoring: define business-specific metrics beyond built-in statistical tests. Use SQL expressions for custom metric computation
- Feature slicing: analyze drift and performance for specific data segments (e.g., by region, customer type). Identifies localized issues that aggregate metrics miss
Common Exam Traps
Model Deployment
The smallest domain at 12% (~7 questions). Tests deployment strategies (blue-green, canary), custom model serving with PyFunc, REST API integration, and model rollout management. This domain has 5 objectives across 2 sub-sections: Deployment Strategies (2) and Custom Model Serving (3). Despite its low weight, these questions are often the most scenario-heavy and nuanced.
Key Topics
Must-Know Concepts
- Blue-green deployment: two identical environments. Traffic switches entirely from old (blue) to new (green) version. Instant rollback by switching back. Higher cost but zero-downtime
- Canary deployment: gradual traffic routing to new version (e.g., 5% to 25% to 50% to 100%). Monitor metrics at each step. Lower risk but slower full rollout
- Evaluate deployment strategy suitability: blue-green for high-traffic applications needing instant rollback, canary for gradual validation with real production traffic
- PyFunc model registration in Unity Catalog: log custom models with mlflow.pyfunc.log_model(), register in Unity Catalog for governance and lineage tracking
- REST API querying: send prediction requests to model serving endpoints via HTTP POST with JSON payloads. Handle authentication with Databricks personal access tokens
- MLflow Deployments SDK: programmatic interface for creating, updating, and querying model serving endpoints. Alternative to REST API for Python-based workflows
- Custom artifact management: log additional files (preprocessing pipelines, lookup tables, configuration) with the model so they are available at serving time
- Model deployment methods: UI (click-based), REST API (programmatic), MLflow Deployments SDK (Python). Know when to use each based on automation needs
- Traffic splitting for gradual rollout: configure percentage-based traffic routing between model versions on the same serving endpoint for A/B testing or canary deployment
- Endpoint scaling and latency: configure auto-scaling, warm-up strategies, and appropriate instance types to meet latency SLAs for real-time serving
Common Exam Traps
Key ML Professional Concepts Compared
These pairs appear on nearly every exam. Learn the difference and you'll avoid the most common traps.
Top Mistakes to Avoid
Exam-Ready Checklist
Recommended Resources
Free & Official Resources
Paid Courses & Practice Exams
These are recommended if you prefer a structured learning path. They can save time but are not required to pass.