Fabric Warehouse Advisor¶
A modular Python advisory framework for Microsoft Fabric Warehouse. Each advisor module analyses a different aspect of warehouse health and produces actionable recommendations with rich reports.
Everything runs inside a Fabric Notebook — no external tools, no data leaves your environment.
Available Advisors¶
| Advisor | What it does | Output |
|---|---|---|
| Data Clustering | Recommends which tables and columns should use CLUSTER BY |
Scored recommendations (0–100) with CTAS DDL |
| Performance Check | Detects data-type, query regression, caching misconfigurations, V-Order status, and statistics health problems | Findings (Critical / Warning / Info) |
Why use it?¶
- Data-driven decisions — recommendations are based on your real workload, not rules of thumb
- Zero setup — Query Insights is enabled by default on every Fabric Warehouse; just install the library and run
- Non-invasive — read-only analysis via T-SQL passthrough; nothing is modified in your warehouse
- Rich output — interactive HTML reports, Markdown, plain text, and Spark DataFrames you can persist to Delta for tracking over time
- Cross-workspace support — analyse warehouses in other Fabric workspaces from a single notebook
- Fully configurable — every threshold, toggle, and weight is exposed as a dataclass field
Quick Start¶
from fabric_warehouse_advisor import (
DataClusteringAdvisor, DataClusteringConfig,
PerformanceCheckAdvisor, PerformanceCheckConfig,
)
# --- Data Clustering ---
dc_config = DataClusteringConfig(warehouse_name="MyWarehouse")
dc_result = DataClusteringAdvisor(spark, dc_config).run()
displayHTML(dc_result.html_report)
# --- Performance Check ---
pc_config = PerformanceCheckConfig(warehouse_name="MyWarehouse")
pc_result = PerformanceCheckAdvisor(spark, pc_config).run()
displayHTML(pc_result.html_report)
Documentation¶
| Document | Description |
|---|---|
| Getting Started | Installation, first run, prerequisites |
| Advisors | Overview |
| Data Clustering | Overview, pipeline, configuration, scoring, reports |
| Performance Check | Overview, pipeline, configuration, checks reference, reports |
| Cross-Workspace | Analysing warehouses in other workspaces |
| Troubleshooting | Common issues and solutions |
License¶
MIT — see LICENSE for details.