What is the actual difference between Fabric Warehouse and Fabric Lakehouse?
Fabric Warehouse runs on the Polaris distributed SQL engine and exposes a full T-SQL surface — DDL, DML (INSERT, UPDATE, DELETE, MERGE), stored procedures, views, multi-statement transactions, and SQL Server-style permissions. Fabric Lakehouse runs on managed Apache Spark with a read-only T-SQL analytics endpoint over the same Delta tables. Both write open Delta Parquet to OneLake, so a Warehouse table is queryable from a Lakehouse notebook and a Lakehouse Delta table is queryable from Warehouse via shortcut — no copy. The decision is which authoring surface fits the workload: T-SQL teams and dimensional modeling go Warehouse, Spark teams and big-data engineering go Lakehouse, most enterprises use a hybrid medallion (Lakehouse bronze and silver, Warehouse gold).
How does Fabric Warehouse compare to Synapse Dedicated SQL Pool?
Fabric Warehouse is the strategic successor to Synapse Dedicated SQL Pool. Same Polaris engine family, evolved. Three concrete differences. First, storage is open Delta Parquet in OneLake, not the proprietary block format of Dedicated SQL Pool — every other engine can read it without copy. Second, capacity is elastic Fabric F-SKU consumption, not DWU sizing — no manual scale operations during quarter-end close. Third, governance, security, and BI integration sit inside the unified Fabric workspace model rather than Synapse Studio. New builds default to Fabric Warehouse. Existing Synapse Dedicated SQL Pool customers migrate via the EPC Group five-phase Accelerator — see also /azure-synapse-analytics-enterprise-guide-2026.
How does Fabric Warehouse + Lakehouse compare to Snowflake or Databricks?
For Microsoft-anchored enterprises already invested in Power BI, Microsoft 365, Entra ID, Purview, and the broader Fabric workload set, Fabric Warehouse plus Lakehouse wins on unified governance (one Entra ID model, one Purview catalog, one capacity meter), open OneLake storage (Delta Parquet readable by every external engine), zero-copy Direct Lake mode for Power BI, free Microsoft-native source mirroring (Cosmos DB, Azure SQL, on-prem SQL Server via Arc), and lower commercial complexity inside an existing Microsoft EA or MCA. Snowflake wins on multi-cloud portability and a deeper third-party data marketplace. Databricks wins on Spark notebook tooling depth and ML platform maturity. The honest decision turns on incumbent investment, multi-cloud requirement, and where the analytics consumption audience already lives.
What T-SQL features are supported in Fabric Warehouse versus traditional SQL Server?
Fabric Warehouse supports the majority of the production T-SQL surface — DDL (CREATE / ALTER / DROP for tables, views, schemas, procedures, functions), DML (INSERT, UPDATE, DELETE, MERGE), multi-statement transactions, stored procedures, table-valued functions, common table expressions, window functions, and SQL Server-style permissions via Entra ID. Notable gaps versus SQL Server engine — identity columns behave differently, some legacy index types are not present, cross-database queries follow the Fabric workspace and shortcut model rather than three-part naming, and a handful of legacy SET options are not supported. The migration assessment in Phase 1 catalogs every incompatibility and ships a remediation pattern per source workload.
When should we use Mirroring versus a Data Factory pipeline or SSIS migration?
Mirroring is the right answer when the source is a supported system (Cosmos DB, Azure SQL Database, Azure SQL Managed Instance, on-premises SQL Server via Azure Arc, PostgreSQL, Snowflake) and the requirement is near-real-time replication into OneLake Delta with no transformation. It is no-code, zero-meter on the source side for Microsoft-native sources, and removes CDC infrastructure. Data Factory pipelines (and the SSIS-to-ADF migration path at /ssis-to-fabric-data-factory-migration-guide-2026) are the right answer when transformation is required during transit, when the source is not a Mirroring target, or when the pattern is batch ETL with scheduling, parameterization, and lineage. Many enterprise builds use both — Mirroring for the supported operational sources, Data Factory for the long tail.
What is OneLake shortcut and how does it eliminate copies?
A OneLake shortcut is a virtual reference to data held elsewhere — another Fabric workspace, another tenant, an Azure Data Lake Storage Gen2 account, an S3 bucket, a Google Cloud Storage bucket, a Dataverse table, or a Delta Sharing endpoint. Shortcuts are mounted into a Lakehouse or Warehouse and queried in place — no copy, no ingestion latency, no duplicate storage cost. Permissions on the underlying source still apply. The pattern unlocks line-of-business federation, M&A pre-merger data exchange, multi-cloud Delta sharing, and the hybrid medallion architecture where bronze and silver live in Lakehouse and gold in Warehouse reads from the silver shortcut. Shortcuts are the architectural unit that makes OneLake the lake, not a copy of every other lake.
How does Direct Lake mode work and when is it the right Power BI storage mode?
Direct Lake mode is a Power BI storage mode that reads Delta Parquet directly from OneLake without import refresh and without DirectQuery query-time translation. The semantic model loads columns into memory on-demand, page by page, from the Delta file. Performance is import-like for the queries the model touches, with no refresh window and no DirectQuery latency. Direct Lake is the right mode when the underlying Warehouse or Lakehouse gold layer is curated for BI consumption, when query patterns are reasonably predictable, and when the audience needs near-real-time data without scheduled refresh. Import mode still wins for very high-cardinality models with heavy calculation groups; DirectQuery still wins when row-level security demands query-time filtering against an external source. EPC Group ships the decision matrix as part of Phase 4 modernization.
What does an EPC Group Fabric Data Warehouse + Lakehouse engagement deliver?
A fixed-fee five-phase Accelerator anchored on the EPC Group Lifecycle — Assess, Architecture, Migrate, Modernize, Operate — priced $200K to $800K depending on source-system count, workload complexity, regulatory scope, and managed-service tail. Deliverables include the costed roadmap with capacity sizing, Entra ID security model with RLS / CLS / DDM, workspace topology with Git integration and deployment pipelines, EPC Group Kimball Pattern Library DDL and MERGE templates, parallel-run reconciliation harness for finance-grade cutover, Direct Lake activation, Mirroring source onboarding, Spark feature engineering scaffolding, FinOps tuning, Reserved Capacity move, and an auditor-ready control matrix. Senior architect-led, named on-record, no offshore handoff.