The Problem Microsoft Fabric Solves
Enterprise analytics has faced fragmentation for over a decade. A typical enterprise analytics stack includes several key components:
- A data lake for raw data storage
- An ETL tool for data movement and transformation
- A data warehouse for structured analytics
- A real-time streaming platform for event processing
- A data science environment for machine learning
- A business intelligence tool for reporting and dashboards
Each component has its own provisioning, security model, governance framework, billing, and management overhead.
The industry calls this problem the "analytics tax." It represents the cost of managing integration among different components. This includes:
- Maintaining consistent security
- Ensuring proper governance
- Troubleshooting data pipeline failures across service boundaries
For enterprise organizations, this tax usually consumes 40-60% of the analytics team's time. Consequently, only a limited amount of time remains for actual analysis and generating insights.
Microsoft Fabric addresses this challenge by unifying all analytics tasks on a single platform. It includes a shared data layer known as OneLake.
Additionally, it offers:
- A common security model
- Governance
- Compute capacity
Data is stored only once. All workloads can access it without the need for movement or duplication. This change is significant and fundamentally transforms how enterprise analytics teams operate.
Understanding OneLake: The Foundation of Fabric
OneLake is Fabric's integrated data lake. Understanding OneLake is essential to appreciating Fabric's value. It functions like OneDrive does for Microsoft 365. OneLake provides a single, unified storage layer that all workloads use by default.
OneLake has several key architectural features that enhance its functionality:
- Automatic provisioning with every Fabric tenant, eliminating the need for a separate storage account setup.
- Hierarchical organization using workspaces, lakehouses, and folders.
- Native Delta Lake (Parquet) format for all structured and semi-structured data.
- One copy of data serving all analytics workloads, with no ETL between components.
- ADLS Gen2 API compatibility for existing tools and scripts.
- Built-in governance through Microsoft Purview integration.
OneLake supports shortcuts. These are virtual references to data stored in external accounts. This includes:
- Azure Data Lake Storage
- Amazon S3
- Google Cloud Storage
Shortcuts let external data appear as native OneLake data without the need to copy it.
This feature is especially useful during migration. It allows organizations to gradually move data to OneLake while still accessing their existing storage.
For enterprise organizations, OneLake removes the most costly part of traditional analytics setups: data movement. By allowing every workload to access the same data store, OneLake offers several advantages:
- No need to build and maintain ETL pipelines between the data lake and data warehouse.
- No data synchronization issues between the BI tool and the analytics database.
- No inconsistencies in security models across different storage systems.
- No duplicate storage costs for the same data in various formats.
Fabric Workloads: A Deep Dive
Data Engineering
The Data Engineering workload uses Apache Spark for processing large-scale data. It focuses on data transformation, cleansing, and preparation.
In this workload, raw data from source systems is:
- Ingested
- Cleaned
- Enriched
- Shaped into analytics-ready formats
Fabric's Spark implementation runs on a managed Spark runtime. This eliminates the need for cluster provisioning, Spark version management, and infrastructure setup.
Engineers can write code using:
- PySpark
- Spark SQL
- Scala
- R
Fabric also manages compute allocation from the capacity pool.
Key benefits include:
- Starter pools allow for near-instant Spark session startup in under 30 seconds.
- This eliminates the 3-5 minute cold start issue found in Azure Synapse Spark pools.
Data Factory pipelines are built into Fabric for orchestration. They offer:
- 150+ connectors for data ingestion
- Support for complex pipeline patterns, including conditional logic, loops, and error handling
- Monitoring and alerting for pipeline failures
- Incremental refresh patterns for large datasets
For organizations using Azure Data Factory, migrating to Fabric pipelines is easy. The pipeline authoring experience is very similar, and many existing pipelines can be moved with little change.
Data Warehouse
The Fabric Warehouse provides a fully managed analytical database that is compatible with T-SQL. It can read and write data directly to OneLake in Delta format. This solution is perfect for organizations that have strong SQL skills.
These organizations can benefit from Fabric without the immediate need to adopt Spark.
Warehouse capabilities include:
- Full T-SQL DML (INSERT, UPDATE, DELETE, MERGE)
- Stored procedures and functions
- Cross-warehouse and cross-database queries
- Automatic statistics and query optimization
- Workload isolation for concurrent queries
- Visual query designer for no-code query building
The main difference between Fabric Warehouse and the Lakehouse SQL endpoint is their write capabilities. The Lakehouse SQL endpoint provides read-only T-SQL access to Delta tables.
In contrast, the Warehouse allows full read-write operations. This makes it ideal for traditional data warehouse tasks, such as:
- Staging
- Transformation
- Dimension management
Lakehouse
The Lakehouse offers the flexibility of a data lake and the structure of a data warehouse. It stores data in Delta Lake format, which provides:
- ACID transactions
- Schema enforcement
- Time travel on top of Parquet files
The Lakehouse is the recommended starting point for organizations creating new analytics platforms in Fabric.
Each Lakehouse offers several key features for data management:
- A managed file area for unstructured and semi-structured data.
- A managed tables area with Delta tables.
- An automatic SQL endpoint for T-SQL queries.
- A default semantic model for Power BI.
This dual-mode access enables data engineers to use Spark while analysts can utilize SQL. Both groups access the same data and maintain consistent governance.
Additionally, this setup helps avoid unnecessary data movement between them.
Data Science
The Data Science workload offers a managed environment for machine learning. It integrates MLflow for tracking experiments, registering models, and deploying them.
Data scientists collaborate with data engineers using the same Spark notebooks. They also have access to the same data in OneLake.
Key capabilities include:
- Native MLflow integration for experiment tracking and model versioning.
- Pre-installed ML libraries: scikit-learn, TensorFlow, PyTorch, and XGBoost.
- PREDICT function for applying trained models in T-SQL queries.
- SynapseML for distributed machine learning at scale.
For enterprise organizations, governance integration is essential. Models are registered, versioned, and governed using the same framework as data assets. This approach ensures complete lineage from raw data to training data, model, and prediction.
Real-Time Analytics
Real-Time Analytics, powered by Azure Data Explorer / Kusto, delivers sub-second query performance. It efficiently processes streaming and time-series data.
- Handles IoT telemetry
- Processes application logs
- Analyzes clickstream data
- Supports scenarios needing near-instant analysis of high-volume event streams
The Eventstream component offers a no-code interface for setting up real-time data ingestion. It supports Azure Event Hubs, Kafka, custom applications, and other streaming sources.
Data flows through Eventstreams into KQL databases. It is then instantly queryable using Kusto Query Language (KQL).
Moreover, Real-Time Dashboards offer live visualizations that refresh automatically as new data arrives.
Power BI Integration
Power BI in Fabric is not a bolted-on addition — it is a native workload that reads directly from OneLake. The most significant advancement is Direct Lake mode, which provides import-mode performance with DirectQuery-level data freshness. Direct Lake loads Delta table column chunks directly into memory, bypassing the traditional import process that required scheduled refreshes.
For enterprise organizations, dashboards now display real-time data without the refresh delays seen in import-mode datasets. Reports on multi-billion-row datasets respond in seconds.
Additionally, since there is no import process, data duplication between OneLake and the Power BI model is eliminated. This helps reduce storage costs and prevents refresh failures.
Fabric Licensing and Capacity Planning
| SKU | Capacity Units | Approx. Monthly Cost | Recommended For |
|---|---|---|---|
| F2 | 2 CU | ~$262 | Development and testing |
| F64 | 64 CU | ~$5,242 | Small-to-mid enterprise production |
| F128 | 128 CU | ~$10,483 | Mid enterprise with multiple workloads |
| F256 | 256 CU | ~$20,966 | Large enterprise with heavy workloads |
| F512 | 512 CU | ~$41,932 | Enterprise with large-scale analytics |
Enterprise organizations should keep several key licensing points in mind. The F64 SKU is the minimum required to provide Power BI Pro equivalent capabilities for all users in the capacity.
Users below the F64 level must obtain individual Power BI Pro licenses. These licenses cost $10 per user per month.
Capacity is shared among all workloads. For example, a heavy Spark job can reduce the available capacity for concurrent Power BI queries.
To manage burst workloads, the following features are available:
- Capacity auto-scaling
Furthermore, you can reserve capacity for 1-year or 3-year terms. This option provides significant discounts of up to 40% compared to pay-as-you-go pricing.
For cost comparison, consider the following typical enterprise services:
- Azure Data Factory: $500/month
- Synapse dedicated SQL pool: $3,000/month
- Synapse Spark pool: $2,000/month
- Power BI Premium P1: $4,995/month
The total cost for separate services is approximately $10,495 per month. In comparison, an F128 Fabric capacity costs $10,483 per month. It offers the same or better capability for all four workloads on a single, unified platform.
Enterprise Governance in Microsoft Fabric
Governance is where Fabric provides its most important enterprise value. Traditional analytics environments face governance fragmentation. This includes:
- Different security models for the data lake and the warehouse.
- Separate audit logs for various services.
- No unified lineage tracking across the analytics pipeline.
Fabric governance capabilities offer several key features to enhance data security and management:
- Workspace-level access control with role-based permissions
- Item-level security for individual data assets
- Row-level and column-level security in Warehouses and Lakehouses
- Sensitivity labels from Microsoft Purview applied across all workloads
- Unified audit logging for all data access and modifications
- End-to-end data lineage from source through transformation to report
- Endorsement labels (Certified, Promoted) for trusted data assets
- Domain-based organization for multi-department environments
For regulated industries, Fabric's governance model offers a unified audit trail and access control framework. This is essential for compliance officers.
A single audit log tracks all data access across various workloads. This feature removes the log correlation issues that often affect multi-service analytics architectures.
Migrating from Azure Synapse to Microsoft Fabric
Microsoft has clearly indicated that Fabric is the future of its analytics platform strategy. While Azure Synapse Analytics will still receive support, new feature investments will focus on Fabric.
Enterprise organizations should start planning their migration strategy now.
Migration Path for Synapse Dedicated SQL Pools
Synapse dedicated SQL pools are closely related to Fabric Warehouses. The T-SQL surface area is mostly compatible, but some features vary.
Migration includes several steps:
- Exporting data from dedicated SQL pools to Parquet format in ADLS Gen2.
- Creating Fabric Warehouses and setting up OneLake shortcuts to the exported data.
- Recreating stored procedures, views, and security settings.
- Validating query performance and results parity.
- Redirecting BI tools and applications to the Fabric Warehouse endpoints.
Migration Path for Synapse Spark Pools
Synapse Spark notebooks can be used in Fabric with minimal changes. The main updates include:
- Changing storage paths from ADLS Gen2 to OneLake references
- Moving linked services to Fabric connections
- Adjusting Spark configurations for Fabric's managed runtime
- Reconfiguring any Azure Key Vault integrations
The biggest advantage of migration is the removal of Spark pool management. You no longer need to worry about pool sizing, auto-scale configuration, or library management.
Migration Path for Azure Data Factory
Data Factory pipelines in Fabric offer the same authoring experience as standalone Azure Data Factory. Migration requires several steps:
- Recreate pipelines in Fabric or use export/import for simple pipelines.
- Migrate linked services to Fabric connections.
- Update dataset references to OneLake.
- Reconfigure triggers and monitoring.
Organizations with complex ADF estates, which may include hundreds of pipelines, should plan for a phased migration over 3-6 months.
How EPC Group Helps with Microsoft Fabric
As a Microsoft Gold Partner with deep expertise in Power BI and enterprise analytics, EPC Group provides end-to-end Fabric consulting services. Our team has been working with Fabric since its public preview and has completed Fabric implementations for Fortune 500 organizations across healthcare, finance, and government.
- Fabric readiness assessments — Evaluate your current analytics estate, identify migration candidates, and develop a prioritized adoption roadmap
- Architecture design — Design workspace structures, security models, and data architectures optimized for your workloads and compliance requirements
- Migration execution — Hands-on migration from Azure Synapse, Databricks, or other platforms to Fabric, including data validation and performance testing
- Capacity planning and optimization — Right-size your Fabric capacity, implement monitoring, and optimize workload scheduling to maximize cost efficiency
- Training and enablement — Role-based training for data engineers, analysts, and administrators to ensure your team can operate Fabric independently
Frequently Asked Questions
What is Microsoft Fabric and how is it different from Azure Synapse?
Microsoft Fabric is a unified SaaS analytics platform that brings together data engineering, data science, real-time analytics, data warehousing, and business intelligence in a single product with a shared data lake (OneLake). Unlike Azure Synapse, which required separate provisioning and management of individual services (Synapse SQL, Spark pools, Data Explorer), Fabric provides all workloads in a single capacity with unified governance, security, and billing. Fabric eliminates the need to manage infrastructure, move data between services, or maintain separate security models for each analytics component. Azure Synapse Analytics is being incorporated into Fabric, with Microsoft encouraging migration to the Fabric platform.
How much does Microsoft Fabric cost for enterprise organizations?
Microsoft Fabric uses capacity-based licensing measured in Capacity Units (CUs). The F2 SKU starts at approximately $262/month, while enterprise organizations typically require F64 ($5,242/month) or higher. F256 capacity runs approximately $20,966/month and F512 at approximately $41,932/month. Power BI Pro is included in F64 and above capacities. Organizations can also use pay-as-you-go pricing for burst workloads. The key cost advantage is consolidation — instead of paying separately for Azure Data Factory, Synapse, Data Explorer, and Power BI Premium, all workloads run on a single capacity with shared compute resources. Trial capacities are available for evaluation.
Can Microsoft Fabric replace our existing data warehouse?
Yes, Microsoft Fabric can replace traditional data warehouses for most enterprise use cases. Fabric offers two primary storage paradigms: the Lakehouse (based on Delta Lake format, optimized for large-scale data engineering and data science) and the Warehouse (T-SQL-based, optimized for traditional BI workloads and SQL-familiar teams). The Warehouse supports full T-SQL DML operations, stored procedures, and cross-database queries. For organizations currently on Azure SQL Data Warehouse or Synapse dedicated SQL pools, migration to Fabric Warehouse provides comparable functionality with simplified management, unified governance, and integrated Power BI. However, organizations with extremely specialized analytical database requirements (columnar compression, materialized views at massive scale) should conduct a thorough proof-of-concept before committing to migration.
How does Microsoft Fabric integrate with Power BI?
Power BI is natively integrated into Microsoft Fabric as one of its core workloads. All data stored in OneLake is automatically accessible to Power BI without data movement or duplication. Direct Lake mode allows Power BI to query Delta tables in OneLake directly, combining the performance of import mode with the freshness of DirectQuery. Semantic models (formerly datasets) can be built directly on Fabric lakehouses and warehouses. Capacity purchased for Fabric also covers Power BI workloads — organizations on F64 or above get Power BI Pro equivalent capabilities included. This integration eliminates the historical gap between data engineering and business intelligence, enabling analysts to build reports directly on governed, engineering-quality data assets.
What skills does our team need to use Microsoft Fabric effectively?
Microsoft Fabric is designed to serve multiple skill levels. Data engineers work primarily with Spark notebooks (PySpark, Spark SQL, Scala) and Data Factory pipelines for ETL orchestration. Data scientists use Spark notebooks with ML libraries and the built-in MLflow integration for experiment tracking. SQL analysts work with the Warehouse experience using familiar T-SQL syntax. Business analysts create reports using Power BI, which can now connect directly to Fabric data assets. Data administrators manage governance through Purview integration, security policies, and monitoring dashboards. The most critical skill gap for most organizations is Spark/PySpark proficiency for the Lakehouse workload — this is where training investment delivers the highest return. Organizations heavily invested in T-SQL skills can start with the Warehouse workload while gradually building Spark capabilities.
Ready to Explore Microsoft Fabric?
EPC Group offers Fabric readiness assessments, architecture design, and migration services for enterprise organizations.
Begin with a complimentary consultation to determine if Fabric is the right platform for your analytics workloads.
Schedule a Fabric ConsultationErrin O'Connor
CEO & Chief AI Architect at EPC Group | 29 years Microsoft consulting | Author, Power BI Field Guide (Microsoft Press)
