Blog/connecting AI to Power BI Beyond Microsoft Copilot — enterprise Microsoft consulting resource from EPC Group. We provide strategic guidance, implementation expertise, governance frameworks, and compliance-native delivery across the Microsoft ecosystem (Power BI, Microsoft Fabric, Microsoft 365, SharePoint, Azure, AI Governance, Microsoft Copilot).
Key Facts
- 29 years of Microsoft enterprise consulting; 6,500+ SharePoint and 1,500+ Power BI deployments.
- Compliance-native delivery across HIPAA, SOC 2, FedRAMP, FINRA, CMMC, and GxP environments.
- Microsoft Solutions Partner with experience across core current designations.
- Senior architect named on every engagement Statement of Work.
- Engagement Operating Model: published seven-phase Microsoft project management methodology.
- Free initial consultation; fixed-fee scoped Statements of Work.
Connecting AI to Power BI: 5 Approaches Beyond Microsoft Copilot
By Errin O'Connor — April 2026
Microsoft Copilot for Power BI is the obvious starting point for AI-powered analytics — natural language questions, auto-generated visuals, and narrative summaries built into the Power BI interface. But Copilot is one model, one vendor, one set of capabilities. Enterprise organizations need a broader AI toolkit: large language models for narrative generation, custom ML models for domain-specific predictions, computer vision for document extraction, and multi-model orchestration that routes the right task to the right AI. This guide covers five production-proven approaches to connecting AI with Power BI that go beyond what Copilot offers today.
Approach 1: Azure OpenAI Service — LLM-Powered Annotations and Anomaly Explanation
Azure OpenAI provides you with GPT-4o, GPT-4.1, and other advanced models within your Azure tenant. It includes enterprise security controls, private endpoints, and data residency guarantees.
The main use case for Power BI integration is automated insight generation. This allows the LLM to:
- Explain why a KPI changed
- Identify what drove an anomaly
- Suggest insights based on data patterns
Architecture pattern: A Fabric notebook or Azure Function runs on a schedule. This schedule can be hourly, daily, or triggered by a data refresh.
It queries the Power BI semantic model through the XMLA endpoint or reads from the Lakehouse.
Next, it creates a prompt using the relevant data context. For example, it might say, “Revenue dropped 12% week-over-week. Here are the top 10 contributing factors by dimension...” This prompt is sent to Azure OpenAI. The LLM's response is then written to a table.
Finally, the Power BI report displays this narrative in a text card or custom visual. It updates with each data refresh.
When to use: Real-time or near-real-time annotations inside dashboards. Anomaly explanation. Automated data storytelling for operational dashboards where users need context, not just numbers.
Limitations: Azure OpenAI context windows (128K tokens for GPT-4o) may not be sufficient for very large datasets without summarization. Cost at high inference volume can be significant — EPC Group recommends caching responses and only re-generating when the underlying data changes meaningfully.
Approach 2: Anthropic Claude via API — Long-Context Narrative Generation
Azure OpenAI is strong in short-form, structured outputs. In contrast, Anthropic's Claude models excel in long-context analysis and narrative writing.
Claude can handle over 200K tokens in a single prompt. This means you can input an entire quarter's worth of KPI data from all business units, regions, and product lines. In return, you will receive a coherent executive summary that rivals the work of a senior analyst.
Architecture pattern: A Python script runs as an Azure Function, Fabric notebook, or scheduled job. It extracts data from the Power BI semantic model and formats it as structured CSV or JSON. The script sends this data to the Claude API with a detailed system prompt that defines the report format and tone. Finally, it writes the resulting narrative to Azure Blob Storage or a database table.
Power BI displays the narrative, or it is delivered as a PDF attachment via Power Automate.
Use cases EPC Group has deployed:
- Quarterly Business Reviews (QBRs) — Claude generates 10–15 page narrative reports summarizing financial performance, operational KPIs, and strategic recommendations from Power BI data. Delivered to the board as polished PDFs.
- Healthcare compliance narratives — for AI governance in healthcare, Claude summarizes patient outcome dashboards into narratives that compliance officers review, with citations back to the source data.
- Multi-property hotel performance summaries — weekly narratives for hotel GMs comparing their property to portfolio averages, generated from the same Power BI semantic model that feeds the interactive dashboards.
Governance consideration: Claude is accessed through an API. This means that data leaves your Azure tenant and goes to Anthropic's infrastructure. For HIPAA or FedRAMP workloads, EPC Group sanitizes PII before sending data to the Claude API. All identifiable data is kept within Azure.
For non-regulated workloads, Anthropic's data retention policies ensure adequate protection. These policies include:
- No training on API data.
Approach 3: Python Visuals with Embedded ML Models
Power BI supports Python and R visuals that execute scripts when the visual renders. This opens the door to embedding machine learning models directly in the report — anomaly detection, clustering, forecasting, or classification displayed as matplotlib, seaborn, or plotly charts.
Architecture pattern: A Python visual gets a filtered subset of data from the Power BI semantic model. This data is based on the active slicers and filters. The script loads a pre-trained model, which can be a pickled scikit-learn model, an ONNX model, or a simple statistical algorithm. It then runs inference on the data and displays the result as a chart. The visual updates in real-time as users interact with the filters.
Production examples:
- Anomaly detection visual — Isolation Forest model flags outlier transactions in financial dashboards. Red dots on a scatter plot indicate anomalies, with tooltips showing the anomaly score.
- Customer segmentation — K-means clustering visual that segments customers by purchase behavior, displayed as a 2D scatter plot with PCA-reduced dimensions. Clicking a cluster filters the rest of the dashboard to that segment.
- Time-series decomposition — statsmodels seasonal decomposition showing trend, seasonal, and residual components of a KPI, helping analysts understand whether a change is structural or cyclical.
Limitations: Python visuals in the Power BI service operate in a sandboxed environment. They have no network access, a 5-minute timeout, and limited library availability. These visuals re-execute with every interaction, which can slow down reports.
For production, EPC Group uses Python visuals for display. The actual ML inference runs in a Fabric notebook or an Azure ML pipeline. This setup writes results to a table, allowing the visual to read the pre-computed output.
Approach 4: Microsoft Fabric Data Science Notebooks
Microsoft Fabric unifies data engineering, data science, and BI in a single platform. Fabric Data Science notebooks (Spark-based, supporting PySpark, Python, R, and Scala) can train and deploy ML models that write predictions directly to the Lakehouse — and Power BI reads from the same Lakehouse via DirectLake mode.
Architecture pattern: A Fabric notebook reads training data from the Lakehouse. It trains a model using MLflow for experiment tracking. The model is then registered in the Fabric ML model registry. Finally, a scoring pipeline is scheduled to write predictions to a Lakehouse Delta table.
The Power BI semantic model includes this table via DirectLake. This setup ensures that predictions appear in dashboards with near-zero latency and no data duplication.
Use cases:
- Demand forecasting — LightGBM or Prophet model trained on historical sales data, external signals (weather, events, economic indicators), and calendar features. Predictions written to Lakehouse and displayed alongside actuals in the Power BI sales dashboard.
- Customer churn prediction — classification model scoring active customers daily. Churn probability appears as a column in the customer dimension, enabling Power BI slicers like “show me high-risk customers with > $100K annual spend.”
- Supply chain risk scoring — model that scores suppliers based on lead time variability, quality defect rates, financial health indicators, and geopolitical risk factors. Risk scores surface in Power BI procurement dashboards.
Why this is EPC Group's preferred approach: Fabric notebooks maintain all data within the Microsoft security boundary. They use the same capacity as Power BI, eliminating separate Azure ML billing. Additionally, they support MLflow for model versioning and governance. Fabric notebooks also write directly to Lakehouse tables that Power BI accesses via DirectLake, providing the tightest integration available today.
Approach 5: Custom AI Visuals (Power BI Custom Visuals SDK)
The Power BI Custom Visuals SDK allows developers to build TypeScript/D3.js visuals that can call external APIs — including AI endpoints. This enables embedding AI directly into the visual interaction layer, where users interact with AI outputs as native Power BI elements.
Architecture pattern: A custom visual built with the pbiviz SDK receives data from the semantic model. It calls an Azure Function, which proxies to Azure OpenAI, Claude, or a custom ML endpoint. The visual then renders the AI response alongside the data visualization.
The Azure Function manages:
- Authentication
- Rate limiting
- Response caching
Examples EPC Group has built:
- AI-annotated chart visual — a line chart that automatically displays LLM-generated annotations at significant inflection points. Hover over a data point and see “Revenue increased 23% due to Black Friday promotions and new product launch in the Southeast region.”
- Natural language query panel — a custom visual that provides a chat-like interface within the Power BI report, routing questions to Azure OpenAI with the current filter context as grounding data. More flexible than the built-in Copilot Q&A because it can target custom models and include business-specific instructions.
- Predictive tooltip visual — hover over a customer or product and see a predictive overlay: “This customer has a 73% probability of churning in the next 90 days based on declining order frequency and support ticket volume.”
Limitations: Custom visuals require TypeScript development expertise and must pass organizational governance review before deployment. They also need certification if distributed via AppSource. For most organizations, EPC Group recommends approaches 1–4 first and reserves custom visuals for high-value use cases that justify the development investment.
Building a Multi-Model AI Strategy for Power BI
The common mistake is treating AI integration as a single-model problem — deploying Copilot and calling it done. Enterprise organizations benefit from a multi-model approach where different AI capabilities serve different analytics needs:
| Use Case | Recommended Model/Approach | Integration Point |
|---|---|---|
| Self-service Q&A | Microsoft Copilot for Power BI | Native in Power BI |
| Anomaly explanation | Azure OpenAI (GPT-4o) | Fabric notebook or Azure Function |
| Executive narratives | Anthropic Claude | Scheduled Python job via API |
| Demand forecasting | LightGBM / Prophet in Fabric | Fabric Data Science notebook |
| Customer segmentation | scikit-learn in Fabric | Fabric Data Science notebook |
| Document extraction | Azure AI Document Intelligence | Azure Function pipeline |
| In-visual AI interaction | Custom Power BI visual + Azure OpenAI | Custom Visual SDK |
The orchestration layer determines which model responds to each request. This can be straightforward, using purpose-built pipelines where each use case has its own pipeline. Alternatively, it can be complex, using a semantic router that classifies intent and directs requests to the right model.
EPC Group begins with purpose-built pipelines. We add orchestration only when the number of AI touchpoints makes it necessary.
Frequently Asked Questions
Can I use Azure OpenAI directly inside Power BI reports?
Not as a native visual — but you can call Azure OpenAI endpoints from Power BI in three ways: (1) Python or R visuals that invoke the Azure OpenAI SDK at render time, (2) Power Automate flows triggered by Power BI data alerts that send prompts to Azure OpenAI and write results back to a Dataverse or SQL table displayed in the report, or (3) Fabric Data Science notebooks that score data and store AI-generated outputs in a Lakehouse table consumed by the semantic model. EPC Group recommends option 3 for production workloads because it decouples the AI inference from report rendering, avoids timeout issues, and allows you to cache and audit AI outputs.
How is Anthropic Claude different from Azure OpenAI for BI narratives?
Claude excels at long-context analysis — it can process 200K+ tokens of tabular data in a single prompt, making it ideal for generating executive narratives that summarize an entire quarter's performance across dozens of KPIs without chunking. Azure OpenAI (GPT-4o) has strong instruction-following and function-calling capabilities that work well for structured extraction and short-form insights. EPC Group uses Claude for quarterly business reviews and board-ready summaries, and Azure OpenAI for real-time, shorter-form annotations within dashboards.
Are Python visuals in Power BI suitable for production ML models?
Python visuals are useful for prototyping and displaying ML outputs but have significant limitations in production: they run in a sandboxed Python environment on the Power BI service, have a 5-minute timeout, cannot access the network (no API calls), and re-execute on every interaction. For production ML, EPC Group deploys models as Azure ML managed endpoints or Fabric ML models and writes predictions to a table that Power BI consumes via Import or DirectQuery — the visual displays results, not runs inference.
What is the best multi-model AI strategy for enterprise Power BI?
EPC Group recommends a tiered approach: Microsoft Copilot for Power BI as the self-service Q&A layer (natural language to visual), Azure OpenAI for structured in-dashboard annotations and anomaly explanations, Claude for long-form narrative generation and executive summaries, and custom ML models (scikit-learn, LightGBM, or PyTorch) deployed as Fabric ML models or Azure ML endpoints for domain-specific predictions like demand forecasting or churn scoring. Each model serves a different use case — the key is routing the right task to the right model.
How do you ensure AI governance when connecting multiple AI models to Power BI?
EPC Group implements AI governance at three layers: (1) Data governance — semantic model access controls, RLS, and sensitivity labels ensure AI models only access authorized data. (2) Model governance — all AI endpoints are registered in a model catalog with version tracking, input/output logging, and performance monitoring. (3) Output governance — AI-generated content in dashboards is labeled as AI-generated, includes confidence scores where applicable, and is subject to human review workflows for high-stakes decisions. This framework aligns with the NIST AI Risk Management Framework and EU AI Act requirements.
Connect AI to Your Power BI Environment
EPC Group specializes in creating and implementing multi-model AI integrations for Power BI. Our services include:
- Azure OpenAI annotations
- Claude narrative generation
- Fabric ML models
- Custom AI visuals
We assist enterprise organizations in advancing from basic Copilot to production AI that influences real business decisions. Call (888) 381-9725 or request a consultation to discuss your AI + BI architecture.
Request an AI + BI Architecture ConsultationPower BI Strategy: 2026 Considerations for Blog Connecting AI To Power BI Beyond Microsoft Copilot
Power BI capacity sizing in 2026 begins with the F-SKU economics:
- F2 ($263/mo): Covers small workloads with up to 4 GB of memory and about 30 reports.
- F4 ($526/mo): Suitable for typical mid-market deployments with semantic-model refresh windows under 10 minutes.
- F64 ($5,257/mo): Ideal for enterprises using Power BI with Microsoft Fabric data engineering, lakehouse storage, and real-time intelligence.
It is important to revisit capacity right-sizing every 90 days. Microsoft updates F-SKU memory allocations, paginated report performance, and Direct Lake mode availability with each major service update.
Direct Lake mode has transformed the economics of enterprise Power BI in 2026. Instead of importing data into Vertipaq, semantic models now query OneLake-resident Parquet files. This approach offers near-Import-mode performance without the refresh-window cost.
For a Fortune 500 finance organization migrating from a 30-minute Import-mode refresh, the Direct Lake model typically queries fact data in under 800 ms. Additionally, it eliminates the entire refresh-orchestration job from Azure Data Factory.
Decision factors EPC Group evaluates
- Capacity sizing decision (F2/F4/F64+) tied to peak concurrent users and refresh window
- Copilot grounding quality assessment of semantic-model metadata
- Direct Lake mode adoption for Fabric-resident semantic models
- License optimization audit (Pro vs Premium Per User vs F-SKU)
- Row-level security via service principal authentication
See related EPC Group services at /services or schedule a discovery call at /contact.
Connecting AI to Power BI Beyond Microsoft Copilot for Fortune 500 and regulated industries
This deep-dive on connecting AI to Power BI goes beyond Microsoft Copilot. It showcases EPC Group's 29 years of Microsoft-exclusive consulting.
Our insights come from the real-world experience of senior architects. They have built enterprise environments for Fortune 500 clients in regulated industries. The patterns and trade-offs discussed here are based on actual production work, not vendor presentations.
EPC Group publishes practitioner-grade content because the buying audience for enterprise Microsoft consulting evaluates depth, not adjectives. Every guide pairs the technical position with how a senior architect would execute it, including the compliance, governance, and adoption considerations that determine whether the implementation survives audit and adoption.
Financial services
EPC Group provides essential services for banks, asset managers, and broker-dealers. We engineer:
- SOC 2 audit trails
- FINRA Rule 4511 and SEC 17a-4 retention
- MNPI containment
- Communication Compliance for trading floors
Our standard baseline is Microsoft Purview Audit Premium, which offers seven-year tamper-evident retention. Additionally, Defender for Cloud Apps detects shadow-AI exfiltration before it leads to a compliance event.
How EPC Group engages
Six-phase methodology applied to every engagement, compressed for fixed-fee accelerators and extended for full programs.
- Discovery — two-week assessment of the current estate, gap analysis, risk register, target architecture, costed remediation roadmap.
- Design — senior architect produces the target topology, identity framework, Conditional Access, Purview, governance model, and security posture, reviewed by client leads.
- Pilot — 25 to 100 user pilot in a real business unit. Migrate, apply baselines, test integrations, capture feedback.
- Wave rollout — migrate in waves of 500 to 2,500 users with communications, training, hypercare, and a per-wave retrospective.
- Adoption — role-based training, Champions network, executive sponsor enablement, metrics tracked against a measured baseline.
- Operate — optional managed-services retainer for license optimization, governance reviews, security monitoring, and quarterly business reviews.
Compliance-native, not bolted on
We have achieved zero governance audit failures across more than 11,000 enterprise engagements. Our approach includes integrating key controls from the start. These controls include:
- HIPAA
- SOC 2
- FINRA
- FedRAMP
- CMMC
The regulated-industry posture serves as the baseline, not just an upgrade tier. We provide audit-ready evidence from day one.
Manufacturing and energy
EPC Group helps multi-plant manufacturers and energy operators by integrating Microsoft 365 with operational technology. We protect intellectual property using Purview labels and Endpoint DLP. Additionally, we provide frontline workers with F1 and F3 licensing patterns.
Our multi-region rollouts include:
- Data residency planning
- Offline-capable Power Platform apps for shop-floor environments
Engagement models
Three engagement models cover most enterprise needs. Most clients start with a fixed-fee accelerator and grow into a full program or a managed-services retainer.
- Fixed-fee accelerators — Copilot Readiness, Security Hardening, Tenant Health Check, SharePoint Migration, Teams Governance. Defined scope and price. Typical range $25,000 to $150,000 over four to twelve weeks.
- Project engagements — full migration or governance program with milestone-based billing. Discovery through hypercare. Typical range $150,000 to $750,000-plus over three to nine months.
- Managed services — tiered retainer for ongoing operations. Named senior architect on the account. From $3,500 per month with a twelve-month minimum.
Fixed-fee accelerators with real scope
Our services offer predictable scope, price, and outcome. We provide defined accelerators such as:
- Copilot Readiness
- Security Hardening
- Tenant Health Check
- SharePoint Migration
- Teams Governance
Unlike Big 4 firms that quote open-ended time-and-materials, most projects fall within the following ranges:
- $25K-$150K for accelerators
- $150K-$750K for full programs
Talk to a senior architect
30-minute discovery call. No pitch deck. Call (888) 381-9725 or schedule a discovery call and a senior architect responds within one business day.