What is Microsoft Fabric: A beginners guide

Microsoft Fabric is a new SaaS data platform announced in MS Build Conference on May 23, 2023.

As per the official doc

Microsoft Fabric is a unified data platform in the era of AI.  Fabric integrates technologies like Azure Data Factory, Azure Synapse Analytics, and Power BI into a single unified product, empowering data and business professionals alike to unlock the potential of their data and lay the foundation for the era of AI.

To put it simply, Microsoft Fabric simplifies and unifies data operations. From data ingestion to data engineering, data management, machine learning, real time insights and reporting, you can do all in one unified platform.

The Fabric platform is a SaaS service for data and analytical workloads. The platform provides core capabilities of Azure Data Factory, Azure Synapse, and Power BI.

The platform can be used for:

-> Data Ingestion using Data Factory

-> Data Engineering using Synapse

-> Data Science operations using Synapse

-> Data Warehousing using Synapse

-> Real Time Analytics using Synapse

-> Reporting using Power BI

-> Actions using Data Activator (coming soon)

-> Governance using Purview

Why should I care about?

One obvious questions that comes to our mind is why should I care about it? I can anyways go a create a data pipeline today using Azure Data Factory, I can create Spark notebooks on top of Synapse, and create my reports in Power BI. Why should I go and do the same in Fabric?

The answer is: simplicity and unification.

The answer is: It simplifies data collaboration across multiple data professional disciplines

The answer is: It simplifies licensing and purchasing. You don’t have to spin up multiple capacity and infrastructure for your multiple data workloads

The answer is: It simplifies setup and configuration

The answer is: It simplifies data management. No data redundancy or data silos

The answer is: It simplifies data governance

With Microsoft Fabric – a data engineer can create data pipelines to ingest data, using the ingested data, a data warehouse engineer can create data warehouses, an ML engineer using the same data can create models, and an analyst can create data models and Power BI reports – all within the same platform.

Where is my data stored?

All workloads store the data in something new called as “OneLake”. The way we have OneDrive for documents, Microsoft has introduced “OneLake” for data.

OneLake provides a single, unified storage system for data workloads in Fabric. Each tenant has one OneLake provisioned.

There is also a concept of ShortCuts in OneLake which blows away my mind. You can create “shortcuts” of your data in AWS or other clouds right inside OneLake. This means the data stays in the original source and can be used for analytical workloads right through OneLake.

What happens to current Azure analytics solutions?

As per Microsoft they will continue to remain and provide analytical capabilities as a PaaS. Microsoft Fabric simplifies this in the form of SaaS.

Existing Microsoft products such as Azure Synapse Analytics, Azure Data Factory, and Azure Data Explorer will continue to provide a robust, enterprise-grade platform as a service (PaaS) solution for data analytics. Fabric represents an evolution of those offerings in the form of a simplified SaaS solution that can connect to existing PaaS offerings. Customers will be able to upgrade from their current products into Fabric at their own pace.  

How should I get started?

Head to fabric.microsoft.com and try a free trial today. The free trial is for 60 days that will allow you to create warehouses, lakehouses, notebooks and more.

If you are an existing Power BI customer with Premium capacity, you can try it right away.

How to enable Fabric in my tenant?

You need to go to Power BI Admin portal and enable it.

Once on the admin portal, you can “enable” the setting for the entire org or specific security group

After 15 mins you will see an icon at the bottom left corner in your fabric portal. Clicking on it will take you to the Fabric home page.

Is Power BI same as Fabric?

Somehow I feel this is same as Power BI.

Yes, the experience, the UI and core components of Power BI portal are the same in Fabric. This means:

-> Workspaces

-> Navigation

-> Collaboration

-> Content Management

-> Admin portal

-> Capacity

will look familiar to you.

However Fabric is an umbrella platform and Power BI is a critical component of it.

What happens after free trial?

Starting June 1 you can start purchasing Fabric capacities from Azure to supercharge your data and analytical workloads.

Any more questions? Feel free to ask here in the comments.

Fabric learning resources (from Microsoft)

To help you get started with Fabric, there are several resources we recommend:

  • Microsoft Fabric learning paths: Experience a high-level tour of Fabric and how to get started.
  • Microsoft Fabric tutorials: Get detailed tutorials with a step-by-step guide on how to create an end-to-end solution in Fabric. These tutorials focus on a few different common patterns including a lakehouse architecture, data warehouse architecture, real-time analytics, and data science projects.
  • Microsoft Fabric documentation: Read Fabric docs to see detailed documentation for all aspects of Fabric.

My personal favorite is this YouTube video by Justyna Lucznik, Principal Group PM for Microsoft Fabric, on Microsoft Mechanics channel.

Enjoy!

12 sales metrics you need to track

One of the most important question when running a business is: Which metrics should I track?

Whether it’s about team performance, or pipeline performance, or competitor or product performance, it all takes a few metrics to keep track to ensure business visibility.


How quickly will I achieve my goals?

Do I need to speed up my sales process?

Is their a team which needs more attention?

Is their an account that I’m losing?

Are we improving?


Metrics should be shown on a dashboard, big and visible! It can be shown on a TV or big displays.


So, here are 12 sales metrics that you need to show on your Sales dashboard.

Image source: SalesForce.com

Metrics include: Lead by source, Pipeline, Sales cycle, Closed opportunities, New business vs. upsell, Win/loss rate, Product gaps, Open opportunities, Open activities, Open cases, Opportunities past due and Sales by closed date.

Only 12 metrics and you are done with your sales tracking!

But,

Which tool should I use?

How do I extract the data?

How do I plot these?

Which visuals should I use?

How should I show sales deal stage funnel?

You will be amazed to see how the metrics have been placed with the right choice of visuals and the layout in this Power BI report developed by us using HubSpot CRM data.

Sales Dashboard in Power BI using HubSpot data

If you see the report here it is divided into several views:

  1. Overview report – Shows your key metrics and sales funnel. Here we track open revenue, deal owners, sales cycle time along with deal details
  2. Deal Geo analysis – Here you can clearly identify Asian regions not performing well with won rate less than 30%
  3. Deal Region analysis – Imagine an executive interested in Region level performance? This view gives a clear picture where his team needs to focus
  4. Deal Details report – Last but not the least, a detailed report to call out specific deals and work with the account owners to understand progress

The CRM source could be HubSpot, SalesForce or even Microsoft Dynamics.

The tool we have used is Microsoft Power BI to pull, model and visualize raw data.

Interested in viewing the live Power BI report?


Image source and inspiration: Sales Force

Key driver analysis – What influences attrition?

Key driver/influencer analysis using the newly released Power BI “Key influencers” visual.

Key driver analysis or key influencer analysis is critical to understand what factors impact an outcome and/or what is the relative importance of a factor. Example:

What influences employee attrition? Overtime? Job Level?

What influences employee attrition in the Sales Executive role? Distance from home?

What influences customer attrition? High call rate? International Voice plan?

Knowing answers to above helps in decision making.

If employees in Job Role “Healthcare Representatives” leave the most because of the distance from home, maybe offer them fuel reimbursement or maybe offer them accommodation expenses if they stay near to office?


The newly released Power BI “Key influencers” visual (released as part of Feb 2019 Power BI Desktop release) aids such analysis very very quickly with no code! Crazy!

We applied this new visual to analyze what drives employee attrition, and I must say, I’m blown away by the outcome, ease of use, and comprehensiveness of the visual.

Download Power BI report and play with the visual.

But, how does the result look like?

From the visuals above we can clearly see what influences our variable Attrition=Yes. OverTime, MaritalStatus, YearsAtCompany, JobSatisfaction, and so on.

Not only that, the visual also provides the values of the factors which influences our variable of interest the most.

How to interpret the visual?

The likelihood of attrition increases by 2.93x if employees are doing overtime. Or, Attrition is 2.93x more likely in the employees who are doing overtime.

Hmm… if you do overtime, you may quit. This is obvious.

The attrition is 2.18x more likely if employees are single!

Attrition is also high if the Department is Sales.

And so on.

The left-hand side view of Key influencers shows all the factors influencing our “Attrition=Yes” by a factor of 1.0 and above.

The right-hand side view shows the distribution of data with respect to the selected factor and Attrition either as a column chart or a scatter plot.

The dashed line shows Avg. Attrition % of all values except for the key influencer one (in this case except for OverTime = Yes)

There is another view of this visual where we can see Top segments with high attrition % and their characteristics.

Top segments view in Key influencers visual in Power BI

The visual identified 4 segments with high attrition % along with population count. Clicking on a bubble shows us the characteristics of that segment.

Top segments deep dive

Segment 1 with Attrition % as 57.6 has employees in Department Sales, DistanceFromHome > 11, JobLevel is high and OverTime is Yes.

Wow!

You can further drill down this segment by clicking on “Learn more about this segment” and see what other factors influence this segment.

Quick FAQs on Key influencer visuals and its outcome

Can I filter this visual?

Yes, you can. Example: Why are employees in job role “Healthcare Representative” leaving the company?

Filter the visual and the analysis changes!

Is the visual interactive?

Yes, you can select individual influencing factors and see the distribution of Attrition % by the selected factor.

Can I hover over the values in the scatter plot above?

Yes, you can!

Can I see the logic or p-values associated with factors or key influencers?

No, not yet. This visual is in preview mode. Power BI team may add this feature in the future. Not sure about this.

Can I just see the top X key influencers?

No, not yet. This visual is in preview mode. Power BI team may add this feature in the future. Not sure about this.

I do not see my key influencer in this visual?

Yes, this can happen. Based on my R code using RandomForest, Age should also be an influencing factor for attrition but doesn’t show up in Power BI visual.

See this scatter plot. If Age decreases, Attrition % increases. Maybe Power BI just checks how the “increase” direction of a factor increases Attrition % or Maybe the number of data points for lower age and high attrition is less. 

As Age decreases, Attrition % increases

Can I export the data for segments?

No, not yet. This visual is in preview mode. Power BI team may add this feature in the future. Not sure about this.

Does this visual analyze multiple factors and provide conclusions?

I do not think so. In the example below, the likelihood of Attrition % increases by 11.58x if monthly income goes up. But why is that so? Could it be because for those employees the YearsAtCompany is also more?

Maybe Power BI visual needs to remove outliers.

Power BI visual, currently, doesn’t analyze this for us.

Why are employees leaving if we increase their monthly income?

I want to set this up for my data?

Ok, here are steps to achieve this.

Step 1: Download and Install Power BI Desktop Feb 2019 from here.

Step 2: Enable this visual from “Preview features”.

Step 3: Restart Power BI Desktop. Click on the visual highlighted to put it on the canvas.

Step 4: In the visual data options, drag the field to analyze in “Analyze”, and possible influencers in “Explain by”.

Note: The visual is evaluated on the table level of the field being analyzed. In this case, we are analyzing Attrition, and hence the visual runs at an Employee level. So you may not need aggregations on “Explain by” field. Otherwise, appropriate aggregations are required.

Step 5: In the visual select “Yes” in Attrition value. In your case, select the value you want to analyze.

Step 6: Share analysis with your boss/team/company, and say Thank you to us 🙂

No code, drag and drop solution to key influencers analysis in Power BI!

Simple, huh?

Thanks

Ranbeer

PS: This visual is currently not supported in Power BI Embedded, Publish to Web and Power BI Mobile scenarios.

Download Power BI report and play with the visual.

AI + Power BI = Wow BI

I’m glad to inform my readers that Microsoft is adding new AI capabilities inside Power BI. And, these are no-code solutions.

Let’s check these 4 new exciting capabilities in detail and in the order of quick wins as per me.

In this post, I will explain uses cases with examples from multiple industries for each of the new capabilities coming in Power BI. This will be followed by a general approach to solve such problems, and then the new AI + Power BI approach to solve such problems.

Capability 1: Key Driver Analysis – or Key Influencer Analysis

Note: Per Microsoft this would be available to all Power BI users.

Suppose you have a dataset of employee attrition which includes details of the employees who are in the company, who left the company along with age, gender, salary, job role, satisfaction, education, years with current manager etc.

Your task is to find factors influencing attrition. Why are employees leaving the company? What segments of employees are leaving?

A general approach for answering such questions would be to use R or Python, fit a model (say using Random Forest algorithm) or use techniques like RFE (Recursive Feature Elimination) to find out top factors affecting our label – Attrition. More details on this general approach and how we did this using R and Power BI is mentioned in detail in our case study here.

With new AI capabilities in Power BI, this would be just a click away. The outcome of the analysis from Power BI would be shown as a kind of “lollipop” chart as shown below.

Image source: Microsoft

Example: When Parental encouragement is true, the probability of a student to plan to attend college increases by 1.8x,

Or, when the employee has spent more than 2 yrs with current manager and his job satisfaction is low then attrition increases by 2.3%

From the screenshot it is not clear how multiple driver analysis can be performed: Ex: When parental encouragement is true and Gender is male – what happens then? 

A contingency matrix would have helped in this case.

Capability 2: Azure Cognitive Capabilities – Sentiment Analysis, Image tagging, object detection in Power BI

Note: Per Microsoft this would be a Power BI Premium capability

You started a campaign on Twitter and would like to analyze your users sentiments – positive/negative.

For a call center company you would like to analyze chat script and identify key items customers are talking about right within your BI reports.

Or, an E-Commerce company would like to detect objects in the images attached with customer reviews, and identify which product/brand is causing negative sentiments or causing pulling “Andon Cord”.

A general approach would be to use Azure Cognitive APIs inside your Power BI report using Power Query (more about this later) using calls such as: Web.Contents(AzureAPICallWithParams).

Another general approach would be to develop and use custom Deep Learning models. A Twitter sentiment analysis (racial vs non-racial tweet) model was developed by us and is hosted in our GitHub repo.

With new AI capabilities in Power BI, this could be just a matter of invoking a function from Power BI ribbon. We do not know yet how this will be invoked by users. But, definitely this will make our BI reports comprehensive and improve decision making.

A snippet of such comprehensive report is attached below.

Image source: Microsoft

When this comes out in preview we will have to see if Microsoft has provided ability to not fire API calls for items already tagged/analyzed – otherwise you will have to pay for every API calls (even for repeats).

Capability 3: Automated Machine Learning models

Note: Per Microsoft this would be a Power BI Premium capability

Imagine in your Power BI report along side Sales Oppty data I provide you a confidence score or probability score against each Oppty data. The Oppty owner can look at this number and decide which Oppty are more likely to be won so he/she can then focus his/her efforts on top highly likely Oppty.

A general approach to add this would be a data scientist developing such models and a developer integrating it inside the Power BI report, and a business analyst consuming the report.

With new AI capability, Microsoft is targeting business analyst so they can build, train, and apply the models right within Power BI service without writing a single line of code. Isn’t that waow?

From the initial screenshots by Microsoft, it looks like this will be part of DataFlows (another new capability, which I will talk about in later posts)

Image source: Microsoft

When this feature is out in preview, we will have to see how easy will it be to do feature engineering – feature selection, normalization, pruning, binning etc. But, this is for sure going to ease out the effort in long term.

Capability 4: Use your existing Azure ML Models in Power BI

Note: Per Microsoft this would be available to all Power BI users.

This capability is more of easing out collaboration between a data scientist and a business analyst.

Typically a data scientist builds models in Azure ML platform and publishes the model as API endpoint. A data analyst or engineer uses that model endpoint to predict outcomes and populate the data inside the BI report. This BI report is then consumed by a business analyst.

In the new AI approach the models developed by data scientist would easily be searching in Power BI, and a new interface would be provided in Power BI to hook to that model and use it in reports.

There are no screenshots for this capability by Microsoft. 

—–

The public preview of these capabilities will be launched towards the end of Nov 2018.

We would evaluate these capabilities and posts about it when they arrive.

What thoughts you have on these capabilities? How are you going to use these capabilities?

Let us know.

Thank you,

Ranbeer Makin

References:

Power BI AI Capability Announcement

Power BI AI Capability Preview Signup Form

Power BI Embedded for Gov clouds

Most of the articles/posts/questions published on community or websites would be on Power BI “Commercial” product. The URL you have seen would be https://app.powerbi.com

This blog post is about “https://app.powerbigov.us” – Power BI URL for US Gov customers. More specifically this blog post is about embedding Power BI assets (Reports, Dashboards, QnA) for US Gov customers.

I would like to highlight certain differences in app registration URL and config URLs when embedding Power BI assets for US Gov clouds.

-> App Registration URL

The app registration URL for commercial cloud and US Gov cloud for Power BI Embedded are different. For:

  1. Commercial Product -> https://dev.powerbi.com/apps
  2. US Gov Product -> https://dev.powerbigov.us/apps

-> When you register your app, the Power BI API name in Azure Portal would look like this:

Commercial Product

commercialproduct

US Gov Product

govproduct

TIP: Based on our experience, when troubleshooting please look out for the API name that you see in Azure Portal. If you are embedding for Gov clouds, the API name should be “Microsoft Power BI Government Community Cloud”. If that is not the name, you would need to register the app on https://dev.powerbigov.us/apps

-> Embedding Config URLs

Commercial Product

commercialproductconfig

US Gov Product

govproductconfig

You would see “resourceUrl”, “apiUrl” and “embedUrlBase” is different in case of US Gov clouds.

That’s it. These are the only changes in configuration and setup required for embedding in Power BI US Gov clouds. Everything else remain the same.

If you are looking for details on embedding with a sample .NET code, please head to our  blog post on baby steps to embed your Power BI report.

Apply the configuration changes as mentioned in this blog post and you would be set to see the Power of embedded analytics for your US Gov clouds.

Questions? Please contact us through our website.

Thank you