Search icon CANCEL
Subscription
0
Cart icon
Your Cart (0 item)
Close icon
You have no products in your basket yet
Arrow left icon
Explore Products
Best Sellers
New Releases
Books
Videos
Audiobooks
Learning Hub
Free Learning
Arrow right icon
Arrow up icon
GO TO TOP
Power BI Machine Learning and OpenAI

You're reading from   Power BI Machine Learning and OpenAI Explore data through business intelligence, predictive analytics, and text generation

Arrow left icon
Product type Paperback
Published in May 2023
Publisher Packt
ISBN-13 9781837636150
Length 308 pages
Edition 1st Edition
Languages
Arrow right icon
Author (1):
Arrow left icon
Greg Beaumont Greg Beaumont
Author Profile Icon Greg Beaumont
Greg Beaumont
Arrow right icon
View More author details
Toc

Table of Contents (21) Chapters Close

Preface 1. Part 1: Data Exploration and Preparation
2. Chapter 1: Requirements, Data Modeling, and Planning FREE CHAPTER 3. Chapter 2: Preparing and Ingesting Data with Power Query 4. Chapter 3: Exploring Data Using Power BI and Creating a Semantic Model 5. Chapter 4: Model Data for Machine Learning in Power BI 6. Part 2: Artificial Intelligence and Machine Learning Visuals and Publishing to the Power BI Service
7. Chapter 5: Discovering Features Using Analytics and AI Visuals 8. Chapter 6: Discovering New Features Using R and Python Visuals 9. Chapter 7: Deploying Data Ingestion and Transformation Components to the Power BI Cloud Service 10. Part 3: Machine Learning in Power BI
11. Chapter 8: Building Machine Learning Models with Power BI 12. Chapter 9: Evaluating Trained and Tested ML Models 13. Chapter 10: Iterating Power BI ML models 14. Chapter 11: Applying Power BI ML Models 15. Part 4: Integrating OpenAI with Power BI
16. Chapter 12: Use Cases for OpenAI 17. Chapter 13: Using OpenAI and Azure OpenAI in Power BI Dataflows 18. Chapter 14: Project Review and Looking Forward 19. Index 20. Other Books You May Enjoy

Requirements, Data Modeling, and Planning

You begin your journey by assessing the requirements and data for your project. The use case will be a fictional scenario, but everything will be built using real data from the Federal Aviation Administration’s (FAA) Wildlife Strike Database. The data is real, the topic can be understood by anyone, and the findings within the data are interesting and fun. According to the FAA’s website, about 47 animal strikes are reported daily by aircraft. These incidents can damage airplanes, potentially endanger passengers, and negatively impact wild animal (especially bird) populations.

For the use case, you have been assigned to provide your leadership with tools to do an interactive analysis of the FAA Wildlife Strike data, find insights about factors that influence the incidents, and also make predictions about future wildlife strike incidents and the associated costs. The primary goal of your project, predicting the future impact of FAA Wildlife Strikes, will require building some Power BI machine learning models.

Before uploading data to Power BI’s machine learning (ML) tools, you’ll need to create tables of data that will train the ML models. There is an old saying about data and analytics: “Garbage in, garbage out.” Software as a Service (SaaS) machine learning tools are easy to use, but you still need to feed them good-quality curated data. Identifying the right training data and getting it into the right format are crucial steps in an ML project.

This project will encompass data exploration, data transformation, data analysis, and additional downstream data transformations before you begin working with Power BI ML tools. You are already an experienced business intelligence (BI) professional and Power BI user, and now you are ready to take your skills to the next level with ML in Power BI!

Power BI supports connections to source data in many different formats, ranging from relational databases to unstructured sources to big flat tables of raw data. Countless books have been written about the best ways to structure and model data for different use cases. Rather than dive into the specifics of data modeling, for this book, we will begin with two simple assumptions:

  • Most of the time, a star schema design will provide the most efficient storage and query performance for business intelligence data models
  • Basic ML models, such as the ones you will build in this book, are usually created with a flattened table

Just to be clear, not every solution will follow these assumptions. Rather, these assumptions are generalizations that can provide you with a starting point as you approach the design of a new data model. Quite often, there will not be a perfect answer, and the optimal design will be dictated by the types of queries and business logic that are generated by the end consumers of the data model.

If you’ve never heard the terms star schema and flattened data before, don’t worry! The book will progress at a pace that is intended to help you learn and will also stay at a level that makes sense when you review the FAA data. Let’s browse the FAA Wildlife Strike data and decide upon the best data modeling strategy for your new project!

In this chapter, we will take the following steps so that you can understand the data, think through how it will be used, and then formulate a preliminary plan for the data model:

  • Reviewing the source data
  • Reviewing the requirements for the solution
  • Designing a preliminary data model
  • Considerations for ML
You have been reading a chapter from
Power BI Machine Learning and OpenAI
Published in: May 2023
Publisher: Packt
ISBN-13: 9781837636150
Register for a free Packt account to unlock a world of extra content!
A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.
Unlock this book and the full library FREE for 7 days
Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of
Renews at $19.99/month. Cancel anytime
Banner background image