In this tutorial, we will perform Exploratory Data Analysis (EDA) on the Netflix userbase dataset. EDA is a technique used to summarize and analyze data sets using visualizations.
The dataset contains various fields, including user ID, subscription type, monthly revenue, joining date, last payment date, country, age, gender, device, and plan duration.
To get started, we’ll import the data from a CSV file using the “get data” feature in Power BI. Once loaded, all fields will be available for visualization.
We will begin our analysis by creating various visualizations:
- A column chart displaying the count and distinct count of records for all main fields.
- A chart showing the average, minimum, maximum, standard deviation, variance, and median of numeric fields.
- A histogram displaying the distribution of customer ages.
- A column chart displaying the number of customers by country.
- A pie chart showing the number of customers by device and gender.
- A histogram analyzing the number of days since joining and last payment.
- A histogram analyzing monthly revenue.
- A text visual for plan duration.
Next, we’ll create separate dashboards for analyzing data by country, device, gender, and subscription type.
The tutorial covers various data visualization techniques and provides links to the histogram, dataset, and Power BI file in the description below.
Link to the dataset/power bi file: https://drive.google.com/file/d/1V4umooRfR68I4ZTLI4LMUEZbd6_-pAh_/view?usp=sharing
Link to the Histogram: https://drive.google.com/file/d/1nY2eh-YGiVAi8Bxe0z-7epHX26LwaeJR/view?usp=sharing