cover-img

Data Analysis: Bellabeat Casestudy

Cleaned and visualized the dataset for Google Data Analytics Certification

21 December, 2022

0

0

0

Collaborators

vivek-zanee

Introduction

Welcome to the Bellabeat data analysis case study! In this case study, you will perform many real-world tasks of a junior data analyst. You will imagine you are working for Bellabeat, a high-tech manufacturer of health-focused products for women, and meet different characters and team members. In order to answer the key business questions, you will follow the steps of the data analysis process: ask, prepare, process, analyze, share, and act. Along the way, the Case Study Roadmap tables — including guiding questions and key tasks — will help you stay on the right path.

Scenario

You are a junior data analyst working on the marketing analyst team at Bellabeat, a high-tech manufacturer of health-focused products for women. Bellabeat is a successful small company, but they have the potential to become a larger player in the global smart device market. Urška Sršen, cofounder and Chief Creative Officer of Bellabeat, believes that analyzing smart device fitness data could help unlock new growth opportunities for the company. You have been asked to focus on one of Bellabeat’s products and analyze smart device data to gain insight into how consumers are using their smart devices. The insights you discover will then help guide marketing strategy for the company. You will present your analysis to the Bellabeat executive team along with your high-level recommendations for Bellabeat’s marketing strategy.

Check out the Case Study to know the trends and Insights.

Languages
R

R

How Can a Wellness Technology Company Play It Smart?

Background:
Urška Sršen and Sando Mur founded Bellabeat, a high-tech company that manufactures health-focused smart products. Sršen used her background as an artist to develop beautifully designed technology that informs and inspires women around the world. Collecting data on activity, sleep, stress, and reproductive health has allowed Bellabeat to empower women with knowledge about their own health and habits. Since it was founded in 2013, Bellabeat has grown rapidly and quickly positioned itself as a tech-driven wellness company for women. By 2016, Bellabeat had opened offices around the world and launched multiple products. Bellabeat products became available through a growing number of online retailers in addition to their own e-commerce channel on their website. The company has invested in traditional advertising media, such as radio, out-of-home billboards, print, and television, but focuses on digital marketing extensively. Bellabeat invests year-round in Google Search, maintaining active Facebook and Instagram pages, and consistently engages consumers on Twitter. Additionally, Bellabeat runs video ads on Youtube and display ads on the Google Display Network to support campaigns around key marketing dates. Sršen knows that an analysis of Bellabeat’s available consumer data would reveal more opportunities for growth. She has asked the marketing analytics team to focus on a Bellabeat product and analyze smart device usage data in order to gain insight into how people are already using their smart devices. Then, using this information, she would like high-level recommendations for how these trends can inform Bellabeat marketing strategy.
Aim for analyzing this case study
1.Analyze smart device data to gain insight into how consumers are using their smart devices
2.Present the analysis to the Bellabeat executive team along with your high-level recommendations for Bellabeat’s marketing strategy
Bellabeat's Current Marketting Strategy
1.Bellabeat products became available through a growing number of online retailers in addition to their own website.
2.The company has invested in traditional advertising media: radio, out-of-home billboards, print, and television.
3.Focused on digital marketing extensively.
4.Invests year-round in Google Search , maintaining active Facebook and Instagram pages, and consistently engages consumers on twitter.
5.Runs video ads on Youtube and display ads on the Google Display Network to support campaigns around key marketing dates.
Google Data Analytics Steps: Ask, Prepare, Process, Analyze, Share, Act
Let's go with the steps one by one

-> Ask Phase : What do we need to know?

What are some trends in smart device usage?
How could these trends apply to Bellabeat customers?
How could these trends help influence Bellabeat marketing strategy?
Some questions after looking through the datasets will be
What is the problem you are trying to solve?
How can your insights drive business decisions?

-> Prepare Phase : What is our Data?

Data Collection:
What?: This Kaggle public data set contains personal fitness tracker data from thirty FitBit users, including minute-level data for physical activity, heart rate, and sleep monitoring.
Who?: The dataset was collected by Amazon Mechanical Turk from consenting FitBit users in their survey.
Why?: The data was to collected with an inspiration to understand Human temporal routine behavior and pattern recognition.
Data Brief: The dataset contains 18 tables linked via the user IDs and timestamps. These tables contain information pertaining to the various intensities of physical activity,sedantary periods, calories burned (measured on daily basis, hourly basis and minute-wise basis)sleep duration(daily and minute-wise) and its frequency, weight logs, as well as heartrate.
Data Limitations:
The data has been collected with predefined datasets with limited data. This makes it outdated to use and get current data.
The sample size is of around 33 participants for most of the parameters and lesser for parmeters like heart rate per second (7) and weight (8). It is not sufficient to establish a good confidence level.
The data set does not provide any demographic information pertaining to the participants which makes it difficult to analyse the data with respect to Bellabeat's target of female customers.

Findings to proceed with data

Preparing Work Environment by loading Packages and Data

-> Process Phase: Data Cleaning

For ensuring that each table is cleaned we need to perform the following:
Ensure the naming consistancy.
Check and Remove duplicates and errors if any.
Store time and date in different columns.
Removing duplicates and ensuting naming conventions
Making date formats consistent and Store time and date in different columns
Ensuring that the individual distances sums up to the total distance
As evident from above:
1.The sum of induvidual distances (light_active_distance,moderately_active_distance,very_active_distance) does not add upto the total distance specified. This makes any analysis based upon the sum of these values incorrect. Hence we will not use the sum of these value in any part of our analysis.
2.Another noticable fact is that the most users are majorly lightly active throughout the total distance they cover. Hence, the need to promote some high intensity activites, so that the users may see evident changes.
Now we will be merging the cleaned daily_activity and daily_sleep tables for ease of analysis and visualisation
Summary statistics of all our tables: To get brief insights of our data
From the above we can note that:
1.The mean of total_steps for about 34 participants is below 8,000 which is not sufficient to see maximum health benefits. This shows that most users of fitness devices would require some motivation to increase their activity levels.
2.The mean sedantary minutes are a staggering 991.2 minutes (16.5 hours). This metric gives us the insight that most fitness device users are likely to be people who have long sitting hours and do a desk job- a great idea of target group for bellabeat marketing.
3.The analysis shows that an average participant remains awake in bed for 21-30 minutes, before they are finally able to sleep.
4.The mean Intensity per hour for the participants was merely 12 minutes.
There are days for each participant when no information has been collected. Let us find out the total number of days for each participant when the data was collected.
As we see, information was not collected for all the participants on all the days. Why did the device did not collect information on few days? was the device not used on said dates? or perhaps the participants were involved in other activities like jump rope, swimming, bicycling etc that the device could not register?
Finding out the target audience
We can do this by analysing either the steps taken, the calories burned or the very_active_minutes against the days of the week.
img
Let us confirm our analysis once, with similar calculations and plotting for total_steps taken on each day
img
from the above we achieve that:
The most active days for participants are Tuesdays and Wednesdays
The least active day for participants are Sundays followed by Fridays
So the target audience for Bellabeat is one that most likely:
aims to relax and rest on the beginning and end of weekends(Fridays and Sundays) but
follows a routine and remains active throughout the workdays(mon-thursdays) ,
while being most energetic and active during the mid-week period(Tuesdays and Wednesdays)
Let us also find out the time of the day when the users are most active
img
From the above we can note that:
The most active Time period for participants is around 5pm to 7pm
The least active timeframe for the participants is obviously the early morning hours of 2am to 4am
So the target audience for Bellabeat is one that is most likely:
full time workers who focus on their physical health after work hours are finished (5pm to 7pm)
Further Visual Analysis
Understanding the correlation between the calories burned and the total steps taken
img
The above analysis shows a slightly positive relation between increase in daily number of steps and calories burned i.e with every increase in the number of steps, the calories burned increases.
This information can be used to market features that involve setting and attaining goals to burn more calories by increasing the daily steps.
Understanding the correlation between Calories burned and the sleeplessness period in bed
img
The analysis is depicting slightly negative correlation between energy expenditure throughout the day and sleeplessness while in bed. This means that an increased period of activity is associated with less time spend in bed before finally sleeping.
Let us also analyse the relationship between sedantry minutes and sleep duration
img
As seen in both of the above analysis:
Having increased activity throughout the day is related to less awake time before sleeping.
Likewise increased periods of inactivity(sedantary minutes) is associated with poor amount of sleep.
These insights can be used to market the Bellabeat for saliant feature of improved sleep with regular physical activity.

-> Share Phase : Sharing Recommendations

Summary of our Target Audience:
Bellabeat's marketing team needs to focus their marketing towards the user segment that are:
1.Working adults that mostly does a routine(9-5) desk jobs. These users indulge in some light_activity to maintain their health but they definitely need some motivation to increase their activity levels to reap maximum health benefits.
2.These users also have a sleepless phase of 20-30 minutes in bed before they are finally able sleep.
3.They remain active throughout the workdays(Mon-Thurs) but considers relaxing on the weekends.
Recommendations for the data
1.The sample size of the data is too small, it needs to be expanded to draw any strong conclusions.
2.The number of participants should be equal for all the parameters, lack of it presented many difficulties while analysis. Some parameters have really few participants(weight logs and heart rate per second) which makes them unsuitable for analysis.
3.The data for the of values for VeryActiveDistance, ModeratelyActiveDistance and LightActiveDistance needs to be reassesed as their sum does not add upto the total distance for daily activity, this renders any analysis that can be performed on sum of these parameters as inaccurate.
Also, some information on the demographics of the users such as gender, age, and height , would provide deep insights for developing strategies, keeping in mind Bellabeat's vision of products curation especially for women.
Recommendations for the app Along with tracking the active movements, Bellabeat can have enhanced features such as a water log so that users can track their hydration status and maintain their overall health.
The app can also include sublte features of notifications or alarms for sleep schedules or going early to bed to enhance the user's overall experience.
Lastly, the users have a number of days when no activity has been logged. One possibility is that the app did not register other forms of physical activity like biking, swimming, or playing a sport, muscle strengthning exercises etc. The app should include metrics to register these activities as well to provide a wholesome user experience.

-> Act Phase : Future marketting startegy

Most users struggle to remain highly active thrughout the day(Overall, the duration of "light active minutes" is much higher than "very active minutes"). This can be used to market some high intensity, short duration workouts so that the customers reap more health benefits.
There were very few users who logged in their weight details every day because it is a manual task. Bellabeat can utilize this information to promote features like weight log notifications and even daily alarms, reminding the users about the scheduled time for their daily physical activity.
Since the app generates a lot of health data, this information can be leveraged to develop and sell personalised goals and activity suggestions.
Finally, Many Participants experience a sleepless period of about 20-30 minutes in bed, Bellabeat can develop and promote sleep assisting features such as sleep inducing music, sleep journals, etc.
Final note The analysis strategies have been seen from some data analyst's methods and tried in those ways. Thanks for going through the steps.

0

0

0

Vivek
Software Developer | Proficient in Java | Web3 Enthusiast

More Articles

Showwcase is a professional tech network with over 0 users from over 150 countries. We assist tech professionals in showcasing their unique skills through dedicated profiles and connect them with top global companies for career opportunities.

© Copyright 2025. Showcase Creators Inc. All rights reserved.