Back to Portfolio

Walmart Black Friday Sales Analysis

Walmart
Business Analytics & Statistical Analysis
Jul 2024 - Aug 2024
Data Analyst/Data Scientist

Project Overview

Conducted comprehensive statistical analysis of Walmart's Black Friday customer purchase behavior to determine if spending habits differ between male and female customers. Performed advanced statistical testing including hypothesis testing, confidence intervals, and Central Limit Theorem simulations to provide data-driven insights for business decision-making. Developed an interactive Streamlit dashboard with comprehensive visualizations and business recommendations to support strategic marketing and inventory planning.

Quantitative Research

Conducted comprehensive quantitative analysis using advanced statistical methods including hypothesis testing, confidence intervals, and Central Limit Theorem simulations to uncover patterns in customer spending behavior. The research involved extensive data preprocessing, exploratory data analysis, and statistical modeling to deliver actionable insights for business optimization.

Key Analysis Areas

Data Quality Analysis: Ensured data integrity and consistency for reliable statistical analysis

Gender Analysis: Investigated spending patterns differences between male and female customers

Age Group Analysis: Identified key demographic segments with highest spending potential

City Category Analysis: Explored regional variations in Black Friday shopping behavior

Occupation Analysis: Provided insights for targeted marketing strategies

Statistical Analysis: Applied rigorous statistical methods for reliable conclusions

Business Recommendations: Developed data-driven strategies for business optimization

Detailed Tasks

• Collected and preprocessed Walmart Black Friday transactional data to ensure data quality and consistency for statistical analysis

• Conducted comprehensive exploratory data analysis (EDA) using Python and statistical libraries to uncover key trends in customer spending behavior

• Performed hypothesis testing to determine if there are statistically significant differences in spending between male and female customers

• Implemented confidence interval analysis to provide reliable estimates of customer spending patterns with different confidence levels

• Conducted Central Limit Theorem simulations to demonstrate the impact of sample size on statistical precision and reliability

• Analyzed customer segments by age, occupation, city category, and marital status to identify high-value customer groups

• Developed an interactive Streamlit dashboard with comprehensive visualizations and statistical analysis tools for stakeholder engagement

• Created data-driven business recommendations for targeted marketing, inventory optimization, and revenue enhancement strategies

• Applied statistical rigor to business problems, translating complex analytical findings into actionable insights for non-technical stakeholders

Key Findings

• Statistical analysis revealed significant differences in spending patterns between male and female customers

• Central Limit Theorem simulations demonstrated the importance of sample size for reliable statistical inference

• Age group analysis identified key demographic segments with highest spending potential

• Geographic analysis showed regional variations in Black Friday shopping behavior

• Occupation-based analysis provided insights for targeted marketing strategies

Business Impact

• Provided statistical evidence for gender-based marketing strategies with confidence intervals

• Identified high-value customer segments for targeted marketing campaigns

• Developed data-driven recommendations for inventory optimization

• Created interactive dashboard for real-time business intelligence

• Established statistical framework for future customer behavior analysis

Skills Used

Data Analysis Python Streamlit Statistical Analysis Hypothesis Testing Confidence Intervals Central Limit Theorem Exploratory Data Analysis Data Preprocessing Customer Segmentation Business Intelligence Data Visualization

Tech Stack

Programming & Tools
Python Streamlit Pandas NumPy Plotly SciPy Matplotlib Seaborn
Statistical Analysis
Hypothesis Testing Confidence Intervals Central Limit Theorem T-Test Analysis Descriptive Statistics Statistical Significance Testing Sample Size Analysis
Data Analysis
Data Preprocessing Exploratory Data Analysis Customer Segmentation Gender Analysis Age Group Analysis Geographic Analysis Occupation Analysis
Business Intelligence
Interactive Dashboards Data Visualization Business Recommendations Marketing Strategy Revenue Optimization Inventory Management Customer Targeting