Haggis Hopper Taxi Business Insights

Organization: Kin Karta Consulting

Location: United Kingdom

Project Type: Academic (Strathclyde Business School)

Duration: Mar 2024 – Apr 2024

Role: Data Analyst/Data Scientist

Project Description

Delivered data-driven insights to optimize taxi fleet operations using advanced analytics and machine learning. Performed data preprocessing, exploratory data analysis using Python, SQL, and Tableau to uncover trip patterns, customer behaviour and identify geographic hotspots to optimize fleet allocations. Developed forecasting models and Time Series Analysis to predict demand and identify key drivers influencing demand, supporting resource optimization and continuous process improvement. Developed visualisations trip patterns, demand forecasts, and fleet utilization and other metrics for stakeholders.

Quantitative Research

Conducted comprehensive quantitative analysis using advanced statistical methods and machine learning techniques to uncover patterns in taxi operations, customer behavior, and demand forecasting. The research involved extensive data preprocessing, exploratory data analysis, and predictive modeling to deliver actionable insights for business optimization.

Geospatial Analysis

Demand Patterns

Hourly Analysis of demand in pickup locations Hourly Analysis of demand by dropoff locations Trip counts between post code areas

Descriptive Statistics

Postcode Demand Analysis

Demand Analysis

Outlier Analysis

Outlier Analysis

Temporal Analysis

Hourly Variations and Outliers in Key Taxi Metrics: Demand, Distance, Duration, Fare, Tip, and Total Amount

Revenue & Fare Analysis

Revenue Analysis Fare Analysis Revenue per KM

Trip Duration Analysis

Duration Analysis Average Trip Duration

Clustering Analysis

Hour-Ahead Demand Forecasting

Business Insights

Detailed Tasks

Core Skills

Data Analysis
Python
SQL
Excel
Tableau
Geospatial Analysis
K-Means Clustering
LSTM Models
Holts Winter Models
Forecasting
Exploratory Data Analysis
Data Preprocessing
Correlation Analysis
Python Libraries (e.g., Pandas, NumPy, Matplotlib, Seaborn)

Tech Stack

Programming & Tools

  • Python
  • SQL
  • Excel
  • Tableau
  • Python Libraries (e.g., Pandas, NumPy, Matplotlib, Seaborn)

Quantitative Research

  • Descriptive Statistics
  • Exploratory Data Analysis
  • Feature Engineering
  • Outlier Analysis
  • Correlation Analysis
  • Heatmaps
  • Time Series Analysis
  • Statistical Analysis
  • Hypothesis Testing

Data Analysis

  • Data Preprocessing
  • Exploratory Data Analysis
  • Geospatial Analysis
  • Revenue & Fare Analysis
  • Trip Duration Analysis
  • Time Series Analysis
  • Correlation Analysis and Heat Maps

Machine Learning

  • K-Means Clustering
  • LSTM Models
  • Holts Winter Models
  • Forecasting
  • Clustering
  • Regression

Business Impact

Expected Project Outcome