Back to Portfolio

Haggis Hopper Taxi Business Insights

Kin Karta Consulting
United Kingdom
Academic Project (Strathclyde Business School)
Data Analyst/Data Scientist
View on GitHub View Analysis Dashboard Interactive Tableau Dashboard

Introduction

The taxi industry plays a crucial role in urban mobility, serving residents, tourists, and business travellers alike. In a vibrant city like Glasgow, known for its rich history, cultural heritage, and bustling urban life, taxi companies play a pivotal role. They are at the forefront of addressing the complex challenges associated with urban transportation, striving to provide reliable, efficient, and sustainable services amidst the city's dynamic and competitive mobility market.

Glasgow's taxi industry operates in an environment where demand can fluctuate significantly due to various factors, including seasonal tourism, local events, weather conditions, and changing travel patterns. These fluctuations present a complex challenge for taxi companies, requiring them to anticipate changes in demand and adjust their operations accordingly to maintain service quality and efficiency.

This case study is designed as a response to these challenges, focusing on a data-driven approach to enhance the operational framework of Haggis Hoppers - one of Glasgow's taxi services companies. To address these obstacles, the company seeks innovative solutions that leverage historical data, including taxi demand patterns, trip logs, weather conditions, event schedules, and traffic data. The goal is to employ advanced time series analysis and predictive modelling techniques to forecast demand, identify operational inefficiencies, and recommend actionable strategies for dynamic fleet allocation, maintenance scheduling, and pricing. This initiative is not only about maintaining profitability but also about increasing customer satisfaction, ensuring operational efficiency, and contributing to the city's sustainability goals.

Overview of the current operations

Haggis Hoppers, in its journey from a modest family-run startup to a notable name in Glasgow's taxi service sector, has enjoyed a period of natural growth, echoing the vibrant energy of the city itself. Today, the company operates a fleet of 50 diverse vehicles, including cutting-edge eco-friendly hybrids and electric models, reflecting its steadfast commitment to sustainability and reducing environmental impact. This growth in fleet size is complemented by an expanded workforce, now comprising over 120 dedicated employees, including 80 drivers known for their professionalism and passion for customer service, alongside operations, maintenance staff, and a burgeoning customer support team.

At the heart of Haggis Hoppers' day-to-day operations is its state-of-the-art dispatch and operational centre. Here, the technology underpins the management of bookings, fleet allocation, and customer interactions, ensuring efficiency and responsiveness. The company prides itself on its fleet management practices, with a rigorous maintenance regime and a dynamic system for fleet allocation that responds adeptly to fluctuating traffic and weather conditions, thereby optimising service delivery.

Despite these advances and the company's evident success, with revenues climbing steadily each year, Haggis Hoppers has identified a critical juncture in its growth trajectory. The realisation dawned that while the business was expanding, its operations were not optimised to the degree necessary for sustainable scaling. This insight marked a strategic pivot towards harnessing data for operational insights and efficiency gains.

Acknowledging the power of data analytics in unlocking operational efficiencies, Haggis Hoppers has recently inaugurated a data team. This small but ambitious unit is tasked with the collection and analysis of various data streams, including trip logs, demand patterns, weather impacts, and traffic flows. However, given the team's nascent stage and the vast potential of data-driven optimisation yet to be tapped, Haggis Hoppers finds itself at a crossroads, seeking external expertise to leverage these insights fully. This is where the decision to engage with business analysts comes into play. Haggis Hoppers recognises the opportunity to invite fresh perspectives and analytical skills to tackle its operational challenges.

With this pivot towards a data-centric operational model, Haggis Hoppers is not merely addressing immediate inefficiencies but is laying the groundwork for a future where data-driven decisions propel the company towards greater profitability, customer satisfaction, and environmental stewardship. This strategic shift underscores Haggis Hoppers' journey from a family-owned business to a pioneering, sustainable mobility provider in Glasgow, ready to redefine urban transport in an increasingly data-driven world.

Problem Statement

Haggis Hoppers, amidst its journey of growth and expansion, has recognised the need to refine its operations through data-driven insights. With a commitment to innovation and efficiency, the company wants to leverage its repository of trip log data to unlock operational efficiencies, optimising service delivery, and ultimately enhancing customer satisfaction.

Expected Outcome

Requirement is a comprehensive analysis of historical trip logs and its time series analysis of demand:

Approach

Exploratory Data Analysis (EDA)

The foundational step in Haggis Hoppers' data-driven journey is to undertake a comprehensive Exploratory Data Analysis (EDA) of its trip log data. This initial phase is crucial for several reasons:

• Understanding Data Characteristics: EDA will allow Haggis Hoppers to gain a deep understanding of the underlying patterns, anomalies, trends, and correlations within its data. This includes analysing trip frequencies, duration, peak times, geographic hotspots of demand, and other relevant metrics.

• Identifying Operational Insights: Through EDA, the company aims to uncover insights that could lead to immediate operational improvements, such as identifying under-served areas, times of day with mismatched supply and demand, and inefficiencies in current fleet allocation practices.

• Data Quality Assessment: This phase will also serve to evaluate the quality of the collected data, identifying any inconsistencies, missing values, or outliers that may impact further analysis.

Quantitative Research and Key Analysis Areas

Conducted comprehensive quantitative analysis using advanced statistical methods and machine learning techniques to uncover patterns in taxi operations, customer behavior, and demand forecasting. The research involved extensive data preprocessing, exploratory data analysis, and predictive modeling to deliver actionable insights for business optimization.

• Geospatial Analysis

• Demand Patterns

• Descriptive Statistics

• Postcode Demand Analysis

• Demand Analysis

• Outlier Analysis

• Temporal Analysis

• Hourly Variations and Outliers in Key Taxi Metrics: Demand, Distance, Duration, Fare, Tip, and Total Amount

• Revenue & Fare Analysis

• Trip Duration Analysis

• Clustering Analysis

• Hour-Ahead Demand Forecasting

• Business Insights

Detailed Tasks

• Collected, integrated, and cleaned transport datasets from multiple sources to ensure consistency and reliability for downstream analytics.

• Conducted exploratory data analysis (EDA) using Python and SQL to uncover key trends and trip patterns, supporting business planning and operational decision-making.

• Performed geospatial analysis to identify high-demand hotspots and optimize location-based services.

• Applied clustering techniques for customer segmentation, enabling targeted strategies for user engagement and service delivery.

• Designed and implemented interactive Tableau dashboards to visualize fleet utilization, demand trends, and key operational KPIs, facilitating real-time monitoring by cross-functional teams.

• Built and deployed advanced time series forecasting models (e.g., LSTM, SARIMA) to predict customer demand, optimize resource allocation, and enable automated forecasting workflows.

• Conducted statistical analyses (e.g., correlation analysis, hypothesis testing) to identify drivers of demand fluctuations and operational inefficiencies, delivering actionable insights to stakeholders.

• Translated complex analytical findings into clear visual narratives and recommendations, enhancing communication with technical and non-technical stakeholders.

Key Findings

Showcases the discovered patterns, insights, and recommendations - covers the time series analysis results and propose data-driven strategies for optimising fleet allocation, improving customer service, and enhancing overall operational efficiency.

Core Skills

Data Analysis Geospatial Analysis Statistical Analysis

Programming & Tools

Python and ML libraries SQL Excel Tableau

Quantitative Research

Descriptive Statistics Feature Engineering Outlier Analysis Correlation Analysis Heatmaps Time Series Analysis Statistical Analysis Hypothesis Testing

Machine Learning

K-Means Clustering LSTM Models Holts Winter Models Forecasting Clustering Regression

Time Series Analysis of Demand

The heart of this project is to perform a rigorous time series analysis on the demand data captured in the historical trip logs. This analysis is expected to reveal demand fluctuations over time, identify predictable patterns related to time of day, week, or season, and potentially uncover correlations with external factors such as weather or local events. The ultimate goal is to utilise these insights to develop a predictive model for demand forecasting, enabling Haggis Hoppers to anticipate and efficiently meet customer needs with precision.

Key Tasks

• Comprehensive time series analysis revealing demand fluctuations and patterns

• Identification of predictable patterns related to time of day, week, and seasonal variations

• Analysis of correlations with external factors (weather, local events)

• Development of predictive models for demand forecasting

Key Findings

Showcases the discovered patterns, insights, and recommendations - covers the time series analysis results and propose data-driven strategies for optimising fleet allocation, improving customer service, and enhancing overall operational efficiency.

Time Series Skills

Time Series Analysis Seasonal Decomposition Trend Analysis ARIMA Models Autocorrelation Pattern Recognition Cyclical Analysis Demand Forecasting

Demand Prediction Model

Building upon the insights gained from EDA, Haggis Hoppers aims to develop a predictive model for demand forecasting. The objective here is twofold:

• Hour Ahead Prediction: The company seeks to create a model that can accurately predict taxi demand on an hour-ahead basis. Such a model would enable dynamic fleet allocation by anticipating demand surges or lulls, thereby optimising fleet utilisation and reducing customer wait times.

• Operational Efficiency: By predicting demand more accurately, Haggis Hoppers can ensure a better match between supply and demand, enhancing operational efficiency, and potentially increasing profitability through optimised resource allocation.

Outliers

Conducted comprehensive quantitative analysis using advanced statistical methods and machine learning techniques to uncover patterns in taxi operations, customer behavior, and demand forecasting. The research involved extensive data preprocessing, exploratory data analysis, and predictive modeling to deliver actionable insights for business optimization.

Detailed Tasks

• Built and deployed advanced time series forecasting models (e.g., LSTM, SARIMA) to predict customer demand, optimize resource allocation, and enable automated forecasting workflows.

Comparison of Prediction models

Core Skills

Data Analysis Data Preprocessing Forecasting Model Accuracy

Programming & Tools

Python SQL Excel Tableau

Quantitative Research

Feature Engineering Outlier Analysis

Machine Learning

Clustering LSTM Models Holts Winter Models Forecasting Regression

Open to Innovation: Beyond Predictive Modelling

While demand prediction stands out as a primary objective, Haggis Hoppers is interested in exploring additional avenues for leveraging its data to improve performance. To this end, the company is open to a range of analytical methodologies and innovative approaches:

• What-If Analysis: Haggis Hoppers is eager to explore what-if scenarios that simulate the impact of various operational changes or external factors (such as weather conditions or major events) on demand and service delivery. This could help in strategic planning and contingency management.

• Operational Enhancements: Beyond demand forecasting, the company invites business analysts to propose data-driven strategies for other aspects of its operations. This could include pricing strategies, customer loyalty programs, driver incentive schemes, or environmental sustainability initiatives.

• Engaging with Innovation: Haggis Hoppers is open to any cutting-edge methodologies, tools, or technologies that business analysts can offer. Whether through machine learning models, geospatial analysis, or real-time data analytics, the company is prepared to invest in ideas that promise to redefine its operational framework and contribute to its growth and efficiency.

Innovation Skills

What-If Analysis Scenario Planning Machine Learning Geospatial Analysis Real-time Analytics Strategic Planning Pricing Strategy Customer Analytics

Data Sources

Historical Trip Logs: The historical trip logs dataset for February 2024 provides a comprehensive snapshot of the operations of Haggis Hoppers, reflecting a typical month of activity for the company. The dataset is a sample of all trips taken in February this year. As confirmed by the Haggis Hoppers data team, the dataset is representative of the entire population of customers choosing the company for their travel needs and encompasses a wide range of variables that capture the nuances of each trip. In the table below there is an overview of the fields included in the dataset.

Other Data Sources: To complement the historical trip logs and enhance the data-driven approach for optimising Haggis Hoppers' operations, business analysts can leverage a variety of additional data sources. These sources can provide external context, enriching the insights drawn from the company's internal data. Here's a rundown of other valuable data sources and the potential insights they might offer.

• Weather Data: Historical and real-time weather data can include information on temperature, precipitation, wind speed, and other weather conditions. By correlating weather conditions with trip data, analysts can identify patterns in demand related to weather changes. For instance, demand might spike on rainy days or drop during extreme weather events. This insight can inform fleet allocation strategies to meet weather-related changes in demand.

• Traffic and Road Closure Data: Real-time and historical traffic data, along with scheduled road closures and construction updates from city transport departments. This data can help in understanding how traffic congestion and road closures affect trip duration and fleet efficiency. Analysts can recommend route optimisation strategies to drivers, potentially reducing fuel consumption and improving customer satisfaction through shorter travel times.

• Event Calendar: Schedules and calendars of local events, including concerts, sports games, festivals, and conventions. Large events can significantly impact local transportation demand. Analysing event schedules alongside trip data can help predict surges in specific areas, allowing for proactive fleet allocation and targeted marketing efforts.

Haggis Hoppers is open and enthusiastic about business analysts utilising any external data sources they find interesting and believe can contribute valuable insights to their analysis. The company encourages analysts to think creatively and explore a wide range of datasets that could enrich their understanding of our operations and the factors influencing demand for our services. Whether it's innovative use of social media analytics, untapped government data sets, or emerging trends in urban mobility, the company is keen to see how diverse and novel data sources can be leveraged to enhance the data-driven decision-making process.

Data Management Skills

Data Collection Data Integration Data Preprocessing Data Cleaning Data Quality Assessment ETL Processes Database Management Data Validation