Problem Statement

Imagine you work as a data analyst for "Karachi Bakery," the iconic Hyderabad bakery chain famous for its fruit biscuits and osmania biscuits, with branches across major locations like Banjara Hills, Jubilee Hills, and Himayatnagar. You are given daily sales data for the past year from their flagship store at Moazzam Jahi Market. The average daily sales are ₹5,00,000 with a standard deviation of ₹50,000. During the analysis, you notice that one day during Ramzan shows sales of ₹50,00,000 - ten times higher than average.

1

Investigating Sales Anomaly

MODERATE

How would you determine if this ₹50,00,000 day is a data entry error, a genuine extraordinary event (like a massive corporate order), or something else? What descriptive statistics or plots would you use to investigate this anomaly that occurred at the historic Moazzam Jahi Market branch?

2

Impact of Outlier on Metrics

MODERATE

What is the impact of this single data point on the mean and standard deviation of Karachi Bakery's sales, particularly when comparing performance between their original Moazzam Jahi Market location and newer outlets in Gachibowli and Madhapur areas frequented by IT professionals?

3

Handling the Outlier: Tradeoffs

ADVANCED

What are the tradeoffs of removing this data point versus keeping it for future sales forecasting or performance reporting to Mr. Ramesh Ramnani, the director of Karachi Bakery? How might this decision affect inventory planning for major festivals like Sankranti and Ramzan when their special boxes featuring Hyderabadi delicacies are in high demand?

 

Nerchuko Academy · Free DS Interview Prep