Problem Statement

Imagine you work for "Prasad Tech in Telugu," a popular tech news and reviews website serving the Telugu-speaking tech community. The engineering team based in Hi-Tech City reports an average website load time of 2.5 seconds. However, during the recent Ugadi festival when many users were checking for special tech deals and discounts, complaints about slowness increased significantly, especially from users in smaller towns like Karimnagar and Nellore. Upon investigation, you discover the data includes instances where load times spiked to 30-40 seconds during power fluctuations common during summer thunderstorms in the region, while most load times are between 1-3 seconds for users browsing from Hyderabad, Vijayawada, and other major cities.

1

Misleading Average Load Time

EASY

How might the reported average load time of 2.5 seconds be misleading when presented to Prasad Tech in Telugu management team at their Jubilee Hills headquarters, particularly in understanding the experience of users from different parts of Telangana and Andhra Pradesh?

2

Better Performance Measures

MODERATE

What alternative measure of central tendency and measure of dispersion would you recommend to better represent the typical user experience for Telugu tech enthusiasts and the variability across different ISPs popular in the region (like ACT Fibernet, BSNL, and Jio)? Justify your choices considering the diverse internet infrastructure across urban and rural Telugu-speaking areas.

3

Handling Extreme Outliers

ADVANCED

What are the tradeoffs of excluding the extreme outliers versus including them when reporting performance metrics at the upcoming quarterly review meeting in Hyderabad? How might this impact decision-making about server investments before the upcoming Dasara and Diwali shopping seasons when traffic is expected to spike?

 

Nerchuko Academy · Free DS Interview Prep