Grokking the Product Architecture Interview/

...

The Estimation of Response Time of an API

Learn to estimate response time of an API generally, as well as for different data sizes in an API message.

We'll cover the following...

Motivation
The GET request response time
The POST request response time
The percentile of response time

Now, let’s estimate the response time of APIs for varying data sizes in the subsequent sections.

Motivation

In this section, we’ll perform some trials using the Postman tool. The goal is to formulate a method to calculate the average response time per kilobyte of a message. Once we have the average response time per KB, we can estimate the plausibility of our API designs in terms of latency. We know that it is not possible to calculate an accurate average response time per KB. The time varies due to different factors involved in the request and response. These factors involve the base time, transfer start time (sum of RTT and processing time), and download time of any request. Network congestion, data transfer technologies, and service provider variations can all affect these factors. Moreover, we’ll use cold start requests for every trial, where base time does not cache, and the server will need to get all the information again.

Using our calculations, we’ll obtain a range of response times that can be used in our design problems. But the estimated numbers can vary depending on the changes in the factors mentioned earlier, which are mainly due to load conditions on the network or the back-end services. The following equation is just a reminder of how we calculate latency:

Let's look at how we estimate response time.

The `GET` request response time

For a GET request, the download time is the main factor that changes according to varying data sizes. It is because the average value of $RTT_{get}$ remains the same (for an optimized bandwidth) regardless of data size. We’ll focus on estimating how the download time is affected per KB. For that, trials are repeated for a single request to get an average download time (a factor affecting the latency in a GET request). We calculated the mean and the standard deviation in download time for varying data sizes, as depicted in the following table:

	Download Time
Response Size	Trial 1	Trial 2	Trial 3	Trial 4	Trial 5	Mean with SD
1.3 KB	2.44 ms	3.46 ms	3.39 ms	3.02 ms	2.56 ms	2.97 ± 0.47 ms
2.5 KB	5.15 ms	4.65 ms	2.45 ms	2.51 ms	3.19 ms	3.59 ± 1.24 ms
28 KB	6.27 ms	4.19 ms	2.57 ms	7.78 ms	4.53 ms	5.06 ± 2.01 ms
155 KB	60.63 ms	67.14 ms	61.6 ms	64.82 ms	57.11 ms	62.26 ± 3.87 ms

Introduction to the Course

Network Intricacies

Different Ways of Client-Server Communication

Common Data Formats for Web APIs

Comparison of API Architectural Styles

API Design Security

Important Concepts in Product Architecture

Back-of-the-Envelope Calculations for Latency

What Are the Foundational API Designs?

Design a Search Service

Design a File Service

Design a Comment Service

Design a Pub-Sub Service

Concluding Foundational Design Problems

YouTube Streaming API Design

YouTube

Facebook Messenger API Design

Google Maps API Design

Google Maps

Learn to Design a Chess API with AI Mentor

Zoom API Design

Zoom

Leetcode API Design

LeetCode

Payment Gateway API Design—Stripe

Stripe

Twitter API Design

Uber API Design

Uber

CamelCamelCamel API Design

CamelCamelCamel (C3)

Gaming API Design

API Failures and Mitigations

Evernote

Conclusion

The Estimation of Response Time of an API

Motivation

The `GET` request response time

GET Request Trial Results

YouTube

Google Maps

Zoom

LeetCode

Stripe

Uber

CamelCamelCamel (C3)

Evernote

The Estimation of Response Time of an API

Motivation

The GET request response time

GET Request Trial Results

The `GET` request response time