What is the weighted round-robin load balancing technique?

Overview

The purpose of load balancers is to improve the performance of applications and decrease the burden by "efficiently"Efficiency depends on server selection. distributing the incoming traffic across a group of servers. For user-facing applications, this will result in improved response times.

Note: Here, we will mainly talk about application load balancers.

Now, let’s jump right into the details of the weighted round-robin load balancing technique.

About the technique

This technique is similar to the round-robin load balancer. But in the weighted round-robin load balancer, the network administrator assigns a numeric weight to all of the servers behind the load balancer. The weights can be assigned based on factors such as the server’s processing power or total bandwidth.

A server, say ServerA, with the most processing power will be assigned the maximum weight. It will also receive the maximum proportion of incoming requests from the load balancer.
A server, say ServerB, with half the processing capacity compared to ServerA will be assigned a weight that is half of the actual weight of ServerA. Additionally, it will receive the proportion of incoming requests from the load balancer accordingly.
A server, say ServerC, with the lowest specifications will be assigned the lowest weight, and it will receive the minimum proportion of incoming requests from the load balancer.

Note: The weighted round-robin load balancer is a static load balancer, as it does not modify the state of the servers while distributing incoming traffic.

Example

Let’s understand this with the help of an example:

Suppose we have three servers —ServerA, ServerB, ServerC— with weights (5, 2, 1) that are waiting to serve incoming requests behind the load balancer.

The load balancer will forward the first five requests to ServerA, the next two requests to ServerB, and then one request to ServerC.

If any of the other incoming requests arrive, the load balancer will forward those requests back to ServerA again for the next five incoming requests, then ServerB will get its turn, and after that the requests will be forwarded to ServerC. The cycle will continue on this way.

This cycle is shown in the illustration below:

Free Resources

Learn in-demand tech skills in half the time

PRODUCTS

Mock Interview

New

Courses

Skill Paths

Projects

Assessments

TRENDING TOPICS

Learn to Code

Tech Interview Prep

Generative AI

Data Science

Machine Learning

GitHub Students Scholarship

Early Access Courses

Blind 75

Layoffs

Pricing

For Individuals

Try for Free

Gift a Subscription

CONTRIBUTE

Become an Author

Become an Affiliate

Earn Referral Credits

RESOURCES

Blog

Cheatsheets

Webinars

Answers

ABOUT US

Our Team

Careers

Hiring

Frequently Asked Questions

Press

LEGAL

Cookie Policy

Business Terms of Service

Data Processing Agreement

INTERVIEW PREP COURSES

Grokking the Modern System Design Interview

Grokking the Product Architecture Design Interview

Grokking the Coding Interview Patterns

Machine Learning System Design

What is the weighted round-robin load balancing technique?

Overview

About the technique

Example

Algorithmic explanation

Algorithm

Advantages

Limitations

Real time load balancers