CAFT: Congestion-Aware Fault-Tolerant Load Balancing for Three-Tier Clos Data Centers

Sultan Alanazi, Bechir Hamdaoui

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

4 Citations (Scopus)

Abstract

Production data centers operate under various workload sizes ranging from latency-sensitive mice flows to long-lived elephant flows. However, the predominant load balancing scheme in data center networks, equal-cost multi-path (ECMP), is agnostic to path conditions and performs poorly in asymmetric topologies, resulting in low throughput and high latencies. In this paper, we propose CAFT, a distributed congestion-aware fault-tolerant load balancing protocol for 3-tier data center networks. It first collects, in real time, the complete congestion information of two subsets from the set of all possible paths between any two hosts. Then, the best path congestion information from each subset is carried across the switches, during the Transport Control Protocol (TCP) connection process, to make path selection decision. Having two candidate paths improve the robustness of CAFT to asymmetries caused by link failures. Large-scale ns-3 simulations show that CAFT outperforms Expeditus in mean flow completion time (FCT) and network throughput for both symmetric and asymmetric scenarios.

Original languageEnglish
Title of host publication2020 International Wireless Communications and Mobile Computing, IWCMC 2020
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages1746-1751
Number of pages6
ISBN (Electronic)9781728131290
DOIs
Publication statusPublished - Jun 2020
Externally publishedYes
Event16th IEEE International Wireless Communications and Mobile Computing Conference, IWCMC 2020 - Limassol, Cyprus
Duration: 15 Jun 202019 Jun 2020

Publication series

Name2020 International Wireless Communications and Mobile Computing, IWCMC 2020

Conference

Conference16th IEEE International Wireless Communications and Mobile Computing Conference, IWCMC 2020
Country/TerritoryCyprus
CityLimassol
Period15/06/2019/06/20

Keywords

  • Load balancing
  • data center networks
  • distributed routing
  • network congestion

Fingerprint

Dive into the research topics of 'CAFT: Congestion-Aware Fault-Tolerant Load Balancing for Three-Tier Clos Data Centers'. Together they form a unique fingerprint.

Cite this