Phase II Amount
$1,600,000
Congestion control is at the heart of communication networks since the early days of the Internet. The primary goal of congestion control is to avoid congestion overload on the network while effectively utilizing the available transmission capacity for reliable end-to-end (e2e) transport. Congestion control in todays Internet is overwhelmingly based on Transmission Control Protocol (TCP) at the transport layer of the TCP/IP protocol stack. Many versions of TCP congestion control/avoidance methods have been researched and implemented over the years to achieve optimized performance. Although many of these versions served well in many cases, the emergence of new Internet applications (e.g., large-scale data transfer, replication, backup, real-time AR/VR communications, etc.) as well as very-high-speed communication (e.g., optical) links require innovative ideas to further enhance the transport-layer protocol. Many transport protocols try to optimize congestion-control algorithms by operating the congestion window in a range which maximizes inflight data. Some of these algorithms are based on loss rate, some are based on round-trip delays, and a recent one from Google is congestion based. Various studies show the shortcomings of these approaches in different scenarios. Thus, transport-layer congestion control needs an intelligent and dynamic evolution to support next-generation, high-throughput applications over e2e network paths with a wide range of network conditions (e.g., various packet-loss rates and/or Round-Trip Times (RTTs)). [Note that these e2e paths can include not only high-speed network links (with low or negligible loss rates) but also low-quality (i.e., lossy) access links.] Also, innovations on transport layer should be related to the TCP/IP stack, so that new solutions are backward-compatible. Accordingly, Ennetix is developing an experimental transport protocol, called Intelligent Transport Proto- col (ITP), which employs multiple learn-and-infer techniques (based on Machine Learning (ML)) in congestion control and network coding for forward-error correction at the transport layer (as traditional acknowledgement- based backward-error correction is too slow for high-throughput applications). ITP operates the congestion window at optimal level by estimating and inferring parameters based on historical data using ML techniques. ITP reduces (i.e., masks over) e2e loss rate (even if it is high) at transport stack by intelligently employing (a) network coding and (b) buffer management by predicting parameters based on estimated network conditions, thereby leading to a more linear response to changes in loss rate and RTT. During Phase I of this SBIR project, requirements analysis and design of the ITP platform architecture were conducted, a working prototype was developed, and evaluation studies have been performed to determine ITPs performance and feasibility to support the ultra-fast transport requirements of the next generation of Internet applications. These feasibility and performance evaluation studies have been accomplished over live networks at Google Cloud Platform (GCP) and Ennetix office. Outcomes of the Phase I R&D efforts and evaluation studies have confirmed the viability of ITP as a commercial-grade transport-protocol platform. In this Phase II project (as a continuation of Phase I), the goal is to significantly expand ITP with advanced performance models and forecasting methods in congestion-control algorithms, scalable and intel- ligent network coding, and programmable interfaces for extensibility. A commercial-grade ITP solution will be developed, with which network operators can build networking infrastructures for the next-generation Internet. Early field trials will demonstrate the functionalities and performance of ITP over live networks and pave the way for successful market entry and deployment on premier R&E networks such as ESnet. ITP will greatly benefit network users at DOE and other government organizations through an innova- tive transport protocol which will provide much higher throughput in todays cloud-based, dynamic, and distributed environments with various RTTs and loss rates. The wider benefits of this effort will extend well beyond the immediate DOE scientific community, and on to common Internet users, other enterprises, and service providers. In particular, many commercial cloud-service providers and enterprises can leverage ITP to implement new use cases and support the next generations of Internet applications.