Navy 2023 Deep Reinforcement Learning for Collaborative Multi-Robot Systems with Low-Latency Wireless Networking

Deep Reinforcement Learning for Collaborative Multi-Robot Systems with Low-Latency Wireless Networking
Award last edited on: 4/27/2024

Awarding Agency

DOD : Navy

Total Award Amount

$139,942

Award Phase

Solicitation Topic Code

N23B-T031

Principal Investigator

Amitav Mukherjee

Tiami LLC

10041 Wild Orchid Way
Elk Grove, CA 95757

(408) 687-8288

founders@tiaminetworks.com

www.tiaminetworks.com

Research Institution

University of Massachusetts - Boston

Phase I

Contract Number: N68335-23-C-0708
Start Date: 9/13/2023 Completed: 3/11/2024

Phase I year

2023

Phase I Amount

$139,942

In this Phase I effort, Tiami, LLC, aims to develop and demonstrate a hardware proof of concept for a collaborative multi-robot system (MRS) that leverages imitative augmented deep reinforcement learning (IADRL) amongst heterogeneous uncrewed systems (robots) to achieve a common task. Collaboration is based on low-latency machine-to-machine wireless links between robots that use both RF and optical wireless communications (OWC) for multimode resiliency. To accomplish this task, the Phase I personnel include a principal engineer from Tiami LLC with 12 years of experience in RF systems design and 124+ patents in low-latency 4G/5G wireless systems, and a subaward to the University of Massachusetts Boston to leverage Dr. Annavajjalas and Prof. Michael Rahaims expertise in optical wireless communications research and lab facilities for rapid software-defined radio (SDR) prototyping. A letter of support is provided from Lockheed Martin Missile Fire Control and Lockheed Martin Space, an anticipated Phase II/Phase III subcontractor to transition our technology to military and commercial users. The system objective is to track the desired target of interest while the adversary MRS attempts to disrupt the ally MRS cyber topology (i.e., communications and intelligence). Strategic motion planning will be implemented with the IADRL methodology, while real-time mission updates are intelligently distributed amongst nodes using a combination of RF and OWC. For strategic motion planning, our MRS will use a single node as a mission leader to aggregate information, implement the IADRL model, and distribute motion plans. The remaining agents continuously transmit their positional information (longitude, latitude, altitude/height, velocity, roll, pitch, yaw/heading, angular rates, acceleration) and status to the mission leader node. Each node acquires absolute positioning, navigation, and timing (PNT) information from an alternative signal-of-opportunity such as commercial LEO systems (Starlink) if GPS is denied. The system will adapt to network segmentation (or any loss of a mission leader) by dynamic reselection of the mission leader(s). For the communications component of our solution, individual nodes will actively decide to provide information on the RF or OWC network (or some combination thereof) for each round of sensor data aggregation. The M3P network utilizes post-quantum cryptographic encryption (e.g., CRYSTALS-DILITHIUM) for security. Traffic routing decisions will be based on channel state information for RF/OWC links and statistical characterization of each networks reliability (i.e., likelihood of connecting to the mission leader in a desired timeframe).

Phase II

Contract Number: ----------
Start Date: 00/00/00 Completed: 00/00/00

Phase II year

----

Phase II Amount

----

SBIR-STTR Award

Deep Reinforcement Learning for Collaborative Multi-Robot Systems with Low-Latency Wireless Networking
Award last edited on: 4/27/2024

Sponsored Program

Awarding Agency

Total Award Amount

Award Phase

Solicitation Topic Code

Principal Investigator

Company Information

Tiami LLC

Research Institution

Phase I

Phase I year

Phase I Amount

Phase II

Phase II year

Phase II Amount

New To Inknowvation.com?

SBIR-STTR Award

Deep Reinforcement Learning for Collaborative Multi-Robot Systems with Low-Latency Wireless NetworkingAward last edited on: 4/27/2024

Sponsored Program

Awarding Agency

Total Award Amount

Award Phase

Solicitation Topic Code

Principal Investigator

Company Information

Tiami LLC

Research Institution

Phase I

Phase I year

Phase I Amount

Phase II

Phase II year

Phase II Amount

Deep Reinforcement Learning for Collaborative Multi-Robot Systems with Low-Latency Wireless Networking
Award last edited on: 4/27/2024