Delay-aware TDMA Scheduling with Deep Reinforcement Learning in Tactical MANET

Gwangjin Wi, Sunghwa Son, Kyung Joon Park

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

10 Scopus citations

Abstract

In tactical networks, traffic should be delivered in a timely manner satisfying the quality of service (QoS) requirements for survivability and mission success. In this paper, we propose a centralized TDMA slot scheduling based on deep reinforcement learning (DRL) to guarantee the QoS requirements by minimizing end-to-end delay. We consider situations in which mission criticality of tactical traffic is dynamically changing. We introduce a DRL actor-critic algorithm to find a TDMA scheduling policy to minimize the weighted end-to-end delay which is a new metric reflecting the mission criticality of tactical traffic. The simulation results verify that the proposed scheduling policy can guarantee QoS requirements in tactical networks.

Original languageEnglish
Title of host publicationICTC 2020 - 11th International Conference on ICT Convergence
Subtitle of host publicationData, Network, and AI in the Age of Untact
PublisherIEEE Computer Society
Pages370-372
Number of pages3
ISBN (Electronic)9781728167589
DOIs
StatePublished - 21 Oct 2020
Event11th International Conference on Information and Communication Technology Convergence, ICTC 2020 - Jeju Island, Korea, Republic of
Duration: 21 Oct 202023 Oct 2020

Publication series

NameInternational Conference on ICT Convergence
Volume2020-October
ISSN (Print)2162-1233
ISSN (Electronic)2162-1241

Conference

Conference11th International Conference on Information and Communication Technology Convergence, ICTC 2020
Country/TerritoryKorea, Republic of
CityJeju Island
Period21/10/2023/10/20

Bibliographical note

Publisher Copyright:
© 2020 IEEE.

Keywords

  • TDMA slot scheduling
  • Tactical networks
  • deep reinforcement learning
  • quality of service

Fingerprint

Dive into the research topics of 'Delay-aware TDMA Scheduling with Deep Reinforcement Learning in Tactical MANET'. Together they form a unique fingerprint.

Cite this