Runtime-Safety-Guided Policy Repair

Weichao Zhou, Ruihan Gao, Baek Gyu Kim, Eunsuk Kang, Wenchao Li

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

5 Scopus citations

Abstract

We study the problem of policy repair for learning-based control policies in safety-critical settings. We consider an architecture where a high-performance learning-based control policy (e.g. one trained as a neural network) is paired with a model-based safety controller. The safety controller is endowed with the abilities to predict whether the trained policy will lead the system to an unsafe state, and take over control when necessary. While this architecture can provide added safety assurances, intermittent and frequent switching between the trained policy and the safety controller can result in undesirable behaviors and reduced performance. We propose to reduce or even eliminate control switching by ‘repairing’ the trained policy based on runtime data produced by the safety controller in a way that deviates minimally from the original policy. The key idea behind our approach is the formulation of a trajectory optimization problem that allows the joint reasoning of policy update and safety constraints. Experimental results demonstrate that our approach is effective even when the system model in the safety controller is unknown and only approximated.

Original languageEnglish
Title of host publicationRuntime Verification - 20th International Conference, RV 2020, Proceedings
EditorsJyotirmoy Deshmukh, Dejan Nickovic
PublisherSpringer Science and Business Media Deutschland GmbH
Pages131-150
Number of pages20
ISBN (Print)9783030605070
DOIs
StatePublished - 2020
Event20th International Conference on Runtime Verification, RV 2020 - Los Angeles, United States
Duration: 6 Oct 20209 Oct 2020

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume12399 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference20th International Conference on Runtime Verification, RV 2020
Country/TerritoryUnited States
CityLos Angeles
Period6/10/209/10/20

Bibliographical note

Publisher Copyright:
© 2020, Springer Nature Switzerland AG.

Fingerprint

Dive into the research topics of 'Runtime-Safety-Guided Policy Repair'. Together they form a unique fingerprint.

Cite this