COPA-LEADERBOARD

Updated 570 days ago
  • ID: 50958129/5
The goal of COPA is to systematically certify the robustness of different offline RL algorithms based on certification criteria including per-state action stability and the lower bound of cumulative reward. Specifically, we propose new partition and aggregation protocols (PARL, TPARL, DPARL) to obtain robust policies and provide certification methods for them... In COPA-leaderboard, we present the certification results in three RL environments under two certification criteria. Notably, we offer direct comparisons from multiple aspects to enable better understanding of different aggregation protocols and offline RL algorithms of subpolicies... Robustness certiï¬ cation for per-state action stability in Highway environment. We plot the cumulative histogram of the tolerable poisoning size K for all time steps. We provide the certification for different aggregation protocols (PARL, TPARL, DPARL) on three RL algorithms and different #sub-policies. The results are averaged over 20 runs..
  • 0
  • 0
Interest Score
1
HIT Score
0.00
Domain
copa-leaderboard.github.io

Actual
copa-leaderboard.github.io

IP
185.199.108.153, 185.199.109.153, 185.199.110.153, 185.199.111.153

Status
OK

Category
Company
0 comments Add a comment