CENTER FOR AI SAFETY

Updated 417 days ago
  • ID: 50894344/8
Allowed Methods (Trojan Detection Track): The use of features that are clearly loopholes is not allowed (e.g., metadata). We may not anticipate all loopholes and we encourage participants to alert us to their existence. Legitimate features that do not constitute loopholes include all features derived from the trained parameters of networks, the target strings, training triggers, and text datasets. Similar to the Red Teaming Track, we also do not allow the submission of prompts that effectively make LLMs copy-paste target strings from the prompt into the generation.
  • 0
  • 0
Interest Score
1
HIT Score
0.00
Domain
trojandetection.ai

Actual
trojandetection.ai

IP
185.199.108.153, 185.199.109.153, 185.199.110.153, 185.199.111.153

Status
OK

Category
Other
0 comments Add a comment