Companies similar to ADVERSARIALGLUE.GITHUB.IO
Sign up to Download

1-30 of 4,352,327 results

  • adversarialglue.github.io
  • 2
Adversarial GLUE Benchmark (AdvGLUE) is a comprehensive robustness evaluation benchmark that focuses on the adversarial robustness evaluation of language models. It covers five natural language understanding tasks from the famous GLUE tasks and is an..

Relevance: 84.708885
  • www.fluebenchmark.com
  • 11
  • 11
FLUE is an evaluation setup for French NLP systems similar to the popular GLUE benchmark. The goal is to enable further reproducible experiments in the future and to share models and progress on the French language. The tasks and data are obtained..

Relevance: 38.018272
  • value-benchmark.github.io
  • 1
VALUE is a evaluation benchmark built on 11 datasets across 3 tasks with the focus on multi-channel videos (video+subtitle) for Video-And-Language Understanding Evaluation... We sincerely thank all dataset contributors to VALUE benchmark, please..

Relevance: 34.907097
  • textattack.readthedocs.io
This guide was heavily inspired by the awesome transformers guide to contributing... Video recording of this talk: https://underline.io/events/192/sessions/7928/lecture/38377-towards-improving-adversarial-training-of-nlp-models... Some of our..

Relevance: 33.30581
  • adarobustness.github.io
We investigate the robustness of four mainstream adaptation methods: full fine-tuning, soft prompt, LoRA, and adapter-based methods including Adapter , Hyperformer, and Compacter. To better understand the robustness of these adaptation methods, we..

Relevance: 31.708563
  • supraconex.de
SupraCoNeX researches ways to improve management approaches to 802.11-based (wireless) network architectures using machine learning / AI methodologies and algorithms. The goal is to have an evaluation of existing statistical and adversarial multi-..

Relevance: 28.588955
  • llm4eval.github.io
  • 2
Large language models (LLMs) have demonstrated increasing task-solving abilities not present in smaller models. Utilizing the capabilities and responsibilities of LLMs for automated evaluation (LLM4Eval) has recently attracted considerable attention..

Relevance: 24.637527
  • abinayam02.github.io
  • 1
Datasets are foundational to many breakthroughs in modern artificial intelligence. Many recent achievements in the space of natural language processing (NLP) can be attributed to the finetuning of pre-trained models on a diverse set of tasks that..

Relevance: 24.17081
  • far.ai
  • 30
  • 24
FAR AI's mission is to ensure AI systems are trustworthy and beneficial to society. We incubate and accelerate research agendas that are too resource-intensive for academia but not yet ready for commercialisation by industry. Our research spans work..

Relevance: 23.330658
  • neural-network-verification.com
In the first half of the tutorial, we introduce Mathematical Backgrounds and Algorithms for neural network verification, including problem formulation, incomplete verification algorithms (with a focus on state-of-the-art bound propagation based..

Relevance: 23.248447
  • sheilaalemany.github.io
Hurricanes are cyclones circulating about a defined center whose closed wind speeds exceed 75 mph originating over tropical and subtropical waters. At landfall, hurricanes can result in severe disasters. The accuracy of predicting their tra- jectory..

Relevance: 23.142244
  • wears21.github.io
  • 1
  • 20
This year WEARS will be co-located with The International Conference on Evaluation and Assessment in Software Engineering (EASE)... Recommender systems in software engineering (RSSE) have gained momentum in recent years. Such systems assist..

Relevance: 22.712875
  • vidhur2k.github.io
  • 1
Hi! My name is Vidhur Kumar and I am currently pursuing my MS in Computer Science at Georgia Tech with a specialization in Computing Systems. Currently, I work with Jacob Abernathy as part of the GT Undergrad ML Theory Research team. Our research..

Relevance: 22.031246
  • kaustubhsridhar.github.io
  • 2
Adversarial training (AT) and its variants have spearheaded progress in improving neural network robustness to adversarial perturbations and common corruptions in the last few years. Algorithm design of AT and its variants are focused on training..

Relevance: 21.779879
  • www.wilddash.cc
Welcome to the WildDash 2 Benchmark. This website provides a dataset and benchmark for panoptic, semantic, and instance segmentation. We aim to improve the expressiveness of performance evaluation for computer vision algorithms in regard to their..

Relevance: 21.62467
  • vimalabs.github.io
Similar to GPT-3, a generalist robot agent should have an intuitive and expressive interface for human users to convey their intent. In this work, we introduce a novel multimodal prompting formulation that converts diverse robot manipulation tasks..

Relevance: 21.258955
  • sites.google.com
Despite recent successes of reinforcement learning (RL), it remains a common problem that agents fail to transfer their learned skills to related environments. To facilitate research addressing this challenge, we propose CausalWorld, a benchmark for..

Relevance: 20.698895
  • vgtomahawk.github.io
  • 1
A comprehensive survey on recent DA methods in NLP - we also try sensitizing the NLP community about lacunae, e.g w.r.t CV research and outline future challenges. We maintain a live git repo and arXiv - send us a PR to add your method onto both!.....

Relevance: 20.547348
  • livecodebench.github.io
LiveCodeBench is a holistic and contamination-free evaluation benchmark of LLMs for code that continuously collects new problems over time. Particularly, LiveCodeBench also focuses on broader code-related capabilities, such as self-repair, code..

Relevance: 20.50718
  • bldl.ii.uib.no
DMPL is a research project of the Bergen Language Design Laboratory, and centres around exploring ideas of flexibility, adaptability, genericity and robustness (in short, mouldability) in programming languages. As part of the project, we are..

Relevance: 20.462915
  • cyprienruffino.github.io
In this thesis, we explore the conditioning of Generative Adversarial Networks, that is influencing the generation process in order to control the content of a generated image. We focus on conditioning through auxiliary tasks, that is we explicitly..

Relevance: 20.34596
  • msingh27.github.io
  • 1
In this paper, we leverage models with interpretable perceptually-aligned features and show that adversarial training with low max-perturbation bound can improve the performance of models for zero-shot and weakly supervised localization tasks...

Relevance: 20.222223
  • goldblum.github.io
  • 1
Meta-learning algorithms produce feature extractors which achieve state-of-the-art performance on few-shot classification. While the literature is rich with meta-learning methods, little is known about why the resulting feature extractors perform so..

Relevance: 20.049639
  • sokcertifiedrobustness.github.io
booktitle={44th {IEEE} Symposium on Security and Privacy, {SP} 2023, San Francisco, CA, USA, 22-26 May 2023},... Benchmark: In the benchmark page, we provide full comparison results along with experimental setups of representative certifiably robust..

Relevance: 19.861694
  • vtjeng.com
I currently am working on efficient verification of robustness of neural networks. As an offshoot of my work, I'm also exploring using universal adversarial perturbations to identify neurons activated by particular objects, as well as approaches to..

Relevance: 19.830818
  • www.josefineolivia.com
  • 1
  • 1
As stated in my Identity, I often position myself as the "glue" in a team. I want my designs to be serving as "glue" in society. I want to connect people, help them understand each other and create communication systems that speak the language of all..

Relevance: 19.51931
  • nlp.usc.edu
We bring together natural language processing and robotics to connect language to the world (RoboNLP). Our lab is broadly interested in connecting language to agent perception and action, and lifelong learning through interaction... The AI,..

Relevance: 19.450394
  • asyml.io
  • 1
× Texar is a highly modularized and customizable toolkit to support a broad set of machine learning (ML), especially natural language processing (NLP) and text generation tasks... Texar provides comprehensive modules for data processing, model..

Relevance: 18.991907
  • albumentations.ai
Albumentations is a computer vision tool that boosts the performance of deep convolutional neural networks... Albumentations supports different computer vision tasks such as classification, semantic segmentation, instance segmentation, object..

Relevance: 18.866165
  • ptchallenge-workshop.github.io
  • 2
With the rise of large multimodal models (e.g. Flamingo, BeIT-3, GPT-4), integrated perception systems that can achieve human level scene understanding may be on the horizon. Making progress towards this ambitious goal requires robust and..

Relevance: 18.81217