GLUE Benchmark

Excerpt

The General Language Understanding Evaluation (GLUE) benchmark is a collection of resources for training, evaluating, and analyzing natural language understanding systems


1

MultiNLI Matched-Accuracy

Microsoft Alexander v-team

MultiNLI Matched-Accuracy

Turing ULR v691.373.397.594.2/92.393.5/93.176.4/90.992.592.196.793.697.955.42

MultiNLI Matched-Accuracy

JDExplore d-team

MultiNLI Matched-Accuracy

Vega v191.373.897.994.5/92.693.5/93.176.7/91.192.191.996.792.497.951.43

MultiNLI Matched-Accuracy

Microsoft Alexander v-team

MultiNLI Matched-Accuracy

Turing NLR v5 91.272.697.693.8/91.793.7/93.376.4/91.192.692.497.994.195.957.04

MultiNLI Matched-Accuracy

DIRL Team

MultiNLI Matched-Accuracy

DeBERTa + CLEVER91.174.797.693.3/91.193.4/93.176.5/91.092.191.896.793.296.653.35

MultiNLI Matched-Accuracy

ERNIE Team - Baidu

MultiNLI Matched-Accuracy

ERNIE91.175.597.893.9/91.893.0/92.675.2/90.992.391.797.392.695.951.76

MultiNLI Matched-Accuracy

AliceMind & DIRL

MultiNLI Matched-Accuracy

StructBERT + CLEVER91.075.397.793.9/91.993.5/93.175.6/90.891.791.597.492.595.249.17

MultiNLI Matched-Accuracy

DeBERTa Team - Microsoft

MultiNLI Matched-Accuracy

DeBERTa / TuringNLRv490.871.597.594.0/92.092.9/92.676.2/90.891.991.699.293.294.553.28

MultiNLI Matched-Accuracy

HFL iFLYTEK

MultiNLI Matched-Accuracy

MacALBERT + DKM90.774.897.094.5/92.692.8/92.674.7/90.691.391.197.892.094.552.69

MultiNLI Matched-Accuracy

PING-AN Omni-Sinitic

MultiNLI Matched-Accuracy

ALBERT + DAAF + NAS90.673.597.294.0/92.093.0/92.476.1/91.091.691.397.591.794.551.210

MultiNLI Matched-Accuracy

T5 Team - Google

MultiNLI Matched-Accuracy

T590.371.697.592.8/90.493.1/92.875.1/90.692.291.996.992.894.553.111

MultiNLI Matched-Accuracy

Microsoft D365 AI & MSR AI & GATECH

MultiNLI Matched-Accuracy

MT-DNN-SMART89.969.597.593.7/91.692.9/92.573.9/90.291.090.899.289.794.550.212

MultiNLI Matched-Accuracy

Huawei Noah’s Ark Lab

MultiNLI Matched-Accuracy

NEZHA-Large89.871.797.393.3/91.092.4/91.975.2/90.791.591.396.290.394.547.913

MultiNLI Matched-Accuracy

LG AI Research

MultiNLI Matched-Accuracy

ANNA89.868.797.092.7/90.193.0/92.875.3/90.591.891.696.091.895.951.814

MultiNLI Matched-Accuracy

Zihang Dai

MultiNLI Matched-Accuracy

Funnel-Transformer (Ensemble B10-10-10H1024)89.770.597.593.4/91.292.6/92.375.4/90.791.491.195.890.094.551.615

MultiNLI Matched-Accuracy

ELECTRA Team

MultiNLI Matched-Accuracy

ELECTRA-Large + Standard Tricks89.471.797.193.1/90.792.9/92.575.6/90.891.390.895.889.891.850.716

MultiNLI Matched-Accuracy

David Kim

MultiNLI Matched-Accuracy

2digit LANet89.371.897.392.4/89.693.0/92.775.5/90.591.891.696.491.188.454.617

MultiNLI Matched-Accuracy

ć€Ș仕文

MultiNLI Matched-Accuracy

DropAttack-RoBERTa-large88.870.396.792.6/90.192.1/91.875.1/90.591.190.995.389.989.748.218

MultiNLI Matched-Accuracy

Microsoft D365 AI & UMD

MultiNLI Matched-Accuracy

FreeLB-RoBERTa (ensemble)88.468.096.893.1/90.892.3/92.174.8/90.391.190.795.688.789.050.119

MultiNLI Matched-Accuracy

Junjie Yang

MultiNLI Matched-Accuracy

HIRE-RoBERTa88.368.697.193.0/90.792.4/92.074.3/90.290.790.495.587.989.049.320

MultiNLI Matched-Accuracy

Shiwen Ni

MultiNLI Matched-Accuracy

ELECTRA-large-M (bert4keras)88.369.395.892.2/89.691.2/91.175.1/90.591.190.993.887.991.848.221

MultiNLI Matched-Accuracy

Facebook AI

MultiNLI Matched-Accuracy

RoBERTa88.167.896.792.3/89.892.2/91.974.3/90.290.890.295.488.289.048.722

MultiNLI Matched-Accuracy

Microsoft D365 AI & MSR AI

MultiNLI Matched-Accuracy

MT-DNN-ensemble87.668.496.592.7/90.391.1/90.773.7/89.987.987.496.086.389.042.823

MultiNLI Matched-Accuracy

GLUE Human Baselines

MultiNLI Matched-Accuracy

GLUE Human Baselines87.166.497.886.3/80.892.7/92.659.5/80.492.092.891.293.695.9-24

MultiNLI Matched-Accuracy

kk xx

MultiNLI Matched-Accuracy

ELECTRA-Large-NewSCL(single)85.673.397.292.7/90.292.0/91.775.3/90.690.890.395.686.960.350.025

MultiNLI Matched-Accuracy

Adrian de Wynter

MultiNLI Matched-Accuracy

Bort (Alexa AI)83.663.996.294.1/92.389.2/88.366.0/85.988.187.892.382.771.251.926

MultiNLI Matched-Accuracy

Lab LV

MultiNLI Matched-Accuracy

ConvBERT base83.267.895.791.4/88.390.4/89.773.0/90.088.387.493.277.965.142.927

MultiNLI Matched-Accuracy

Stanford Hazy Research

MultiNLI Matched-Accuracy

Snorkel MeTaL83.263.896.291.5/88.590.1/89.773.1/89.987.687.293.980.965.139.928

MultiNLI Matched-Accuracy

XLM Systems

MultiNLI Matched-Accuracy

XLM (English only)83.162.995.690.7/87.188.8/88.273.2/89.889.188.594.076.071.944.729

MultiNLI Matched-Accuracy

WATCH ME

MultiNLI Matched-Accuracy

ConvBERT-base-paddle-v1.183.166.395.491.6/88.690.0/89.273.9/90.088.287.793.378.265.19.230

MultiNLI Matched-Accuracy

Zhuosheng Zhang

MultiNLI Matched-Accuracy

SemBERT82.962.394.691.2/88.387.8/86.772.8/89.887.686.394.684.565.142.431

MultiNLI Matched-Accuracy

Jun Yu

MultiNLI Matched-Accuracy

mpnet-base-paddle82.960.595.991.6/88.990.8/90.372.5/89.787.686.693.382.465.19.232

MultiNLI Matched-Accuracy

Danqi Chen

MultiNLI Matched-Accuracy

SpanBERT (single-task training)82.864.394.890.9/87.989.9/89.171.9/89.588.187.794.379.065.145.133

MultiNLI Matched-Accuracy

GAL team

MultiNLI Matched-Accuracy

distilRoBERTa+GAL (6-layer transformer single model)82.660.095.391.9/89.290.0/89.673.3/90.087.486.592.781.865.10.034

MultiNLI Matched-Accuracy

Kevin Clark

MultiNLI Matched-Accuracy

BERT + BAM82.361.595.291.3/88.388.6/87.972.5/89.786.685.893.180.465.140.735

MultiNLI Matched-Accuracy

Nitish Shirish Keskar

MultiNLI Matched-Accuracy

Span-Extractive BERT on STILTs82.363.294.590.6/87.689.4/89.272.2/89.486.585.892.579.865.128.336

MultiNLI Matched-Accuracy

LV NUS

MultiNLI Matched-Accuracy

LV-BERT-base82.164.094.790.9/87.989.4/88.872.3/89.586.686.192.677.065.139.537

MultiNLI Matched-Accuracy

Jason Phang

MultiNLI Matched-Accuracy

BERT on STILTs82.062.194.390.2/86.688.7/88.371.9/89.486.485.692.780.165.128.338

MultiNLI Matched-Accuracy

gao jie

MultiNLI Matched-Accuracy

182.066.896.590.9/87.291.4/90.872.9/89.690.256.494.782.862.39.239

MultiNLI Matched-Accuracy

Gino Tesei

MultiNLI Matched-Accuracy

RobustRoBERTa81.963.696.891.6/88.690.3/89.673.2/89.790.089.495.150.380.150.540

MultiNLI Matched-Accuracy

Karen Hambardzumyan

MultiNLI Matched-Accuracy

WARP with RoBERTa81.653.996.388.2/83.989.5/88.868.6/87.788.088.293.584.365.141.241

MultiNLI Matched-Accuracy

Junxiong Wang

MultiNLI Matched-Accuracy

Bigs-128-1000k81.564.494.988.7/84.287.8/87.571.2/89.286.185.091.677.665.136.242

MultiNLI Matched-Accuracy

Huawei Noah’s Ark Lab MTL

MultiNLI Matched-Accuracy

CombinedKD-TinyRoBERTa (6 layer 82M parameters, MATE-KD + AnnealingKD)81.558.695.191.2/88.188.5/88.473.0/89.786.285.692.476.665.120.243

MultiNLI Matched-Accuracy

Richard Bai

MultiNLI Matched-Accuracy

segaBERT-large81.462.694.889.7/86.188.6/87.772.5/89.487.987.794.071.665.10.044

MultiNLI Matched-Accuracy

ć»–äșż

MultiNLI Matched-Accuracy

u-PMLM-R (Huawei Noah’s Ark Lab)81.356.994.290.7/87.789.7/89.172.2/89.486.185.492.178.565.140.045

MultiNLI Matched-Accuracy

Xinsong Zhang

MultiNLI Matched-Accuracy

AMBERT-BASE81.060.095.290.6/87.186.3/88.272.2/89.587.286.592.672.665.139.446

MultiNLI Matched-Accuracy

Mikita Sazanovich

MultiNLI Matched-Accuracy

Routed BERTs80.756.193.688.6/84.788.0/87.671.0/88.885.284.592.680.065.19.247

MultiNLI Matched-Accuracy

USCD-AI4Health Team

MultiNLI Matched-Accuracy

CERT80.758.994.689.8/85.987.9/86.872.5/90.387.286.493.071.265.139.648

MultiNLI Matched-Accuracy

Jacob Devlin

MultiNLI Matched-Accuracy

BERT: 24-layers, 16-heads, 1024-hidden80.560.594.989.3/85.487.6/86.572.1/89.386.785.992.770.165.139.649

MultiNLI Matched-Accuracy

Chen Qian

MultiNLI Matched-Accuracy

KerasNLP XLM-R80.456.396.189.8/86.388.4/87.772.3/89.087.787.192.869.265.140.650

MultiNLI Matched-Accuracy

Chen Qian

MultiNLI Matched-Accuracy

KerasNLP RoBERTa80.456.396.189.8/86.388.4/87.772.3/89.087.787.192.869.265.140.651

MultiNLI Matched-Accuracy

Jinliang LU

MultiNLI Matched-Accuracy

MULTIPLE_ADAPTER_T5_BASE80.354.193.890.1/86.887.9/87.671.8/88.986.185.793.576.862.39.252

MultiNLI Matched-Accuracy

Yoshitomo Matsubara

MultiNLI Matched-Accuracy

HF bert-large-uncased (default fine-tuning)80.261.594.689.2/85.286.4/85.072.2/89.386.485.792.468.965.136.953

MultiNLI Matched-Accuracy

Neil Houlsby

MultiNLI Matched-Accuracy

BERT + Single-task Adapters80.259.294.388.7/84.387.3/86.171.5/89.485.485.092.471.665.19.254

MultiNLI Matched-Accuracy

KI BERT

MultiNLI Matched-Accuracy

KI-BERT80.055.694.588.2/83.986.3/85.171.5/88.985.283.791.269.373.335.655

MultiNLI Matched-Accuracy

Xiangyang Liu

MultiNLI Matched-Accuracy

elasticbert-large-12L79.957.092.989.4/86.089.7/88.672.7/89.685.484.992.371.862.39.256

MultiNLI Matched-Accuracy

战搑阳

MultiNLI Matched-Accuracy

roberta-large-12L79.859.494.689.1/85.889.8/89.171.5/89.486.485.291.667.362.39.257

MultiNLI Matched-Accuracy

Zhuohan Li

MultiNLI Matched-Accuracy

Macaron Net-base79.757.694.088.4/84.487.5/86.370.8/89.085.484.591.670.565.138.758

MultiNLI Matched-Accuracy

shi To

MultiNLI Matched-Accuracy

GAT-bert-base79.656.894.089.4/85.387.9/86.872.4/89.485.784.591.870.562.39.259

MultiNLI Matched-Accuracy

teerapong saelim

MultiNLI Matched-Accuracy

WT-VAT-BERT (Base)79.556.094.489.2/85.587.3/86.272.9/89.885.584.891.470.462.39.260

MultiNLI Matched-Accuracy

Anshuman Singh

MultiNLI Matched-Accuracy

Bert-n-Pals79.152.293.489.5/85.686.6/85.971.4/89.084.183.590.675.462.333.861

MultiNLI Matched-Accuracy

ANSHUMAN SINGH (RA1811003010460)

MultiNLI Matched-Accuracy

DeepPavlov Multitask PalBert78.848.193.488.9/85.687.0/86.771.4/89.083.983.490.876.762.333.862

MultiNLI Matched-Accuracy

xiaok Liu

MultiNLI Matched-Accuracy

BERT-EMD(6-layer; Single model; No DA)78.747.593.389.8/86.487.6/86.872.0/89.384.783.590.771.765.19.263

MultiNLI Matched-Accuracy

è˜‡ć€§éˆž

MultiNLI Matched-Accuracy

SesameBERT-Base78.652.794.288.9/84.886.5/85.570.8/88.883.783.691.067.665.135.864

MultiNLI Matched-Accuracy

xinge ma

MultiNLI Matched-Accuracy

ReptileDistil78.547.992.889.2/85.487.1/85.971.0/89.083.682.990.473.565.133.265

MultiNLI Matched-Accuracy

MobileBERT Team

MultiNLI Matched-Accuracy

MobileBERT78.551.192.688.8/84.586.2/84.870.5/88.384.383.491.670.465.134.366

MultiNLI Matched-Accuracy

Linyuan Gong

MultiNLI Matched-Accuracy

StackingBERT-Base78.456.293.988.2/83.984.2/82.570.4/88.784.484.290.167.065.136.667

MultiNLI Matched-Accuracy

TinyBERT Team

MultiNLI Matched-Accuracy

TinyBERT (6-layer; Single model)78.151.193.187.3/82.685.0/83.771.6/89.184.683.290.470.065.19.268

MultiNLI Matched-Accuracy

SqueezeBERT Team

MultiNLI Matched-Accuracy

SqueezeBERT (4.3x faster than BERT-base on smartphone)78.146.591.489.5/86.087.0/86.371.5/89.082.081.190.173.265.135.369

MultiNLI Matched-Accuracy

Anshuman Singh

MultiNLI Matched-Accuracy

CAMTL77.953.092.688.3/84.486.6/85.970.0/88.582.382.090.572.858.233.870

MultiNLI Matched-Accuracy

ć‚…è–›æž—

MultiNLI Matched-Accuracy

KRISFU77.852.492.589.0/84.883.7/82.270.4/88.684.383.490.965.965.136.171

MultiNLI Matched-Accuracy

王侊

MultiNLI Matched-Accuracy

s077.846.892.988.9/84.887.2/86.571.9/89.184.583.490.870.960.335.372

MultiNLI Matched-Accuracy

Stark Tony

MultiNLI Matched-Accuracy

Pocket GLUE77.649.392.489.0/84.684.9/84.070.1/88.784.082.890.167.265.136.173

MultiNLI Matched-Accuracy

Pavan Kalyan Reddy Neerudu

MultiNLI Matched-Accuracy

Pavan Neerudu - BERT77.656.193.587.6/83.285.3/83.870.6/88.884.083.490.864.060.334.674

MultiNLI Matched-Accuracy

NLC MSR Asia

MultiNLI Matched-Accuracy

BERT-of-Theseus (6-layer; single model)77.147.892.287.6/83.285.6/84.171.6/89.382.482.189.666.265.19.275

MultiNLI Matched-Accuracy

Hanxiong Huang

MultiNLI Matched-Accuracy

Hanxiong Huang75.949.393.387.1/81.983.3/81.771.5/89.184.883.891.064.153.49.276

MultiNLI Matched-Accuracy

YeonTaek Oh

MultiNLI Matched-Accuracy

EL-BERT(6-Layer, Single model)75.647.791.087.8/83.081.2/80.269.9/88.181.881.090.259.965.131.877

MultiNLI Matched-Accuracy

EVS Team

MultiNLI Matched-Accuracy

Anonymous74.752.693.487.6/83.261.2/59.171.8/89.383.783.289.965.062.335.678

MultiNLI Matched-Accuracy

Chen Money

MultiNLI Matched-Accuracy

KerasNLP 12/05/2022 Trial 274.652.293.587.8/82.684.5/83.171.3/89.382.381.689.361.743.832.979

MultiNLI Matched-Accuracy

Sinx

MultiNLI Matched-Accuracy

ZHIYUAN74.157.095.291.4/88.491.1/90.824.2/23.787.787.392.581.747.90.380

MultiNLI Matched-Accuracy

Tirana Noor Fatyanosa

MultiNLI Matched-Accuracy

distilbert-base-uncased73.645.892.387.6/83.171.0/71.069.6/88.281.681.388.854.165.131.881

MultiNLI Matched-Accuracy

Haiqin YANG

MultiNLI Matched-Accuracy

RefBERT73.147.992.986.9/81.975.0/76.361.6/84.480.980.387.361.754.8-10.382

MultiNLI Matched-Accuracy

Haiqin Yang

MultiNLI Matched-Accuracy

RefBERT73.147.992.986.9/81.975.0/76.361.4/84.280.980.387.361.754.8-10.383

MultiNLI Matched-Accuracy

Haiqin Yang

MultiNLI Matched-Accuracy

RefBERT71.836.392.986.9/81.975.0/76.361.6/83.880.980.387.361.754.8-10.384

MultiNLI Matched-Accuracy

Haiqin Yang

MultiNLI Matched-Accuracy

RefBERT71.836.392.986.9/81.975.0/76.361.3/83.680.980.387.361.754.8-10.385

MultiNLI Matched-Accuracy

ć…Źèƒœć…Źèƒœ

MultiNLI Matched-Accuracy

111171.435.890.183.2/75.781.0/79.368.5/87.577.577.186.758.056.89.286

MultiNLI Matched-Accuracy

Jack Hessel

MultiNLI Matched-Accuracy

Bag-of-words only BoW-BERT (Base)70.014.386.782.9/75.281.8/80.368.3/87.579.879.786.260.465.131.087

MultiNLI Matched-Accuracy

GLUE Baselines

MultiNLI Matched-Accuracy

BiLSTM+ELMo+Attn70.033.690.484.4/78.074.2/72.363.1/84.374.174.579.858.965.121.7

MultiNLI Matched-Accuracy

MultiNLI Matched-Accuracy

BiLSTM+ELMo67.732.189.384.7/78.070.3/67.861.1/82.667.267.975.557.465.121.3

MultiNLI Matched-Accuracy

MultiNLI Matched-Accuracy

Single Task BiLSTM+ELMo+Attn66.535.090.280.2/68.855.5/52.566.1/86.576.976.776.750.365.127.9

MultiNLI Matched-Accuracy

MultiNLI Matched-Accuracy

Single Task BiLSTM+ELMo66.435.090.280.8/69.064.0/60.265.6/85.772.973.471.750.165.119.5

MultiNLI Matched-Accuracy

MultiNLI Matched-Accuracy

GenSen66.17.783.183.0/76.679.3/79.259.8/82.971.471.378.659.265.120.6

MultiNLI Matched-Accuracy

MultiNLI Matched-Accuracy

BiLSTM+Attn65.618.683.083.9/76.272.8/70.560.1/82.467.668.374.358.465.117.8

MultiNLI Matched-Accuracy

MultiNLI Matched-Accuracy

BiLSTM64.211.682.881.8/74.370.3/67.862.5/84.265.666.174.657.465.120.3

MultiNLI Matched-Accuracy

MultiNLI Matched-Accuracy

InferSent63.94.585.181.2/74.175.9/75.359.1/81.766.165.772.758.065.118.3

MultiNLI Matched-Accuracy

MultiNLI Matched-Accuracy

Single Task BiLSTM63.715.785.979.4/69.366.0/62.861.4/81.770.370.875.752.862.321.0

MultiNLI Matched-Accuracy

MultiNLI Matched-Accuracy

Single Task BiLSTM+CoVe63.614.588.581.4/73.467.2/64.159.4/83.364.564.875.453.561.620.6

MultiNLI Matched-Accuracy

MultiNLI Matched-Accuracy

BiLSTM+CoVe+Attn63.18.380.780.0/71.869.8/68.460.5/83.468.168.672.956.065.118.3

MultiNLI Matched-Accuracy

MultiNLI Matched-Accuracy

Single Task BiLSTM+CoVe+Attn63.114.588.579.7/68.657.2/53.660.1/84.171.671.574.552.764.423.8

MultiNLI Matched-Accuracy

MultiNLI Matched-Accuracy

BiLSTM+CoVe62.918.581.978.7/71.564.4/62.760.6/84.965.465.770.852.765.117.6

MultiNLI Matched-Accuracy

MultiNLI Matched-Accuracy

Single Task BiLSTM+Attn62.815.785.980.3/68.559.3/55.862.9/83.574.273.877.251.955.524.9

MultiNLI Matched-Accuracy

MultiNLI Matched-Accuracy

DisSent61.94.983.781.7/74.166.1/64.859.5/82.658.759.173.956.465.115.9

MultiNLI Matched-Accuracy

MultiNLI Matched-Accuracy

Skip-Thought61.30.081.880.8/71.771.8/69.756.4/82.262.962.872.953.165.112.2

MultiNLI Matched-Accuracy

MultiNLI Matched-Accuracy

CBOW58.60.080.081.5/73.461.2/58.751.4/79.156.056.472.154.162.39.288

MultiNLI Matched-Accuracy

XLNet Team

MultiNLI Matched-Accuracy

XLNet (ensemble)-70.297.192.9/90.593.0/92.674.7/90.490.990.9-88.592.548.489

MultiNLI Matched-Accuracy

ALBERT-Team Google Language

MultiNLI Matched-Accuracy

ALBERT (Ensemble)-69.197.193.4/91.292.5/92.074.2/90.591.391.0-89.291.850.2