QueryGym
QueryGym Leaderboard
Reproducible benchmarks for LLM query reformulation.
← Datasets

DL 2020

msmarco-v1-passage.trecdl2020
All results produced by QueryGym · fully reproducible!

120 (method × LLM × retriever) configurations evaluated on this dataset.
Click any row or the + button to expand. The three steps (reformulate → retrieve → evaluate) for that run appear inline.

Retriever
Model
Method
120 / 120 runs
best in column
Method LLM Retriever nDCG@10 R@1k
csqe Qwen2.5-72B-Instruct BGE-base-en-v1.5 0.66870.8722
methodcsqe llmQwen2.5-72B-Instruct retrieverBGE-base-en-v1.5 datasetDL 2020
No run config available.
csqe Qwen2.5-72B-Instruct BM25 0.56060.8603
methodcsqe llmQwen2.5-72B-Instruct retrieverBM25 datasetDL 2020
No run config available.
csqe Qwen2.5-72B-Instruct SPLADE++ 0.57360.9052
methodcsqe llmQwen2.5-72B-Instruct retrieverSPLADE++ datasetDL 2020
No run config available.
csqe Qwen2.5-7B-Instruct BGE-base-en-v1.5 0.68850.8850
methodcsqe llmQwen2.5-7B-Instruct retrieverBGE-base-en-v1.5 datasetDL 2020
No run config available.
csqe Qwen2.5-7B-Instruct BM25 0.60830.8596
methodcsqe llmQwen2.5-7B-Instruct retrieverBM25 datasetDL 2020
No run config available.
csqe Qwen2.5-7B-Instruct SPLADE++ 0.61640.9039
methodcsqe llmQwen2.5-7B-Instruct retrieverSPLADE++ datasetDL 2020
No run config available.
csqe gpt-4.1 BGE-base-en-v1.5 0.71390.8968
methodcsqe llmgpt-4.1 retrieverBGE-base-en-v1.5 datasetDL 2020
No run config available.
csqe gpt-4.1 BM25 0.65480.8871
methodcsqe llmgpt-4.1 retrieverBM25 datasetDL 2020
No run config available.
csqe gpt-4.1 SPLADE++ 0.67960.9397
methodcsqe llmgpt-4.1 retrieverSPLADE++ datasetDL 2020
No run config available.
csqe gpt-4.1-nano BGE-base-en-v1.5 0.68730.8535
methodcsqe llmgpt-4.1-nano retrieverBGE-base-en-v1.5 datasetDL 2020
No run config available.
csqe gpt-4.1-nano BM25 0.51420.8586
methodcsqe llmgpt-4.1-nano retrieverBM25 datasetDL 2020
No run config available.
csqe gpt-4.1-nano SPLADE++ 0.58830.9119
methodcsqe llmgpt-4.1-nano retrieverSPLADE++ datasetDL 2020
No run config available.
genqr Qwen2.5-72B-Instruct BGE-base-en-v1.5 0.66800.8652
methodgenqr llmQwen2.5-72B-Instruct retrieverBGE-base-en-v1.5 datasetDL 2020
No run config available.
genqr Qwen2.5-72B-Instruct BM25 0.42380.7919
methodgenqr llmQwen2.5-72B-Instruct retrieverBM25 datasetDL 2020
No run config available.
genqr Qwen2.5-72B-Instruct SPLADE++ 0.57510.8971
methodgenqr llmQwen2.5-72B-Instruct retrieverSPLADE++ datasetDL 2020
No run config available.
genqr Qwen2.5-7B-Instruct BGE-base-en-v1.5 0.63350.8395
methodgenqr llmQwen2.5-7B-Instruct retrieverBGE-base-en-v1.5 datasetDL 2020
No run config available.
genqr Qwen2.5-7B-Instruct BM25 0.38570.7740
methodgenqr llmQwen2.5-7B-Instruct retrieverBM25 datasetDL 2020
No run config available.
genqr Qwen2.5-7B-Instruct SPLADE++ 0.61150.8989
methodgenqr llmQwen2.5-7B-Instruct retrieverSPLADE++ datasetDL 2020
No run config available.
genqr gpt-4.1 BGE-base-en-v1.5 0.69030.8516
methodgenqr llmgpt-4.1 retrieverBGE-base-en-v1.5 datasetDL 2020
No run config available.
genqr gpt-4.1 BM25 0.53680.8402
methodgenqr llmgpt-4.1 retrieverBM25 datasetDL 2020
No run config available.
genqr gpt-4.1 SPLADE++ 0.62600.9143
methodgenqr llmgpt-4.1 retrieverSPLADE++ datasetDL 2020
No run config available.
genqr gpt-4.1-nano BGE-base-en-v1.5 0.65680.8485
methodgenqr llmgpt-4.1-nano retrieverBGE-base-en-v1.5 datasetDL 2020
No run config available.
genqr gpt-4.1-nano BM25 0.43020.7701
methodgenqr llmgpt-4.1-nano retrieverBM25 datasetDL 2020
No run config available.
genqr gpt-4.1-nano SPLADE++ 0.60110.9074
methodgenqr llmgpt-4.1-nano retrieverSPLADE++ datasetDL 2020
No run config available.
genqr_ensemble Qwen2.5-72B-Instruct BGE-base-en-v1.5 0.67740.8585
methodgenqr_ensemble llmQwen2.5-72B-Instruct retrieverBGE-base-en-v1.5 datasetDL 2020
No run config available.
genqr_ensemble Qwen2.5-72B-Instruct BM25 0.42480.7820
methodgenqr_ensemble llmQwen2.5-72B-Instruct retrieverBM25 datasetDL 2020
No run config available.
genqr_ensemble Qwen2.5-72B-Instruct SPLADE++ 0.54470.8886
methodgenqr_ensemble llmQwen2.5-72B-Instruct retrieverSPLADE++ datasetDL 2020
No run config available.
genqr_ensemble Qwen2.5-7B-Instruct BGE-base-en-v1.5 0.67000.8582
methodgenqr_ensemble llmQwen2.5-7B-Instruct retrieverBGE-base-en-v1.5 datasetDL 2020
No run config available.
genqr_ensemble Qwen2.5-7B-Instruct BM25 0.48960.8164
methodgenqr_ensemble llmQwen2.5-7B-Instruct retrieverBM25 datasetDL 2020
No run config available.
genqr_ensemble Qwen2.5-7B-Instruct SPLADE++ 0.63070.9020
methodgenqr_ensemble llmQwen2.5-7B-Instruct retrieverSPLADE++ datasetDL 2020
No run config available.
genqr_ensemble gpt-4.1 BGE-base-en-v1.5 0.68260.8699
methodgenqr_ensemble llmgpt-4.1 retrieverBGE-base-en-v1.5 datasetDL 2020
No run config available.
genqr_ensemble gpt-4.1 BM25 0.55280.8613
methodgenqr_ensemble llmgpt-4.1 retrieverBM25 datasetDL 2020
No run config available.
genqr_ensemble gpt-4.1 SPLADE++ 0.58570.9141
methodgenqr_ensemble llmgpt-4.1 retrieverSPLADE++ datasetDL 2020
No run config available.
genqr_ensemble gpt-4.1-nano BGE-base-en-v1.5 0.66450.8620
methodgenqr_ensemble llmgpt-4.1-nano retrieverBGE-base-en-v1.5 datasetDL 2020
No run config available.
genqr_ensemble gpt-4.1-nano BM25 0.47180.8158
methodgenqr_ensemble llmgpt-4.1-nano retrieverBM25 datasetDL 2020
No run config available.
genqr_ensemble gpt-4.1-nano SPLADE++ 0.60440.9194
methodgenqr_ensemble llmgpt-4.1-nano retrieverSPLADE++ datasetDL 2020
No run config available.
lamer Qwen2.5-72B-Instruct BGE-base-en-v1.5 0.72760.9045
methodlamer llmQwen2.5-72B-Instruct retrieverBGE-base-en-v1.5 datasetDL 2020
No run config available.
lamer Qwen2.5-72B-Instruct BM25 0.67110.8920
methodlamer llmQwen2.5-72B-Instruct retrieverBM25 datasetDL 2020
No run config available.
lamer Qwen2.5-72B-Instruct SPLADE++ 0.64830.9195
methodlamer llmQwen2.5-72B-Instruct retrieverSPLADE++ datasetDL 2020
No run config available.
lamer Qwen2.5-7B-Instruct BGE-base-en-v1.5 0.68250.8940
methodlamer llmQwen2.5-7B-Instruct retrieverBGE-base-en-v1.5 datasetDL 2020
No run config available.
lamer Qwen2.5-7B-Instruct BM25 0.63220.8933
methodlamer llmQwen2.5-7B-Instruct retrieverBM25 datasetDL 2020
No run config available.
lamer Qwen2.5-7B-Instruct SPLADE++ 0.60760.9213
methodlamer llmQwen2.5-7B-Instruct retrieverSPLADE++ datasetDL 2020
No run config available.
lamer gpt-4.1 BGE-base-en-v1.5 0.71480.9026
methodlamer llmgpt-4.1 retrieverBGE-base-en-v1.5 datasetDL 2020
No run config available.
lamer gpt-4.1 BM25 0.65300.9002
methodlamer llmgpt-4.1 retrieverBM25 datasetDL 2020
No run config available.
lamer gpt-4.1 SPLADE++ 0.63900.9378
methodlamer llmgpt-4.1 retrieverSPLADE++ datasetDL 2020
No run config available.
lamer gpt-4.1-nano BGE-base-en-v1.5 0.71350.8846
methodlamer llmgpt-4.1-nano retrieverBGE-base-en-v1.5 datasetDL 2020
No run config available.
lamer gpt-4.1-nano BM25 0.65600.8865
methodlamer llmgpt-4.1-nano retrieverBM25 datasetDL 2020
No run config available.
lamer gpt-4.1-nano SPLADE++ 0.62540.9244
methodlamer llmgpt-4.1-nano retrieverSPLADE++ datasetDL 2020
No run config available.
mugi Qwen2.5-72B-Instruct BGE-base-en-v1.5 0.71220.8894
methodmugi llmQwen2.5-72B-Instruct retrieverBGE-base-en-v1.5 datasetDL 2020
No run config available.
mugi Qwen2.5-72B-Instruct BM25 0.62680.9015
methodmugi llmQwen2.5-72B-Instruct retrieverBM25 datasetDL 2020
No run config available.
mugi Qwen2.5-72B-Instruct SPLADE++ 0.64190.9165
methodmugi llmQwen2.5-72B-Instruct retrieverSPLADE++ datasetDL 2020
No run config available.
mugi Qwen2.5-7B-Instruct BGE-base-en-v1.5 0.68880.8823
methodmugi llmQwen2.5-7B-Instruct retrieverBGE-base-en-v1.5 datasetDL 2020
No run config available.
mugi Qwen2.5-7B-Instruct BM25 0.60690.8882
methodmugi llmQwen2.5-7B-Instruct retrieverBM25 datasetDL 2020
No run config available.
mugi Qwen2.5-7B-Instruct SPLADE++ 0.55270.9104
methodmugi llmQwen2.5-7B-Instruct retrieverSPLADE++ datasetDL 2020
No run config available.
mugi gpt-4.1 BGE-base-en-v1.5 0.72030.8950
methodmugi llmgpt-4.1 retrieverBGE-base-en-v1.5 datasetDL 2020
No run config available.
mugi gpt-4.1 BM25 0.65780.8996
methodmugi llmgpt-4.1 retrieverBM25 datasetDL 2020
No run config available.
mugi gpt-4.1 SPLADE++ 0.65080.9199
methodmugi llmgpt-4.1 retrieverSPLADE++ datasetDL 2020
No run config available.
mugi gpt-4.1-nano BGE-base-en-v1.5 0.71870.8911
methodmugi llmgpt-4.1-nano retrieverBGE-base-en-v1.5 datasetDL 2020
No run config available.
mugi gpt-4.1-nano BM25 0.64730.9017
methodmugi llmgpt-4.1-nano retrieverBM25 datasetDL 2020
No run config available.
mugi gpt-4.1-nano SPLADE++ 0.64320.9203
methodmugi llmgpt-4.1-nano retrieverSPLADE++ datasetDL 2020
No run config available.
qa_expand Qwen2.5-72B-Instruct BGE-base-en-v1.5 0.69160.8785
methodqa_expand llmQwen2.5-72B-Instruct retrieverBGE-base-en-v1.5 datasetDL 2020
No run config available.
qa_expand Qwen2.5-72B-Instruct BM25 0.61520.8727
methodqa_expand llmQwen2.5-72B-Instruct retrieverBM25 datasetDL 2020
No run config available.
qa_expand Qwen2.5-72B-Instruct SPLADE++ 0.69830.9284
methodqa_expand llmQwen2.5-72B-Instruct retrieverSPLADE++ datasetDL 2020
No run config available.
qa_expand Qwen2.5-7B-Instruct BGE-base-en-v1.5 0.65410.8606
methodqa_expand llmQwen2.5-7B-Instruct retrieverBGE-base-en-v1.5 datasetDL 2020
No run config available.
qa_expand Qwen2.5-7B-Instruct BM25 0.56540.8454
methodqa_expand llmQwen2.5-7B-Instruct retrieverBM25 datasetDL 2020
No run config available.
qa_expand Qwen2.5-7B-Instruct SPLADE++ 0.61560.8945
methodqa_expand llmQwen2.5-7B-Instruct retrieverSPLADE++ datasetDL 2020
No run config available.
qa_expand gpt-4.1 BGE-base-en-v1.5 0.70740.8754
methodqa_expand llmgpt-4.1 retrieverBGE-base-en-v1.5 datasetDL 2020
No run config available.
qa_expand gpt-4.1 BM25 0.64180.8787
methodqa_expand llmgpt-4.1 retrieverBM25 datasetDL 2020
No run config available.
qa_expand gpt-4.1 SPLADE++ 0.67390.9260
methodqa_expand llmgpt-4.1 retrieverSPLADE++ datasetDL 2020
No run config available.
qa_expand gpt-4.1-nano BGE-base-en-v1.5 0.66120.8397
methodqa_expand llmgpt-4.1-nano retrieverBGE-base-en-v1.5 datasetDL 2020
No run config available.
qa_expand gpt-4.1-nano BM25 0.60260.8649
methodqa_expand llmgpt-4.1-nano retrieverBM25 datasetDL 2020
No run config available.
qa_expand gpt-4.1-nano SPLADE++ 0.66280.9279
methodqa_expand llmgpt-4.1-nano retrieverSPLADE++ datasetDL 2020
No run config available.
Q2D (FS) Qwen2.5-72B-Instruct BGE-base-en-v1.5 0.67920.8913
methodQ2D (FS) llmQwen2.5-72B-Instruct retrieverBGE-base-en-v1.5 datasetDL 2020
No run config available.
Q2D (ZS) Qwen2.5-72B-Instruct BGE-base-en-v1.5 0.69820.8945
methodQ2D (ZS) llmQwen2.5-72B-Instruct retrieverBGE-base-en-v1.5 datasetDL 2020
No run config available.
Q2D (COT) Qwen2.5-72B-Instruct BGE-base-en-v1.5 0.64110.8485
methodQ2D (COT) llmQwen2.5-72B-Instruct retrieverBGE-base-en-v1.5 datasetDL 2020
No run config available.
Q2D (ZS) Qwen2.5-72B-Instruct BM25 0.62070.8801
methodQ2D (ZS) llmQwen2.5-72B-Instruct retrieverBM25 datasetDL 2020
No run config available.
Q2D (COT) Qwen2.5-72B-Instruct BM25 0.56510.8549
methodQ2D (COT) llmQwen2.5-72B-Instruct retrieverBM25 datasetDL 2020
No run config available.
Q2D (FS) Qwen2.5-72B-Instruct BM25 0.62640.8907
methodQ2D (FS) llmQwen2.5-72B-Instruct retrieverBM25 datasetDL 2020
No run config available.
Q2D (FS) Qwen2.5-72B-Instruct SPLADE++ 0.64990.9234
methodQ2D (FS) llmQwen2.5-72B-Instruct retrieverSPLADE++ datasetDL 2020
No run config available.
Q2D (ZS) Qwen2.5-72B-Instruct SPLADE++ 0.61440.9161
methodQ2D (ZS) llmQwen2.5-72B-Instruct retrieverSPLADE++ datasetDL 2020
No run config available.
Q2D (COT) Qwen2.5-72B-Instruct SPLADE++ 0.60990.8857
methodQ2D (COT) llmQwen2.5-72B-Instruct retrieverSPLADE++ datasetDL 2020
No run config available.
Q2D (COT) Qwen2.5-7B-Instruct BGE-base-en-v1.5 0.63020.8573
methodQ2D (COT) llmQwen2.5-7B-Instruct retrieverBGE-base-en-v1.5 datasetDL 2020
No run config available.
Q2D (FS) Qwen2.5-7B-Instruct BGE-base-en-v1.5 0.64020.8578
methodQ2D (FS) llmQwen2.5-7B-Instruct retrieverBGE-base-en-v1.5 datasetDL 2020
No run config available.
Q2D (ZS) Qwen2.5-7B-Instruct BGE-base-en-v1.5 0.66170.8566
methodQ2D (ZS) llmQwen2.5-7B-Instruct retrieverBGE-base-en-v1.5 datasetDL 2020
No run config available.
Q2D (COT) Qwen2.5-7B-Instruct BM25 0.58020.8684
methodQ2D (COT) llmQwen2.5-7B-Instruct retrieverBM25 datasetDL 2020
No run config available.
Q2D (FS) Qwen2.5-7B-Instruct BM25 0.54280.8691
methodQ2D (FS) llmQwen2.5-7B-Instruct retrieverBM25 datasetDL 2020
No run config available.
Q2D (ZS) Qwen2.5-7B-Instruct BM25 0.56850.8647
methodQ2D (ZS) llmQwen2.5-7B-Instruct retrieverBM25 datasetDL 2020
No run config available.
Q2D (COT) Qwen2.5-7B-Instruct SPLADE++ 0.59480.9019
methodQ2D (COT) llmQwen2.5-7B-Instruct retrieverSPLADE++ datasetDL 2020
No run config available.
Q2D (FS) Qwen2.5-7B-Instruct SPLADE++ 0.54920.9062
methodQ2D (FS) llmQwen2.5-7B-Instruct retrieverSPLADE++ datasetDL 2020
No run config available.
Q2D (ZS) Qwen2.5-7B-Instruct SPLADE++ 0.60960.9045
methodQ2D (ZS) llmQwen2.5-7B-Instruct retrieverSPLADE++ datasetDL 2020
No run config available.
Q2D (ZS) gpt-4.1 BGE-base-en-v1.5 0.73930.9056
methodQ2D (ZS) llmgpt-4.1 retrieverBGE-base-en-v1.5 datasetDL 2020
No run config available.
Q2D (COT) gpt-4.1 BGE-base-en-v1.5 0.67200.8756
methodQ2D (COT) llmgpt-4.1 retrieverBGE-base-en-v1.5 datasetDL 2020
No run config available.
Q2D (FS) gpt-4.1 BGE-base-en-v1.5 0.71410.8948
methodQ2D (FS) llmgpt-4.1 retrieverBGE-base-en-v1.5 datasetDL 2020
No run config available.
Q2D (ZS) gpt-4.1 BM25 0.66250.8942
methodQ2D (ZS) llmgpt-4.1 retrieverBM25 datasetDL 2020
No run config available.
Q2D (COT) gpt-4.1 BM25 0.62390.8781
methodQ2D (COT) llmgpt-4.1 retrieverBM25 datasetDL 2020
No run config available.
Q2D (FS) gpt-4.1 BM25 0.67460.8984
methodQ2D (FS) llmgpt-4.1 retrieverBM25 datasetDL 2020
No run config available.
Q2D (ZS) gpt-4.1 SPLADE++ 0.68750.9372
methodQ2D (ZS) llmgpt-4.1 retrieverSPLADE++ datasetDL 2020
No run config available.
Q2D (COT) gpt-4.1 SPLADE++ 0.65340.9089
methodQ2D (COT) llmgpt-4.1 retrieverSPLADE++ datasetDL 2020
No run config available.
Q2D (FS) gpt-4.1 SPLADE++ 0.67490.9389
methodQ2D (FS) llmgpt-4.1 retrieverSPLADE++ datasetDL 2020
No run config available.
Q2D (COT) gpt-4.1-nano BGE-base-en-v1.5 0.67440.8709
methodQ2D (COT) llmgpt-4.1-nano retrieverBGE-base-en-v1.5 datasetDL 2020
No run config available.
Q2D (FS) gpt-4.1-nano BGE-base-en-v1.5 0.69880.8742
methodQ2D (FS) llmgpt-4.1-nano retrieverBGE-base-en-v1.5 datasetDL 2020
No run config available.
Q2D (ZS) gpt-4.1-nano BGE-base-en-v1.5 0.70290.8743
methodQ2D (ZS) llmgpt-4.1-nano retrieverBGE-base-en-v1.5 datasetDL 2020
No run config available.
Q2D (COT) gpt-4.1-nano BM25 0.60920.8846
methodQ2D (COT) llmgpt-4.1-nano retrieverBM25 datasetDL 2020
No run config available.
Q2D (FS) gpt-4.1-nano BM25 0.62270.8848
methodQ2D (FS) llmgpt-4.1-nano retrieverBM25 datasetDL 2020
No run config available.
Q2D (ZS) gpt-4.1-nano BM25 0.62680.8869
methodQ2D (ZS) llmgpt-4.1-nano retrieverBM25 datasetDL 2020
No run config available.
Q2D (COT) gpt-4.1-nano SPLADE++ 0.62710.9167
methodQ2D (COT) llmgpt-4.1-nano retrieverSPLADE++ datasetDL 2020
No run config available.
Q2D (FS) gpt-4.1-nano SPLADE++ 0.64710.9232
methodQ2D (FS) llmgpt-4.1-nano retrieverSPLADE++ datasetDL 2020
No run config available.
Q2D (ZS) gpt-4.1-nano SPLADE++ 0.62420.9219
methodQ2D (ZS) llmgpt-4.1-nano retrieverSPLADE++ datasetDL 2020
No run config available.
query2e Qwen2.5-72B-Instruct BGE-base-en-v1.5 0.66060.8528
methodquery2e llmQwen2.5-72B-Instruct retrieverBGE-base-en-v1.5 datasetDL 2020
No run config available.
query2e Qwen2.5-72B-Instruct BM25 0.55460.8609
methodquery2e llmQwen2.5-72B-Instruct retrieverBM25 datasetDL 2020
No run config available.
query2e Qwen2.5-72B-Instruct SPLADE++ 0.63530.9286
methodquery2e llmQwen2.5-72B-Instruct retrieverSPLADE++ datasetDL 2020
No run config available.
query2e Qwen2.5-7B-Instruct BGE-base-en-v1.5 0.64250.8443
methodquery2e llmQwen2.5-7B-Instruct retrieverBGE-base-en-v1.5 datasetDL 2020
No run config available.
query2e Qwen2.5-7B-Instruct BM25 0.54040.8548
methodquery2e llmQwen2.5-7B-Instruct retrieverBM25 datasetDL 2020
No run config available.
query2e Qwen2.5-7B-Instruct SPLADE++ 0.53120.9001
methodquery2e llmQwen2.5-7B-Instruct retrieverSPLADE++ datasetDL 2020
No run config available.
query2e gpt-4.1 BGE-base-en-v1.5 0.64220.8184
methodquery2e llmgpt-4.1 retrieverBGE-base-en-v1.5 datasetDL 2020
No run config available.
query2e gpt-4.1 BM25 0.57590.8594
methodquery2e llmgpt-4.1 retrieverBM25 datasetDL 2020
No run config available.
query2e gpt-4.1 SPLADE++ 0.65220.9252
methodquery2e llmgpt-4.1 retrieverSPLADE++ datasetDL 2020
No run config available.
query2e gpt-4.1-nano BGE-base-en-v1.5 0.67060.8514
methodquery2e llmgpt-4.1-nano retrieverBGE-base-en-v1.5 datasetDL 2020
No run config available.
query2e gpt-4.1-nano BM25 0.54750.8392
methodquery2e llmgpt-4.1-nano retrieverBM25 datasetDL 2020
No run config available.
query2e gpt-4.1-nano SPLADE++ 0.66050.9142
methodquery2e llmgpt-4.1-nano retrieverSPLADE++ datasetDL 2020
No run config available.