QueryGym
QueryGym Leaderboard
Reproducible benchmarks for LLM query reformulation.
← Datasets

DL 2019

msmarco-v1-passage.trecdl2019
All results produced by QueryGym · fully reproducible!

120 (method × LLM × retriever) configurations evaluated on this dataset.
Click any row or the + button to expand. The three steps (reformulate → retrieve → evaluate) for that run appear inline.

Retriever
Model
Method
120 / 120 runs
best in column
Method LLM Retriever nDCG@10 R@1k
csqe Qwen2.5-72B-Instruct BGE-base-en-v1.5 0.71790.8944
methodcsqe llmQwen2.5-72B-Instruct retrieverBGE-base-en-v1.5 datasetDL 2019
No run config available.
csqe Qwen2.5-72B-Instruct BM25 0.63910.8608
methodcsqe llmQwen2.5-72B-Instruct retrieverBM25 datasetDL 2019
No run config available.
csqe Qwen2.5-72B-Instruct SPLADE++ 0.61890.9070
methodcsqe llmQwen2.5-72B-Instruct retrieverSPLADE++ datasetDL 2019
No run config available.
csqe Qwen2.5-7B-Instruct BGE-base-en-v1.5 0.71270.8803
methodcsqe llmQwen2.5-7B-Instruct retrieverBGE-base-en-v1.5 datasetDL 2019
No run config available.
csqe Qwen2.5-7B-Instruct BM25 0.68730.8921
methodcsqe llmQwen2.5-7B-Instruct retrieverBM25 datasetDL 2019
No run config available.
csqe Qwen2.5-7B-Instruct SPLADE++ 0.65230.9089
methodcsqe llmQwen2.5-7B-Instruct retrieverSPLADE++ datasetDL 2019
No run config available.
csqe gpt-4.1 BGE-base-en-v1.5 0.75510.9009
methodcsqe llmgpt-4.1 retrieverBGE-base-en-v1.5 datasetDL 2019
No run config available.
csqe gpt-4.1 BM25 0.68990.9035
methodcsqe llmgpt-4.1 retrieverBM25 datasetDL 2019
No run config available.
csqe gpt-4.1 SPLADE++ 0.69360.9193
methodcsqe llmgpt-4.1 retrieverSPLADE++ datasetDL 2019
No run config available.
csqe gpt-4.1-nano BGE-base-en-v1.5 0.73040.8749
methodcsqe llmgpt-4.1-nano retrieverBGE-base-en-v1.5 datasetDL 2019
No run config available.
csqe gpt-4.1-nano BM25 0.54100.8221
methodcsqe llmgpt-4.1-nano retrieverBM25 datasetDL 2019
No run config available.
csqe gpt-4.1-nano SPLADE++ 0.61340.8900
methodcsqe llmgpt-4.1-nano retrieverSPLADE++ datasetDL 2019
No run config available.
genqr Qwen2.5-72B-Instruct BGE-base-en-v1.5 0.67410.8618
methodgenqr llmQwen2.5-72B-Instruct retrieverBGE-base-en-v1.5 datasetDL 2019
No run config available.
genqr Qwen2.5-72B-Instruct BM25 0.41980.7616
methodgenqr llmQwen2.5-72B-Instruct retrieverBM25 datasetDL 2019
No run config available.
genqr Qwen2.5-72B-Instruct SPLADE++ 0.61540.9030
methodgenqr llmQwen2.5-72B-Instruct retrieverSPLADE++ datasetDL 2019
No run config available.
genqr Qwen2.5-7B-Instruct BGE-base-en-v1.5 0.64160.8381
methodgenqr llmQwen2.5-7B-Instruct retrieverBGE-base-en-v1.5 datasetDL 2019
No run config available.
genqr Qwen2.5-7B-Instruct BM25 0.43340.7860
methodgenqr llmQwen2.5-7B-Instruct retrieverBM25 datasetDL 2019
No run config available.
genqr Qwen2.5-7B-Instruct SPLADE++ 0.64490.8870
methodgenqr llmQwen2.5-7B-Instruct retrieverSPLADE++ datasetDL 2019
No run config available.
genqr gpt-4.1 BGE-base-en-v1.5 0.70230.8650
methodgenqr llmgpt-4.1 retrieverBGE-base-en-v1.5 datasetDL 2019
No run config available.
genqr gpt-4.1 BM25 0.54790.8282
methodgenqr llmgpt-4.1 retrieverBM25 datasetDL 2019
No run config available.
genqr gpt-4.1 SPLADE++ 0.70650.9333
methodgenqr llmgpt-4.1 retrieverSPLADE++ datasetDL 2019
No run config available.
genqr gpt-4.1-nano BGE-base-en-v1.5 0.65870.8493
methodgenqr llmgpt-4.1-nano retrieverBGE-base-en-v1.5 datasetDL 2019
No run config available.
genqr gpt-4.1-nano BM25 0.43890.7360
methodgenqr llmgpt-4.1-nano retrieverBM25 datasetDL 2019
No run config available.
genqr gpt-4.1-nano SPLADE++ 0.63510.9162
methodgenqr llmgpt-4.1-nano retrieverSPLADE++ datasetDL 2019
No run config available.
genqr_ensemble Qwen2.5-72B-Instruct BGE-base-en-v1.5 0.68190.8825
methodgenqr_ensemble llmQwen2.5-72B-Instruct retrieverBGE-base-en-v1.5 datasetDL 2019
No run config available.
genqr_ensemble Qwen2.5-72B-Instruct BM25 0.47390.7999
methodgenqr_ensemble llmQwen2.5-72B-Instruct retrieverBM25 datasetDL 2019
No run config available.
genqr_ensemble Qwen2.5-72B-Instruct SPLADE++ 0.59790.9053
methodgenqr_ensemble llmQwen2.5-72B-Instruct retrieverSPLADE++ datasetDL 2019
No run config available.
genqr_ensemble Qwen2.5-7B-Instruct BGE-base-en-v1.5 0.66610.8520
methodgenqr_ensemble llmQwen2.5-7B-Instruct retrieverBGE-base-en-v1.5 datasetDL 2019
No run config available.
genqr_ensemble Qwen2.5-7B-Instruct BM25 0.45120.7952
methodgenqr_ensemble llmQwen2.5-7B-Instruct retrieverBM25 datasetDL 2019
No run config available.
genqr_ensemble Qwen2.5-7B-Instruct SPLADE++ 0.59480.8824
methodgenqr_ensemble llmQwen2.5-7B-Instruct retrieverSPLADE++ datasetDL 2019
No run config available.
genqr_ensemble gpt-4.1 BGE-base-en-v1.5 0.70340.8870
methodgenqr_ensemble llmgpt-4.1 retrieverBGE-base-en-v1.5 datasetDL 2019
No run config available.
genqr_ensemble gpt-4.1 BM25 0.55890.8685
methodgenqr_ensemble llmgpt-4.1 retrieverBM25 datasetDL 2019
No run config available.
genqr_ensemble gpt-4.1 SPLADE++ 0.68590.9020
methodgenqr_ensemble llmgpt-4.1 retrieverSPLADE++ datasetDL 2019
No run config available.
genqr_ensemble gpt-4.1-nano BGE-base-en-v1.5 0.68830.8711
methodgenqr_ensemble llmgpt-4.1-nano retrieverBGE-base-en-v1.5 datasetDL 2019
No run config available.
genqr_ensemble gpt-4.1-nano BM25 0.45790.8217
methodgenqr_ensemble llmgpt-4.1-nano retrieverBM25 datasetDL 2019
No run config available.
genqr_ensemble gpt-4.1-nano SPLADE++ 0.66170.9104
methodgenqr_ensemble llmgpt-4.1-nano retrieverSPLADE++ datasetDL 2019
No run config available.
lamer Qwen2.5-72B-Instruct BGE-base-en-v1.5 0.72190.8859
methodlamer llmQwen2.5-72B-Instruct retrieverBGE-base-en-v1.5 datasetDL 2019
No run config available.
lamer Qwen2.5-72B-Instruct BM25 0.66510.8666
methodlamer llmQwen2.5-72B-Instruct retrieverBM25 datasetDL 2019
No run config available.
lamer Qwen2.5-72B-Instruct SPLADE++ 0.66510.8956
methodlamer llmQwen2.5-72B-Instruct retrieverSPLADE++ datasetDL 2019
No run config available.
lamer Qwen2.5-7B-Instruct BGE-base-en-v1.5 0.71130.8668
methodlamer llmQwen2.5-7B-Instruct retrieverBGE-base-en-v1.5 datasetDL 2019
No run config available.
lamer Qwen2.5-7B-Instruct BM25 0.66020.8553
methodlamer llmQwen2.5-7B-Instruct retrieverBM25 datasetDL 2019
No run config available.
lamer Qwen2.5-7B-Instruct SPLADE++ 0.64650.8654
methodlamer llmQwen2.5-7B-Instruct retrieverSPLADE++ datasetDL 2019
No run config available.
lamer gpt-4.1 BGE-base-en-v1.5 0.70320.8888
methodlamer llmgpt-4.1 retrieverBGE-base-en-v1.5 datasetDL 2019
No run config available.
lamer gpt-4.1 BM25 0.63680.8566
methodlamer llmgpt-4.1 retrieverBM25 datasetDL 2019
No run config available.
lamer gpt-4.1 SPLADE++ 0.68360.9065
methodlamer llmgpt-4.1 retrieverSPLADE++ datasetDL 2019
No run config available.
lamer gpt-4.1-nano BGE-base-en-v1.5 0.72650.8894
methodlamer llmgpt-4.1-nano retrieverBGE-base-en-v1.5 datasetDL 2019
No run config available.
lamer gpt-4.1-nano BM25 0.67310.8548
methodlamer llmgpt-4.1-nano retrieverBM25 datasetDL 2019
No run config available.
lamer gpt-4.1-nano SPLADE++ 0.69160.8975
methodlamer llmgpt-4.1-nano retrieverSPLADE++ datasetDL 2019
No run config available.
mugi Qwen2.5-72B-Instruct BGE-base-en-v1.5 0.75120.9071
methodmugi llmQwen2.5-72B-Instruct retrieverBGE-base-en-v1.5 datasetDL 2019
No run config available.
mugi Qwen2.5-72B-Instruct BM25 0.69110.9055
methodmugi llmQwen2.5-72B-Instruct retrieverBM25 datasetDL 2019
No run config available.
mugi Qwen2.5-72B-Instruct SPLADE++ 0.67460.9275
methodmugi llmQwen2.5-72B-Instruct retrieverSPLADE++ datasetDL 2019
No run config available.
mugi Qwen2.5-7B-Instruct BGE-base-en-v1.5 0.68690.8781
methodmugi llmQwen2.5-7B-Instruct retrieverBGE-base-en-v1.5 datasetDL 2019
No run config available.
mugi Qwen2.5-7B-Instruct BM25 0.63940.8732
methodmugi llmQwen2.5-7B-Instruct retrieverBM25 datasetDL 2019
No run config available.
mugi Qwen2.5-7B-Instruct SPLADE++ 0.57730.8929
methodmugi llmQwen2.5-7B-Instruct retrieverSPLADE++ datasetDL 2019
No run config available.
mugi gpt-4.1 BGE-base-en-v1.5 0.73510.8869
methodmugi llmgpt-4.1 retrieverBGE-base-en-v1.5 datasetDL 2019
No run config available.
mugi gpt-4.1 BM25 0.69520.9005
methodmugi llmgpt-4.1 retrieverBM25 datasetDL 2019
No run config available.
mugi gpt-4.1 SPLADE++ 0.68590.9088
methodmugi llmgpt-4.1 retrieverSPLADE++ datasetDL 2019
No run config available.
mugi gpt-4.1-nano BGE-base-en-v1.5 0.71690.8725
methodmugi llmgpt-4.1-nano retrieverBGE-base-en-v1.5 datasetDL 2019
No run config available.
mugi gpt-4.1-nano BM25 0.68350.8915
methodmugi llmgpt-4.1-nano retrieverBM25 datasetDL 2019
No run config available.
mugi gpt-4.1-nano SPLADE++ 0.66110.8904
methodmugi llmgpt-4.1-nano retrieverSPLADE++ datasetDL 2019
No run config available.
qa_expand Qwen2.5-72B-Instruct BGE-base-en-v1.5 0.69990.8733
methodqa_expand llmQwen2.5-72B-Instruct retrieverBGE-base-en-v1.5 datasetDL 2019
No run config available.
qa_expand Qwen2.5-72B-Instruct BM25 0.61090.8396
methodqa_expand llmQwen2.5-72B-Instruct retrieverBM25 datasetDL 2019
No run config available.
qa_expand Qwen2.5-72B-Instruct SPLADE++ 0.67570.9005
methodqa_expand llmQwen2.5-72B-Instruct retrieverSPLADE++ datasetDL 2019
No run config available.
qa_expand Qwen2.5-7B-Instruct BGE-base-en-v1.5 0.67400.8469
methodqa_expand llmQwen2.5-7B-Instruct retrieverBGE-base-en-v1.5 datasetDL 2019
No run config available.
qa_expand Qwen2.5-7B-Instruct BM25 0.55530.7976
methodqa_expand llmQwen2.5-7B-Instruct retrieverBM25 datasetDL 2019
No run config available.
qa_expand Qwen2.5-7B-Instruct SPLADE++ 0.65740.8890
methodqa_expand llmQwen2.5-7B-Instruct retrieverSPLADE++ datasetDL 2019
No run config available.
qa_expand gpt-4.1 BGE-base-en-v1.5 0.73700.8936
methodqa_expand llmgpt-4.1 retrieverBGE-base-en-v1.5 datasetDL 2019
No run config available.
qa_expand gpt-4.1 BM25 0.68320.8495
methodqa_expand llmgpt-4.1 retrieverBM25 datasetDL 2019
No run config available.
qa_expand gpt-4.1 SPLADE++ 0.73350.9170
methodqa_expand llmgpt-4.1 retrieverSPLADE++ datasetDL 2019
No run config available.
qa_expand gpt-4.1-nano BGE-base-en-v1.5 0.65230.8486
methodqa_expand llmgpt-4.1-nano retrieverBGE-base-en-v1.5 datasetDL 2019
No run config available.
qa_expand gpt-4.1-nano BM25 0.58190.8385
methodqa_expand llmgpt-4.1-nano retrieverBM25 datasetDL 2019
No run config available.
qa_expand gpt-4.1-nano SPLADE++ 0.68830.9010
methodqa_expand llmgpt-4.1-nano retrieverSPLADE++ datasetDL 2019
No run config available.
Q2D (FS) Qwen2.5-72B-Instruct BGE-base-en-v1.5 0.74190.9027
methodQ2D (FS) llmQwen2.5-72B-Instruct retrieverBGE-base-en-v1.5 datasetDL 2019
No run config available.
Q2D (ZS) Qwen2.5-72B-Instruct BGE-base-en-v1.5 0.72690.9092
methodQ2D (ZS) llmQwen2.5-72B-Instruct retrieverBGE-base-en-v1.5 datasetDL 2019
No run config available.
Q2D (COT) Qwen2.5-72B-Instruct BGE-base-en-v1.5 0.71210.8712
methodQ2D (COT) llmQwen2.5-72B-Instruct retrieverBGE-base-en-v1.5 datasetDL 2019
No run config available.
Q2D (ZS) Qwen2.5-72B-Instruct BM25 0.65570.8807
methodQ2D (ZS) llmQwen2.5-72B-Instruct retrieverBM25 datasetDL 2019
No run config available.
Q2D (COT) Qwen2.5-72B-Instruct BM25 0.63780.8508
methodQ2D (COT) llmQwen2.5-72B-Instruct retrieverBM25 datasetDL 2019
No run config available.
Q2D (FS) Qwen2.5-72B-Instruct BM25 0.68750.8959
methodQ2D (FS) llmQwen2.5-72B-Instruct retrieverBM25 datasetDL 2019
No run config available.
Q2D (FS) Qwen2.5-72B-Instruct SPLADE++ 0.71510.9124
methodQ2D (FS) llmQwen2.5-72B-Instruct retrieverSPLADE++ datasetDL 2019
No run config available.
Q2D (ZS) Qwen2.5-72B-Instruct SPLADE++ 0.66820.9161
methodQ2D (ZS) llmQwen2.5-72B-Instruct retrieverSPLADE++ datasetDL 2019
No run config available.
Q2D (COT) Qwen2.5-72B-Instruct SPLADE++ 0.69410.9148
methodQ2D (COT) llmQwen2.5-72B-Instruct retrieverSPLADE++ datasetDL 2019
No run config available.
Q2D (COT) Qwen2.5-7B-Instruct BGE-base-en-v1.5 0.65610.8397
methodQ2D (COT) llmQwen2.5-7B-Instruct retrieverBGE-base-en-v1.5 datasetDL 2019
No run config available.
Q2D (FS) Qwen2.5-7B-Instruct BGE-base-en-v1.5 0.67760.8535
methodQ2D (FS) llmQwen2.5-7B-Instruct retrieverBGE-base-en-v1.5 datasetDL 2019
No run config available.
Q2D (ZS) Qwen2.5-7B-Instruct BGE-base-en-v1.5 0.69070.8584
methodQ2D (ZS) llmQwen2.5-7B-Instruct retrieverBGE-base-en-v1.5 datasetDL 2019
No run config available.
Q2D (COT) Qwen2.5-7B-Instruct BM25 0.60740.8585
methodQ2D (COT) llmQwen2.5-7B-Instruct retrieverBM25 datasetDL 2019
No run config available.
Q2D (FS) Qwen2.5-7B-Instruct BM25 0.58840.8605
methodQ2D (FS) llmQwen2.5-7B-Instruct retrieverBM25 datasetDL 2019
No run config available.
Q2D (ZS) Qwen2.5-7B-Instruct BM25 0.60140.8467
methodQ2D (ZS) llmQwen2.5-7B-Instruct retrieverBM25 datasetDL 2019
No run config available.
Q2D (COT) Qwen2.5-7B-Instruct SPLADE++ 0.65130.9037
methodQ2D (COT) llmQwen2.5-7B-Instruct retrieverSPLADE++ datasetDL 2019
No run config available.
Q2D (FS) Qwen2.5-7B-Instruct SPLADE++ 0.60950.8612
methodQ2D (FS) llmQwen2.5-7B-Instruct retrieverSPLADE++ datasetDL 2019
No run config available.
Q2D (ZS) Qwen2.5-7B-Instruct SPLADE++ 0.60910.8665
methodQ2D (ZS) llmQwen2.5-7B-Instruct retrieverSPLADE++ datasetDL 2019
No run config available.
Q2D (ZS) gpt-4.1 BGE-base-en-v1.5 0.72810.8995
methodQ2D (ZS) llmgpt-4.1 retrieverBGE-base-en-v1.5 datasetDL 2019
No run config available.
Q2D (COT) gpt-4.1 BGE-base-en-v1.5 0.71250.8877
methodQ2D (COT) llmgpt-4.1 retrieverBGE-base-en-v1.5 datasetDL 2019
No run config available.
Q2D (FS) gpt-4.1 BGE-base-en-v1.5 0.72720.8890
methodQ2D (FS) llmgpt-4.1 retrieverBGE-base-en-v1.5 datasetDL 2019
No run config available.
Q2D (ZS) gpt-4.1 BM25 0.68730.8924
methodQ2D (ZS) llmgpt-4.1 retrieverBM25 datasetDL 2019
No run config available.
Q2D (COT) gpt-4.1 BM25 0.65280.8777
methodQ2D (COT) llmgpt-4.1 retrieverBM25 datasetDL 2019
No run config available.
Q2D (FS) gpt-4.1 BM25 0.69040.8861
methodQ2D (FS) llmgpt-4.1 retrieverBM25 datasetDL 2019
No run config available.
Q2D (ZS) gpt-4.1 SPLADE++ 0.70000.9142
methodQ2D (ZS) llmgpt-4.1 retrieverSPLADE++ datasetDL 2019
No run config available.
Q2D (COT) gpt-4.1 SPLADE++ 0.68770.9153
methodQ2D (COT) llmgpt-4.1 retrieverSPLADE++ datasetDL 2019
No run config available.
Q2D (FS) gpt-4.1 SPLADE++ 0.69320.9068
methodQ2D (FS) llmgpt-4.1 retrieverSPLADE++ datasetDL 2019
No run config available.
Q2D (COT) gpt-4.1-nano BGE-base-en-v1.5 0.67100.8530
methodQ2D (COT) llmgpt-4.1-nano retrieverBGE-base-en-v1.5 datasetDL 2019
No run config available.
Q2D (FS) gpt-4.1-nano BGE-base-en-v1.5 0.71570.8601
methodQ2D (FS) llmgpt-4.1-nano retrieverBGE-base-en-v1.5 datasetDL 2019
No run config available.
Q2D (ZS) gpt-4.1-nano BGE-base-en-v1.5 0.72020.8701
methodQ2D (ZS) llmgpt-4.1-nano retrieverBGE-base-en-v1.5 datasetDL 2019
No run config available.
Q2D (COT) gpt-4.1-nano BM25 0.62540.8621
methodQ2D (COT) llmgpt-4.1-nano retrieverBM25 datasetDL 2019
No run config available.
Q2D (FS) gpt-4.1-nano BM25 0.66430.8527
methodQ2D (FS) llmgpt-4.1-nano retrieverBM25 datasetDL 2019
No run config available.
Q2D (ZS) gpt-4.1-nano BM25 0.67790.8862
methodQ2D (ZS) llmgpt-4.1-nano retrieverBM25 datasetDL 2019
No run config available.
Q2D (COT) gpt-4.1-nano SPLADE++ 0.65440.8954
methodQ2D (COT) llmgpt-4.1-nano retrieverSPLADE++ datasetDL 2019
No run config available.
Q2D (FS) gpt-4.1-nano SPLADE++ 0.63180.8839
methodQ2D (FS) llmgpt-4.1-nano retrieverSPLADE++ datasetDL 2019
No run config available.
Q2D (ZS) gpt-4.1-nano SPLADE++ 0.68770.8916
methodQ2D (ZS) llmgpt-4.1-nano retrieverSPLADE++ datasetDL 2019
No run config available.
query2e Qwen2.5-72B-Instruct BGE-base-en-v1.5 0.70690.8760
methodquery2e llmQwen2.5-72B-Instruct retrieverBGE-base-en-v1.5 datasetDL 2019
No run config available.
query2e Qwen2.5-72B-Instruct BM25 0.58450.8501
methodquery2e llmQwen2.5-72B-Instruct retrieverBM25 datasetDL 2019
No run config available.
query2e Qwen2.5-72B-Instruct SPLADE++ 0.66860.9104
methodquery2e llmQwen2.5-72B-Instruct retrieverSPLADE++ datasetDL 2019
No run config available.
query2e Qwen2.5-7B-Instruct BGE-base-en-v1.5 0.66460.8422
methodquery2e llmQwen2.5-7B-Instruct retrieverBGE-base-en-v1.5 datasetDL 2019
No run config available.
query2e Qwen2.5-7B-Instruct BM25 0.57210.8431
methodquery2e llmQwen2.5-7B-Instruct retrieverBM25 datasetDL 2019
No run config available.
query2e Qwen2.5-7B-Instruct SPLADE++ 0.54740.8734
methodquery2e llmQwen2.5-7B-Instruct retrieverSPLADE++ datasetDL 2019
No run config available.
query2e gpt-4.1 BGE-base-en-v1.5 0.69700.8701
methodquery2e llmgpt-4.1 retrieverBGE-base-en-v1.5 datasetDL 2019
No run config available.
query2e gpt-4.1 BM25 0.59350.8698
methodquery2e llmgpt-4.1 retrieverBM25 datasetDL 2019
No run config available.
query2e gpt-4.1 SPLADE++ 0.68120.9302
methodquery2e llmgpt-4.1 retrieverSPLADE++ datasetDL 2019
No run config available.
query2e gpt-4.1-nano BGE-base-en-v1.5 0.68020.8662
methodquery2e llmgpt-4.1-nano retrieverBGE-base-en-v1.5 datasetDL 2019
No run config available.
query2e gpt-4.1-nano BM25 0.58910.8474
methodquery2e llmgpt-4.1-nano retrieverBM25 datasetDL 2019
No run config available.
query2e gpt-4.1-nano SPLADE++ 0.63200.9104
methodquery2e llmgpt-4.1-nano retrieverSPLADE++ datasetDL 2019
No run config available.