← Datasets
DL 2019
msmarco-v1-passage.trecdl2019 All results produced by QueryGym · fully reproducible! 120 (method × LLM × retriever) configurations evaluated on this dataset.
Click any row or the + button to expand. The three steps
(reformulate → retrieve → evaluate) for that run appear inline.
Retriever
Model
Method
| Method | LLM | Retriever | nDCG@10 | R@1k | |
|---|---|---|---|---|---|
| csqe | Qwen2.5-72B-Instruct | BGE-base-en-v1.5 | 0.7179 | 0.8944 | |
| No run config available. | |||||
| csqe | Qwen2.5-72B-Instruct | BM25 | 0.6391 | 0.8608 | |
| No run config available. | |||||
| csqe | Qwen2.5-72B-Instruct | SPLADE++ | 0.6189 | 0.9070 | |
| No run config available. | |||||
| csqe | Qwen2.5-7B-Instruct | BGE-base-en-v1.5 | 0.7127 | 0.8803 | |
| No run config available. | |||||
| csqe | Qwen2.5-7B-Instruct | BM25 | 0.6873 | 0.8921 | |
| No run config available. | |||||
| csqe | Qwen2.5-7B-Instruct | SPLADE++ | 0.6523 | 0.9089 | |
| No run config available. | |||||
| csqe | gpt-4.1 | BGE-base-en-v1.5 | 0.7551 | 0.9009 | |
| No run config available. | |||||
| csqe | gpt-4.1 | BM25 | 0.6899 | 0.9035 | |
| No run config available. | |||||
| csqe | gpt-4.1 | SPLADE++ | 0.6936 | 0.9193 | |
| No run config available. | |||||
| csqe | gpt-4.1-nano | BGE-base-en-v1.5 | 0.7304 | 0.8749 | |
| No run config available. | |||||
| csqe | gpt-4.1-nano | BM25 | 0.5410 | 0.8221 | |
| No run config available. | |||||
| csqe | gpt-4.1-nano | SPLADE++ | 0.6134 | 0.8900 | |
| No run config available. | |||||
| genqr | Qwen2.5-72B-Instruct | BGE-base-en-v1.5 | 0.6741 | 0.8618 | |
| No run config available. | |||||
| genqr | Qwen2.5-72B-Instruct | BM25 | 0.4198 | 0.7616 | |
| No run config available. | |||||
| genqr | Qwen2.5-72B-Instruct | SPLADE++ | 0.6154 | 0.9030 | |
| No run config available. | |||||
| genqr | Qwen2.5-7B-Instruct | BGE-base-en-v1.5 | 0.6416 | 0.8381 | |
| No run config available. | |||||
| genqr | Qwen2.5-7B-Instruct | BM25 | 0.4334 | 0.7860 | |
| No run config available. | |||||
| genqr | Qwen2.5-7B-Instruct | SPLADE++ | 0.6449 | 0.8870 | |
| No run config available. | |||||
| genqr | gpt-4.1 | BGE-base-en-v1.5 | 0.7023 | 0.8650 | |
| No run config available. | |||||
| genqr | gpt-4.1 | BM25 | 0.5479 | 0.8282 | |
| No run config available. | |||||
| genqr | gpt-4.1 | SPLADE++ | 0.7065 | 0.9333 | |
| No run config available. | |||||
| genqr | gpt-4.1-nano | BGE-base-en-v1.5 | 0.6587 | 0.8493 | |
| No run config available. | |||||
| genqr | gpt-4.1-nano | BM25 | 0.4389 | 0.7360 | |
| No run config available. | |||||
| genqr | gpt-4.1-nano | SPLADE++ | 0.6351 | 0.9162 | |
| No run config available. | |||||
| genqr_ensemble | Qwen2.5-72B-Instruct | BGE-base-en-v1.5 | 0.6819 | 0.8825 | |
| No run config available. | |||||
| genqr_ensemble | Qwen2.5-72B-Instruct | BM25 | 0.4739 | 0.7999 | |
| No run config available. | |||||
| genqr_ensemble | Qwen2.5-72B-Instruct | SPLADE++ | 0.5979 | 0.9053 | |
| No run config available. | |||||
| genqr_ensemble | Qwen2.5-7B-Instruct | BGE-base-en-v1.5 | 0.6661 | 0.8520 | |
| No run config available. | |||||
| genqr_ensemble | Qwen2.5-7B-Instruct | BM25 | 0.4512 | 0.7952 | |
| No run config available. | |||||
| genqr_ensemble | Qwen2.5-7B-Instruct | SPLADE++ | 0.5948 | 0.8824 | |
| No run config available. | |||||
| genqr_ensemble | gpt-4.1 | BGE-base-en-v1.5 | 0.7034 | 0.8870 | |
| No run config available. | |||||
| genqr_ensemble | gpt-4.1 | BM25 | 0.5589 | 0.8685 | |
| No run config available. | |||||
| genqr_ensemble | gpt-4.1 | SPLADE++ | 0.6859 | 0.9020 | |
| No run config available. | |||||
| genqr_ensemble | gpt-4.1-nano | BGE-base-en-v1.5 | 0.6883 | 0.8711 | |
| No run config available. | |||||
| genqr_ensemble | gpt-4.1-nano | BM25 | 0.4579 | 0.8217 | |
| No run config available. | |||||
| genqr_ensemble | gpt-4.1-nano | SPLADE++ | 0.6617 | 0.9104 | |
| No run config available. | |||||
| lamer | Qwen2.5-72B-Instruct | BGE-base-en-v1.5 | 0.7219 | 0.8859 | |
| No run config available. | |||||
| lamer | Qwen2.5-72B-Instruct | BM25 | 0.6651 | 0.8666 | |
| No run config available. | |||||
| lamer | Qwen2.5-72B-Instruct | SPLADE++ | 0.6651 | 0.8956 | |
| No run config available. | |||||
| lamer | Qwen2.5-7B-Instruct | BGE-base-en-v1.5 | 0.7113 | 0.8668 | |
| No run config available. | |||||
| lamer | Qwen2.5-7B-Instruct | BM25 | 0.6602 | 0.8553 | |
| No run config available. | |||||
| lamer | Qwen2.5-7B-Instruct | SPLADE++ | 0.6465 | 0.8654 | |
| No run config available. | |||||
| lamer | gpt-4.1 | BGE-base-en-v1.5 | 0.7032 | 0.8888 | |
| No run config available. | |||||
| lamer | gpt-4.1 | BM25 | 0.6368 | 0.8566 | |
| No run config available. | |||||
| lamer | gpt-4.1 | SPLADE++ | 0.6836 | 0.9065 | |
| No run config available. | |||||
| lamer | gpt-4.1-nano | BGE-base-en-v1.5 | 0.7265 | 0.8894 | |
| No run config available. | |||||
| lamer | gpt-4.1-nano | BM25 | 0.6731 | 0.8548 | |
| No run config available. | |||||
| lamer | gpt-4.1-nano | SPLADE++ | 0.6916 | 0.8975 | |
| No run config available. | |||||
| mugi | Qwen2.5-72B-Instruct | BGE-base-en-v1.5 | 0.7512 | 0.9071 | |
| No run config available. | |||||
| mugi | Qwen2.5-72B-Instruct | BM25 | 0.6911 | 0.9055 | |
| No run config available. | |||||
| mugi | Qwen2.5-72B-Instruct | SPLADE++ | 0.6746 | 0.9275 | |
| No run config available. | |||||
| mugi | Qwen2.5-7B-Instruct | BGE-base-en-v1.5 | 0.6869 | 0.8781 | |
| No run config available. | |||||
| mugi | Qwen2.5-7B-Instruct | BM25 | 0.6394 | 0.8732 | |
| No run config available. | |||||
| mugi | Qwen2.5-7B-Instruct | SPLADE++ | 0.5773 | 0.8929 | |
| No run config available. | |||||
| mugi | gpt-4.1 | BGE-base-en-v1.5 | 0.7351 | 0.8869 | |
| No run config available. | |||||
| mugi | gpt-4.1 | BM25 | 0.6952 | 0.9005 | |
| No run config available. | |||||
| mugi | gpt-4.1 | SPLADE++ | 0.6859 | 0.9088 | |
| No run config available. | |||||
| mugi | gpt-4.1-nano | BGE-base-en-v1.5 | 0.7169 | 0.8725 | |
| No run config available. | |||||
| mugi | gpt-4.1-nano | BM25 | 0.6835 | 0.8915 | |
| No run config available. | |||||
| mugi | gpt-4.1-nano | SPLADE++ | 0.6611 | 0.8904 | |
| No run config available. | |||||
| qa_expand | Qwen2.5-72B-Instruct | BGE-base-en-v1.5 | 0.6999 | 0.8733 | |
| No run config available. | |||||
| qa_expand | Qwen2.5-72B-Instruct | BM25 | 0.6109 | 0.8396 | |
| No run config available. | |||||
| qa_expand | Qwen2.5-72B-Instruct | SPLADE++ | 0.6757 | 0.9005 | |
| No run config available. | |||||
| qa_expand | Qwen2.5-7B-Instruct | BGE-base-en-v1.5 | 0.6740 | 0.8469 | |
| No run config available. | |||||
| qa_expand | Qwen2.5-7B-Instruct | BM25 | 0.5553 | 0.7976 | |
| No run config available. | |||||
| qa_expand | Qwen2.5-7B-Instruct | SPLADE++ | 0.6574 | 0.8890 | |
| No run config available. | |||||
| qa_expand | gpt-4.1 | BGE-base-en-v1.5 | 0.7370 | 0.8936 | |
| No run config available. | |||||
| qa_expand | gpt-4.1 | BM25 | 0.6832 | 0.8495 | |
| No run config available. | |||||
| qa_expand | gpt-4.1 | SPLADE++ | 0.7335 | 0.9170 | |
| No run config available. | |||||
| qa_expand | gpt-4.1-nano | BGE-base-en-v1.5 | 0.6523 | 0.8486 | |
| No run config available. | |||||
| qa_expand | gpt-4.1-nano | BM25 | 0.5819 | 0.8385 | |
| No run config available. | |||||
| qa_expand | gpt-4.1-nano | SPLADE++ | 0.6883 | 0.9010 | |
| No run config available. | |||||
| Q2D (FS) | Qwen2.5-72B-Instruct | BGE-base-en-v1.5 | 0.7419 | 0.9027 | |
| No run config available. | |||||
| Q2D (ZS) | Qwen2.5-72B-Instruct | BGE-base-en-v1.5 | 0.7269 | 0.9092 | |
| No run config available. | |||||
| Q2D (COT) | Qwen2.5-72B-Instruct | BGE-base-en-v1.5 | 0.7121 | 0.8712 | |
| No run config available. | |||||
| Q2D (ZS) | Qwen2.5-72B-Instruct | BM25 | 0.6557 | 0.8807 | |
| No run config available. | |||||
| Q2D (COT) | Qwen2.5-72B-Instruct | BM25 | 0.6378 | 0.8508 | |
| No run config available. | |||||
| Q2D (FS) | Qwen2.5-72B-Instruct | BM25 | 0.6875 | 0.8959 | |
| No run config available. | |||||
| Q2D (FS) | Qwen2.5-72B-Instruct | SPLADE++ | 0.7151 | 0.9124 | |
| No run config available. | |||||
| Q2D (ZS) | Qwen2.5-72B-Instruct | SPLADE++ | 0.6682 | 0.9161 | |
| No run config available. | |||||
| Q2D (COT) | Qwen2.5-72B-Instruct | SPLADE++ | 0.6941 | 0.9148 | |
| No run config available. | |||||
| Q2D (COT) | Qwen2.5-7B-Instruct | BGE-base-en-v1.5 | 0.6561 | 0.8397 | |
| No run config available. | |||||
| Q2D (FS) | Qwen2.5-7B-Instruct | BGE-base-en-v1.5 | 0.6776 | 0.8535 | |
| No run config available. | |||||
| Q2D (ZS) | Qwen2.5-7B-Instruct | BGE-base-en-v1.5 | 0.6907 | 0.8584 | |
| No run config available. | |||||
| Q2D (COT) | Qwen2.5-7B-Instruct | BM25 | 0.6074 | 0.8585 | |
| No run config available. | |||||
| Q2D (FS) | Qwen2.5-7B-Instruct | BM25 | 0.5884 | 0.8605 | |
| No run config available. | |||||
| Q2D (ZS) | Qwen2.5-7B-Instruct | BM25 | 0.6014 | 0.8467 | |
| No run config available. | |||||
| Q2D (COT) | Qwen2.5-7B-Instruct | SPLADE++ | 0.6513 | 0.9037 | |
| No run config available. | |||||
| Q2D (FS) | Qwen2.5-7B-Instruct | SPLADE++ | 0.6095 | 0.8612 | |
| No run config available. | |||||
| Q2D (ZS) | Qwen2.5-7B-Instruct | SPLADE++ | 0.6091 | 0.8665 | |
| No run config available. | |||||
| Q2D (ZS) | gpt-4.1 | BGE-base-en-v1.5 | 0.7281 | 0.8995 | |
| No run config available. | |||||
| Q2D (COT) | gpt-4.1 | BGE-base-en-v1.5 | 0.7125 | 0.8877 | |
| No run config available. | |||||
| Q2D (FS) | gpt-4.1 | BGE-base-en-v1.5 | 0.7272 | 0.8890 | |
| No run config available. | |||||
| Q2D (ZS) | gpt-4.1 | BM25 | 0.6873 | 0.8924 | |
| No run config available. | |||||
| Q2D (COT) | gpt-4.1 | BM25 | 0.6528 | 0.8777 | |
| No run config available. | |||||
| Q2D (FS) | gpt-4.1 | BM25 | 0.6904 | 0.8861 | |
| No run config available. | |||||
| Q2D (ZS) | gpt-4.1 | SPLADE++ | 0.7000 | 0.9142 | |
| No run config available. | |||||
| Q2D (COT) | gpt-4.1 | SPLADE++ | 0.6877 | 0.9153 | |
| No run config available. | |||||
| Q2D (FS) | gpt-4.1 | SPLADE++ | 0.6932 | 0.9068 | |
| No run config available. | |||||
| Q2D (COT) | gpt-4.1-nano | BGE-base-en-v1.5 | 0.6710 | 0.8530 | |
| No run config available. | |||||
| Q2D (FS) | gpt-4.1-nano | BGE-base-en-v1.5 | 0.7157 | 0.8601 | |
| No run config available. | |||||
| Q2D (ZS) | gpt-4.1-nano | BGE-base-en-v1.5 | 0.7202 | 0.8701 | |
| No run config available. | |||||
| Q2D (COT) | gpt-4.1-nano | BM25 | 0.6254 | 0.8621 | |
| No run config available. | |||||
| Q2D (FS) | gpt-4.1-nano | BM25 | 0.6643 | 0.8527 | |
| No run config available. | |||||
| Q2D (ZS) | gpt-4.1-nano | BM25 | 0.6779 | 0.8862 | |
| No run config available. | |||||
| Q2D (COT) | gpt-4.1-nano | SPLADE++ | 0.6544 | 0.8954 | |
| No run config available. | |||||
| Q2D (FS) | gpt-4.1-nano | SPLADE++ | 0.6318 | 0.8839 | |
| No run config available. | |||||
| Q2D (ZS) | gpt-4.1-nano | SPLADE++ | 0.6877 | 0.8916 | |
| No run config available. | |||||
| query2e | Qwen2.5-72B-Instruct | BGE-base-en-v1.5 | 0.7069 | 0.8760 | |
| No run config available. | |||||
| query2e | Qwen2.5-72B-Instruct | BM25 | 0.5845 | 0.8501 | |
| No run config available. | |||||
| query2e | Qwen2.5-72B-Instruct | SPLADE++ | 0.6686 | 0.9104 | |
| No run config available. | |||||
| query2e | Qwen2.5-7B-Instruct | BGE-base-en-v1.5 | 0.6646 | 0.8422 | |
| No run config available. | |||||
| query2e | Qwen2.5-7B-Instruct | BM25 | 0.5721 | 0.8431 | |
| No run config available. | |||||
| query2e | Qwen2.5-7B-Instruct | SPLADE++ | 0.5474 | 0.8734 | |
| No run config available. | |||||
| query2e | gpt-4.1 | BGE-base-en-v1.5 | 0.6970 | 0.8701 | |
| No run config available. | |||||
| query2e | gpt-4.1 | BM25 | 0.5935 | 0.8698 | |
| No run config available. | |||||
| query2e | gpt-4.1 | SPLADE++ | 0.6812 | 0.9302 | |
| No run config available. | |||||
| query2e | gpt-4.1-nano | BGE-base-en-v1.5 | 0.6802 | 0.8662 | |
| No run config available. | |||||
| query2e | gpt-4.1-nano | BM25 | 0.5891 | 0.8474 | |
| No run config available. | |||||
| query2e | gpt-4.1-nano | SPLADE++ | 0.6320 | 0.9104 | |
| No run config available. | |||||