← Datasets
DL 2020
msmarco-v1-passage.trecdl2020 All results produced by QueryGym · fully reproducible! 120 (method × LLM × retriever) configurations evaluated on this dataset.
Click any row or the + button to expand. The three steps
(reformulate → retrieve → evaluate) for that run appear inline.
Retriever
Model
Method
| Method | LLM | Retriever | nDCG@10 | R@1k | |
|---|---|---|---|---|---|
| csqe | Qwen2.5-72B-Instruct | BGE-base-en-v1.5 | 0.6687 | 0.8722 | |
| No run config available. | |||||
| csqe | Qwen2.5-72B-Instruct | BM25 | 0.5606 | 0.8603 | |
| No run config available. | |||||
| csqe | Qwen2.5-72B-Instruct | SPLADE++ | 0.5736 | 0.9052 | |
| No run config available. | |||||
| csqe | Qwen2.5-7B-Instruct | BGE-base-en-v1.5 | 0.6885 | 0.8850 | |
| No run config available. | |||||
| csqe | Qwen2.5-7B-Instruct | BM25 | 0.6083 | 0.8596 | |
| No run config available. | |||||
| csqe | Qwen2.5-7B-Instruct | SPLADE++ | 0.6164 | 0.9039 | |
| No run config available. | |||||
| csqe | gpt-4.1 | BGE-base-en-v1.5 | 0.7139 | 0.8968 | |
| No run config available. | |||||
| csqe | gpt-4.1 | BM25 | 0.6548 | 0.8871 | |
| No run config available. | |||||
| csqe | gpt-4.1 | SPLADE++ | 0.6796 | 0.9397 | |
| No run config available. | |||||
| csqe | gpt-4.1-nano | BGE-base-en-v1.5 | 0.6873 | 0.8535 | |
| No run config available. | |||||
| csqe | gpt-4.1-nano | BM25 | 0.5142 | 0.8586 | |
| No run config available. | |||||
| csqe | gpt-4.1-nano | SPLADE++ | 0.5883 | 0.9119 | |
| No run config available. | |||||
| genqr | Qwen2.5-72B-Instruct | BGE-base-en-v1.5 | 0.6680 | 0.8652 | |
| No run config available. | |||||
| genqr | Qwen2.5-72B-Instruct | BM25 | 0.4238 | 0.7919 | |
| No run config available. | |||||
| genqr | Qwen2.5-72B-Instruct | SPLADE++ | 0.5751 | 0.8971 | |
| No run config available. | |||||
| genqr | Qwen2.5-7B-Instruct | BGE-base-en-v1.5 | 0.6335 | 0.8395 | |
| No run config available. | |||||
| genqr | Qwen2.5-7B-Instruct | BM25 | 0.3857 | 0.7740 | |
| No run config available. | |||||
| genqr | Qwen2.5-7B-Instruct | SPLADE++ | 0.6115 | 0.8989 | |
| No run config available. | |||||
| genqr | gpt-4.1 | BGE-base-en-v1.5 | 0.6903 | 0.8516 | |
| No run config available. | |||||
| genqr | gpt-4.1 | BM25 | 0.5368 | 0.8402 | |
| No run config available. | |||||
| genqr | gpt-4.1 | SPLADE++ | 0.6260 | 0.9143 | |
| No run config available. | |||||
| genqr | gpt-4.1-nano | BGE-base-en-v1.5 | 0.6568 | 0.8485 | |
| No run config available. | |||||
| genqr | gpt-4.1-nano | BM25 | 0.4302 | 0.7701 | |
| No run config available. | |||||
| genqr | gpt-4.1-nano | SPLADE++ | 0.6011 | 0.9074 | |
| No run config available. | |||||
| genqr_ensemble | Qwen2.5-72B-Instruct | BGE-base-en-v1.5 | 0.6774 | 0.8585 | |
| No run config available. | |||||
| genqr_ensemble | Qwen2.5-72B-Instruct | BM25 | 0.4248 | 0.7820 | |
| No run config available. | |||||
| genqr_ensemble | Qwen2.5-72B-Instruct | SPLADE++ | 0.5447 | 0.8886 | |
| No run config available. | |||||
| genqr_ensemble | Qwen2.5-7B-Instruct | BGE-base-en-v1.5 | 0.6700 | 0.8582 | |
| No run config available. | |||||
| genqr_ensemble | Qwen2.5-7B-Instruct | BM25 | 0.4896 | 0.8164 | |
| No run config available. | |||||
| genqr_ensemble | Qwen2.5-7B-Instruct | SPLADE++ | 0.6307 | 0.9020 | |
| No run config available. | |||||
| genqr_ensemble | gpt-4.1 | BGE-base-en-v1.5 | 0.6826 | 0.8699 | |
| No run config available. | |||||
| genqr_ensemble | gpt-4.1 | BM25 | 0.5528 | 0.8613 | |
| No run config available. | |||||
| genqr_ensemble | gpt-4.1 | SPLADE++ | 0.5857 | 0.9141 | |
| No run config available. | |||||
| genqr_ensemble | gpt-4.1-nano | BGE-base-en-v1.5 | 0.6645 | 0.8620 | |
| No run config available. | |||||
| genqr_ensemble | gpt-4.1-nano | BM25 | 0.4718 | 0.8158 | |
| No run config available. | |||||
| genqr_ensemble | gpt-4.1-nano | SPLADE++ | 0.6044 | 0.9194 | |
| No run config available. | |||||
| lamer | Qwen2.5-72B-Instruct | BGE-base-en-v1.5 | 0.7276 | 0.9045 | |
| No run config available. | |||||
| lamer | Qwen2.5-72B-Instruct | BM25 | 0.6711 | 0.8920 | |
| No run config available. | |||||
| lamer | Qwen2.5-72B-Instruct | SPLADE++ | 0.6483 | 0.9195 | |
| No run config available. | |||||
| lamer | Qwen2.5-7B-Instruct | BGE-base-en-v1.5 | 0.6825 | 0.8940 | |
| No run config available. | |||||
| lamer | Qwen2.5-7B-Instruct | BM25 | 0.6322 | 0.8933 | |
| No run config available. | |||||
| lamer | Qwen2.5-7B-Instruct | SPLADE++ | 0.6076 | 0.9213 | |
| No run config available. | |||||
| lamer | gpt-4.1 | BGE-base-en-v1.5 | 0.7148 | 0.9026 | |
| No run config available. | |||||
| lamer | gpt-4.1 | BM25 | 0.6530 | 0.9002 | |
| No run config available. | |||||
| lamer | gpt-4.1 | SPLADE++ | 0.6390 | 0.9378 | |
| No run config available. | |||||
| lamer | gpt-4.1-nano | BGE-base-en-v1.5 | 0.7135 | 0.8846 | |
| No run config available. | |||||
| lamer | gpt-4.1-nano | BM25 | 0.6560 | 0.8865 | |
| No run config available. | |||||
| lamer | gpt-4.1-nano | SPLADE++ | 0.6254 | 0.9244 | |
| No run config available. | |||||
| mugi | Qwen2.5-72B-Instruct | BGE-base-en-v1.5 | 0.7122 | 0.8894 | |
| No run config available. | |||||
| mugi | Qwen2.5-72B-Instruct | BM25 | 0.6268 | 0.9015 | |
| No run config available. | |||||
| mugi | Qwen2.5-72B-Instruct | SPLADE++ | 0.6419 | 0.9165 | |
| No run config available. | |||||
| mugi | Qwen2.5-7B-Instruct | BGE-base-en-v1.5 | 0.6888 | 0.8823 | |
| No run config available. | |||||
| mugi | Qwen2.5-7B-Instruct | BM25 | 0.6069 | 0.8882 | |
| No run config available. | |||||
| mugi | Qwen2.5-7B-Instruct | SPLADE++ | 0.5527 | 0.9104 | |
| No run config available. | |||||
| mugi | gpt-4.1 | BGE-base-en-v1.5 | 0.7203 | 0.8950 | |
| No run config available. | |||||
| mugi | gpt-4.1 | BM25 | 0.6578 | 0.8996 | |
| No run config available. | |||||
| mugi | gpt-4.1 | SPLADE++ | 0.6508 | 0.9199 | |
| No run config available. | |||||
| mugi | gpt-4.1-nano | BGE-base-en-v1.5 | 0.7187 | 0.8911 | |
| No run config available. | |||||
| mugi | gpt-4.1-nano | BM25 | 0.6473 | 0.9017 | |
| No run config available. | |||||
| mugi | gpt-4.1-nano | SPLADE++ | 0.6432 | 0.9203 | |
| No run config available. | |||||
| qa_expand | Qwen2.5-72B-Instruct | BGE-base-en-v1.5 | 0.6916 | 0.8785 | |
| No run config available. | |||||
| qa_expand | Qwen2.5-72B-Instruct | BM25 | 0.6152 | 0.8727 | |
| No run config available. | |||||
| qa_expand | Qwen2.5-72B-Instruct | SPLADE++ | 0.6983 | 0.9284 | |
| No run config available. | |||||
| qa_expand | Qwen2.5-7B-Instruct | BGE-base-en-v1.5 | 0.6541 | 0.8606 | |
| No run config available. | |||||
| qa_expand | Qwen2.5-7B-Instruct | BM25 | 0.5654 | 0.8454 | |
| No run config available. | |||||
| qa_expand | Qwen2.5-7B-Instruct | SPLADE++ | 0.6156 | 0.8945 | |
| No run config available. | |||||
| qa_expand | gpt-4.1 | BGE-base-en-v1.5 | 0.7074 | 0.8754 | |
| No run config available. | |||||
| qa_expand | gpt-4.1 | BM25 | 0.6418 | 0.8787 | |
| No run config available. | |||||
| qa_expand | gpt-4.1 | SPLADE++ | 0.6739 | 0.9260 | |
| No run config available. | |||||
| qa_expand | gpt-4.1-nano | BGE-base-en-v1.5 | 0.6612 | 0.8397 | |
| No run config available. | |||||
| qa_expand | gpt-4.1-nano | BM25 | 0.6026 | 0.8649 | |
| No run config available. | |||||
| qa_expand | gpt-4.1-nano | SPLADE++ | 0.6628 | 0.9279 | |
| No run config available. | |||||
| Q2D (FS) | Qwen2.5-72B-Instruct | BGE-base-en-v1.5 | 0.6792 | 0.8913 | |
| No run config available. | |||||
| Q2D (ZS) | Qwen2.5-72B-Instruct | BGE-base-en-v1.5 | 0.6982 | 0.8945 | |
| No run config available. | |||||
| Q2D (COT) | Qwen2.5-72B-Instruct | BGE-base-en-v1.5 | 0.6411 | 0.8485 | |
| No run config available. | |||||
| Q2D (ZS) | Qwen2.5-72B-Instruct | BM25 | 0.6207 | 0.8801 | |
| No run config available. | |||||
| Q2D (COT) | Qwen2.5-72B-Instruct | BM25 | 0.5651 | 0.8549 | |
| No run config available. | |||||
| Q2D (FS) | Qwen2.5-72B-Instruct | BM25 | 0.6264 | 0.8907 | |
| No run config available. | |||||
| Q2D (FS) | Qwen2.5-72B-Instruct | SPLADE++ | 0.6499 | 0.9234 | |
| No run config available. | |||||
| Q2D (ZS) | Qwen2.5-72B-Instruct | SPLADE++ | 0.6144 | 0.9161 | |
| No run config available. | |||||
| Q2D (COT) | Qwen2.5-72B-Instruct | SPLADE++ | 0.6099 | 0.8857 | |
| No run config available. | |||||
| Q2D (COT) | Qwen2.5-7B-Instruct | BGE-base-en-v1.5 | 0.6302 | 0.8573 | |
| No run config available. | |||||
| Q2D (FS) | Qwen2.5-7B-Instruct | BGE-base-en-v1.5 | 0.6402 | 0.8578 | |
| No run config available. | |||||
| Q2D (ZS) | Qwen2.5-7B-Instruct | BGE-base-en-v1.5 | 0.6617 | 0.8566 | |
| No run config available. | |||||
| Q2D (COT) | Qwen2.5-7B-Instruct | BM25 | 0.5802 | 0.8684 | |
| No run config available. | |||||
| Q2D (FS) | Qwen2.5-7B-Instruct | BM25 | 0.5428 | 0.8691 | |
| No run config available. | |||||
| Q2D (ZS) | Qwen2.5-7B-Instruct | BM25 | 0.5685 | 0.8647 | |
| No run config available. | |||||
| Q2D (COT) | Qwen2.5-7B-Instruct | SPLADE++ | 0.5948 | 0.9019 | |
| No run config available. | |||||
| Q2D (FS) | Qwen2.5-7B-Instruct | SPLADE++ | 0.5492 | 0.9062 | |
| No run config available. | |||||
| Q2D (ZS) | Qwen2.5-7B-Instruct | SPLADE++ | 0.6096 | 0.9045 | |
| No run config available. | |||||
| Q2D (ZS) | gpt-4.1 | BGE-base-en-v1.5 | 0.7393 | 0.9056 | |
| No run config available. | |||||
| Q2D (COT) | gpt-4.1 | BGE-base-en-v1.5 | 0.6720 | 0.8756 | |
| No run config available. | |||||
| Q2D (FS) | gpt-4.1 | BGE-base-en-v1.5 | 0.7141 | 0.8948 | |
| No run config available. | |||||
| Q2D (ZS) | gpt-4.1 | BM25 | 0.6625 | 0.8942 | |
| No run config available. | |||||
| Q2D (COT) | gpt-4.1 | BM25 | 0.6239 | 0.8781 | |
| No run config available. | |||||
| Q2D (FS) | gpt-4.1 | BM25 | 0.6746 | 0.8984 | |
| No run config available. | |||||
| Q2D (ZS) | gpt-4.1 | SPLADE++ | 0.6875 | 0.9372 | |
| No run config available. | |||||
| Q2D (COT) | gpt-4.1 | SPLADE++ | 0.6534 | 0.9089 | |
| No run config available. | |||||
| Q2D (FS) | gpt-4.1 | SPLADE++ | 0.6749 | 0.9389 | |
| No run config available. | |||||
| Q2D (COT) | gpt-4.1-nano | BGE-base-en-v1.5 | 0.6744 | 0.8709 | |
| No run config available. | |||||
| Q2D (FS) | gpt-4.1-nano | BGE-base-en-v1.5 | 0.6988 | 0.8742 | |
| No run config available. | |||||
| Q2D (ZS) | gpt-4.1-nano | BGE-base-en-v1.5 | 0.7029 | 0.8743 | |
| No run config available. | |||||
| Q2D (COT) | gpt-4.1-nano | BM25 | 0.6092 | 0.8846 | |
| No run config available. | |||||
| Q2D (FS) | gpt-4.1-nano | BM25 | 0.6227 | 0.8848 | |
| No run config available. | |||||
| Q2D (ZS) | gpt-4.1-nano | BM25 | 0.6268 | 0.8869 | |
| No run config available. | |||||
| Q2D (COT) | gpt-4.1-nano | SPLADE++ | 0.6271 | 0.9167 | |
| No run config available. | |||||
| Q2D (FS) | gpt-4.1-nano | SPLADE++ | 0.6471 | 0.9232 | |
| No run config available. | |||||
| Q2D (ZS) | gpt-4.1-nano | SPLADE++ | 0.6242 | 0.9219 | |
| No run config available. | |||||
| query2e | Qwen2.5-72B-Instruct | BGE-base-en-v1.5 | 0.6606 | 0.8528 | |
| No run config available. | |||||
| query2e | Qwen2.5-72B-Instruct | BM25 | 0.5546 | 0.8609 | |
| No run config available. | |||||
| query2e | Qwen2.5-72B-Instruct | SPLADE++ | 0.6353 | 0.9286 | |
| No run config available. | |||||
| query2e | Qwen2.5-7B-Instruct | BGE-base-en-v1.5 | 0.6425 | 0.8443 | |
| No run config available. | |||||
| query2e | Qwen2.5-7B-Instruct | BM25 | 0.5404 | 0.8548 | |
| No run config available. | |||||
| query2e | Qwen2.5-7B-Instruct | SPLADE++ | 0.5312 | 0.9001 | |
| No run config available. | |||||
| query2e | gpt-4.1 | BGE-base-en-v1.5 | 0.6422 | 0.8184 | |
| No run config available. | |||||
| query2e | gpt-4.1 | BM25 | 0.5759 | 0.8594 | |
| No run config available. | |||||
| query2e | gpt-4.1 | SPLADE++ | 0.6522 | 0.9252 | |
| No run config available. | |||||
| query2e | gpt-4.1-nano | BGE-base-en-v1.5 | 0.6706 | 0.8514 | |
| No run config available. | |||||
| query2e | gpt-4.1-nano | BM25 | 0.5475 | 0.8392 | |
| No run config available. | |||||
| query2e | gpt-4.1-nano | SPLADE++ | 0.6605 | 0.9142 | |
| No run config available. | |||||