120 (method × LLM × retriever) configurations evaluated on this dataset.
Click any row or the + button to expand. The three steps
(reformulate → retrieve → evaluate) for that run appear inline.
Retriever
Model
Method
| Method | LLM | Retriever | nDCG@10 | R@1k | |
|---|---|---|---|---|---|
| csqe | Qwen2.5-72B-Instruct | BGE-base-en-v1.5 | 0.3757 | 0.8531 | |
| No run config available. | |||||
| csqe | Qwen2.5-72B-Instruct | BM25 | 0.2848 | 0.6998 | |
| No run config available. | |||||
| csqe | Qwen2.5-72B-Instruct | SPLADE++ | 0.2857 | 0.8246 | |
| No run config available. | |||||
| csqe | Qwen2.5-7B-Instruct | BGE-base-en-v1.5 | 0.3671 | 0.8348 | |
| No run config available. | |||||
| csqe | Qwen2.5-7B-Instruct | BM25 | 0.3322 | 0.7913 | |
| No run config available. | |||||
| csqe | Qwen2.5-7B-Instruct | SPLADE++ | 0.3025 | 0.8057 | |
| No run config available. | |||||
| csqe | gpt-4.1 | BGE-base-en-v1.5 | 0.4144 | 0.8640 | |
| No run config available. | |||||
| csqe | gpt-4.1 | BM25 | 0.3658 | 0.7873 | |
| No run config available. | |||||
| csqe | gpt-4.1 | SPLADE++ | 0.3690 | 0.8341 | |
| No run config available. | |||||
| csqe | gpt-4.1-nano | BGE-base-en-v1.5 | 0.3516 | 0.8371 | |
| No run config available. | |||||
| csqe | gpt-4.1-nano | BM25 | 0.2436 | 0.7327 | |
| No run config available. | |||||
| csqe | gpt-4.1-nano | SPLADE++ | 0.2789 | 0.7872 | |
| No run config available. | |||||
| genqr | Qwen2.5-72B-Instruct | BGE-base-en-v1.5 | 0.3471 | 0.8144 | |
| No run config available. | |||||
| genqr | Qwen2.5-72B-Instruct | BM25 | 0.2091 | 0.6822 | |
| No run config available. | |||||
| genqr | Qwen2.5-72B-Instruct | SPLADE++ | 0.2916 | 0.7861 | |
| No run config available. | |||||
| genqr | Qwen2.5-7B-Instruct | BGE-base-en-v1.5 | 0.3375 | 0.8235 | |
| No run config available. | |||||
| genqr | Qwen2.5-7B-Instruct | BM25 | 0.2006 | 0.6458 | |
| No run config available. | |||||
| genqr | Qwen2.5-7B-Instruct | SPLADE++ | 0.3386 | 0.8000 | |
| No run config available. | |||||
| genqr | gpt-4.1 | BGE-base-en-v1.5 | 0.3870 | 0.8402 | |
| No run config available. | |||||
| genqr | gpt-4.1 | BM25 | 0.2921 | 0.7434 | |
| No run config available. | |||||
| genqr | gpt-4.1 | SPLADE++ | 0.3800 | 0.8488 | |
| No run config available. | |||||
| genqr | gpt-4.1-nano | BGE-base-en-v1.5 | 0.3586 | 0.8389 | |
| No run config available. | |||||
| genqr | gpt-4.1-nano | BM25 | 0.1743 | 0.6575 | |
| No run config available. | |||||
| genqr | gpt-4.1-nano | SPLADE++ | 0.3043 | 0.8408 | |
| No run config available. | |||||
| genqr_ensemble | Qwen2.5-72B-Instruct | BGE-base-en-v1.5 | 0.3543 | 0.8269 | |
| No run config available. | |||||
| genqr_ensemble | Qwen2.5-72B-Instruct | BM25 | 0.2463 | 0.6975 | |
| No run config available. | |||||
| genqr_ensemble | Qwen2.5-72B-Instruct | SPLADE++ | 0.2849 | 0.7823 | |
| No run config available. | |||||
| genqr_ensemble | Qwen2.5-7B-Instruct | BGE-base-en-v1.5 | 0.3713 | 0.8356 | |
| No run config available. | |||||
| genqr_ensemble | Qwen2.5-7B-Instruct | BM25 | 0.2429 | 0.7210 | |
| No run config available. | |||||
| genqr_ensemble | Qwen2.5-7B-Instruct | SPLADE++ | 0.3292 | 0.8005 | |
| No run config available. | |||||
| genqr_ensemble | gpt-4.1 | BGE-base-en-v1.5 | 0.3572 | 0.8633 | |
| No run config available. | |||||
| genqr_ensemble | gpt-4.1 | BM25 | 0.2697 | 0.7775 | |
| No run config available. | |||||
| genqr_ensemble | gpt-4.1 | SPLADE++ | 0.3047 | 0.8207 | |
| No run config available. | |||||
| genqr_ensemble | gpt-4.1-nano | BGE-base-en-v1.5 | 0.3579 | 0.8282 | |
| No run config available. | |||||
| genqr_ensemble | gpt-4.1-nano | BM25 | 0.2154 | 0.6990 | |
| No run config available. | |||||
| genqr_ensemble | gpt-4.1-nano | SPLADE++ | 0.3233 | 0.8400 | |
| No run config available. | |||||
| lamer | Qwen2.5-72B-Instruct | BGE-base-en-v1.5 | 0.4055 | 0.8453 | |
| No run config available. | |||||
| lamer | Qwen2.5-72B-Instruct | BM25 | 0.3635 | 0.7820 | |
| No run config available. | |||||
| lamer | Qwen2.5-72B-Instruct | SPLADE++ | 0.3648 | 0.8156 | |
| No run config available. | |||||
| lamer | Qwen2.5-7B-Instruct | BGE-base-en-v1.5 | 0.3788 | 0.8315 | |
| No run config available. | |||||
| lamer | Qwen2.5-7B-Instruct | BM25 | 0.3570 | 0.7633 | |
| No run config available. | |||||
| lamer | Qwen2.5-7B-Instruct | SPLADE++ | 0.3280 | 0.7917 | |
| No run config available. | |||||
| lamer | gpt-4.1 | BGE-base-en-v1.5 | 0.4120 | 0.8557 | |
| No run config available. | |||||
| lamer | gpt-4.1 | BM25 | 0.3555 | 0.8065 | |
| No run config available. | |||||
| lamer | gpt-4.1 | SPLADE++ | 0.3673 | 0.8246 | |
| No run config available. | |||||
| lamer | gpt-4.1-nano | BGE-base-en-v1.5 | 0.3759 | 0.8352 | |
| No run config available. | |||||
| lamer | gpt-4.1-nano | BM25 | 0.3398 | 0.7697 | |
| No run config available. | |||||
| lamer | gpt-4.1-nano | SPLADE++ | 0.3459 | 0.7969 | |
| No run config available. | |||||
| mugi | Qwen2.5-72B-Instruct | BGE-base-en-v1.5 | 0.3948 | 0.8548 | |
| No run config available. | |||||
| mugi | Qwen2.5-72B-Instruct | BM25 | 0.3609 | 0.8122 | |
| No run config available. | |||||
| mugi | Qwen2.5-72B-Instruct | SPLADE++ | 0.3260 | 0.8098 | |
| No run config available. | |||||
| mugi | Qwen2.5-7B-Instruct | BGE-base-en-v1.5 | 0.3619 | 0.8495 | |
| No run config available. | |||||
| mugi | Qwen2.5-7B-Instruct | BM25 | 0.3173 | 0.7707 | |
| No run config available. | |||||
| mugi | Qwen2.5-7B-Instruct | SPLADE++ | 0.2642 | 0.8028 | |
| No run config available. | |||||
| mugi | gpt-4.1 | BGE-base-en-v1.5 | 0.4038 | 0.8415 | |
| No run config available. | |||||
| mugi | gpt-4.1 | BM25 | 0.3651 | 0.8216 | |
| No run config available. | |||||
| mugi | gpt-4.1 | SPLADE++ | 0.3625 | 0.8111 | |
| No run config available. | |||||
| mugi | gpt-4.1-nano | BGE-base-en-v1.5 | 0.3903 | 0.8354 | |
| No run config available. | |||||
| mugi | gpt-4.1-nano | BM25 | 0.3423 | 0.7924 | |
| No run config available. | |||||
| mugi | gpt-4.1-nano | SPLADE++ | 0.3254 | 0.8105 | |
| No run config available. | |||||
| qa_expand | Qwen2.5-72B-Instruct | BGE-base-en-v1.5 | 0.3485 | 0.8498 | |
| No run config available. | |||||
| qa_expand | Qwen2.5-72B-Instruct | BM25 | 0.3215 | 0.7876 | |
| No run config available. | |||||
| qa_expand | Qwen2.5-72B-Instruct | SPLADE++ | 0.3347 | 0.8285 | |
| No run config available. | |||||
| qa_expand | Qwen2.5-7B-Instruct | BGE-base-en-v1.5 | 0.3418 | 0.8267 | |
| No run config available. | |||||
| qa_expand | Qwen2.5-7B-Instruct | BM25 | 0.2892 | 0.7746 | |
| No run config available. | |||||
| qa_expand | Qwen2.5-7B-Instruct | SPLADE++ | 0.3143 | 0.8305 | |
| No run config available. | |||||
| qa_expand | gpt-4.1 | BGE-base-en-v1.5 | 0.3739 | 0.8543 | |
| No run config available. | |||||
| qa_expand | gpt-4.1 | BM25 | 0.3018 | 0.7570 | |
| No run config available. | |||||
| qa_expand | gpt-4.1 | SPLADE++ | 0.3552 | 0.8034 | |
| No run config available. | |||||
| qa_expand | gpt-4.1-nano | BGE-base-en-v1.5 | 0.3688 | 0.8113 | |
| No run config available. | |||||
| qa_expand | gpt-4.1-nano | BM25 | 0.3469 | 0.7480 | |
| No run config available. | |||||
| qa_expand | gpt-4.1-nano | SPLADE++ | 0.3702 | 0.8506 | |
| No run config available. | |||||
| Q2D (FS) | Qwen2.5-72B-Instruct | BGE-base-en-v1.5 | 0.3845 | 0.8568 | |
| No run config available. | |||||
| Q2D (ZS) | Qwen2.5-72B-Instruct | BGE-base-en-v1.5 | 0.3954 | 0.8508 | |
| No run config available. | |||||
| Q2D (COT) | Qwen2.5-72B-Instruct | BGE-base-en-v1.5 | 0.3498 | 0.8236 | |
| No run config available. | |||||
| Q2D (ZS) | Qwen2.5-72B-Instruct | BM25 | 0.3506 | 0.8002 | |
| No run config available. | |||||
| Q2D (COT) | Qwen2.5-72B-Instruct | BM25 | 0.3075 | 0.7526 | |
| No run config available. | |||||
| Q2D (FS) | Qwen2.5-72B-Instruct | BM25 | 0.3467 | 0.8020 | |
| No run config available. | |||||
| Q2D (FS) | Qwen2.5-72B-Instruct | SPLADE++ | 0.3333 | 0.8206 | |
| No run config available. | |||||
| Q2D (ZS) | Qwen2.5-72B-Instruct | SPLADE++ | 0.3200 | 0.8248 | |
| No run config available. | |||||
| Q2D (COT) | Qwen2.5-72B-Instruct | SPLADE++ | 0.3016 | 0.8393 | |
| No run config available. | |||||
| Q2D (COT) | Qwen2.5-7B-Instruct | BGE-base-en-v1.5 | 0.3391 | 0.8300 | |
| No run config available. | |||||
| Q2D (FS) | Qwen2.5-7B-Instruct | BGE-base-en-v1.5 | 0.3628 | 0.8348 | |
| No run config available. | |||||
| Q2D (ZS) | Qwen2.5-7B-Instruct | BGE-base-en-v1.5 | 0.3675 | 0.8255 | |
| No run config available. | |||||
| Q2D (COT) | Qwen2.5-7B-Instruct | BM25 | 0.3044 | 0.7815 | |
| No run config available. | |||||
| Q2D (FS) | Qwen2.5-7B-Instruct | BM25 | 0.3141 | 0.7724 | |
| No run config available. | |||||
| Q2D (ZS) | Qwen2.5-7B-Instruct | BM25 | 0.3352 | 0.7763 | |
| No run config available. | |||||
| Q2D (COT) | Qwen2.5-7B-Instruct | SPLADE++ | 0.2731 | 0.8239 | |
| No run config available. | |||||
| Q2D (FS) | Qwen2.5-7B-Instruct | SPLADE++ | 0.2672 | 0.8116 | |
| No run config available. | |||||
| Q2D (ZS) | Qwen2.5-7B-Instruct | SPLADE++ | 0.2904 | 0.8006 | |
| No run config available. | |||||
| Q2D (ZS) | gpt-4.1 | BGE-base-en-v1.5 | 0.3786 | 0.8591 | |
| No run config available. | |||||
| Q2D (COT) | gpt-4.1 | BGE-base-en-v1.5 | 0.3755 | 0.8505 | |
| No run config available. | |||||
| Q2D (FS) | gpt-4.1 | BGE-base-en-v1.5 | 0.4074 | 0.8726 | |
| No run config available. | |||||
| Q2D (ZS) | gpt-4.1 | BM25 | 0.3502 | 0.7811 | |
| No run config available. | |||||
| Q2D (COT) | gpt-4.1 | BM25 | 0.3291 | 0.7737 | |
| No run config available. | |||||
| Q2D (FS) | gpt-4.1 | BM25 | 0.3562 | 0.8042 | |
| No run config available. | |||||
| Q2D (ZS) | gpt-4.1 | SPLADE++ | 0.3377 | 0.8389 | |
| No run config available. | |||||
| Q2D (COT) | gpt-4.1 | SPLADE++ | 0.3308 | 0.8456 | |
| No run config available. | |||||
| Q2D (FS) | gpt-4.1 | SPLADE++ | 0.3771 | 0.8396 | |
| No run config available. | |||||
| Q2D (COT) | gpt-4.1-nano | BGE-base-en-v1.5 | 0.3722 | 0.8367 | |
| No run config available. | |||||
| Q2D (FS) | gpt-4.1-nano | BGE-base-en-v1.5 | 0.3480 | 0.8374 | |
| No run config available. | |||||
| Q2D (ZS) | gpt-4.1-nano | BGE-base-en-v1.5 | 0.3683 | 0.8395 | |
| No run config available. | |||||
| Q2D (COT) | gpt-4.1-nano | BM25 | 0.3320 | 0.7655 | |
| No run config available. | |||||
| Q2D (FS) | gpt-4.1-nano | BM25 | 0.3358 | 0.7627 | |
| No run config available. | |||||
| Q2D (ZS) | gpt-4.1-nano | BM25 | 0.3368 | 0.7832 | |
| No run config available. | |||||
| Q2D (COT) | gpt-4.1-nano | SPLADE++ | 0.3426 | 0.8390 | |
| No run config available. | |||||
| Q2D (FS) | gpt-4.1-nano | SPLADE++ | 0.3533 | 0.8005 | |
| No run config available. | |||||
| Q2D (ZS) | gpt-4.1-nano | SPLADE++ | 0.3479 | 0.8092 | |
| No run config available. | |||||
| query2e | Qwen2.5-72B-Instruct | BGE-base-en-v1.5 | 0.3744 | 0.8503 | |
| No run config available. | |||||
| query2e | Qwen2.5-72B-Instruct | BM25 | 0.3148 | 0.7605 | |
| No run config available. | |||||
| query2e | Qwen2.5-72B-Instruct | SPLADE++ | 0.3442 | 0.8328 | |
| No run config available. | |||||
| query2e | Qwen2.5-7B-Instruct | BGE-base-en-v1.5 | 0.3521 | 0.8171 | |
| No run config available. | |||||
| query2e | Qwen2.5-7B-Instruct | BM25 | 0.3101 | 0.7432 | |
| No run config available. | |||||
| query2e | Qwen2.5-7B-Instruct | SPLADE++ | 0.3056 | 0.7882 | |
| No run config available. | |||||
| query2e | gpt-4.1 | BGE-base-en-v1.5 | 0.3779 | 0.8306 | |
| No run config available. | |||||
| query2e | gpt-4.1 | BM25 | 0.3446 | 0.7639 | |
| No run config available. | |||||
| query2e | gpt-4.1 | SPLADE++ | 0.3518 | 0.8380 | |
| No run config available. | |||||
| query2e | gpt-4.1-nano | BGE-base-en-v1.5 | 0.3609 | 0.8321 | |
| No run config available. | |||||
| query2e | gpt-4.1-nano | BM25 | 0.3101 | 0.7665 | |
| No run config available. | |||||
| query2e | gpt-4.1-nano | SPLADE++ | 0.3297 | 0.8143 | |
| No run config available. | |||||