Reciprocal Rank Fusion

Reciprocal Rank Fusion (RRF) is a technique used in Retrieval-Augmented Generation (RAG) systems to combine multiple ranked lists of documents into a single, improved ranking. It's particularly useful for RAG-Fusion, which enhances RAG by generating multiple queries and then using RRF to re-rank the results.

Let me explain RRF like a beginner, then compare it with filter-based fan-out retrieval, and finally show how you can apply it to your multi-query PDF RAG system.

📚 What is Reciprocal Rank Fusion (RRF)?

It works the same as Parallel Query (Fan Out) Retrieval i.e. LLM would generate 3 to 5 different queries based on the original user query. Now imagine you searched 3 slightly different versions of your question in Qdrant and got relevant chunks:

Rewritten Query	Top Results (Ranked)
Query 1	📄A, 📄B, 📄C
Query 2	📄C, 📄D, 📄E
Query 3	📄F, 📄A, 📄D

Now, instead of:

Just combining all chunks (👎 duplicates possible)
Or filtering for unique ones (👎 loses score/rank info)

You fuse the results intelligently by their rank. This will give the Rank of each chunk. Hence putting the chunks in order based on the similarity with the generated query.

🔁 RRF vs Parallel Query (Fan Out) Retrieval (Comparison)

Feature	Parallel Query	Reciprocal Rank Fusion (RRF)
Keeps all chunks.	✅ Yes	✅ Yes
Uses rank from each search.	❌ No	✅ Yes
Gives weight to overlap.	❌ All equal	✅ Overlap = more score
Handles chunk quality.	❌ Not really	✅ Top-ranked chunks score better
Good when...?	Results are noisy	When ranks matter + you want diversity

💡 Why RRF Is Great for You

In fan-out RAG, different queries return different "views" of the same concept.
Some chunks show up across multiple queries = likely to be super relevant
RRF promotes these, without needing to manually guess which is best.

Code to get Rankings of each response

before understanding this, you need to understand the working of

from collections import defaultdict

def reciprocal_rank_fusion(results_list: list[list[str]], k: int = 60) -> list[str]:
    """
    results_list = [
      [chunk_id1, chunk_id2, chunk_id3],  # from query1
      [chunk_id3, chunk_id4, chunk_id5],  # from query2
      ...
    ]
    Returns: ranked list of unique chunk IDs
    """
    scores = defaultdict(float)

    for result in results_list:
        for rank, chunk_id in enumerate(result):
            scores[chunk_id] += 1 / (k + rank + 1)  

    return sorted(scores, key=scores.get, reverse=True)

results_list: This is a list of ranked lists.
scores = defaultdict(float): This creates a dictionary where:

Keys = chunk IDs
Values = scores (starts at 0.0 by default)

enumerate(result) gives us the rank (position) and the chunk_id(built-in function in Python)
We then calculate the score for each chunk_id
Let’s say k = 60:
- If chunk1 is at rank 0 → score = 1 / (60 + 0 + 1) = 1/61
- If it's at rank 1 → score = 1/62
- So, higher ranked items (closer to top) get more score.

💡 Also, if the same chunk_id appears in multiple lists, their scores get added up — this is how RRF merges and rewards consensus.

📚 What is Reciprocal Rank Fusion (RRF)?

📚 What is Reciprocal Rank Fusion (RRF)?

🔁 RRF vs Parallel Query (Fan Out) Retrieval (Comparison)

💡 Why RRF Is Great for You

Code to get Rankings of each response

Subscribe to my newsletter

hardik sehgal

hardik sehgal