RAPTOR: RECURSIVE ABSTRACTIVE PROCESSING FOR TREE-ORGANIZED RETRIEVAL

Previous Card

Google’s Approach for Secure AI Agents: An Introduction

rag embedding long context knowledge graph

RAPTOR is a novel retrieval-augmented language model approach that constructs a hierarchical tree by recursively embedding, clustering, and summarizing text chunks. This method allows for integrating information across lengthy documents at different levels of abstraction. Controlled experiments demonstrate that RAPTOR significantly improves performance over traditional retrieval-augmented LMs on complex question-answering tasks, achieving state-of-the-art results on benchmarks like QuALITY, QASPER, and NarrativeQA. ✨

Article Points:

RAPTOR builds a hierarchical tree via recursive embedding, clustering, and summarization.

Integrates information across documents at varying levels of abstraction for holistic understanding.

Outperforms traditional retrieval-augmented LMs on several complex QA tasks.

Achieves new state-of-the-art results on QuALITY, QASPER, and NarrativeQA datasets.

The 'collapsed tree' querying method consistently shows superior performance.

Upper-level nodes in the tree are crucial for thematic and multi-hop queries.

Source:

RAPTOR: RECURSIVE ABSTRACTIVE PROCESSING FOR TREE-ORGANIZED RETRIEVAL

rag embedding long context knowledge graph

Problem Addressed

Limited context in RALMs

Lack of holistic document understanding

Expensive long contexts

Tree Construction Process

Segment text into chunks

Embed chunks using SBERT

Cluster similar chunks with GMM/UMAP

Summarize clusters using LLM

Recursively build tree bottom-up

Querying Mechanisms

Collapsed Tree: Evaluates all nodes simultaneously

Tree Traversal: Layer-by-layer selection

Collapsed Tree performs better

Key Contributions

Multi-level abstraction for context

Semantic grouping, not just adjacency

Improved retrieval effectiveness

Linear scalability in cost and time

Achieved Performance

Outperforms traditional retrieval methods

New state-of-the-art on QA datasets

Significant accuracy gains with GPT-4

Source:

RAPTOR: RECURSIVE ABSTRACTIVE PROCESSING FOR TREE-ORGANIZED RETRIEVAL

Next Card

Google’s Approach for Secure AI Agents: An Introduction

Limited context in RALMs

Lack of holistic document understanding

Expensive long contexts

Segment text into chunks

Embed chunks using SBERT

Cluster similar chunks with GMM/UMAP

Summarize clusters using LLM

Recursively build tree bottom-up

Collapsed Tree: Evaluates all nodes simultaneously

Tree Traversal: Layer-by-layer selection

Collapsed Tree performs better

Multi-level abstraction for context

Semantic grouping, not just adjacency

Improved retrieval effectiveness

Linear scalability in cost and time

Outperforms traditional retrieval methods

New state-of-the-art on QA datasets

Significant accuracy gains with GPT-4

What makes Claude Code so damn good (and how to recreate that magic in your agent)!?

Related Cards

Titans: Learning to Memorize at Test Time

Key-value memory in the brain

The Illusion of Progress: Re-evaluating Hallucination Detection in LLMs

Learning Facts at Scale with Active Reading