Enhancing Retrieval-Augmented Generation: A Study of Best Practices

Previous Card

The Future of AI: Exploring the Potential of Large Concept Models

rag in-context learning prompt engineering

This paper investigates various components and configurations within Retrieval-Augmented Generation (RAG) systems to optimize their performance. It introduces novel RAG designs, including query expansion, new retrieval strategies, and Contrastive In-Context Learning RAG. The study provides actionable insights for developing adaptable and high-performing RAG frameworks by analyzing factors like LLM size, prompt design, and knowledge base characteristics. ✨

Article Points:

Contrastive In-Context Learning RAG significantly outperforms other RAG variants.

Focus Mode RAG is highly effective, emphasizing concise, high-precision retrieved documents.

Knowledge base quality and relevance are more critical than its sheer size for RAG performance.

Prompt formulation remains crucial for optimizing RAG system performance.

Larger LLM size generally boosts RAG performance, especially on general knowledge tasks.

Query Expansion, multilingual KBs, document size, and retrieval stride showed minimal gains.

Source:

Enhancing Retrieval-Augmented Generation: A Study of Best Practices

rag in-context learning prompt engineering

Novel Contributions

Query Expansion

Contrastive In-Context Learning RAG

Multilingual Knowledge Bases

Focus Mode

Key Factors Investigated

LLM Size

Prompt Design

Document Chunk Size

Knowledge Base Size

Retrieval Stride

RAG Architecture

Query Expansion Module

Retrieval Module

Text Generation Module

Evaluation

Datasets

- TruthfulQA

- MMLU

Metrics

- ROUGE

- Embedding Cosine Similarity (ECS)

- MAUVE

- FActScore

Key Findings

Contrastive ICL excels

Focus Mode effective

KB quality over size

Prompt design crucial

Other factors less impactful

Limitations

No combined approaches

Limited LLM size study

Limited multilingual scope

Source:

Enhancing Retrieval-Augmented Generation: A Study of Best Practices

Next Card

The Future of AI: Exploring the Potential of Large Concept Models

Query Expansion

Contrastive In-Context Learning RAG

Multilingual Knowledge Bases

Focus Mode

LLM Size

Prompt Design

Document Chunk Size

Knowledge Base Size

Retrieval Stride

Query Expansion Module

Retrieval Module

Text Generation Module

Datasets

Metrics

Contrastive ICL excels

Focus Mode effective

KB quality over size

Prompt design crucial

Other factors less impactful

No combined approaches

Limited LLM size study

Limited multilingual scope

CodeACT: Code Adaptive Compute-efficient Tuning Framework for Code LLMs

Related Cards

Same Task, More Tokens: the Impact of Input Length on the Reasoning Performance of Large Language Models

Tree of Thoughts: Deliberate Problem Solving with Large Language Models

Tree-of-Code: A Hybrid Approach for Robust Complex Task Planning and Execution

SmolDocling: An ultra-compact vision-language model for end-to-end multi-modal document conversion