《Advanced RAG》-06-探索RAG技术 Query Rewriting

news2025/1/10 16:55:35

摘要

详细阐述了多种查询重写技术,这些技术用于在检索增强生成(RAG)中优化查询和文档之间的语义匹配。

首先,介绍了假设文档嵌入(HyDE)方法,它通过生成假设文档并将其与原始查询结合,以提高查询向量与实际文档的一致性。

接着,提出了重写-检索-读取(Rewrite-Retrieve-Read)框架,该框架通过重写查询来改善信息检索和回答生成的过程。

此外,还介绍了回退提示法(Step-Back Prompting)技术,它通过提出更抽象的问题来辅助大型语言模型进行更准确的推理。另一种技术是Query2doc,它通过使用大型语言模型生成的伪文档来扩展查询。

最后,ITER-RETGEN方法通过迭代检索和生成步骤来增强查询和回答的质量。网页还提供了相关技术的代码示例和实现细节,并讨论了这些方法的优势和局限性。

文章观点

  1. 查询重写对于提高RAG中的查询和文档语义一致性至关重要。
  2. HyDE通过生成假设文档来桥接原始查询与文档库之间的语义差距。
  3. 重写-检索-读取框架强调了查询重写在提升信息检索和回答生成流程中的作用。
  4. 回退提示法技术通过抽象化问题来帮助模型避免中间推理步骤中的错误。
  5. Query2doc通过生成伪文档来扩展查询,从而提高检索的精确度。
  6. ITER-RETGEN通过迭代的检索和生成过程来不断优化查询和回答的质量。
  7. 这些技术虽然提高了查询处理的效果,但也带来了性能权衡和实际应用中的挑战。
  8. 除了查询重写,还有其他预检索方法,如查询路由和分解查询,这些方法有望在未来被进一步探索和应用。

在检索增强生成(RAG)中,我们经常会遇到用户原始查询的问题,如措辞不当或缺乏语义信息。例如,"2020 年 NBA 总冠军是洛杉矶湖人队!告诉我什么是 langchain 框架?"这样的查询如果直接搜索,LLM 可能会给出错误或无法回答的回复。

因此,**必须将用户查询的语义空间与文档的语义空间相统一。**查询重写技术可以有效解决这一问题。它在 RAG 中的作用如图 1 所示:

图 1:RAG 中的查询重写(红色虚线框)

从位置的角度来看,查询重写是一种预检索方法。请注意,该图大致说明了查询重写在 RAG 中的位置。下面,将会提到到一些算法可以改进这一过程。

查询重写是调整查询和文档语义的关键技术。例如:

  1. Hypothetical Document Embeddings (HyDE) :通过假设文档来对齐查询和文档的语义空间。
  2. Rewrite-Retrieve-Read:提出了一个不同于传统的检索和阅读顺序的框架,专注于查询重写。
  3. Step-Back Prompting:允许LLM基于高级概念进行抽象推理和检索。
  4. Query2Doc:使用LLM的一些提示创建伪文档。然后,它将这些查询与原始查询合并,以构建一个新的查询。
  5. ITER-RETGEN:提出了一种将先前生成的结果与先前查询相结合的方法。随后检索相关文档并生成新结果。此过程重复多次以获得最终结果。

让我们深入了解一下这些方法的细节。

Hypothetical Document Embeddings (HyDE)

论文《Precise Zero-Shot Dense Retrieval without Relevance Labels》提出了一种基于假设文档嵌入(HyDE)的方法,其主要过程如图 2 所示。

图 2:HyDE 模型示意图。

该过程主要分为四个步骤:

  1. 使用 LLM 根据查询生成 k 个假设文档。这些生成的文档可能与事实不符,可能包含错误,但应该与相关文档相似。这一步的目的是通过 LLM 来解释用户的查询。
  2. 将生成的假定文档输入编码器,将其映射为密集向量 f(dk)。我们认为编码器具有过滤功能,可以过滤掉假设文档中的噪音。这里,dk 表示第 k 个生成的文档,f 表示编码器操作。
  3. 用给定的公式计算下列 k 个向量的平均值:

在这里插入图片描述

我们还可以将原始查询 q 视为一个可能的假设:

在这里插入图片描述

  1. 使用向量 v 从文档库中检索答案。如步骤 3 所述,该向量包含用户查询和所需答案模式的信息,可提高召回率。

我对 HyDE 的理解如图 3 所示。HyDE 的目标是生成假设文档,使最终查询向量 v 与向量空间中的实际文档尽可能接近。

在这里插入图片描述

LlamaIndex 和 Langchain 都实现了 HyDE。下面以 LlamaIndex 为例进行说明。

将此文件放到 YOUR_DIR_PATH。测试代码如下(我安装的 LlamaIndex 版本是 0.10.12):

import os
os.environ["OPENAI_API_KEY"] = "YOUR_OPENAI_API_KEY"from llama_index.core import VectorStoreIndex, SimpleDirectoryReader
from llama_index.core.indices.query.query_transform import HyDEQueryTransform
from llama_index.core.query_engine import TransformQueryEngine

# Load documents, build the VectorStoreIndex
dir_path = "YOUR_DIR_PATH"
documents = SimpleDirectoryReader(dir_path).load_data()
index = VectorStoreIndex.from_documents(documents)


query_str = "what did paul graham do after going to RISD"# Query without transformation: The same query string is used for embedding lookup and also summarization.
query_engine = index.as_query_engine()
response = query_engine.query(query_str)

print('-' * 100)
print("Base query:")
print(response)


# Query with HyDE transformation
hyde = HyDEQueryTransform(include_original=True)
hyde_query_engine = TransformQueryEngine(query_engine, hyde)
response = hyde_query_engine.query(query_str)

print('-' * 100)
print("After HyDEQueryTransform:")
print(response)

首先,看看 LlamaIndex 中默认的 HyDE 提示符:

############################################# HYDE##############################################

HYDE_TMPL = (
    "Please write a passage to answer the question\n""Try to include as many key details as possible.\n""\n""\n""{context_str}\n""\n""\n"'Passage:"""\n'
)

DEFAULT_HYDE_PROMPT = PromptTemplate(HYDE_TMPL, prompt_type=PromptType.SUMMARY)

HyDEQueryTransform 类的代码如下。

def _run 函数的目的是生成假设文档,我们在 def _run 函数中添加了三条调试语句,以监控假设文档的内容:

class HyDEQueryTransform(BaseQueryTransform):
    """Hypothetical Document Embeddings (HyDE) query transform.

    It uses an LLM to generate hypothetical answer(s) to a given query,
    and use the resulting documents as embedding strings.

    As described in `[Precise Zero-Shot Dense Retrieval without Relevance Labels]
    (https://arxiv.org/abs/2212.10496)`
    """def __init__(
        self,
        llm: Optional[LLMPredictorType] = None,
        hyde_prompt: Optional[BasePromptTemplate] = None,
        include_original: bool = True,
    ) -> None:
        """Initialize HyDEQueryTransform.

        Args:
            llm_predictor (Optional[LLM]): LLM for generating
                hypothetical documents
            hyde_prompt (Optional[BasePromptTemplate]): Custom prompt for HyDE
            include_original (bool): Whether to include original query
                string as one of the embedding strings
        """super().__init__()

        self._llm = llm or Settings.llm
        self._hyde_prompt = hyde_prompt or DEFAULT_HYDE_PROMPT
        self._include_original = include_original

    def _get_prompts(self) -> PromptDictType:
        """Get prompts."""return {"hyde_prompt": self._hyde_prompt}

    def _update_prompts(self, prompts: PromptDictType) -> None:
        """Update prompts."""if "hyde_prompt" in prompts:
            self._hyde_prompt = prompts["hyde_prompt"]

    def _run(self, query_bundle: QueryBundle, metadata: Dict) -> QueryBundle:
        """Run query transform."""# TODO: support generating multiple hypothetical docs
        query_str = query_bundle.query_str
        hypothetical_doc = self._llm.predict(self._hyde_prompt, context_str=query_str)
        embedding_strs = [hypothetical_doc]
        if self._include_original:
            embedding_strs.extend(query_bundle.embedding_strs)

        # The following three lines contain the added debug statements.print('-' * 100)
        print("Hypothetical doc:")
        print(embedding_strs)

        return QueryBundle(
            query_str=query_str,
            custom_embedding_strs=embedding_strs,
        )

测试代码操作如下

(llamaindex_010) Florian:~ Florian$ python /Users/Florian/Documents/test_hyde.py 
-----------------------------------------------------------------------------------------
Base query:
Paul Graham resumed his old life in New York after attending RISD. He became rich and continued his old patterns, but with new opportunities such as being able to easily hail taxis and dine at charming restaurants. He also started experimenting with a new kind of still life painting technique.
-----------------------------------------------------------------------------------------
Hypothetical doc:
["After attending the Rhode Island School of Design (RISD), Paul Graham went on to co-found Viaweb, an online store builder that was later acquired by Yahoo for $49 million. Following the success of Viaweb, Graham became an influential figure in the tech industry, co-founding the startup accelerator Y Combinator in 2005. Y Combinator has since become one of the most prestigious and successful startup accelerators in the world, helping launch companies like Dropbox, Airbnb, and Reddit. Graham is also known for his prolific writing on technology, startups, and entrepreneurship, with his essays being widely read and respected in the tech community. Overall, Paul Graham's career after RISD has been marked by innovation, success, and a significant impact on the startup ecosystem.", 'what did paul graham do after going to RISD']
------------------------------------------------------------------
After HyDEQueryTransform:
After going to RISD, Paul Graham resumed his old life in New York, but now he was rich. He continued his old patterns but with new opportunities, such as being able to easily hail taxis and dine at charming restaurants. He also started to focus more on his painting, experimenting with a new technique. Additionally, he began looking for an apartment to buy and contemplated the idea of building a web app for making web apps, which eventually led him to start a new company called Aspra.

embedding_strs 是一个包含两个元素的列表。第一个是生成的假设文档,第二个是原始查询。它们被合并成一个列表,以方便向量计算。

在这个例子中,HyDE 通过准确想象保罗-格雷厄姆在 RISD 之后的工作(见假设文档),大大提高了输出质量。这提高了嵌入质量和最终输出。

当然,HyDE 也有一些失败案例。感兴趣的读者可以访问该网页进行测试。

HyDE 似乎是无监督的,HyDE 中没有训练任何模型:生成模型和对比编码器都保持不变。

总之,虽然 HyDE 引入了一种新的查询重写方法,但它也有一些局限性。它不依赖于查询嵌入相似性,而是强调一个文档与另一个文档的相似性。但是,如果语言模型并不精通该主题,它可能不会总是产生最佳结果,从而可能导致错误增加。

Rewrite-Retrieve-Read

这一想法来自论文 “Query Rewriting for Retrieval-Augmented Large Language Models”。该论文认为,原始查询(尤其是在现实世界中)不一定总是最适合由大语言模型进行检索。

因此,本文建议首先使用 LLM 重写查询。然后再进行检索和生成答案,而不是直接从原始查询中检索内容和生成答案,如图 4 (b) 所示。

在这里插入图片描述

为了说明查询重写如何影响上下文检索和预测性能,请看下面这个例子:查询 "2020 年 NBA 总冠军是洛杉矶湖人队!告诉我什么是 langchain 框架?"的查询通过重写得到了准确处理。

这是通过 Langchain 实现的,安装所需的基本库如下:

pip install langchain
pip install openai
pip install langchainhub
pip install duckduckgo-search
pip install langchain_openai

环境配置和库导入

import os
os.environ["OPENAI_API_KEY"] = "YOUR_OPEN_AI_KEY"

from langchain_community.utilities import DuckDuckGoSearchAPIWrapper
from langchain_core.output_parsers import StrOutputParser
from langchain_core.prompts import ChatPromptTemplate
from langchain_core.runnables import RunnablePassthrough
from langchain_openai import ChatOpenAI

构建链并执行简单查询:

def june_print(msg, res):
    print('-' * 100)
    print(msg)
    print(res)


base_template = """Answer the users question based only on the following context:

<context>
{context}
</context>

Question: {question}
"""

base_prompt = ChatPromptTemplate.from_template(base_template)

model = ChatOpenAI(temperature=0)

search = DuckDuckGoSearchAPIWrapper()


def retriever(query):
    return search.run(query)

chain = (
    {"context": retriever, "question": RunnablePassthrough()}
    | base_prompt
    | model
    | StrOutputParser()
)

query = "The NBA champion of 2020 is the Los Angeles Lakers! Tell me what is langchain framework?"

june_print(
    'The result of query:', 
    chain.invoke(query)
)

june_print(
    'The result of the searched contexts:', 
    retriever(query)
)

操作结果如下:

(langchain) Florian:~ Florian$ python /Users/Florian/Documents/test_rewrite_retrieve_read.py 
----------------------------------------------------------------------------------------------------
The result of query:
I'm sorry, but the context provided does not mention anything about the langchain framework.
----------------------------------------------------------------------------------------------------
The result of the searched contexts:
The Los Angeles Lakers are the 2020 NBA Champions!Watch their championship celebration here!Subscribe to the NBA: https://on.nba.com/2JX5gSN Full Game Highli... Aug 4, 2023. The 2020 Los Angeles Lakers were truly one of the most complete teams over the decade. LeBron James' fourth championship was one of the biggest moments of his career. Only two players from the 2020 team remain on the Lakers. In the storied history of the NBA, few teams have captured the imagination of fans and left a lasting ... James had 28 points, 14 rebounds and 10 assists, and the Lakers beat the Miami Heat 106-93 on Sunday night to win the NBA finals in six games. James was also named Most Valuable Player of the NBA ... Portland Trail Blazers star Damian Lillard recently spoke about the 2020 NBA "bubble" playoffs and had an interesting perspective on the criticism the eventual winners, the Los Angeles Lakers, faced. But perhaps none were more surprising than Adebayo's opinion on the 2020 NBA Finals. The Heat were defeated by LeBron James and the Los Angeles Lakers in six games. Miller asked, "Tell me about ...

结果表明,根据搜索到的语境,有关 "langchain "的信息非常少。

现在就开始构建重写器,重写搜索查询。

rewrite_template = """Provide a better search query for web search engine to answer the given question, end the queries with ’**’. 
Question: {x} 
Answer:
"""
rewrite_prompt = ChatPromptTemplate.from_template(rewrite_template)


def _parse(text):
    return text.strip("**")

rewriter = rewrite_prompt | ChatOpenAI(temperature=0) | StrOutputParser() | _parse
june_print(
    'Rewritten query:', 
    rewriter.invoke({"x": query})
)

结果如下

----------------------------------------------------------------------------------------------------
Rewritten query:
What is langchain framework and how does it work?

构建 rewrite_retrieve_read_chain,并使用重写的查询。

rewrite_retrieve_read_chain = (
    {
        "context": {"x": RunnablePassthrough()} | rewriter | retriever,
        "question": RunnablePassthrough(),
    }
    | base_prompt
    | model
    | StrOutputParser()
)

june_print(
    'The result of the rewrite_retrieve_read_chain:', 
    rewrite_retrieve_read_chain.invoke(query)
)

操作结果如下:

-------------------------------------------------------------------------------------------
The result of the rewrite_retrieve_read_chain:
LangChain is a Python framework designed to help build AI applications powered by language models, particularly large language models (LLMs). It provides a generic interface to different foundation models, a framework for managing prompts, and a central interface to long-term memory, external data, other LLMs, and more. It simplifies the process of interacting with LLMs and can be used to build a wide range of applications, including chatbots that interact with users naturally.

至此,通过重写查询,我们成功获得了正确答案。

STEP-BACK PROMPTING

回退提示(STEP-BACK PROMPTING)是一种简单的提示词技术,它使LLMs能够进行抽象,从包含特定细节的实例中提炼出高级概念和基本原理。这个想法是将“回退问题”定义为从原始问题派生出的更抽象的问题。

例如,如果查询包含大量细节,LLM 就很难检索到相关事实来解决任务。如图 5 中的第一个示例所示,对于物理问题 “如果温度增加 2 倍,体积增加 8 倍,理想气体的压强 P 会发生什么变化?”,LLM 在推理时可能会偏离理想气体定律的第一原理。在直接推理该问题时,LLM 可能会偏离理想气体定律的第一原理。

同样,"埃斯特拉-利奥波德在 1954 年 8 月至 1954 年 11 月期间去了哪所学校?"这个问题也因具体时间范围的限制而难以直接回答。

图 5:通过概念和原理引导的抽象和推理两个步骤来说明 "后退式引导"。

在这两种情况下,提出一个更宽泛的问题可以帮助模型有效地回答具体的询问。

我们可以询问 “埃斯特拉-利奥波德的教育史”,而不是直接询问 “埃斯特拉-利奥波德在特定时间就读于哪所学校”。

这个更宽泛的话题包含了原问题,可以提供所有必要的信息来推断 “埃斯特拉-利奥波德在特定时间就读于哪所学校”。值得注意的是,这些更宽泛的问题通常比原来的具体问题更容易回答。

在图 5(左)所示的 "思维链 "中间步骤中,根据这些抽象概念进行推理有助于防止出错。

总之,后退式诱导包括两个基本步骤:

  • 抽象:最初,我们提示LLM提出一个关于高级概念或原则的广泛问题,而不是直接回应查询。然后,我们检索关于所述概念或原则的相关事实。
  • 推理:LLM可以根据这些关于高级概念或原则的事实推导出原始问题的答案。我们称之为抽象推理

为了说明后退提示如何影响上下文检索和预测性能,下面是使用 Langchain 实现的演示代码。

环境配置和import

import os
os.environ["OPENAI_API_KEY"] = "YOUR_OPEN_AI_KEY"

from langchain_core.output_parsers import StrOutputParser
from langchain_core.prompts import ChatPromptTemplate, FewShotChatMessagePromptTemplate
from langchain_core.runnables import RunnableLambda
from langchain_openai import ChatOpenAI
from langchain_community.utilities import DuckDuckGoSearchAPIWrapper

构建链并执行原始查询:

def june_print(msg, res):
    print('-' * 100)
    print(msg)
    print(res)


question = "was chatgpt around while trump was president?"

base_prompt_template = """You are an expert of world knowledge. I am going to ask you a question. Your response should be comprehensive and not contradicted with the following context if they are relevant. Otherwise, ignore them if they are not relevant.

{normal_context}

Original Question: {question}
Answer:"""

base_prompt = ChatPromptTemplate.from_template(base_prompt_template)

search = DuckDuckGoSearchAPIWrapper(max_results=4)
def retriever(query):
    return search.run(query)

base_chain = (
    {
        # Retrieve context using the normal question (only the first 3 results)
        "normal_context": RunnableLambda(lambda x: x["question"]) | retriever,
        # Pass on the question
        "question": lambda x: x["question"],
    }
    | base_prompt
    | ChatOpenAI(temperature=0)
    | StrOutputParser()
)


june_print('The searched contexts of the original question:', retriever(question))
june_print('The result of base_chain:', base_chain.invoke({"question": question}) )

结果

(langchain) Florian:~ Florian$ python /Users/Florian/Documents/test_step_back.py 
----------------------------------------------------------------------------------------------------
The searched contexts of the original question:
While impressive in many respects, ChatGPT also has some major flaws. ... [President's Name]," refused to write a poem about ex-President Trump, but wrote one about President Biden ... The company said GPT-4 recently passed a simulated law school bar exam with a score around the top 10% of test takers. By contrast, the prior version, GPT-3.5, scored around the bottom 10%. The ... These two moments show how Twitter's choices helped former President Trump. ... With ChatGPT, which launched to the public in late November, users can generate essays, stories and song lyrics ... Donald Trump is asked a question—say, whether he regrets his actions on Jan. 6and he answers with something like this: " Let me tell you, there's nobody who loves this country more than me ...
----------------------------------------------------------------------------------------------------
The result of base_chain:
Yes, ChatGPT was around while Trump was president. ChatGPT is an AI language model developed by OpenAI and was launched to the public in late November. It has the capability to generate essays, stories, and song lyrics. While it may have been used to write a poem about President Biden, it also has the potential to be used in various other contexts, including generating responses from hypothetical scenarios involving former President Trump.

结果显然是不正确的。

开始构建 step_back_question_chainstep_back_chain,以获得正确结果。

# Few Shot Examples
examples = [
    {
        "input": "Could the members of The Police perform lawful arrests?",
        "output": "what can the members of The Police do?",
    },
    {
        "input": "Jan Sindel’s was born in what country?",
        "output": "what is Jan Sindel’s personal history?",
    },
]
# We now transform these to example messages
example_prompt = ChatPromptTemplate.from_messages(
    [
        ("human", "{input}"),
        ("ai", "{output}"),
    ]
)
few_shot_prompt = FewShotChatMessagePromptTemplate(
    example_prompt=example_prompt,
    examples=examples,
)

step_back_prompt = ChatPromptTemplate.from_messages(
    [
        (
            "system",
            """You are an expert at world knowledge. Your task is to step back and paraphrase a question to a more generic step-back question, which is easier to answer. Here are a few examples:""",
        ),
        # Few shot examples
        few_shot_prompt,
        # New question
        ("user", "{question}"),
    ]
)
step_back_question_chain = step_back_prompt | ChatOpenAI(temperature=0) | StrOutputParser()
june_print('The step-back question:', step_back_question_chain.invoke({"question": question}))
june_print('The searched contexts of the step-back question:', retriever(step_back_question_chain.invoke({"question": question})) )



response_prompt_template = """You are an expert of world knowledge. I am going to ask you a question. Your response should be comprehensive and not contradicted with the following context if they are relevant. Otherwise, ignore them if they are not relevant.

{normal_context}
{step_back_context}

Original Question: {question}
Answer:"""
response_prompt = ChatPromptTemplate.from_template(response_prompt_template)


step_back_chain = (
    {
        # Retrieve context using the normal question
        "normal_context": RunnableLambda(lambda x: x["question"]) | retriever,
        # Retrieve context using the step-back question
        "step_back_context": step_back_question_chain | retriever,
        # Pass on the question
        "question": lambda x: x["question"],
    }
    | response_prompt
    | ChatOpenAI(temperature=0)
    | StrOutputParser()
)

june_print('The result of step_back_chain:', step_back_chain.invoke({"question": question}) )

结果如下

----------------------------------------------------------------------------------------------------
The step-back question:
When did ChatGPT become available?
----------------------------------------------------------------------------------------------------
The searched contexts of the step-back question:
OpenAI released an early demo of ChatGPT on November 30, 2022, and the chatbot quickly went viral on social media as users shared examples of what it could do. Stories and samples included ... March 14, 2023 - Anthropic launched Claude, its ChatGPT alternative. March 20, 2023 - A major ChatGPT outage affects all users for several hours. March 21, 2023 - Google launched Bard, its ... The same basic models had been available on the API for almost a year before ChatGPT came out. In another sense, we made it more aligned with what humans want to do with it. A paid ChatGPT Plus subscription is available. (Image credit: OpenAI) ChatGPT is based on a language model from the GPT-3.5 series, which OpenAI says finished its training in early 2022.
----------------------------------------------------------------------------------------------------
The result of step_back_chain:
No, ChatGPT was not around while Trump was president. ChatGPT was released to the public in late November, after Trump's presidency had ended. The references to ChatGPT in the context provided are all dated after Trump's presidency, such as the release of an early demo on November 30, 2022, and the launch of ChatGPT Plus subscription. Therefore, it is safe to say that ChatGPT was not around during Trump's presidency.

我们可以看到,通过将原始查询 "后退 "到一个更抽象的问题,并同时使用抽象查询和原始查询进行检索,LLM 提高了其沿着正确的推理路径找到解决方案的能力。

正如 Edsger W. Dijkstra 所说:“抽象的目的不是为了模糊,而是为了创造一个新的语义层次,在这个层次上,人们可以做到绝对精确”。

Query2doc

Query2doc: Query Expansion with Large Language Models 介绍了query2doc。它使用LLMs的几个提示生成伪文档,然后如图6所示,将它们与原始查询结合起来创建一个新的查询。

图 6:query2doc 少量提示说明。

在密集检索中,新查询(用 q+ 表示)是原始查询(q)和伪文档(d’)的简单连接,并用 [SEP] 分隔:q+ = concat(q, [SEP], d’).

Query2doc 认为,HyDE 隐含地假定真实文档和伪文档用不同的词语表达了相同的语义,而这对于某些查询来说可能并不成立。

Query2doc 和 HyDE 之间的另一个区别是,Query2doc 训练的是一种有监督的密集检索器,正如论文中所概述的那样。

目前,在 Langchain 或 LlamaIndex 中,尚未发现 query2doc 的复制。

ITER-RETGEN

ITER-RETGEN方法 使用生成的内容来指导检索。它在检索-阅读-检索-阅读流程中迭代实现“检索增强生成”和“生成增强检索”

在这里插入图片描述

如图 7 所示,对于给定的问题 q 和检索语料 D = {d}(其中 d 代表一个段落),ITER-RETGEN 会连续执行 T 次检索生成迭代。

在每次迭代 t 中,我们首先使用前一次迭代的生成 yt-1,将其与 q 结合,并检索出前 k 个段落。接下来,我们提示 LLM M 生成输出 yt,将检索到的段落(表示为 Dyt-1||q)和 q 纳入提示中。因此,每次迭代可以表述如下:

在这里插入图片描述

最后的输出 yt 将作为最终响应。

与 Query2doc 类似,目前在 Langchain 或 LlamaIndex 中没有发现复制。

结论

本文介绍了各种查询重写技术,包括一些技术的代码演示。

在实践中,这些查询重写方法都可以尝试,至于使用哪种方法或方法组合,则取决于具体效果。

不过,无论采用哪种重写方法,调用 LLM 都会带来一些性能上的权衡,这需要在实际使用中加以考虑。

此外,还有一些方法,如查询路由、将查询分解成多个子问题等,它们不属于查询重写,但属于预检索方法,这些方法今后将有机会被引入。

本文为翻译,原文地址:https://medium.com/@florian_algo/advanced-rag-06-exploring-query-rewriting-23997297f2d1

本文来自互联网用户投稿,该文观点仅代表作者本人,不代表本站立场。本站仅提供信息存储空间服务,不拥有所有权,不承担相关法律责任。如若转载,请注明出处:http://www.coloradmin.cn/o/1987595.html

如若内容造成侵权/违法违规/事实不符,请联系多彩编程网进行投诉反馈,一经查实,立即删除!

相关文章

卡码网KamaCoder 104. 建造最大岛屿

题目来源&#xff1a;104. 建造最大岛屿 C题解&#xff1a;先用深度优化算法计算每个孤岛的面积&#xff0c;然后给每个孤岛编号&#xff08;如果孤岛是U型的&#xff0c;防止面积加重复了&#xff09;&#xff0c;再去遍历每个海水区域&#xff0c;计算最大面积。 #include &…

jenkins+gitlab实现微服务的差异化部署

前景&#xff1a; Jenkins 和 GitLab 结合实现微服务的差异化部署具有多方面的意义和优势&#xff0c;主要包括以下几点&#xff1a; 自动化和持续集成&#xff1a;通过 Jenkins 可以实现从代码提交到部署的全自动化流程。这种自动化确保了每次代码变更都能够快速、可靠地构建…

【Python】数据类型之集合

集合是一个无序、可变、不允许元素重复的容器。 1、定义 v1{11,22,33} 1&#xff09;&#xff09;无序&#xff1a;集合无法通过索引取值。 2&#xff09;&#xff09;可变&#xff1a;可以添加和删除集合中的元素。 3&#xff09;&#xff09;集合不允许元素重复。 例如…

GD32 MCU碰到IIC总线卡死怎么办?

大家在使用MCU IIC通信时&#xff0c;若碰到设备复位或者总线干扰等情况&#xff0c;可能会导致IIC总线卡死&#xff0c;表现上总线上SDA或者SCL其中一根线为低电平&#xff0c;IIC总线一直处于busy状态。此时若代码上一直等待总线空闲&#xff0c;则可能导致软件死机&#xff…

PCL Chaikin曲线逼近型细分算法

文章目录 一、简介二、实现代码三、实现效果参考资料一、简介 Chaikin 逼近型细分算法是一种生成平滑曲线的方法,常用于计算机图形学和曲线建模。它基于 Paul Chaikin 于 1974 年提出的算法,通过迭代地插入新的控制点并移动原始控制点来细分曲线,从而逐渐逼近光滑曲线。具体…

Flamingo: a Visual Language Model for Few-Shot Learning

发表时间&#xff1a;NeurIPS 2022 论文链接&#xff1a;https://proceedings.neurips.cc/paper_files/paper/2022/file/960a172bc7fbf0177ccccbb411a7d800-Paper-Conference.pdf 作者单位&#xff1a;DeepMind Motivation&#xff1a;仅使用少量注释示例可以快速适应新任务…

谈基于大语言模型的图数据库路径检索

随着微软已经开源了GraphRAG项目的代码&#xff0c;基于图数据库的RAG 热度迅速升温。关注基于大语言模型与图模型数据库相结合的技术的人多了起来。 本文提出了一种类似人工搜索的“顺藤摸瓜”方法&#xff0c;实现图数据库的智能搜索方法。 本地私有数据存储和查询 本地私有…

大数据面试SQL(一):合并日期重叠的活动

文章目录 合并日期重叠的活动 一、题目 二、分析 三、SQL实战 四、样例数据参考 合并日期重叠的活动 一、题目 已知有表记录了每个品牌的活动开始日期和结束日期&#xff0c;每个品牌可以有多个活动。请编写一个SQL查询合并在同一个品牌举行的所有重叠的活动&#xff0c…

Canvas简历编辑器-图形绘制与状态管理(轻量级DOM)

Canvas简历编辑器-图形绘制与状态管理(轻量级DOM) 在前边我们聊了数据结构的设计和剪贴板的数据操作&#xff0c;那么这些操作都还是比较倾向于数据相关的操作&#xff0c;那么我们现在就来聊聊基本的图形绘制以及图形状态管理。 在线编辑: https://windrunnermax.github.io/…

树与二叉树、图的基本概念

一、树与二叉树的基本概念和性质 1、树的的性质&#xff1a; 1&#xff09;树中的结点数 n 等于所有结点的度数之和加 1 【说明】结点的度是指该结点的孩子数量&#xff0c;每个结点与其每个孩子都由唯一的边相连&#xff0c;因此树中所有结点的度数之和等于树中的边数之和。…

模型 MBTI(性格模型)

系列文章 分享 模型&#xff0c;了解更多&#x1f449; 模型_思维模型目录。探索真我&#xff0c;和谐人际。 1 MBTI性格模型的应用 1.1 跨国公司团队协作改进 ABC公司是一家全球性的科技公司&#xff0c;其研发团队由来自世界各地的工程师和设计师组成。尽管团队成员个个才华…

创意指南丨AR数学沉浸式空间体验

AR学习种类那么多&#xff0c;哪款最吸引你&#xff1f; 星河造梦坊和Unity联手打造的沉浸式空间AR无疑是其中的佼佼者。 这款应用不仅利用AR技术将抽象的数学概念变得生动有趣&#xff0c;还通过互动体验让学习者仿佛置身于一个充满奇幻色彩的数学世界中。 无论是学生还是教…

鸿蒙应用服务开发【自定义通知角标】

自定义通知角标 介绍 本示例主要展示了设定应用的桌面图标角标的功能&#xff0c;使用ohos.notificationManager接口&#xff0c;进行桌面角标的设置&#xff0c;通知的发送&#xff0c;获取等。 效果预览 使用说明 在主界面&#xff0c;可以看到当前应用的所有消息通知&am…

ts-node 报错 ERR_UNKNOWN_FILE_EXTENSION

问题 有个monorepo项目&#xff0c;在最外层一次性打包 3 个项目的脚本已经成功实现&#xff0c;如下&#xff1a; "build:test": "cross-env NODE_ENVtest vite build --mode test && esno ./build/script/postBuild.ts", "build:prod"…

一款基于RBAC模型的开源快速开发平台,支持权限粒度达到列级别,前后端分离,可免费用于商业

前言 在企业级应用开发中&#xff0c;权限管理和系统配置是两个核心问题。传统的开发模式往往面临权限控制不够灵活、系统配置难以管理等挑战。为了解-决这些问题&#xff0c;需要一款能够提供细致权限控制和灵活系统配置的软件。 现有的一些软件虽然提供了基本的权限管理功能…

索尼相机SD卡找不到视频怎么办?提供全面解决方案

在使用索尼相机拍摄美好瞬间时&#xff0c;SD卡作为存储介质&#xff0c;承载着珍贵的视频和照片。然而&#xff0c;有时我们可能会遇到SD卡中视频文件无法找到的问题&#xff0c;这无疑让人倍感焦虑。本文旨在为大家提供一套全面的解决方案&#xff0c;希望帮助大家快速找回丢…

Istio 金丝雀发布

转载&#xff1a;备考ICA-Istio 金丝雀实验4 环境清理 kubectl delete gw/helloworld-gateway vs/helloworld dr/helloworld-destination #测试 kubectl get svc,pods for i in {1..10};do curl $(kubectl get svc helloworld|grep helloworld|awk {print $3":"$5}|a…

谷粒商城实战笔记-138-商城业务-首页-渲染二级三级分类数据

本节的主要内容是在前一节的基础上&#xff0c;提供结构查询出所有的二级、三级分类数据。 一&#xff0c;构造响应体数据结构 后端返回给前端的数据结构是在开发详细设计中应该确定的内容。 分析前端需要的数据结构&#xff0c;后端要将所有一级分类包含的二级和三级分类信…

嵌入式学习之路 14(C语言基础学习——指针)

一、指针基础 指针的概念 地址表示内存单元的编号&#xff0c;也被称为指针。指针既是地址&#xff0c;也是一种专门用于处理地址数据的数据类型。 例如&#xff0c;变量a的地址或者十六进制表示的0x1000都可以视作指针。 指针变量的定义 语法&#xff1a;基类型 * 指针变…

Python面试宝典第28题:合并区间

题目 以数组 intervals 表示若干个区间的集合&#xff0c;其中单个区间为intervals[i] [starti, endi]&#xff0c;且endi大于starti。请合并所有重叠的区间&#xff0c;并返回一个不重叠的区间数组&#xff0c;该数组需恰好覆盖输入中的所有区间。 示例 1&#xff1a; 输入&…