Ошибка в PDF-вопросе-ответе: объект «dict» не имеет атрибута «поиск».

Ошибка в PDF-вопросе-ответе: объект «dict» не имеет атрибута «поиск». ⇐ Python

1 сообщение • Страница 1 из 1

Anonymous

Ошибка в PDF-вопросе-ответе: объект «dict» не имеет атрибута «поиск».

Цитата

Сообщение Anonymous » 02 окт 2024, 18:31

Я работаю над системой вопросов и ответов в формате PDF, используя LangChain и FAISS для векторного хранения. Однако во время процесса получения возникает ошибка:

Код: Выделить всё

Error during question answering: 'dict' object has no attribute 'search'

Полная обратная трассировка:

Код: Выделить всё

AttributeError                            Traceback (most recent call last)
 in ()
157 query = "What is the main topic of the document?"
158
--> 159 main(pdf_path, query)
160
161 # try:

13 frames
/usr/local/lib/python3.10/dist-packages/langchain_community/vectorstores/faiss.py in similarity_search_with_score_by_vector(self, embedding, k, filter, fetch_k, **kwargs)
424                 continue
425             _id = self.index_to_docstore_id[i]
--> 426             doc = self.docstore.search(_id)
427             if not isinstance(doc, Document):
428                 raise ValueError(f"Could not find document for id {_id}, got {doc}")

AttributeError: 'dict' object has no attribute 'search'

Вот фрагмент соответствующего кода, в котором, похоже, возникает проблема:

Код: Выделить всё

def create_retrieval_qa(document_text):
"""Create a RetrievalQA instance using the FAISS index(document text) and LLM."""
document = Document(page_content=document_text)

# Get embeddings for the document using embed_documents
embedding = embeddings.embed_documents([document.page_content])  # Use the LangChain embeddings
embedding = np.array(embedding)  # Convert to a NumPy array

# Check if embedding is correctly shaped
if embedding.ndim != 2 or embedding.shape[0] != 1:
raise ValueError("Embedding should be a 2D array with shape (1, embedding_dim).")

# Create FAISS index and add the document embedding
index = faiss.IndexFlatL2(embedding.shape[1])
index.add(embedding)  # Add the document embedding to the index

# Create docstore and index to docstore id mapping
docstore = {0: document}  # Mapping index to the Document
index_to_docstore_id = {0: 0}  # Document index mapping

# Create the FAISS vector store with the required embedding function
vector_store = FAISS(
index=index,
docstore=docstore,
index_to_docstore_id=index_to_docstore_id,
embedding_function=embeddings.embed_query  # Use embed_query for retrieval
)

# print("Vector Store:", vector_store)
retriever = vector_store.as_retriever(search_type="similarity")
# Check the retriever type
print("Retriever Type:", type(retriever))

'''
error you're encountering suggests that the retriever is not being set up correctly or that it's being used improperly in this context.
'''
retrieval_qa = RetrievalQA.from_chain_type(
llm=llm,
chain_type="stuff", # or "map_reduce"  or other types depending on your needs
retriever=retriever
)

return retrieval_qa, retriever

def answer_question(query, retrieval_qa):
"""Answer the question using the RetrievalQA instance."""
response = retrieval_qa({"query": query})
return response.get('output') or response.get('result')

def main(pdf_path, query):
"""Main function to ingest a PDF, create a QA instance, and answer a question."""
# Ingest the PDF and create an index
text = ingest_single_pdf(pdf_path)

# Create the retrieval chain
retrieval_qa, retriever = create_retrieval_qa(text)

# Test the vector store with the query
# test_vector_store(retrieval_qa.retriever.vectorstore, query)

try:
print(dir(retriever))
docs = retriever.get_relevant_documents(query)
print("Relevant Documents:", docs)  # Debugging
except Exception as e:
print("Error during document retrieval:", e)

# Answer the query
answer = answer_question(query, retrieval_qa)

print(f"Query: '{query}'")
print(f"Answer: {answer}")

# Example usage
pdf_path = "/content/my_pdf.pdf"

Ошибка возникает, когда я пытаюсь использовать средство извлечения для получения соответствующих документов для запроса.
Вопросы:< /p>

Правильно ли я настраиваю векторное хранилище и ретривер FAISS?
Есть ли что-то особенное в том, как я настраиваю пытаетесь получить документы, которые могут вызвать эту проблему?
Есть ли обновления или изменения в библиотеке LangChain, о которых мне следует знать, которые могут повлиять на то, как я это настроил?

Дополнительная информация:

Я подтвердил, что индекс FAISS заполнено правильно.
Модель внедрения работает должным образом.

Подробнее здесь: https://stackoverflow.com/questions/790 ... ute-search

1727883073

Anonymous

Я работаю над системой вопросов и ответов в формате PDF, используя LangChain и FAISS для векторного хранения.  Однако во время процесса получения возникает ошибка:
[code]Error during question answering: 'dict' object has no attribute 'search'
[/code]
Полная обратная трассировка:
[code]AttributeError                            Traceback (most recent call last)
 in ()
157 query = "What is the main topic of the document?"
158
--> 159 main(pdf_path, query)
160
161 # try:

13 frames
/usr/local/lib/python3.10/dist-packages/langchain_community/vectorstores/faiss.py in similarity_search_with_score_by_vector(self, embedding, k, filter, fetch_k, **kwargs)
424                 continue
425             _id = self.index_to_docstore_id[i]
--> 426             doc = self.docstore.search(_id)
427             if not isinstance(doc, Document):
428                 raise ValueError(f"Could not find document for id {_id}, got {doc}")

AttributeError: 'dict' object has no attribute 'search'
[/code]
Вот фрагмент соответствующего кода, в котором, похоже, возникает проблема:
[code]def create_retrieval_qa(document_text):
"""Create a RetrievalQA instance using the FAISS index(document text) and LLM."""
document = Document(page_content=document_text)

# Get embeddings for the document using embed_documents
embedding = embeddings.embed_documents([document.page_content])  # Use the LangChain embeddings
embedding = np.array(embedding)  # Convert to a NumPy array

# Check if embedding is correctly shaped
if embedding.ndim != 2 or embedding.shape[0] != 1:
raise ValueError("Embedding should be a 2D array with shape (1, embedding_dim).")

# Create FAISS index and add the document embedding
index = faiss.IndexFlatL2(embedding.shape[1])
index.add(embedding)  # Add the document embedding to the index

# Create docstore and index to docstore id mapping
docstore = {0: document}  # Mapping index to the Document
index_to_docstore_id = {0: 0}  # Document index mapping

# Create the FAISS vector store with the required embedding function
vector_store = FAISS(
index=index,
docstore=docstore,
index_to_docstore_id=index_to_docstore_id,
embedding_function=embeddings.embed_query  # Use embed_query for retrieval
)

# print("Vector Store:", vector_store)
retriever = vector_store.as_retriever(search_type="similarity")
# Check the retriever type
print("Retriever Type:", type(retriever))

'''
error you're encountering suggests that the retriever is not being set up correctly or that it's being used improperly in this context.
'''
retrieval_qa = RetrievalQA.from_chain_type(
llm=llm,
chain_type="stuff", # or "map_reduce"  or other types depending on your needs
retriever=retriever
)

return retrieval_qa, retriever

def answer_question(query, retrieval_qa):
"""Answer the question using the RetrievalQA instance."""
response = retrieval_qa({"query": query})
return response.get('output') or response.get('result')

def main(pdf_path, query):
"""Main function to ingest a PDF, create a QA instance, and answer a question."""
# Ingest the PDF and create an index
text = ingest_single_pdf(pdf_path)

# Create the retrieval chain
retrieval_qa, retriever = create_retrieval_qa(text)

# Test the vector store with the query
# test_vector_store(retrieval_qa.retriever.vectorstore, query)

try:
print(dir(retriever))
docs = retriever.get_relevant_documents(query)
print("Relevant Documents:", docs)  # Debugging
except Exception as e:
print("Error during document retrieval:", e)

# Answer the query
answer = answer_question(query, retrieval_qa)

print(f"Query: '{query}'")
print(f"Answer: {answer}")

# Example usage
pdf_path = "/content/my_pdf.pdf"
[/code]
Ошибка возникает, когда я пытаюсь использовать средство извлечения для получения соответствующих документов для запроса.
[b]Вопросы:[/b]< /p>
[list]
[*]Правильно ли я настраиваю векторное хранилище и ретривер FAISS?
[*]Есть ли что-то особенное в том, как я настраиваю пытаетесь получить документы, которые могут вызвать эту проблему?
[*]Есть ли обновления или изменения в библиотеке LangChain, о которых мне следует знать, которые могут повлиять на то, как я это настроил?
[/list]
[b]Дополнительная информация:[/b]
[list]
[*]Я подтвердил, что индекс FAISS заполнено правильно.
[*]Модель внедрения работает должным образом.
[/list] 

Подробнее здесь: [url]https://stackoverflow.com/questions/79036190/error-in-pdf-question-answering-dict-object-has-no-attribute-search[/url]

Ответить Пред. тема След. тема

1 сообщение • Страница 1 из 1

Быстрый ответ

Заголовок:

Имя пользователя:

Изменение регистра текста:

Смайлики

Ещё смайлики…

К этому ответу прикреплено по крайней мере одно вложение.

Если вы не хотите добавлять вложения, оставьте поля пустыми. Можно прикреплять файлы, перетаскивая их в окно сообщения.

Максимально разрешённый размер вложения: 15 МБ.

Имя файла:

Комментарий к файлу:

Имя файла	Комментарий к файлу	Размер	Статус

Похожие темы

Ответы

Просмотры

Последнее сообщение

FastAPI + Strawberry GraphQL: проблема с установкой файлов cookie в ответе (объект ответа не имеет атрибута)

Последнее сообщение Anonymous « 25 ноя 2024, 15:22
Добавлено в форуме Python

Anonymous » 25 ноя 2024, 15:22 » в форуме Python

Я использую FastAPI с Strawberry GraphQL и пытаюсь реализовать мутацию, которая устанавливает в ответе access_token иrefresh_token как файлы cookie. Однако я столкнулся со следующей ошибкой:
Объект «Ответ» не имеет атрибута «cookie»
from typing...

0 Ответы

20 Просмотры

Последнее сообщение Anonymous
25 ноя 2024, 15:22
Объект «dict» не имеет атрибута «добавить» ошибку

Последнее сообщение Anonymous « 31 июл 2024, 22:11
Добавлено в форуме Python

Anonymous » 31 июл 2024, 22:11 » в форуме Python

import json
from difflib import get_close_matches

def load_data(file_path: str) -> dict:
Load data from a JSON file.
with open(file_path, r ) as file:
data = json.load(file)
return data

def save_data(file_path: str, data: dict):
Save data to a...

0 Ответы

27 Просмотры

Последнее сообщение Anonymous
31 июл 2024, 22:11
AttributeError: объект «dict» не имеет атрибута «mask_padding» «model = Tacotron2(hparams)»

Последнее сообщение Anonymous « 03 авг 2024, 22:22
Добавлено в форуме Python

Anonymous » 03 авг 2024, 22:22 » в форуме Python

Я пытаюсь создать музыку, используя собственный голос, и я впервые использую его и пишу код, и понятия не имею, почему я получаю эту ошибку.
вот мои зависимости:
pip install --upgrade pip !pip install torch numpy matplotlib scipy !pip install...

0 Ответы

40 Просмотры

Последнее сообщение Anonymous
03 авг 2024, 22:22
Sqlalchemy select AttributeError: объект «dict» не имеет атрибута

Последнее сообщение Anonymous « 01 ноя 2024, 17:10
Добавлено в форуме Python

Anonymous » 01 ноя 2024, 17:10 » в форуме Python

Я создаю приложение fastapi + sqlalchemy
Мой код
sql.py
form sqlalchemy import Table, String, BigInteger, MetaData

metadata = MetaData()

custom_datasets = Table(
custom_datasets ,
metadata,
Column( project_id , String(500), primary_key=True),...

0 Ответы

21 Просмотры

Последнее сообщение Anonymous
01 ноя 2024, 17:10
Sqlalchemy select AttributeError: объект «dict» не имеет атрибута

Последнее сообщение Anonymous « 01 ноя 2024, 17:54
Добавлено в форуме Python

Anonymous » 01 ноя 2024, 17:54 » в форуме Python

Я создаю приложение fastapi + sqlalchemy
Мой код
sql.py
form sqlalchemy import Table, String, BigInteger, MetaData

metadata = MetaData()

custom_datasets = Table(
custom_datasets ,
metadata,
Column( project_id , String(500), primary_key=True),...

0 Ответы

29 Просмотры

Последнее сообщение Anonymous
01 ноя 2024, 17:54

Вернуться в «Python»