pandera: 0.18.3
pandas: 2.2.2
python: 3.9/3.11
Привет!
Я не могу настроить pandera для фрейма данных pandas, поскольку он жалуется:
File "/anaconda/envs/data_quality_env/lib/python3.9/site-packages/pandera/api/base/schema.py",
line 96, in get_backend
raise BackendNotFoundError(
pandera.errors.BackendNotFoundError: Backend not found for backend, class: (,
). Looked up the following base
classes: (, , , , , , )
Моя структура папок:
project/
data_validation/
schema/
case.py
validation/
validations.py
pipeline.py
case.py:
import pandas as pd
import pandera as pa
class CaseSchema(pa.DataFrameSchema):
case_id = pa.Column(pa.Int)
validations.py
import pandas as pd
from data_validation.schemas.case import CaseSchema
def validate_case_data(df: pd.DataFrame) -> pd.DataFrame:
"""Validate a DataFrame against the PersonSchema."""
schema = CaseSchema()
return schema.validate(df)
pipeline.py
import pandas as pd
from data_validation.validation.validations import validate_case_data
def validate_df(df: pd.DataFrame) -> pd.DataFrame:
"""Process data, validating it against the PersonSchema."""
validated_df = validate_case_data(df)
return validated_df
df = pd.DataFrame({
"case_id": [1, 2, 3]
})
processed_df = validate_df(df)
Подробнее здесь: https://stackoverflow.com/questions/784 ... -dataframe
Pandera.errors.BackendNotFoundError с Pandas Dataframe ⇐ Python
-
- Похожие темы
- Ответы
- Просмотры
- Последнее сообщение