Конфигурация склейки:
- тип рабочего: G1 X
- максимальное количество рабочих: 10
- клей версии 5.0
select_query = (
f"SELECT * FROM table1 "
f"WHERE year='{year}' AND month='{month}' AND day='{day}' AND hour='{hour}' AND col1 ='abc' "
f"AND col2='123' AND col3 in ('ABC12','CDE23','DEF34','GHI23', "
) AND col4='NEW' "
f"AND key IN ('val1', 'val2')"
)
data_df = spark.sql(select_query )
data_df.write.mode("append").parquet(athena_output_location)
Подробнее здесь: https://stackoverflow.com/questions/793 ... park-write