Модель всегда классифицирует изображения как кошек с высокой уверенностью, несмотря на настройку гиперпараметров

Модель всегда классифицирует изображения как кошек с высокой уверенностью, несмотря на настройку гиперпараметров ⇐ Python

Ответить Пред. тема След. тема

1 сообщение • Страница 1 из 1

Anonymous

Модель всегда классифицирует изображения как кошек с высокой уверенностью, несмотря на настройку гиперпараметров

Цитата

Сообщение Anonymous » 10 ноя 2024, 03:01

Я пытаюсь обучить модель на изображениях, используя приведенный ниже код. Моя структура каталогов следующая:

Папка PetImages размером примерно 1 ГБ расположена рядом с моим файлом main.py скрипт.
Внутри папки PetImages есть две подпапки: Cats и Dogs.
< li>Каждая из этих подпапок содержит 10 000 изображений кошек и собак соответственно.

Код: Выделить всё

import os
import tensorflow as tf
from keras import layers
from tensorflow import keras

# === Constants for easy hyperparameter tuning ===

BATCH_SIZE = 256  # Batch size for training.
EPOCHS = 80  # Number of training epochs.
LEARNING_RATE = 5e-4  # Learning rate for Adam optimizer.

# === End of constants ===

# Filter out corrupted images
def filter_corrupted_images():
num_skipped = 0
for folder_name in ("Cat", "Dog"):
folder_path = os.path.join("PetImages", folder_name)
for fname in os.listdir(folder_path):
fpath = os.path.join(folder_path, fname)
try:
with open(fpath, "rb") as fobj:
is_jfif = b"JFIF" in fobj.read(10)
except Exception:
is_jfif = False
if not is_jfif:
num_skipped += 1
os.remove(fpath)
print(f"Deleted {num_skipped} corrupted images.")

# Generate Dataset
def generate_datasets(image_size=(180, 180), batch_size=BATCH_SIZE):
train_ds, val_ds = keras.utils.image_dataset_from_directory(
"PetImages",
validation_split=0.2,
subset="both",
seed=1337,
image_size=image_size,
batch_size=batch_size,
)
return train_ds, val_ds

# Configure the Dataset for Performance
def configure_for_performance(ds):
AUGMENTATION = keras.Sequential([
layers.RandomFlip("horizontal"),
layers.RandomRotation(0.3),
layers.RandomZoom(0.2),
layers.RandomBrightness(0.2)
])
ds = ds.map(lambda x, y: (AUGMENTATION(x, training=True), y),
num_parallel_calls=tf.data.AUTOTUNE)
return ds.prefetch(buffer_size=tf.data.AUTOTUNE)

# Define Model Architecture with adjusted strides and fewer pooling layers
def make_model(input_shape, num_classes=2):
inputs = keras.Input(shape=input_shape)
x = layers.Rescaling(1.0 / 255)(inputs)

# Convolutional Layers with reduced stride for some layers
FILTER_SIZES = [32, 64, 128, 256, 512]
KERNEL_SIZE = (3, 3)
DROPOUT_RATE = 0.5
ACTIVATION_FUNCTION = "swish"

for i, size in enumerate(FILTER_SIZES):
x = layers.Conv2D(size, KERNEL_SIZE, strides=1 if i < 2 else 2, padding="same")(x)  # Use stride 1 for first two layers
x = layers.BatchNormalization()(x)
x = layers.Activation(ACTIVATION_FUNCTION)(x)
if i < 3:  # Apply MaxPooling only in the first three layers
x = layers.MaxPooling2D(pool_size=(2, 2))(x)

x = layers.GlobalAveragePooling2D()(x)
x = layers.Dropout(DROPOUT_RATE)(x)
outputs = layers.Dense(1 if num_classes == 2 else num_classes,
activation="sigmoid"  if num_classes == 2 else "softmax")(x)

model = keras.Model(inputs, outputs)
return model

# Train the Model
def train_model(model, train_ds, val_ds, epochs=EPOCHS):
model.compile(
optimizer=keras.optimizers.Adam(LEARNING_RATE),
loss="binary_crossentropy",
metrics=["accuracy"],
)
checkpoint_callback = keras.callbacks.ModelCheckpoint(
"model_best.keras", monitor="val_accuracy", save_best_only=True
)
early_stopping_callback = keras.callbacks.EarlyStopping(
monitor="val_loss", patience=5, restore_best_weights=True
)

model.fit(
train_ds,
validation_data=val_ds,
epochs=epochs,
callbacks=[checkpoint_callback, early_stopping_callback],
)

# Check if model exists and load it for continuing training
def load_model_if_exists(model_filepath="model_best.keras"):
if os.path.exists(model_filepath):
print(f"Loading existing model from {model_filepath}...")
model = keras.models.load_model(model_filepath)
else:
print("No existing model found, starting a new model...")
model = make_model(input_shape=(180, 180, 3))
return model

if __name__ == "__main__":
filter_corrupted_images()

# Check if a saved model exists, if yes, load it, if not, create a new one
model = load_model_if_exists("model_best.keras")

# Prepare the datasets for training
train_ds, val_ds = generate_datasets(image_size=(180, 180), batch_size=BATCH_SIZE)
train_ds = configure_for_performance(train_ds)
val_ds = configure_for_performance(val_ds)

# Continue training or start fresh
train_model(model, train_ds, val_ds, epochs=EPOCHS)

# Save the trained model in .keras format
model.save("model_best.keras")

После небольшого обучения я использую приведенный ниже код для проверки своей модели.

Код: Выделить всё

import numpy as np
from tensorflow import keras
from tensorflow.keras.preprocessing import image

# Load the trained model
model = keras.models.load_model("model_best.keras")

# Preprocess image for prediction
def preprocess_image(img_path, image_size=(180, 180)):
img = image.load_img(img_path, target_size=image_size)
img_array = image.img_to_array(img)
img_array = np.expand_dims(img_array, axis=0)
img_array = img_array / 255.0  # Normalize to match training
return img_array

# Predict single image
def predict_image(img_path):
img_array = preprocess_image(img_path)
predictions = model.predict(img_array)
print(f"The image at {img_path} is likely a Cat with confidence {1 - predictions[0][0]:.2f}")
print(f"The image at {img_path} is likely a Dog with confidence {predictions[0][0]:.2f}")

if __name__ == "__main__":
predict_image("Cat01.jpeg")
predict_image("Cat02.jpg")
predict_image("Cat03.jpg")
predict_image("Dog01.jpeg")
predict_image("Dog02.jpg")
predict_image("Dog03.jpg")

Это результат:

Код: Выделить всё

1/1 ━━━━━━━━━━━━━━━━━━━━ 1s 642ms/step
The image at Cat01.jpeg is likely a Cat with confidence 0.91
The image at Cat01.jpeg is likely a Dog with confidence 0.09
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 31ms/step
The image at Cat02.jpg is likely a Cat with confidence 0.92
The image at Cat02.jpg is likely a Dog with confidence 0.08
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 37ms/step
The image at Cat03.jpg is likely a Cat with confidence 0.92
The image at Cat03.jpg is likely a Dog with confidence 0.08
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 44ms/step
The image at Dog01.jpeg is likely a Cat with confidence 0.92
The image at Dog01.jpeg is likely a Dog with confidence 0.08
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 30ms/step
The image at Dog02.jpg is likely a Cat with confidence 0.92
The image at Dog02.jpg is likely a Dog with confidence 0.08
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 31ms/step
The image at Dog03.jpg is likely a Cat with confidence 0.92
The image at Dog03.jpg is likely a Dog with confidence 0.08

Он каждый раз последовательно классифицирует изображения как кошек почти с одинаковым уровнем достоверности.
Я пробовал изменить размер пакета, слои и скорость обучения, но ничего не получилось работало до сих пор. В чем может быть проблема?

Подробнее здесь: https://stackoverflow.com/questions/791 ... perparamet

1731196861

Anonymous

Я пытаюсь обучить модель на изображениях, используя приведенный ниже код.  Моя структура каталогов следующая:
[list]
[*]Папка PetImages размером примерно 1 ГБ расположена рядом с моим файлом main.py скрипт.
[*]Внутри папки PetImages есть две подпапки: Cats и Dogs.
< li>Каждая из этих подпапок содержит 10 000 изображений кошек и собак соответственно.
[/list]
[code]import os
import tensorflow as tf
from keras import layers
from tensorflow import keras

# === Constants for easy hyperparameter tuning ===

BATCH_SIZE = 256  # Batch size for training.
EPOCHS = 80  # Number of training epochs.
LEARNING_RATE = 5e-4  # Learning rate for Adam optimizer.

# === End of constants ===

# Filter out corrupted images
def filter_corrupted_images():
num_skipped = 0
for folder_name in ("Cat", "Dog"):
folder_path = os.path.join("PetImages", folder_name)
for fname in os.listdir(folder_path):
fpath = os.path.join(folder_path, fname)
try:
with open(fpath, "rb") as fobj:
is_jfif = b"JFIF" in fobj.read(10)
except Exception:
is_jfif = False
if not is_jfif:
num_skipped += 1
os.remove(fpath)
print(f"Deleted {num_skipped} corrupted images.")

# Generate Dataset
def generate_datasets(image_size=(180, 180), batch_size=BATCH_SIZE):
train_ds, val_ds = keras.utils.image_dataset_from_directory(
"PetImages",
validation_split=0.2,
subset="both",
seed=1337,
image_size=image_size,
batch_size=batch_size,
)
return train_ds, val_ds

# Configure the Dataset for Performance
def configure_for_performance(ds):
AUGMENTATION = keras.Sequential([
layers.RandomFlip("horizontal"),
layers.RandomRotation(0.3),
layers.RandomZoom(0.2),
layers.RandomBrightness(0.2)
])
ds = ds.map(lambda x, y: (AUGMENTATION(x, training=True), y),
num_parallel_calls=tf.data.AUTOTUNE)
return ds.prefetch(buffer_size=tf.data.AUTOTUNE)

# Define Model Architecture with adjusted strides and fewer pooling layers
def make_model(input_shape, num_classes=2):
inputs = keras.Input(shape=input_shape)
x = layers.Rescaling(1.0 / 255)(inputs)

# Convolutional Layers with reduced stride for some layers
FILTER_SIZES = [32, 64, 128, 256, 512]
KERNEL_SIZE = (3, 3)
DROPOUT_RATE = 0.5
ACTIVATION_FUNCTION = "swish"

for i, size in enumerate(FILTER_SIZES):
x = layers.Conv2D(size, KERNEL_SIZE, strides=1 if i < 2 else 2, padding="same")(x)  # Use stride 1 for first two layers
x = layers.BatchNormalization()(x)
x = layers.Activation(ACTIVATION_FUNCTION)(x)
if i < 3:  # Apply MaxPooling only in the first three layers
x = layers.MaxPooling2D(pool_size=(2, 2))(x)

x = layers.GlobalAveragePooling2D()(x)
x = layers.Dropout(DROPOUT_RATE)(x)
outputs = layers.Dense(1 if num_classes == 2 else num_classes,
activation="sigmoid"  if num_classes == 2 else "softmax")(x)

model = keras.Model(inputs, outputs)
return model

# Train the Model
def train_model(model, train_ds, val_ds, epochs=EPOCHS):
model.compile(
optimizer=keras.optimizers.Adam(LEARNING_RATE),
loss="binary_crossentropy",
metrics=["accuracy"],
)
checkpoint_callback = keras.callbacks.ModelCheckpoint(
"model_best.keras", monitor="val_accuracy", save_best_only=True
)
early_stopping_callback = keras.callbacks.EarlyStopping(
monitor="val_loss", patience=5, restore_best_weights=True
)

model.fit(
train_ds,
validation_data=val_ds,
epochs=epochs,
callbacks=[checkpoint_callback, early_stopping_callback],
)

# Check if model exists and load it for continuing training
def load_model_if_exists(model_filepath="model_best.keras"):
if os.path.exists(model_filepath):
print(f"Loading existing model from {model_filepath}...")
model = keras.models.load_model(model_filepath)
else:
print("No existing model found, starting a new model...")
model = make_model(input_shape=(180, 180, 3))
return model

if __name__ == "__main__":
filter_corrupted_images()

# Check if a saved model exists, if yes, load it, if not, create a new one
model = load_model_if_exists("model_best.keras")

# Prepare the datasets for training
train_ds, val_ds = generate_datasets(image_size=(180, 180), batch_size=BATCH_SIZE)
train_ds = configure_for_performance(train_ds)
val_ds = configure_for_performance(val_ds)

# Continue training or start fresh
train_model(model, train_ds, val_ds, epochs=EPOCHS)

# Save the trained model in .keras format
model.save("model_best.keras")

[/code]
После небольшого обучения я использую приведенный ниже код для проверки своей модели.
[code]import numpy as np
from tensorflow import keras
from tensorflow.keras.preprocessing import image

# Load the trained model
model = keras.models.load_model("model_best.keras")

# Preprocess image for prediction
def preprocess_image(img_path, image_size=(180, 180)):
img = image.load_img(img_path, target_size=image_size)
img_array = image.img_to_array(img)
img_array = np.expand_dims(img_array, axis=0)
img_array = img_array / 255.0  # Normalize to match training
return img_array

# Predict single image
def predict_image(img_path):
img_array = preprocess_image(img_path)
predictions = model.predict(img_array)
print(f"The image at {img_path} is likely a Cat with confidence {1 - predictions[0][0]:.2f}")
print(f"The image at {img_path} is likely a Dog with confidence {predictions[0][0]:.2f}")

if __name__ == "__main__":
predict_image("Cat01.jpeg")
predict_image("Cat02.jpg")
predict_image("Cat03.jpg")
predict_image("Dog01.jpeg")
predict_image("Dog02.jpg")
predict_image("Dog03.jpg")

[/code]
Это результат:
[code]1/1 ━━━━━━━━━━━━━━━━━━━━ 1s 642ms/step
The image at Cat01.jpeg is likely a Cat with confidence 0.91
The image at Cat01.jpeg is likely a Dog with confidence 0.09
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 31ms/step
The image at Cat02.jpg is likely a Cat with confidence 0.92
The image at Cat02.jpg is likely a Dog with confidence 0.08
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 37ms/step
The image at Cat03.jpg is likely a Cat with confidence 0.92
The image at Cat03.jpg is likely a Dog with confidence 0.08
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 44ms/step
The image at Dog01.jpeg is likely a Cat with confidence 0.92
The image at Dog01.jpeg is likely a Dog with confidence 0.08
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 30ms/step
The image at Dog02.jpg is likely a Cat with confidence 0.92
The image at Dog02.jpg is likely a Dog with confidence 0.08
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 31ms/step
The image at Dog03.jpg is likely a Cat with confidence 0.92
The image at Dog03.jpg is likely a Dog with confidence 0.08
[/code]
Он каждый раз последовательно классифицирует изображения как кошек почти с одинаковым уровнем достоверности.
Я пробовал изменить размер пакета, слои и скорость обучения, но ничего не получилось работало до сих пор. В чем может быть проблема? 

Подробнее здесь: [url]https://stackoverflow.com/questions/79173890/model-always-classifies-images-as-cats-with-high-confidence-despite-hyperparamet[/url]

Ответить Пред. тема След. тема

1 сообщение • Страница 1 из 1

Быстрый ответ

Заголовок:

Имя пользователя:

Изменение регистра текста:

Смайлики

Ещё смайлики…

К этому ответу прикреплено по крайней мере одно вложение.

Если вы не хотите добавлять вложения, оставьте поля пустыми. Можно прикреплять файлы, перетаскивая их в окно сообщения.

Максимально разрешённый размер вложения: 15 МБ.

Имя файла:

Комментарий к файлу:

Имя файла	Комментарий к файлу	Размер	Статус

Похожие темы

Ответы

Просмотры

Последнее сообщение

Почему моя ИНС на C# не сходится, несмотря на настройку гиперпараметров?

Последнее сообщение Anonymous « 22 май 2024, 02:33
Добавлено в форуме C#

Anonymous » 22 май 2024, 02:33 » в форуме C#

У меня есть ИНС, реализованная с нуля на C#. Однако не сходилось, в чем может быть проблема? Я пробовал разные функции активации, другой набор данных. Набор данных генерируется случайным образом, но ИНС в Python работает с тем же набором данных,...

0 Ответы

29 Просмотры

Последнее сообщение Anonymous
22 май 2024, 02:33
Почему моя модель, обученная MNIST, неправильно классифицирует пользовательское изображение в Python?

Последнее сообщение Anonymous « 02 дек 2024, 11:03
Добавлено в форуме Python

Anonymous » 02 дек 2024, 11:03 » в форуме Python

Я обучил модель нейронной сети с использованием набора данных MNIST распознаванию рукописных цифр. Модель достигает точности 97 % на тестовом наборе MNIST, но не может правильно предсказать цифры из пользовательского файла изображения. Например,...

0 Ответы

12 Просмотры

Последнее сообщение Anonymous
02 дек 2024, 11:03
Почему моя модель, обученная MNIST, неправильно классифицирует пользовательское изображение в Python?

Последнее сообщение Anonymous « 02 дек 2024, 14:37
Добавлено в форуме Python

Anonymous » 02 дек 2024, 14:37 » в форуме Python

Я обучил модель нейронной сети с использованием набора данных MNIST распознаванию рукописных цифр. Модель достигает точности 97 % на тестовом наборе MNIST, но не может правильно предсказать цифры из пользовательского файла изображения. Например,...

0 Ответы

13 Просмотры

Последнее сообщение Anonymous
02 дек 2024, 14:37
Почему моя модель, обученная MNIST, неправильно классифицирует пользовательское изображение?

Последнее сообщение Anonymous « 03 дек 2024, 03:22
Добавлено в форуме Python

Anonymous » 03 дек 2024, 03:22 » в форуме Python

Я обучил модель нейронной сети с использованием набора данных MNIST распознаванию рукописных цифр. Модель достигает точности 97 % на тестовом наборе MNIST, но не может правильно предсказать цифры из пользовательского файла изображения. Например,...

0 Ответы

13 Просмотры

Последнее сообщение Anonymous
03 дек 2024, 03:22
403 Запрещенная ошибка при получении изображений кошек

Последнее сообщение Anonymous « 13 окт 2024, 22:06
Добавлено в форуме Python

Anonymous » 13 окт 2024, 22:06 » в форуме Python

Я только что закончил программирование программы для фотографий собак, и после нескольких проблем она работает нормально. Я решил сделать модифицированную версию, которая вместо этого использует другой API для предоставления изображений кошек....

0 Ответы

18 Просмотры

Последнее сообщение Anonymous
13 окт 2024, 22:06

Вернуться в «Python»