Модель DETR (преобразователь обнаружения) не работает для моего набора данных [закрыто]

Модель DETR (преобразователь обнаружения) не работает для моего набора данных [закрыто] ⇐ Python

1 сообщение • Страница 1 из 1

Anonymous

Модель DETR (преобразователь обнаружения) не работает для моего набора данных [закрыто]

Цитата

Сообщение Anonymous » 14 окт 2024, 11:58

Я пытаюсь обучить свой набор данных DETR (преобразователю обнаружения), чтобы иметь возможность обнаруживать дефекты на бетонных мостах. у меня много ошибок, я пытался их исправить, но через некоторое время те же ошибки появляются снова.
Вот мой код, учитывая, что я имею дело с прямоугольными ограничивающими рамками.
Вот мой код, учитывая, что я имею дело с прямоугольными ограничивающими рамками.
р>

Код: Выделить всё

import os
import json
from PIL import Image
from torch.utils.data import Dataset, DataLoader
import torch
from torchvision import transforms
from transformers import DetrImageProcessor, DetrForObjectDetection
import torch.optim as optim
import cv2  # Ensure you have OpenCV installed

# Set your directories
train_images_dir = r'D:\Civil Engineering\PHD\Thesis\ImageProcessing\work\dataset\defect_detection\dacl10k-DatasetNinja\train\img_sized'
val_images_dir = r'D:\Civil Engineering\PHD\Thesis\ImageProcessing\work\dataset\defect_detection\dacl10k-DatasetNinja\val\img_sized'
test_images_dir = r'D:\Civil Engineering\PHD\Thesis\ImageProcessing\work\dataset\defect_detection\dacl10k-DatasetNinja\test\img_sized'

train_annotations_dir = r'D:\Civil Engineering\PHD\Thesis\ImageProcessing\work\dataset\defect_detection\dacl10k-DatasetNinja\train\ann_sized'
val_annotations_dir = r'D:\Civil Engineering\PHD\Thesis\ImageProcessing\work\dataset\defect_detection\dacl10k-DatasetNinja\val\ann_sized'
test_annotations_dir = None  # No masks for test dataset

# Function to load annotations from JSON file
def load_annotations(json_file):
with open(json_file, 'r') as f:
data = json.load(f)

annotations = []
for obj in data['objects']:
class_id = obj['classId']
points = obj['points']['exterior']  # polygon points
annotations.append({'class_id': class_id, 'points': points})

return annotations

def create_mask_from_polygons(image_shape, annotations):
""" Create a binary mask from polygon annotations.  """
mask = torch.zeros(image_shape, dtype=torch.float32)  # Assuming image_shape is (height, width)

for ann in annotations:
if ann and 'points' in ann:
# Convert polygon points to a format compatible with drawing
points = torch.tensor(ann['points'], dtype=torch.int32)
# Fill the mask using polygon points
cv2.fillPoly(mask.numpy(), [points.numpy()], color=(1.0))  # Fill the polygon with 1

return mask

# Custom dataset class
class DefectDataset(Dataset):
def __init__(self, images_dir, annotations_dir):
self.images_dir = images_dir
self.annotations_dir = annotations_dir
self.image_files = os.listdir(images_dir)
self.transform = transforms.Compose([
transforms.Pad(padding=(10, 10, 10, 10), fill=0),  # Pad image by 10 pixels on all sides
transforms.Resize((224, 224)),  # Resize to 224x224
transforms.ToTensor()  # Convert image to tensor
])

def __len__(self):
return len(self.image_files)

def __getitem__(self, idx):
image_file = self.image_files[idx]
image_path = os.path.join(self.images_dir, image_file)

try:
image = Image.open(image_path).convert("RGB")
image = self.transform(image)  # Convert image to tensor

# Load corresponding JSON file
json_file = os.path.join(self.annotations_dir, f"{os.path.splitext(image_file)[0]}.json")
annotations = load_annotations(json_file)

return image, annotations
except Exception as e:
print(f"Error loading {image_file}: {e}")
return None, None  # Skip or handle as needed

# Custom collate function to handle variable-sized images and annotations
def custom_collate_fn(batch):
images, annotations = zip(*batch)

# Filter out any None items
images = [img for img in images if img is not None]
annotations = [ann for ann in annotations if ann is not None]

# Stack images into a single tensor (this assumes images are of the same size)
images = torch.stack(images, dim=0)  # Ensure all images have same size

return images, annotations

# Create datasets and dataloaders
train_dataset = DefectDataset(train_images_dir, train_annotations_dir)
val_dataset = DefectDataset(val_images_dir, val_annotations_dir)

train_loader = DataLoader(train_dataset, batch_size=5, shuffle=True, collate_fn=custom_collate_fn)
val_loader = DataLoader(val_dataset, batch_size=5, collate_fn=custom_collate_fn)

# Load Detr model
image_processor = DetrImageProcessor.from_pretrained("facebook/detr-resnet-50")
model = DetrForObjectDetection.from_pretrained("facebook/detr-resnet-50")

# Define optimizer
optimizer = optim.Adam(model.parameters(), lr=1e-5)

# Example placeholder loss function for object detection
# Updated compute_loss function
def compute_loss(outputs, annotations):
"""
Computes the loss for object detection using DETR outputs.

Args:
outputs: The model's output.
annotations: A list of annotations for each image in the batch, containing class labels and bounding boxes.

Returns:
The computed loss tensor.
"""

# Extract predicted class logits and bounding boxes
pred_logits = outputs.logits  # Shape: [batch_size, num_queries, num_classes + 1]
pred_boxes = outputs.pred_boxes  # Shape: [batch_size, num_queries, 4]

target_classes = []
target_boxes = []

for ann in annotations:
# Collect class IDs and bounding boxes from annotations
classes = [obj['class_id'] for obj in ann]
boxes = [obj['bbox'] for obj in ann]  # [x_min, y_min, x_max, y_max]

target_classes.append(torch.tensor(classes, dtype=torch.long))
target_boxes.append(torch.tensor(boxes, dtype=torch.float32))

# Convert to tensors
target_classes = torch.stack(target_classes)  # Shape: [batch_size, num_objects]
target_boxes = torch.stack(target_boxes)  # Shape:  [batch_size, num_objects, 4]

# Calculate classification and bounding box loss (DETR uses Hungarian matching)
loss_class = torch.nn.CrossEntropyLoss()(pred_logits.view(-1, pred_logits.shape[-1]), target_classes.view(-1))
loss_bbox = torch.nn.L1Loss()(pred_boxes.view(-1, 4), target_boxes.view(-1, 4))

# Total loss (you might want to weight these losses)
total_loss = loss_class + loss_bbox

return total_loss

# Training loop with loss computation and saving model weights
for epoch in range(5):  # Number of epochs
model.train()
total_loss = 0
for batch in train_loader:
images, annotations = batch

# Check if batch is empty or invalid (skip invalid images)
if len(images) == 0 or any([ann is None for ann in annotations]):
continue  # Skip this batch if there are no valid images or annotations

try:
# Forward pass
outputs = model(images)

# Compute loss (with image shape passed)
loss = compute_loss(outputs, annotations)  # Only pass two arguments

# If loss is NaN or invalid, skip this batch
if not torch.isfinite(loss):
print(f"Invalid loss for batch, skipping.")
continue

# Backward pass and optimization
optimizer.zero_grad()
loss.backward()
optimizer.step()

# Accumulate total loss
total_loss += loss.item()

except Exception as e:
# Handle any exceptions in processing a batch
print(f"Error processing batch: {e}")
continue  # Skip this batch on error

print(f"Epoch {epoch + 1} completed. Total Loss: {total_loss:.4f}")

# Save the trained model
torch.save(model.state_dict(), "detr_trained_weights.pth")
print("Model saved as detr_trained_weights.pth")

Я пытался исправить ошибки, я исправил некоторые, но другие ошибки продолжают появляться.
Я ожидаю, что процесс обучения может быть завершен и файл весов будет создан.

Подробнее здесь: https://stackoverflow.com/questions/790 ... my-dataset

1728896322

Anonymous

Я пытаюсь обучить свой набор данных DETR (преобразователю обнаружения), чтобы иметь возможность обнаруживать дефекты на бетонных мостах. у меня много ошибок, я пытался их исправить, но через некоторое время те же ошибки появляются снова.
Вот мой код, учитывая, что я имею дело с прямоугольными ограничивающими рамками.
Вот мой код, учитывая, что я имею дело с прямоугольными ограничивающими рамками.
р>
[code]import os
import json
from PIL import Image
from torch.utils.data import Dataset, DataLoader
import torch
from torchvision import transforms
from transformers import DetrImageProcessor, DetrForObjectDetection
import torch.optim as optim
import cv2  # Ensure you have OpenCV installed

# Set your directories
train_images_dir = r'D:\Civil Engineering\PHD\Thesis\ImageProcessing\work\dataset\defect_detection\dacl10k-DatasetNinja\train\img_sized'
val_images_dir = r'D:\Civil Engineering\PHD\Thesis\ImageProcessing\work\dataset\defect_detection\dacl10k-DatasetNinja\val\img_sized'
test_images_dir = r'D:\Civil Engineering\PHD\Thesis\ImageProcessing\work\dataset\defect_detection\dacl10k-DatasetNinja\test\img_sized'

train_annotations_dir = r'D:\Civil Engineering\PHD\Thesis\ImageProcessing\work\dataset\defect_detection\dacl10k-DatasetNinja\train\ann_sized'
val_annotations_dir = r'D:\Civil Engineering\PHD\Thesis\ImageProcessing\work\dataset\defect_detection\dacl10k-DatasetNinja\val\ann_sized'
test_annotations_dir = None  # No masks for test dataset

# Function to load annotations from JSON file
def load_annotations(json_file):
with open(json_file, 'r') as f:
data = json.load(f)

annotations = []
for obj in data['objects']:
class_id = obj['classId']
points = obj['points']['exterior']  # polygon points
annotations.append({'class_id': class_id, 'points': points})

return annotations

def create_mask_from_polygons(image_shape, annotations):
""" Create a binary mask from polygon annotations.  """
mask = torch.zeros(image_shape, dtype=torch.float32)  # Assuming image_shape is (height, width)

for ann in annotations:
if ann and 'points' in ann:
# Convert polygon points to a format compatible with drawing
points = torch.tensor(ann['points'], dtype=torch.int32)
# Fill the mask using polygon points
cv2.fillPoly(mask.numpy(), [points.numpy()], color=(1.0))  # Fill the polygon with 1

return mask

# Custom dataset class
class DefectDataset(Dataset):
def __init__(self, images_dir, annotations_dir):
self.images_dir = images_dir
self.annotations_dir = annotations_dir
self.image_files = os.listdir(images_dir)
self.transform = transforms.Compose([
transforms.Pad(padding=(10, 10, 10, 10), fill=0),  # Pad image by 10 pixels on all sides
transforms.Resize((224, 224)),  # Resize to 224x224
transforms.ToTensor()  # Convert image to tensor
])

def __len__(self):
return len(self.image_files)

def __getitem__(self, idx):
image_file = self.image_files[idx]
image_path = os.path.join(self.images_dir, image_file)

try:
image = Image.open(image_path).convert("RGB")
image = self.transform(image)  # Convert image to tensor

# Load corresponding JSON file
json_file = os.path.join(self.annotations_dir, f"{os.path.splitext(image_file)[0]}.json")
annotations = load_annotations(json_file)

return image, annotations
except Exception as e:
print(f"Error loading {image_file}: {e}")
return None, None  # Skip or handle as needed

# Custom collate function to handle variable-sized images and annotations
def custom_collate_fn(batch):
images, annotations = zip(*batch)

# Filter out any None items
images = [img for img in images if img is not None]
annotations = [ann for ann in annotations if ann is not None]

# Stack images into a single tensor (this assumes images are of the same size)
images = torch.stack(images, dim=0)  # Ensure all images have same size

return images, annotations

# Create datasets and dataloaders
train_dataset = DefectDataset(train_images_dir, train_annotations_dir)
val_dataset = DefectDataset(val_images_dir, val_annotations_dir)

train_loader = DataLoader(train_dataset, batch_size=5, shuffle=True, collate_fn=custom_collate_fn)
val_loader = DataLoader(val_dataset, batch_size=5, collate_fn=custom_collate_fn)

# Load Detr model
image_processor = DetrImageProcessor.from_pretrained("facebook/detr-resnet-50")
model = DetrForObjectDetection.from_pretrained("facebook/detr-resnet-50")

# Define optimizer
optimizer = optim.Adam(model.parameters(), lr=1e-5)

# Example placeholder loss function for object detection
# Updated compute_loss function
def compute_loss(outputs, annotations):
"""
Computes the loss for object detection using DETR outputs.

Args:
outputs: The model's output.
annotations: A list of annotations for each image in the batch, containing class labels and bounding boxes.

Returns:
The computed loss tensor.
"""

# Extract predicted class logits and bounding boxes
pred_logits = outputs.logits  # Shape: [batch_size, num_queries, num_classes + 1]
pred_boxes = outputs.pred_boxes  # Shape: [batch_size, num_queries, 4]

target_classes = []
target_boxes = []

for ann in annotations:
# Collect class IDs and bounding boxes from annotations
classes = [obj['class_id'] for obj in ann]
boxes = [obj['bbox'] for obj in ann]  # [x_min, y_min, x_max, y_max]

target_classes.append(torch.tensor(classes, dtype=torch.long))
target_boxes.append(torch.tensor(boxes, dtype=torch.float32))

# Convert to tensors
target_classes = torch.stack(target_classes)  # Shape: [batch_size, num_objects]
target_boxes = torch.stack(target_boxes)  # Shape:  [batch_size, num_objects, 4]

# Calculate classification and bounding box loss (DETR uses Hungarian matching)
loss_class = torch.nn.CrossEntropyLoss()(pred_logits.view(-1, pred_logits.shape[-1]), target_classes.view(-1))
loss_bbox = torch.nn.L1Loss()(pred_boxes.view(-1, 4), target_boxes.view(-1, 4))

# Total loss (you might want to weight these losses)
total_loss = loss_class + loss_bbox

return total_loss

# Training loop with loss computation and saving model weights
for epoch in range(5):  # Number of epochs
model.train()
total_loss = 0
for batch in train_loader:
images, annotations = batch

# Check if batch is empty or invalid (skip invalid images)
if len(images) == 0 or any([ann is None for ann in annotations]):
continue  # Skip this batch if there are no valid images or annotations

try:
# Forward pass
outputs = model(images)

# Compute loss (with image shape passed)
loss = compute_loss(outputs, annotations)  # Only pass two arguments

# If loss is NaN or invalid, skip this batch
if not torch.isfinite(loss):
print(f"Invalid loss for batch, skipping.")
continue

# Backward pass and optimization
optimizer.zero_grad()
loss.backward()
optimizer.step()

# Accumulate total loss
total_loss += loss.item()

except Exception as e:
# Handle any exceptions in processing a batch
print(f"Error processing batch: {e}")
continue  # Skip this batch on error

print(f"Epoch {epoch + 1} completed. Total Loss: {total_loss:.4f}")

# Save the trained model
torch.save(model.state_dict(), "detr_trained_weights.pth")
print("Model saved as detr_trained_weights.pth")

[/code]
Я пытался исправить ошибки, я исправил некоторые, но другие ошибки продолжают появляться.
Я ожидаю, что процесс обучения может быть завершен и файл весов будет создан. 

Подробнее здесь: [url]https://stackoverflow.com/questions/79084295/detr-detection-transformer-model-is-not-working-for-my-dataset[/url]

Ответить Пред. тема След. тема

1 сообщение • Страница 1 из 1

Быстрый ответ

Заголовок:

Имя пользователя:

Изменение регистра текста:

Смайлики

Ещё смайлики…

К этому ответу прикреплено по крайней мере одно вложение.

Если вы не хотите добавлять вложения, оставьте поля пустыми. Можно прикреплять файлы, перетаскивая их в окно сообщения.

Максимально разрешённый размер вложения: 15 МБ.

Имя файла:

Комментарий к файлу:

Имя файла	Комментарий к файлу	Размер	Статус

Похожие темы

Ответы

Просмотры

Последнее сообщение

Модель DETR (преобразователь обнаружения) не работает для моего набора данных

Последнее сообщение Anonymous « 14 окт 2024, 01:51
Добавлено в форуме Python

Anonymous » 14 окт 2024, 01:51 » в форуме Python

Я пытаюсь обучить свой набор данных DETR (преобразователю обнаружения), чтобы иметь возможность обнаруживать дефекты на бетонных мостах. у меня много ошибок, я пытался их исправить, но через некоторое время те же ошибки появляются снова.
Вот мой...

0 Ответы

11 Просмотры

Последнее сообщение Anonymous
14 окт 2024, 01:51
Как использовать модель DETR Facebook для обнаружения и маскировки воздушных кабелей на изображениях?

Последнее сообщение Anonymous « 05 май 2024, 21:42
Добавлено в форуме Python

Anonymous » 05 май 2024, 21:42 » в форуме Python

В настоящее время я работаю над проектом, в котором мне нужно обнаружить и замаскировать воздушные кабели на изображениях с помощью модели Facebook DETR (DEtection TRansformers). Я подготовил набор данных с использованием CVAT (инструмент аннотации...

0 Ответы

42 Просмотры

Последнее сообщение Anonymous
05 май 2024, 21:42
Ошибка ОС: невозможно загрузить модель для «facebook/detr-resnet-101».

Последнее сообщение Anonymous « 03 янв 2025, 15:47
Добавлено в форуме Python

Anonymous » 03 янв 2025, 15:47 » в форуме Python

этот код рендерится два раза
я получаю ожидаемый результат, но я буду рендерить снова, так что на этот раз я получу эту ошибку ОС
как решить эта ошибка?

Невозможно загрузить модель для «facebook/detr-resnet-101». Если вы пытались загрузить его с...

0 Ответы

15 Просмотры

Последнее сообщение Anonymous
03 янв 2025, 15:47
Я использую CNN (последовательную модель) для обнаружения глаз. Могу ли я сохранить обученную модель и переобучить ее, н

Последнее сообщение Гость « 29 окт 2023, 09:57
Добавлено в форуме Python

Гость » 29 окт 2023, 09:57 » в форуме Python

Мой графический процессор — Rtx 3050 4 ГБ. Из-за меньшего количества видеопамяти я уменьшил размер пакета, но это все равно занимало слишком много времени, почти 1 час для каждой эпохи. Могу ли я сохранить обученную модель (.h5) и переобучить ее без...

0 Ответы

116 Просмотры

Последнее сообщение Гость
29 окт 2023, 09:57
Я хочу развернуть модель Yolov5 на ESP32-CAM для обнаружения объектов. Тем не менее, модель не работает на устройстве

Последнее сообщение Anonymous « 21 фев 2025, 23:08
Добавлено в форуме Python

Anonymous » 21 фев 2025, 23:08 » в форуме Python

Сначала извините за мой простой английский
, поэтому у меня есть этот проект для обнаружения объектов, и я использую Yolov5 для обнаружения целевого объекта, и я делаю этот шаг, я попробую код и его Работа в кулачке в моем ноутбуке, и теперь я хочу...

0 Ответы

22 Просмотры

Последнее сообщение Anonymous
21 фев 2025, 23:08

Вернуться в «Python»