矿场工程车检测数据集 4900张工程车带标注voc yolo

news2025/4/2 7:29:50

矿场工程车检测数据集

数据集描述

该数据集旨在用于矿场工程车的检测和分类任务，涵盖了多种常见的工程车辆类型。数据集包含了大量的工程车图像及其对应的标注信息，可用于训练计算机视觉模型，以识别和定位矿场中的不同工程车辆。

数据规模

数据集共有4981张图像，标注了7798个对象，涵盖了10种不同的工程车辆类别。

类别及数量

数据集中的类别及数量如下：

挖掘机 (excavator)：1086张图像，标注了1356个对象。
自卸卡车 (dump truck)：1074张图像，标注了1816个对象。
压路机 (compactor)：538张图像，标注了583个对象。
移动起重机 (mobile crane)：529张图像，标注了589个对象。
塔式起重机 (tower crane)：169张图像，标注了235个对象。
轮式装载机 (wheel loader)：851张图像，标注了922个对象。
混凝土搅拌车 (concrete mixer truck)：376张图像，标注了427个对象。
反铲装载机 (backhoe loader)：666张图像，标注了693个对象。
推土机 (dozer)：492张图像，标注了536个对象。
平地机 (grader)：622张图像，标注了641个对象。

标注格式

数据集中的标注信息采用了VOC（Visual Object Classes）格式，每个图像都有一个对应的XML文件，记录了每个对象的位置信息（边界框坐标）和类别标签。

数据集结构

典型的数据集目录结构如下：

1mine_engineering_vehicle_dataset/
2├── Annotations/
3│   ├── img_0001.xml
4│   ├── img_0002.xml
5│   └── ...
6├── ImageSets/
7│   ├── Main/
8│   │   ├── train.txt
9│   │   ├── val.txt
10│   │   └── test.txt
11├── JPEGImages/
12│   ├── img_0001.jpg
13│   ├── img_0002.jpg
14│   └── ...
15└── labels/
16    ├── train/
17    │   ├── img_0001.txt
18    │   ├── img_0002.txt
19    └── val/
20        ├── img_0001.txt
21        ├── img_0002.txt

应用场景

该数据集可以用于以下应用场景：

工程车辆检测与分类：训练模型识别矿场中的不同工程车辆类型。
安全管理：实时监测矿场中的车辆活动，提高安全管理效率。
自动化调度：辅助矿场自动化调度系统，提高运营效率。
科研分析：用于研究矿场工程车辆的行为模式和发展趋势。

示例代码

以下是一个使用Python和相关库（如OpenCV、PyTorch等）来加载和展示数据集的简单示例代码：

1import os
2import cv2
3import xml.etree.ElementTree as ET
4from PIL import Image
5import numpy as np
6
7# 数据集路径
8dataset_path = 'path/to/mine_engineering_vehicle_dataset/'
9
10# 加载图像和标签
11def load_image_and_label(image_path, annotation_path):
12    # 读取图像
13    image = Image.open(image_path).convert('RGB')
14    # 解析XML文件
15    tree = ET.parse(annotation_path)
16    root = tree.getroot()
17    objects = []
18    for obj in root.findall('object'):
19        name = obj.find('name').text
20        bbox = obj.find('bndbox')
21        xmin = int(bbox.find('xmin').text)
22        ymin = int(bbox.find('ymin').text)
23        xmax = int(bbox.find('xmax').text)
24        ymax = int(bbox.find('ymax').text)
25        objects.append([xmin, ymin, xmax, ymax, name])
26    return image, objects
27
28# 展示图像
29def show_image_with_boxes(image, boxes):
30    img = np.array(image)
31    for box in boxes:
32        xmin, ymin, xmax, ymax, name = box
33        cv2.rectangle(img, (xmin, ymin), (xmax, ymax), (0, 255, 0), 2)
34        cv2.putText(img, name, (xmin, ymin - 10), cv2.FONT_HERSHEY_SIMPLEX, 0.5, (0, 255, 0), 2)
35    cv2.imshow('Image with Boxes', img)
36    cv2.waitKey(0)
37    cv2.destroyAllWindows()
38
39# 主函数
40if __name__ == "__main__":
41    images_dir = os.path.join(dataset_path, 'JPEGImages')
42    annotations_dir = os.path.join(dataset_path, 'Annotations')
43    
44    # 获取图像列表
45    image_files = [f for f in os.listdir(images_dir) if f.endswith('.jpg')]
46    
47    # 随机选择一张图像
48    selected_image = np.random.choice(image_files)
49    image_path = os.path.join(images_dir, selected_image)
50    annotation_path = os.path.join(annotations_dir, selected_image.replace('.jpg', '.xml'))
51    
52    # 加载图像和标签
53    image, boxes = load_image_and_label(image_path, annotation_path)
54    
55    # 展示带有标注框的图像
56    show_image_with_boxes(image, boxes)

这段代码展示了如何加载图像和其对应的VOC XML标注文件，并在图像上绘制边界框和类别标签。您可以根据实际需求进一步扩展和修改这段代码，以适应您的具体应用场景。