使用labelme工具制作分割数据集

程序员文章站 2022-03-25 17:24:15

...

参考：图像语义分割标注工具labelme制作自己的数据集用于mask-rcnn训练

1.安装

labelme（标注mask数据集用的）github地址
python3环境

pip install pyqt5
pip install labelme

建议使用国内镜像：

pip install pyqt5 -i https://pypi.tuna.tsinghua.edu.cn/simple 
pip install labelme -i https://pypi.tuna.tsinghua.edu.cn/simple

2.使用

（1）打开软件
终端执行：

labelme

（2）标注图片
点击Open Dir，选择文件夹；点击Create Polygons开始标注；按字母D完成标注跳往下一张，这事会弹出保存界面，默认保存在图片所在文件夹。右键可选择取消当前标注点选项（ctrl+z）。
相同类的多个对象需要在类名后加数字。使用labelme工具制作分割数据集
（3）标注文件处理
标注完成后会产生一个对应的图片名.json文件

此时需要进入图片所指的文件夹，执行下面的程序生成下面的标注图：

labelme_json_to_dataset  <文件名>.json

使用labelme工具制作分割数据集
但是我发现我的程序没有这个程序，此时可以取github复制个文件过来，可以直接使用。
https://github.com/wkentaro/labelme/blob/master/labelme/cli/json_to_dataset.py

但是批量处理图片需要写脚本。
可使用下面的程序：唯一输入是只带.json文件的文件夹
参考：关于json_to_dataset时出现 module ‘labelme.utils’ has no attribute 'draw_label’的解决

# labelme_json_to_dataset_batch.py
import argparse
import base64
import json
import os
import os.path as osp

import imgviz
import PIL.Image

from labelme.logger import logger
from labelme import utils
import cv2
from math import *
import numpy as np
import random


def main():
    # 唯一输入:
    # 给出只带.json文件的文件夹
    label_path = r"E:/pics/json/"

    list_path = os.listdir(label_path)
    for i in range(0, len(list_path)):
            logger.warning('This script is aimed to demonstrate how to convert the'
                           'JSON file to a single image dataset, and not to handle'
                           'multiple JSON files to generate a real-use dataset.')

            parser = argparse.ArgumentParser()
            parser.add_argument('--json_file')
            parser.add_argument('-o', '--out', default=None)
            args = parser.parse_args()

            json_file = label_path + list_path[i]
            print(list_path[i])
            if args.out is None:
                out_dir = osp.basename(json_file).replace('.', '_')  # 返回文件名
                out_dir = osp.join(osp.dirname(json_file), out_dir)  # 把目录和文件名合成一个路径
            else:
                out_dir = args.out
            if not osp.exists(out_dir):
                os.mkdir(out_dir)  # 用于以数字权限模式创建目录

            data = json.load(open(json_file))
            imageData = data.get('imageData')

            if not imageData:
                imagePath = os.path.join(os.path.dirname(json_file), data['imagePath']) # os.path.dirname返回文件路径
                with open(imagePath, 'rb') as f:
                    imageData = f.read()
                    imageData = base64.b64encode(imageData).decode('utf-8')
            img = utils.img_b64_to_arr(imageData)

            label_name_to_value = {'_background_': 0}
            for shape in sorted(data['shapes'], key=lambda x: x['label']):
                label_name = shape['label']
                if label_name in label_name_to_value:
                    label_value = label_name_to_value[label_name]
                else:
                    label_value = len(label_name_to_value)
                    label_name_to_value[label_name] = label_value
            lbl, _ = utils.shapes_to_label(
                img.shape, data['shapes'], label_name_to_value
            )

            label_names = [None] * (max(label_name_to_value.values()) + 1)
            for name, value in label_name_to_value.items():
                label_names[value] = name

            lbl_viz = imgviz.label2rgb(
                label=lbl, img=imgviz.asgray(img), label_names=label_names, loc='rb'
            )

            PIL.Image.fromarray(img).save(osp.join(out_dir, 'img.png'))
            utils.lblsave(osp.join(out_dir, 'label.png'), lbl)
            PIL.Image.fromarray(lbl_viz).save(osp.join(out_dir, 'label_viz.png'))

            with open(osp.join(out_dir, 'label_names.txt'), 'w') as f:
                for lbl_name in label_names:
                    f.write(lbl_name + '\n')

            logger.info('Saved to: {}'.format(out_dir))



if __name__ == '__main__':
    main()

使用labelme工具制作分割数据集

1.安装

2.使用

自己制作机器学习训练和测试使用的二进制数据集（C++）

图像语义分割标注工具labelme制作自己的数据集用于mask-rcnn训练

语义分割数据集标注工具labelme

使用labelme标注工具制作图像分割标签数据

labelme 制作语义分割（Semantic Segmentation）数据，不同labels指定不同颜色

PaddleDetection——使用自己制作的VOC数据集进行模型训练的避坑指南

使用精灵标注助手制作yolov3训练数据集（附解析xml代码）

tensorflow版使用uNet进行医学图像分割(Skin数据集)

使用labelme工具制作分割数据集

python工具集pandas处理数据使用介绍