图像语义分割python_Pytorch实现用于图像语义分割：U-Net

阅读量：

UNet: semantic segmentation with PyTorch

A custom-built implementation of the U-Net model in PyTorch for Kaggle’s Carvana Image Masking Challenge, derived from high-resolution images.

This model was developed without any prior training using a dataset of 5,000 images (and no data augmentation techniques applied). It achieved a Dice score of 0.988423, which corresponds to correctly segmenting 511 pixels out of a total of 735 pixels across over 100k test images. While this performance is impressive, it could potentially be enhanced through additional training sessions, by applying more sophisticated data augmentation techniques, fine-tuning the model further, tuning the CRF parameters for better post-processing results, and assigning greater emphasis to edge pixel accuracy.

The Carvana data is available on the Kaggle website.

Usage

Note : Use Python 3

Prediction

You can easily test the output masks on your images via the CLI.

To predict a single image and save it:

python predict.py -i image.jpg -o output.jpg

To predict a multiple images and show them without saving them:

python predict.py -i image1.jpg image2.jpg --viz --no-save

python predict.py -h

usage: predict.py [-h] [--model FILE] --input INPUT [INPUT ...]

[--output INPUT [INPUT ...]] [--viz] [--no-save]

[--mask-threshold MASK_THRESHOLD] [--scale SCALE]

Predict masks from input images

optional arguments:

-h, --help show this help message and exit

--model FILE, -m FILE

Specify the file in which the model is stored

(default: MODEL.pth)

--input INPUT [INPUT ...], -i INPUT [INPUT ...]

filenames of input images (default: None)

--output INPUT [INPUT ...], -o INPUT [INPUT ...]

Filenames of ouput images (default: None)

--viz, -v Visualize the images as they are processed (default:

False)

--no-save, -n Do not save the output masks (default: False)

--mask-threshold MASK_THRESHOLD, -t MASK_THRESHOLD

Minimum probability value to consider a mask pixel

white (default: 0.5)

--scale SCALE, -s SCALE

Scale factor for the input images (default: 0.5)

You can specify which model file to use with --model MODEL.pth.

Training

python train.py -h

usage: train.py [-h] [-e E] [-b [B]] [-l [LR]] [-f LOAD] [-s SCALE] [-v VAL]

Train the UNet on images and target masks

optional arguments:

-h, --help show this help message and exit

-e E, --epochs E Number of epochs (default: 5)

-b [B], --batch-size [B]

Batch size (default: 1)

-l [LR], --learning-rate [LR]

Learning rate (default: 0.1)

-f LOAD, --load LOAD Load model from a .pth file (default: False)

-s SCALE, --scale SCALE

Downscaling factor of the images (default: 0.5)

-v VAL, --validation VAL

Percent of the data that is used as validation (0-100)

(default: 15.0)

By default settings, the scaling factor is set to 0.5. If you aim for improved performance (but may require more memory resources), consider setting it to a value of 1.

The input images and target masks must be placed within the data/imgs and data/masks directories, respectively.

Tensorboard

Using TensorBoard, you can enable real-time visualization of both training and testing losses alongside model predictions.

tensorboard --logdir=runs

Notes on memory

A model has been trained from the beginning on a GTX970M with 3GB VRAM. Generating images of size 1918x1280 requires about 1.5GB of memory. Training demands roughly around 3GB, so if you're slightly short on VRAM, disable all graphical output to proceed efficiently. This setup assumes the use of bilinear interpolation instead of transposed convolutions.

Original work led by Olaf Ronneberger, Philipp Fischer, and Thomas Brox was published on arXiv with the identifier 1505.04597.

全部评论 (0)

还没有任何评论哟~

图像语义分割python_Pytorch实现用于图像语义分割：U-Net

UNet:semanticsegmentationwithPyTorch CustomizedimplementationoftheUNetinPyTorchforKaggle'sCarvanaIma...

图像语义分割 U-Net图像分割网络详解

图像语义分割UNet图像分割网络详解简介原始论文中的网络结构在医学方面的应用 pytorch官方实现以DRIVE眼底血管分割数据集训练UNet语义分割网络模型 UNet网络训练损失函数简介 ...

图像语义分割(二) —— denseCRF模型用于图像语义分割

斯坦福的2011年NIPS论文《EfficientInferenceinFullyConnectedCRFswith GaussianEdgePotentials》，阐述了如何使用高效的全连接条件随机...

图像分割-语义分割

图像分割语义分割 1.FCN 1.1CNN与FCN的比较 1.2三种上采样方法 1.2.1双线性插值上采样 1.2.2反卷积上采样 1.2.3反池化上采样 1.3FCN跳层结构（Skiplayer） ...

图像语义分割_图像处理——路面语义分割

检测坑洼，水坑，不同类型的地形等本期是关于路面语义分割方法的。因此，这里的重点是路面模式，例如：车辆行驶在哪种路面上或道路上是否有损坏，还有道路标记和减速带等等。

语义分割_图像语义分割综述

本文转载自知乎用户“stone” 一、什么是语义分割语义分割是在像素级别上的分类，属于同一类的像素都要被归为一类，因此语义分割是从像素级别来理解图像的。比如说如下的照片，属于人的像素都要分成一类，属...

ps语义分割_图像语义分割训练经验总结--图像语义分割

最近一直在学pytorch，copy了几个经典的入门问题。现在作一下总结。首先，做的小项目主要有分类问题：Mnist手写体识别、FashionMnist识别、猫狗大战语义分割：Unet分割肝脏图...

ps语义分割_图像语义分割训练经验总结--图像语义分割

图像语义分割

因为参加比赛的关系，接触到一些有关图像语义分割的知识，在此以做记录。图像语义分割在深度学习应用到计算机视觉领域之前，人们使用TextonForest和随机森林分类器进行语义分割。卷积神经网络（CN...

图像语义分割

1.1图像语义分割的概念与原理图像语义分割可以说是图像理解的基石性技术，在自动驾驶系统（具体为街景识别与理解）、无人机应用（着陆点判断）以及穿戴式设备应用中举足轻重。

是否确定退出登录?

图像语义分割python_Pytorch实现用于图像语义分割：U-Net

全部评论 (0)

相关文章推荐

图像语义分割python_Pytorch实现用于图像语义分割：U-Net

图像语义分割 U-Net图像分割网络详解

图像语义分割(二) —— denseCRF模型用于图像语义分割

图像分割-语义分割

图像语义分割_图像处理——路面语义分割

语义分割_图像语义分割综述

ps语义分割_图像语义分割训练经验总结--图像语义分割

ps语义分割_图像语义分割训练经验总结--图像语义分割

图像语义分割

图像语义分割