Facebook開源Mask R-CNN的PyTorch 1.0基準

calx 發布于2019-04-25 18:29 / 2148人閱讀

摘要：近日，開源了和的實現基準。是商湯和港中文近日聯合開源的基于的開源目標檢測工具包。你也可以配置你自己的到數據集的路徑。

近日，Facebook AI Research 開源了 Faster R-CNN 和 Mask R-CNN 的 PyTorch 1.0 實現基準：MaskRCNN-Benchmark。相比 Detectron 和 mmdetection，MaskRCNN-Benchmark 的性能相當，并擁有更快的訓練速度和更低的 GPU 內存占用。

項目地址：https://github.com/facebookresearch/maskrcnn-benchmark

MaskRCNN-Benchmark 目標檢測示例。

Detectron 和 mmdetection

Detectron 是 Facebook AI Research 實現頂尖目標檢測算法（包括 Mask R-CNN）的軟件系統。該系統是基于 Python 和深度學習框架 Caffe 2 而構建的。Detectron 目前包含以下目標檢測算法的實現：Mask R-CNN 、RetinaNet、Faster R-CNN、RPN、Fast R-CNN、R-FCN。

mmdetection 是商湯和港中文近日聯合開源的基于 PyTorch 的開源目標檢測工具包。該工具包支持 Mask R-CNN 等多種流行的檢測框架，讀者可在 PyTorch 環境下測試不同的預訓練模型及訓練新的檢測分割模型。和 Detectron 對比，mmdetection 的性能稍高、訓練速度稍快、所需顯存稍小。

mmdetection 第一個版本中實現了 RPN、Fast R-CNN、Faster R-CNN、Mask R-CNN，近期還計劃放出 RetinaNet 和 Cascade R-CNN。但更重要的是，基于 PyTorch 和基于 Caffe2 的 code 相比，易用性是有代差的。成功安裝 Detectron 的時間，大概可以裝好一打的 mmdetection。

MaskRCNN-Benchmark 項目亮點：

PyTorch 1.0：相當或者超越 Detectron 準確率的 RPN、Faster R-CNN、Mask R-CNN 實現；

非常快：訓練速度是 Detectron 的兩倍，是 mmdection 的 1.3 倍。

節省內存：在訓練過程中使用的 GPU 內存比 mmdetection 少大約 500MB；

使用多 GPU 訓練和推理；

批量化推理：可以在每 GPU 每批量上使用多張圖像進行推理；

支持 CPU 推理：可以在推理時間內于 CPU 上運行。

提供幾乎所有參考 Mask R-CNN 和 Faster R-CNN 配置的預訓練模型，具有 1x 的 schedule。

MaskRCNN-Benchmark Model Zoo 基線模型性能數據

地址：https://github.com/facebookresearch/maskrcnn-benchmark/blob/master/MODEL_ZOO.md

硬件

8 NVIDIA V100 GPUs

軟件：

PyTorch version: 1.0.0a0+dd2c487

CUDA 9.2

CUDNN 7.1

NCCL 2.2.13-1

端到端 Mask R-CNN 和 Faster R-CNN 基線模型

所有的基線模型都使用了和 Detectron 相同的實驗設置，檢測模型權重使用 Caffe2 中的 ImageNet 權重初始化，這和 Detectron 是一樣的。預訓練模型通過下表中的 model id 鏈接獲取。

和 Detectron、mmdetection 的性能對比

訓練速度

下表中的數據單位是秒/迭代，越低越好。（mmdetection 中備注的硬件和 maskrcnn_benchmark 是不同的）

訓練內存（越低越好）

推理準確率（越高越好）

Webcam 和 Jupyter notebook demo

該項目提供了一個簡單的 webcam demo，展示如何使用 maskrcnn_benchmark 進行推理：

cd demo# by default, it runs on the GPU# for best results, use min-image-size 800

python webcam.py --min-image-size 800# can also run it on the CPU

python webcam.py --min-image-size 300 MODEL.DEVICE cpu# or change the model that you want to use

python webcam.py --config-file ../configs/caffe2/e2e_mask_rcnn_R_101_FPN_1x_caffe2.py --min-image-size 300 MODEL.DEVICE cpu# in order to see the probability heatmaps, pass --show-mask-heatmaps

python webcam.py --min-image-size 300 --show-mask-heatmaps MODEL.DEVICE cpu

安裝

教程地址：https://github.com/facebookresearch/maskrcnn-benchmark/blob/master/INSTALL.md

安裝要求：

PyTorch 1.0 的每日測試版本，安裝說明：https://pytorch.org/get-started/locally/

torchvision

cocoapi

yacs

（可選）OpenCV（用于 webcam demo）

# maskrnn_benchmark and coco api dependencies

pip install ninja yacs cython

# follow PyTorch installation in https://pytorch.org/get-started/locally/# we give the instructions for CUDA 9.0

conda install pytorch-nightly -c pytorch

# install torchvisioncd ~/github

git clone git@github.com:pytorch/vision.gitcd vision

python setup.py install

# install pycocotoolscd ~/github

git clone git@github.com:cocodataset/cocoapi.gitcd cocoapi/PythonAPI

python setup.py build_ext install

# install PyTorch Detectioncd ~/github

git clone git@github.com:facebookresearch/maskrcnn-benchmark.gitcd maskrcnn-benchmark# the following will install the lib with# symbolic links, so that you can modify# the files if you want and won"t need to# re-build it

python setup.py build develop

通過幾行代碼進行推理

該項目提供了一個 helper 類來簡化編寫使用預訓練模型進行推理的流程，只要在 demo 文件夾下運行以下代碼：

from maskrcnn_benchmark.config import cfgfrom predictor import COCODemo

config_file = "../configs/caffe2/e2e_mask_rcnn_R_50_FPN_1x_caffe2.yaml"# update the config options with the config file

cfg.merge_from_file(config_file)# manual override some options

cfg.merge_from_list(["MODEL.DEVICE", "cpu"])

coco_demo = COCODemo(

? ? cfg,

? ? min_image_size=800,

? ? confidence_threshold=0.7,

)# load image and then run prediction

image = ...

predictions = coco_demo.run_on_opencv_image(image)

在 COCO 數據集上執行訓練

為了運行以下示例，你首先需要安裝 maskrcnn_benchmark。你還需要下載 COCO 數據集，推薦按以下方式符號鏈接 COCO 數據集的路徑到 datasets/。我們使用來自 Detectron 的 GitHub 的 minival 和 valminusminival 集合。

# symlink the coco datasetcd ~/github/maskrcnn-benchmark

mkdir -p datasets/coco

ln -s /path_to_coco_dataset/annotations datasets/coco/annotations

ln -s /path_to_coco_dataset/train2014 datasets/coco/train2014

ln -s /path_to_coco_dataset/test2014 datasets/coco/test2014

ln -s /path_to_coco_dataset/val2014 datasets/coco/val2014

你也可以配置你自己的到數據集的路徑。為此，你需要讓 maskrcnn_benchmark/config/paths_catalog.py 指向你的數據集保存的位置。你也可以創建一個新的 paths_catalog.py 文件，其實現了相同的兩個類，并在訓練過程中將它作為一個配置參數 PATHS_CATALOG 傳遞。

單 GPU 訓練

python /path_to_maskrnn_benchmark/tools/train_net.py --config-file "/path/to/config/file.yaml"

多 GPU 訓練

該項目使用內部的 torch.distributed.launch 以啟動多 GPU 訓練。這個來自 PyTorch 的效用函數可以產生我們想要使用 GPU 數目的 Python 進程，并且每個 Python 進程只需要使用一個 GPU。

export NGPUS=8

python -m torch.distributed.launch --nproc_per_node=$NGPUS /path_to_maskrcnn_benchmark/tools/train_net.py --

添加你自己的數據集

該項目添加了對 COCO 類型數據集的支持，為在新數據集上訓練添加支持可以通過以下方法實現：

from maskrcnn_benchmark.structures.bounding_box import BoxList

class MyDataset(object):

? ? def __init__(self, ...):

? ? ? ? # as you would do normallydef __getitem__(self, idx):

? ? ? ? # load the image as a PIL Image

? ? ? ? image = ...# load the bounding boxes as a list of list of boxes# in this case, for illustrative purposes, we use# x1, y1, x2, y2 order.

? ? ? ? boxes = [[0, 0, 10, 10], [10, 20, 50, 50]]

? ? ? ? # and labels

? ? ? ? labels = torch.tensor([10, 20])

? ? ? ? # create a BoxList from the boxes

? ? ? ? boxlist = Boxlist(boxes, size=image.size, mode="xyxy")

? ? ? ? # add the labels to the boxlist

? ? ? ? boxlist.add_field("labels", labels)

? ? ? ? if self.transforms:

? ? ? ? ? ? image, boxlist = self.transforms(image, boxlist)

? ? ? ? # return the image, the boxlist and the idx in your datasetreturn image, boxlist, idx

? ? def get_img_info(self, idx):

? ? ? ? # get img_height and img_width. This is used if# we want to split the batches according to the asp

就這樣。你可以添加額外的字段到 boxlist，例如 segmentation masks（使用 structures.segmentation_mask.SegmentationMask），或甚至是你自己的實例類型。如果想了解 COCO 數據集實現的完整過程，可以查看：https://github.com/facebookresearch/maskrcnn-benchmark/blob/master/maskrcnn_benchmark/data/datasets/coco.py

聲明：文章收集于網絡，如有侵權，請聯系小編及時處理，謝謝！

歡迎加入本站公開興趣群

商業智能與數據分析群

興趣范圍包括各種讓數據產生價值的辦法，實際應用案例分享與討論，分析工具，ETL工具，數據倉庫，數據挖掘工具，報表系統等全方位知識

QQ群：81035754

GPU云服務器云服務器巧用CSS的MASK濾鏡 facebook的基準測試 mask

文章版權歸作者所有，未經允許請勿轉載,若此文章存在違規行為，您可以聯系管理員刪除。

轉載請注明本文地址：http://specialneedsforspecialkids.com/yun/4816.html

Mask R-CNN源代碼終于來了，還有它背后的物體檢測平臺

摘要：現在，官方版開源代碼終于來了。同時發布的，是這項研究背后的一個基礎平臺。是的物體檢測平臺，今天宣布開源，它基于，用寫成，這次開放的代碼中就包含了的實現。說，將平臺開源出來，是想要加速世界各地實驗室的研究，推動物體檢測的進展。等代碼吧。從Mask R-CNN論文亮相至今的10個月里，關于它的討論幾乎都會以這句話收尾。現在，官方版開源代碼終于來了。同時發布的，是這項研究背后的一個基礎平臺：De...

robin 2019-04-25 18:23 評論0 收藏0
CNN 在圖像分割中的簡史：從 R-CNN 到 Mask R-CNN

摘要：這個像素級別的圖像分割問題被等科學家解決，這個框架被稱為。由于圖像分割需要做到像素級，這與邊框分割不同，所以必然導致不準確。作者：chen_h微信號 & QQ：862251340微信公眾號：coderpai簡書地址：https://www.jianshu.com/p/867... 自從?Alex Krizhevsky, Geoff Hinton, and Ilya Sutskeve...

AJie 2019-07-30 15:17 評論0 收藏0
整合全部頂尖目標檢測算法：FAIR開源Detectron

摘要：昨天，研究院開源了，業內較佳水平的目標檢測平臺。項目地址是實現頂尖目標檢測算法包括的軟件系統。因此基本上已經是最目前包含最全與最多目標檢測算法的代碼庫了。昨天，Facebook AI 研究院（FAIR）開源了 Detectron，業內較佳水平的目標檢測平臺。據介紹，該項目自 2016 年 7 月啟動，構建于 Caffe2 之上，目前支持大量機器學習算法，其中包括 Mask R-CNN（何愷...

劉玉平 2019-04-25 18:23 評論0 收藏0
何愷明團隊推出Mask^X R-CNN，將實例分割擴展到3000類

摘要：從標題上可以看出，這是一篇在實例分割問題中研究擴展分割物體類別數量的論文。試驗結果表明，這個擴展可以改進基準和權重傳遞方法。今年10月，何愷明的論文Mask R-CNN摘下ICCV 2017的較佳論文獎（Best Paper Award），如今，何愷明團隊在Mask R-CNN的基礎上更近一步，推出了（以下稱Mask^X R-CNN）。這篇論文的第一作者是伯克利大學的在讀博士生胡戎航（清華...

MockingBird 2019-04-25 18:22 評論0 收藏0
用于圖像分割的卷積神經網絡：從R-CNN到Mark R-CNN

摘要：自從和在年贏得了的冠軍，卷積神經網絡就成為了分割圖像的黃金準則。事實上，從那時起，卷積神經網絡不斷獲得完善，并已在挑戰上超越人類。現在，卷積神經網絡在的表現已超越人類。卷積神經網絡（CNN）的作用遠不止分類那么簡單！在本文中，我們將看到卷積神經網絡（CNN）如何在圖像實例分割任務中提升其結果。自從 Alex Krizhevsky、Geoff Hinton 和 Ilya Sutskever ...

daydream 2019-04-25 18:13 評論0 收藏0