openvino系列教程之人脸检测 mobilenetv2

conda create -n vino2021 python=3.8 -y
conda activate vino2021
pip install opencv-python==
pip install openvino==2021.4.1  # 建议最好使用这个版本




  • 图像预处理
  • 推理
  • 后处理


import cv2
src = cv2.imread("d:/Data/15.jpg")
src_ = cv2.cvtColor(src, cv2.COLOR_BGR2RGB) # 将BGR转成RGB次序 
image = cv2.resize(src, (256, 256)) # 图像resize
image = image.transpose(2, 0, 1) # 将CHW转成HWC


# 读取模型
model_xml = "data/face-detection-0200.xml"
model_bin = "data/face-detection-0200.bin"
net = ie.read_network(model=model_xml)
# 加载模型到CPU中
exec_net = ie.load_network(network=net, device_name="CPU")
# 推理(这里相当于将image塞进推理引擎了)
res = exec_net.infer(inputs={input_blob: [image]})


res = res[output_blob]
dets = res.reshape(-1, 7)
sh, sw, _ = src.shape
for det in dets:
    conf = det[2]
    if conf > 0.5:
        # calss_id...
        xmin = int(det[3] * sw)
        ymin = int(det[4] * sh)
        xmax = int(det[5] * sw)
        ymax = int(det[6] * sh)


  • Step1:初始化推理引擎
  • Step2:从xml文件读取模型网络,从bin文件读取模型参数;或者直接从onnx文件同时读取模               型和参数
  • Step3:配置网络的输出、输入(图像预处理)
  • Step4:加载模型到设备
  • Step5:创建推理请求
  • Step6:准备输入
  • Step7:推理
  • Step8:后处理


import cv2
from openvino.inference_engine import IECore
import numpy as np
from timeit import default_timer as timer

# ---------------------------Step 1. Initialize inference engine core--------------------------------------------------
ie = IECore()
device = "CPU"
# ---------------------------Step 2. Read a model in OpenVINO Intermediate Representation or ONNX format---------------
model_xml = "data/face-detection-0200.xml"
model_bin = "data/face-detection-0200.bin"
net = ie.read_network(model=model_xml)
# ---------------------------Step 3. Configure input & output----------------------------------------------------------
input_blob = next(iter(net.input_info))
output_blob = next(iter(net.outputs))
n, c, h, w = net.inputs[input_blob].shape
print("outputs's shape = ", net.outputs[output_blob].shape)

src = cv2.imread("d:/Data/6.jpg")
#src_ = cv2.cvtColor(src, cv2.COLOR_BGR2RGB)
image = cv2.resize(src, (w, h))
image = image.transpose(2, 0, 1)
# ---------------------------Step 4. Loading model to the device-------------------------------------------------------
exec_net = ie.load_network(network=net, device_name=device)
# ---------------------------Step 5. Create infer request--------------------------------------------------------------
# ---------------------------Step 6. Prepare input---------------------------------------------------------------------
# ---------------------------Step 7. Do inference----------------------------------------------------------------------
tic = timer()
res = exec_net.infer(inputs={input_blob: [image]})
toc = timer()
print("the cost time is(ms): ", 1000*(toc - tic))
print("the latance is:", exec_net.requests[0].latency)
# ---------------------------Step 8. Process output--------------------------------------------------------------------



        这个人脸检测模型backbone是mobilev2,人脸检测头是SSD目标检测的head,在此模型的训练期间,训练图像的大小调整为 256x256。上一节我们知道,模型部署只需要三步:图像预处理、推理、后处理;由于推理openvino帮咱们干了,咱们只需要写好模型输入和输出就行了。


        在模型文件中,输入的名称为: `input`, 输入图像的shape为: `1, 3, 256, 256` 输入图像次序为 `B, C, H, W`, 其中:

  • `B` - batch size
  • `C` - 图像通道数,一般为3
  • `H` - image height
  • `W` - image width

输入图像的次序为: `BGR`.


        网络输出特征图的shape为: `1, 1, 200, 7`,其中200表示候选目标数量.每一个候选目标是一个7维的向量,存储顺序为: [`image_id`, `label`, `conf`, `x_min`, `y_min`, `x_max`, `y_max`], 其中:

  • `image_id` - 图像在这个batch中的ID,不用管,因为本文是单batch推理
  •  `label` - 预测的类别ID(0 - face)
  •  `conf` - 置信度
  •  (`x_min`, `y_min`) - 矩形bbox左上角的点坐标
  •  (`x_max`, `y_max`) - 矩形bbox右下角的点坐标



import cv2
from openvino.inference_engine import IECore
import numpy as np
from timeit import default_timer as timer

