NVR with realtime local object detection for IP cameras
Go to file
2019-02-09 20:38:11 -06:00
.dockerignore initial commit 2019-01-26 08:02:59 -06:00
detect_objects.py add support for a minimum object size 2019-02-09 20:38:11 -06:00
Dockerfile add imutils to container 2019-02-09 20:37:46 -06:00
LICENSE initial commit 2019-01-26 08:02:59 -06:00
README.md update readme 2019-02-09 07:23:54 -06:00

Realtime Object Detection for RTSP Cameras

This results in a MJPEG stream with objects identified that has a lower latency than directly viewing the RTSP feed with VLC.

  • Prioritizes realtime processing over frames per second. Dropping frames is fine.
  • OpenCV runs in a separate process so it can grab frames as quickly as possible to ensure there aren't old frames in the buffer
  • Object detection with Tensorflow runs in a separate process and ignores frames that are more than 0.5 seconds old
  • Uses shared memory arrays for handing frames between processes
  • Provides a url for viewing the video feed at a hard coded ~5FPS as an mjpeg stream
  • Frames are only encoded into mjpeg stream when it is being viewed
  • A process is created per detection region

Getting Started

Build the container with

docker build -t realtime-od .

Download a model from the zoo.

Download the cooresponding label map from here.

Run the container with

docker run -it --rm \
-v <path_to_frozen_detection_graph.pb>:/frozen_inference_graph.pb:ro \
-v <path_to_labelmap.pbtext>:/label_map.pbtext:ro \
-p 5000:5000 \
-e RTSP_URL='<rtsp_url>' \
-e REGIONS='<box_size_1>,<x_offset_1>,<y_offset_1>:<box_size_2>,<x_offset_2>,<y_offset_2>' \
realtime-od:latest

Access the mjpeg stream at http://localhost:5000

Tips

  • Lower the framerate of the RTSP feed on the camera to what you want to reduce the CPU usage for capturing the feed
  • Use SSDLite models

Future improvements

  • Look for a subset of object types
  • Try and simplify the tensorflow model to just look for the objects we care about
  • MQTT messages when detected objects change
  • Implement basic motion detection with opencv and only look for objects in the regions with detected motion
  • Dynamic changes to processing speed, ie. only process 1FPS unless motion detected
  • Parallel processing to increase FPS
  • Look into GPU accelerated decoding of RTSP stream
  • Send video over a socket and use JSMPEG

Building Tensorflow from source for CPU optimizations

https://www.tensorflow.org/install/source#docker_linux_builds used tensorflow/tensorflow:1.12.0-devel-py3

Optimizing the graph (cant say I saw much difference in CPU usage)

https://github.com/tensorflow/tensorflow/blob/master/tensorflow/tools/graph_transforms/README.md#optimizing-for-deployment

docker run -it -v ${PWD}:/lab -v ${PWD}/../back_camera_model/models/ssd_mobilenet_v2_coco_2018_03_29/frozen_inference_graph.pb:/frozen_inference_graph.pb:ro tensorflow/tensorflow:1.12.0-devel-py3 bash

bazel build tensorflow/tools/graph_transforms:transform_graph

bazel-bin/tensorflow/tools/graph_transforms/transform_graph \
--in_graph=/frozen_inference_graph.pb \
--out_graph=/lab/optimized_inception_graph.pb \
--inputs='image_tensor' \
--outputs='num_detections,detection_scores,detection_boxes,detection_classes' \
--transforms='
  strip_unused_nodes(type=float, shape="1,300,300,3")
  remove_nodes(op=Identity, op=CheckNumerics)
  fold_constants(ignore_errors=true)
  fold_batch_norms
  fold_old_batch_norms'