mirror of https://github.com/blakeblackshear/frigate.git synced 2025-08-22 13:47:29 +02:00

NVR with realtime local object detection for IP cameras

ai camera google-coral home-assistant home-automation homeautomation mqtt nvr object-detection realtime rtsp tensorflow

Go to file

Blake Blackshear 12c4cd77c5 only convert pix_fmt when necessary		2020-10-18 13:47:13 -05:00
.github	Update bug_report.md	2020-08-22 06:49:45 -05:00
config	use yuv420p pixel format for motion	2020-10-18 13:47:13 -05:00
docker	build ffmpeg in separate container	2020-10-18 13:47:13 -05:00
docs	update docs	2020-08-08 07:40:48 -05:00
frigate	only convert pix_fmt when necessary	2020-10-18 13:47:13 -05:00
.dockerignore	update dockerignore and debug option	2019-07-30 19:11:22 -05:00
.gitignore	move config example	2019-12-08 07:06:52 -06:00
benchmark.py	support multiple coral devices (fixes #100 )	2020-10-18 13:47:13 -05:00
detect_objects.py	only convert pix_fmt when necessary	2020-10-18 13:47:13 -05:00
diagram.png	update docs and add back benchmark	2020-03-03 20:26:53 -06:00
labelmap.txt	refactor and reduce false positives	2020-09-17 07:37:27 -05:00
LICENSE	switch to MIT license	2020-07-26 12:07:47 -05:00
Makefile	build ffmpeg in separate container	2020-10-18 13:47:13 -05:00
process_clip.py	upgrade to python3.8 and switch from plasma store to shared_memory	2020-10-18 13:47:13 -05:00
README.md	remove affiliate links	2020-10-08 07:26:02 -05:00

README.md

Frigate - NVR With Realtime Object Detection for IP Cameras

Uses OpenCV and Tensorflow to perform realtime object detection locally for IP cameras. Designed for integration with HomeAssistant or others via MQTT.

Use of a Google Coral Accelerator is optional, but highly recommended. On my Intel i7 processor, I can process 2-3 FPS with the CPU. The Coral can process 100+ FPS with very low CPU load.

Leverages multiprocessing heavily with an emphasis on realtime over processing every frame
Uses a very low overhead motion detection to determine where to run object detection
Object detection with Tensorflow runs in a separate process
Object info is published over MQTT for integration into HomeAssistant as a binary sensor
An endpoint is available to view an MJPEG stream for debugging, but should not be used continuously

Example video (from older version)

You see multiple bounding boxes because it draws bounding boxes from all frames in the past 1 second where a person was detected. Not all of the bounding boxes were from the current frame.

Getting Started

Run the container with

docker run --rm \
-name blakeblackshear/frigate:stable \
--privileged \
--shm-size=512m \ # should work for a 2-3 cameras
-v /dev/bus/usb:/dev/bus/usb \
-v <path_to_config_dir>:/config:ro \
-v /etc/localtime:/etc/localtime:ro \
-p 5000:5000 \
-e FRIGATE_RTSP_PASSWORD='password' \
blakeblackshear/frigate:stable

Example docker-compose:

  frigate:
    container_name: frigate
    restart: unless-stopped
    privileged: true
    shm_size: '1g' # should work for 5-7 cameras
    image: blakeblackshear/frigate:stable
    volumes:
      - /dev/bus/usb:/dev/bus/usb
      - /etc/localtime:/etc/localtime:ro
      - <path_to_config>:/config
      - <path_to_directory_for_clips>:/clips
    ports:
      - "5000:5000"
    environment:
      FRIGATE_RTSP_PASSWORD: "password"

A config.yml file must exist in the config directory. See example here and device specific info can be found here.

Recommended Hardware

Name	Inference Speed	Notes
Atomic Pi	16ms	Best option for a dedicated low power board with a small number of cameras.
Intel NUC NUC7i3BNK	8-10ms	Best possible performance. Can handle 7+ cameras at 5fps depending on typical amounts of motion.
BMAX B2 Plus	10-12ms	Good balance of performance and cost. Also capable of running many other services at the same time as frigate.
Minisforum GK41	9-10ms	Great alternative to a NUC. Easily handiles 4 1080p cameras.

ARM boards are not officially supported at the moment due to some python dependencies that require modification to work on ARM devices. The Raspberry Pi4 gets about 16ms inference speeds, but the hardware acceleration for ffmpeg does not work for converting yuv420 to rgb24. The Atomic Pi is x86 and much more efficient.

Users have reported varying success in getting frigate to run in a VM. In some cases, the virtualization layer introduces a significant delay in communication with the Coral. If running virtualized in Proxmox, pass the USB card/interface to the virtual machine not the USB ID for faster inference speed.

Integration with HomeAssistant

Setup a camera, binary_sensor, sensor and optionally automation as shown for each camera you define in frigate. Replace <camera_name> with the camera name as defined in the frigate config.yml (The frigate_coral_fps and frigate_coral_inference sensors only need to be defined once)

camera:
  - name: <camera_name> Last Person
    platform: mqtt
    topic: frigate/<camera_name>/person/snapshot
  - name: <camera_name> Last Car
    platform: mqtt
    topic: frigate/<camera_name>/car/snapshot

binary_sensor:
  - name: <camera_name> Person
    platform: mqtt
    state_topic: "frigate/<camera_name>/person"
    device_class: motion
    availability_topic: "frigate/available"

sensor:
  - platform: rest
    name: Frigate Debug
    resource: http://localhost:5000/debug/stats
    scan_interval: 5
    json_attributes:
      - <camera_name>
      - coral
    value_template: 'OK'  
  - platform: template
    sensors:
      <camera_name>_fps: 
        value_template: '{{ states.sensor.frigate_debug.attributes["<camera_name>"]["fps"] }}'
        unit_of_measurement: 'FPS'
      <camera_name>_skipped_fps: 
        value_template: '{{ states.sensor.frigate_debug.attributes["<camera_name>"]["skipped_fps"] }}'
        unit_of_measurement: 'FPS'
      <camera_name>_detection_fps: 
        value_template: '{{ states.sensor.frigate_debug.attributes["<camera_name>"]["detection_fps"] }}'
        unit_of_measurement: 'FPS'
      frigate_coral_fps: 
        value_template: '{{ states.sensor.frigate_debug.attributes["coral"]["fps"] }}'
        unit_of_measurement: 'FPS'
      frigate_coral_inference:
        value_template: '{{ states.sensor.frigate_debug.attributes["coral"]["inference_speed"] }}' 
        unit_of_measurement: 'ms'
        
automation:
  - alias: Alert me if a person is detected while armed away
    trigger: 
      platform: state
      entity_id: binary_sensor.camera_person
      from: 'off'
      to: 'on'
    condition:
      - condition: state
        entity_id: alarm_control_panel.home_alarm
        state: armed_away
    action:
      - service: notify.user_telegram
        data:
          message: "A person was detected."
          data:
            photo:
              - url: http://<ip>:5000/<camera_name>/person/best.jpg
                caption: A person was detected.

HTTP Endpoints

A web server is available on port 5000 with the following endpoints.

`/<camera_name>`

An mjpeg stream for debugging. Keep in mind the mjpeg endpoint is for debugging only and will put additional load on the system when in use.

You can access a higher resolution mjpeg stream by appending h=height-in-pixels to the endpoint. For example http://localhost:5000/back?h=1080. You can also increase the FPS by appending fps=frame-rate to the URL such as http://localhost:5000/back?fps=10 or both with ?fps=10&h=1000

`/<camera_name>/<object_name>/best.jpg[?h=300&crop=1]`

The best snapshot for any object type. It is a full resolution image by default.

Example parameters:

h=300: resizes the image to 300 pixes tall
crop=1: crops the image to the region of the detection rather than returning the entire image

`/<camera_name>/latest.jpg[?h=300]`

The most recent frame that frigate has finished processing. It is a full resolution image by default.

Example parameters:

h=300: resizes the image to 300 pixes tall

`/debug/stats`

Contains some granular debug info that can be used for sensors in HomeAssistant. See details below.

MQTT Messages

These are the MQTT messages generated by Frigate. The default topic_prefix is frigate, but can be changed in the config file.

frigate/available

Designed to be used as an availability topic with HomeAssistant. Possible message are: "online": published when frigate is running (on startup) "offline": published right before frigate stops

frigate/<camera_name>/<object_name>

Publishes ON or OFF and is designed to be used a as a binary sensor in HomeAssistant for whether or not that object type is detected.

frigate/<camera_name>/<object_name>/snapshot

Publishes a jpeg encoded frame of the detected object type. When the object is no longer detected, the highest confidence image is published or the original image is published again.

The height and crop of snapshots can be configured as shown in the example config.

frigate/<camera_name>/events/start

Message published at the start of any tracked object. JSON looks as follows:

{
    "label": "person",
    "score": 0.87890625,
    "box": [
        95,
        155,
        581,
        1182
    ],
    "area": 499122,
    "region": [
        0,
        132,
        1080,
        1212
    ],
    "frame_time": 1600208805.60284,
    "centroid": [
        338,
        668
    ],
    "id": "1600208805.60284-k1l43p",
    "start_time": 1600208805.60284,
    "top_score": 0.87890625,
    "zones": [],
    "score_history": [
        0.87890625
    ],
    "computed_score": 0.0,
    "false_positive": true
}

frigate/<camera_name>/events/end

Same as frigate/<camera_name>/events/start, but with an end_time property as well.

frigate/<zone_name>/<object_name>

Publishes ON or OFF and is designed to be used a as a binary sensor in HomeAssistant for whether or not that object type is detected in the zone.

Understanding min_score and threshold

min_score defines the minimum score for Frigate to begin tracking a detected object. Any single detection below min_score will be ignored as a false positive. threshold is based on the median of the history of scores for a tracked object. Consider the following frames when min_score is set to 0.6 and threshold is set to 0.85:

Frame	Current Score	Score History	Computed Score	Detected Object
1	0.7	0.0, 0, 0.7	0.0	No
2	0.55	0.0, 0.7, 0.0	0.0	No
3	0.85	0.7, 0.0, 0.85	0.7	No
4	0.90	0.7, 0.85, 0.95, 0.90	0.875	Yes
5	0.88	0.7, 0.85, 0.95, 0.90, 0.88	0.88	Yes
6	0.95	0.7, 0.85, 0.95, 0.90, 0.88, 0.95	0.89	Yes

In frame 2, the score is below the min_score value, so frigate ignores it and it becomes a 0.0. The computed score is the median of the score history (padding to at least 3 values), and only when that computed score crosses the threshold is the object marked as a true positive. That happens in frame 4 in the example.

Using a custom model or labels

Models for both CPU and EdgeTPU (Coral) are bundled in the image. You can use your own models with volume mounts:

CPU Model: /cpu_model.tflite
EdgeTPU Model: /edgetpu_model.tflite
Labels: /labelmap.txt

Customizing the Labelmap

The labelmap can be customized to your needs. A common reason to do this is to combine multiple object types that are easily confused when you don't need to be as granular such as car/truck. You must retain the same number of labels, but you can change the names. To change:

Download the COCO labelmap
Modify the label names as desired. For example, change 7 truck to 7 car
Mount the new file at /labelmap.txt in the container with an additional volume
```
-v ./config/labelmap.txt:/labelmap.txt
```

Recording Clips

Note: Previous versions of frigate included -vsync drop in input parameters. This is not compatible with FFmpeg's segment feature and must be removed from your input parameters if you have overrides set.

Frigate can save video clips without any CPU overhead for encoding by simply copying the stream directly with FFmpeg. It leverages FFmpeg's segment functionality to maintain a cache of 90 seconds of video for each camera. The cache files are written to disk at /cache and do not introduce memory overhead. When an object is being tracked, it will extend the cache to ensure it can assemble a clip when the event ends. Once the event ends, it again uses FFmpeg to assemble a clip by combining the video clips without any encoding by the CPU. Assembled clips are are saved to the /clips directory along with a json file containing the current information about the tracked object.

Global Configuration Options

max_seconds: This limits the size of the cache when an object is being tracked. If an object is stationary and being tracked for a long time, the cache files will expire and this value will be the maximum clip length for the end of the event. For example, if this is set to 300 seconds and an object is being tracked for 600 seconds, the clip will end up being the last 300 seconds. Defaults to 300 seconds.

Per-camera Configuration Options

pre_capture: Defines how much time should be included in the clip prior to the beginning of the event. Defaults to 30 seconds.
objects: List of object types to save clips for. Object types here must be listed for tracking at the camera or global configuration. Defaults to all tracked objects.

Google Coral Configuration

Frigate attempts to detect your Coral device automatically. If you have multiple Coral devices or a version that is not detected automatically, you can specify using the tensorflow_device config option.

Masks and limiting detection to a certain area

The mask works by looking at the bottom center of any bounding box (first image, red dot below) and comparing that to your mask. If that red dot falls on an area of your mask that is black, the detection (and motion) will be ignored. The mask in the second image would limit detection on this camera to only objects that are in the front yard and not the street.

The following types of masks are supported:

base64: Base64 encoded image file
poly: List of x,y points like zone configuration
image: Path to an image file in the config directory

base64 and image masks must be the same aspect ratio as your camera.

Zones

Zones allow you to define a specific area of the frame and apply additional filters for object types so you can determine whether or not an object is within a particular area. Zones cannot have the same name as a camera. If desired, a single zone can include multiple cameras if you have multiple cameras covering the same area. See the sample config for details on how to configure.

During testing, draw_zones can be set in the config to tell frigate to draw the zone on the frames so you can adjust as needed. The zone line will increase in thickness when any object enters the zone.

Debug Info

{
    /* Per Camera Stats */
    "back": {
        /***************
        * Frames per second being consumed from your camera. If this is higher
        * than it is supposed to be, you should set -r FPS in your input_args.
        * camera_fps = process_fps + skipped_fps
        ***************/
        "camera_fps": 5.0,
        /***************
        * Number of times detection is run per second. This can be higher than
        * your camera FPS because frigate often looks at the same frame multiple times
        * or in multiple locations
        ***************/
        "detection_fps": 1.5,
        /***************
        * PID for the ffmpeg process that consumes this camera
        ***************/
        "ffmpeg_pid": 27,
        /***************
        * Timestamps of frames in various parts of processing
        ***************/
        "frame_info": {
            /***************
            * Timestamp of the frame frigate is running object detection on.
            ***************/
            "detect": 1596994991.91426,
            /***************
            * Timestamp of the frame frigate is processing detected objects on.
            * This is where MQTT messages are sent, zones are checked, etc.
            ***************/
            "process": 1596994991.91426,
            /***************
            * Timestamp of the frame frigate last read from ffmpeg.
            ***************/
            "read": 1596994991.91426
        },
        /***************
        * PID for the process that runs detection for this camera
        ***************/
        "pid": 34,
        /***************
        * Frames per second being processed by frigate.
        ***************/
        "process_fps": 5.1,
        /***************
        * Timestamp when the detection process started looking for a frame. If this value stays constant
        * for a long time, that means there aren't any frames in the frame queue.
        ***************/
        "read_start": 1596994991.943814,
        /***************
        * Frames per second skip for processing by frigate.
        ***************/
        "skipped_fps": 0.0
    },
    /* Coral Stats */
    "coral": {
        /***************
        * Timestamp when object detection started. If this value stays non-zero and constant
        * for a long time, that means the detection process is stuck.
        ***************/
        "detection_start": 0.0,
        /***************
        * Frames per second of the Coral. This should be the sum of all detection_fps values from cameras.
        ***************/
        "fps": 6.9,
        /***************
        * Time spent running object detection in milliseconds.
        ***************/
        "inference_speed": 10.48,
        /***************
        * PID for the shared process that runs object detection on the Coral.
        ***************/
        "pid": 25321
    },
    "plasma_store_rc": null // Return code for the plasma store. This should be null normally.
}

Tips

Lower the framerate of the video feed on the camera to reduce the CPU usage for capturing the feed. Not as effective, but you can also modify the take_frame configuration for each camera to only analyze every other frame, or every third frame, etc.
Hard code the resolution of each camera in your config if you are having difficulty starting frigate or if the initial ffprobe for camerea resolution fails or returns incorrect info. Example:

cameras:
  back:
    ffmpeg:
      input: rtsp://<camera>
    height: 1080
    width: 1920

Additional logging is available in the docker container - You can view the logs by running docker logs -t frigate
Object configuration - Tracked objects types, sizes and thresholds can be defined globally and/or on a per camera basis. The global and camera object configuration is merged. For example, if you defined tracking person, car, and truck globally but modified your backyard camera to only track person, the global config would merge making the effective list for the backyard camera still contain person, car and truck. If you want precise object tracking per camera, best practice to put a minimal list of objects at the global level and expand objects on a per camera basis. Object threshold and area configuration will be used first from the camera object config (if defined) and then from the global config. See the example config for more information.

Troubleshooting

"ffmpeg didnt return a frame. something is wrong"

Turn on logging for the camera by overriding the global_args and setting the log level to info:

ffmpeg:
  global_args:
    - -hide_banner
    - -loglevel
    - info