blakeblackshear.frigate/frigate/detectors/detection_api.py

import logging
from abc import ABC, abstractmethod
from typing import List

import numpy as np

from frigate.detectors.detector_config import ModelTypeEnum

logger = logging.getLogger(__name__)


class DetectionApi(ABC):
    type_key: str
    supported_models: List[ModelTypeEnum]

    @abstractmethod
    def __init__(self, detector_config):
        self.detector_config = detector_config
        self.thresh = 0.5
        self.height = detector_config.model.height
        self.width = detector_config.model.width

    @abstractmethod
    def detect_raw(self, tensor_input):
        pass

    def post_process_yolonas(self, output):
        """
        @param output: output of inference
        expected shape: [np.array(1, N, 4), np.array(1, N, 80)]
        where N depends on the input size e.g. N=2100 for 320x320 images

        @return: best results: np.array(20, 6) where each row is
        in this order (class_id, score, y1/height, x1/width, y2/height, x2/width)
        """

        N = output[0].shape[1]

        boxes = output[0].reshape(N, 4)
        scores = output[1].reshape(N, 80)

        class_ids = np.argmax(scores, axis=1)
        scores = scores[np.arange(N), class_ids]

        args_best = np.argwhere(scores > self.thresh)[:, 0]

        num_matches = len(args_best)
        if num_matches == 0:
            return np.zeros((20, 6), np.float32)
        elif num_matches > 20:
            args_best20 = np.argpartition(scores[args_best], -20)[-20:]
            args_best = args_best[args_best20]

        boxes = boxes[args_best]
        class_ids = class_ids[args_best]
        scores = scores[args_best]

        boxes = np.transpose(
            np.vstack(
                (
                    boxes[:, 1] / self.height,
                    boxes[:, 0] / self.width,
                    boxes[:, 3] / self.height,
                    boxes[:, 2] / self.width,
                )
            )
        )

        results = np.hstack(
            (class_ids[..., np.newaxis], scores[..., np.newaxis], boxes)
        )

        return np.resize(results, (20, 6))

    def post_process(self, output):
        if self.detector_config.model.model_type == ModelTypeEnum.yolonas:
            return self.post_process_yolonas(output)
        else:
            raise ValueError(
                f'Model type "{self.detector_config.model.model_type}" is currently not supported.'
            )
Refactor to simplify support for additional detector types (#3656) * Refactor EdgeTPU and CPU model handling to detector submodules. * Fix selecting the correct detection device type from the config * Remove detector type check when creating ObjectDetectProcess * Fixes after rebasing to 0.11 * Add init file to detector folder * Rename to detect_api Co-authored-by: Nicolas Mowen <nickmowen213@gmail.com> * Add unit test for LocalObjectDetector class * Add configuration for model inputs Support transforming detection regions to RGB or BGR. Support specifying the input tensor shape. The tensor shape has a standard format ["BHWC"] when handed to the detector, but can be transformed in the detector to match the model shape using the model input_tensor config. * Add documentation for new model config parameters * Add input tensor transpose to LocalObjectDetector * Change the model input tensor config to use an enumeration * Updates for model config documentation Co-authored-by: Nicolas Mowen <nickmowen213@gmail.com> 2022-11-04 03:23:09 +01:00			`import logging`
			`from abc import ABC, abstractmethod`
Adds support for YOLO-NAS in OpenVino (#11645) * update onnxruntime * support for yolo-nas in openvino * cleanup notebook * update docs * improve docs * handle AUTO issue and update docs 2024-06-07 13:52:08 +02:00			`from typing import List`
Refactor to simplify support for additional detector types (#3656) * Refactor EdgeTPU and CPU model handling to detector submodules. * Fix selecting the correct detection device type from the config * Remove detector type check when creating ObjectDetectProcess * Fixes after rebasing to 0.11 * Add init file to detector folder * Rename to detect_api Co-authored-by: Nicolas Mowen <nickmowen213@gmail.com> * Add unit test for LocalObjectDetector class * Add configuration for model inputs Support transforming detection regions to RGB or BGR. Support specifying the input tensor shape. The tensor shape has a standard format ["BHWC"] when handed to the detector, but can be transformed in the detector to match the model shape using the model input_tensor config. * Add documentation for new model config parameters * Add input tensor transpose to LocalObjectDetector * Change the model input tensor config to use an enumeration * Updates for model config documentation Co-authored-by: Nicolas Mowen <nickmowen213@gmail.com> 2022-11-04 03:23:09 +01:00
Implement common post_processing (#11408) * implement common post_processing * fix formatting * rename yolonas to post_process_yolonas 2024-05-17 18:50:45 +02:00			`import numpy as np`

			`from frigate.detectors.detector_config import ModelTypeEnum`

Refactor to simplify support for additional detector types (#3656) * Refactor EdgeTPU and CPU model handling to detector submodules. * Fix selecting the correct detection device type from the config * Remove detector type check when creating ObjectDetectProcess * Fixes after rebasing to 0.11 * Add init file to detector folder * Rename to detect_api Co-authored-by: Nicolas Mowen <nickmowen213@gmail.com> * Add unit test for LocalObjectDetector class * Add configuration for model inputs Support transforming detection regions to RGB or BGR. Support specifying the input tensor shape. The tensor shape has a standard format ["BHWC"] when handed to the detector, but can be transformed in the detector to match the model shape using the model input_tensor config. * Add documentation for new model config parameters * Add input tensor transpose to LocalObjectDetector * Change the model input tensor config to use an enumeration * Updates for model config documentation Co-authored-by: Nicolas Mowen <nickmowen213@gmail.com> 2022-11-04 03:23:09 +01:00			`logger = logging.getLogger(__name__)`


			`class DetectionApi(ABC):`
Convert detectors to factory pattern, ability to set different model for each detector (#4635) * refactor detectors * move create_detector and DetectorTypeEnum * fixed code formatting * add detector model config models * fix detector unit tests * adjust SharedMemory size to largest detector model shape * fix detector model config defaults * enable auto-discovery of detectors * simplify config * simplify config changes further * update detectors docs; detect detector configs dynamic * add suggested changes * remove custom detector doc * fix grammar, adjust device defaults 2022-12-15 14:12:52 +01:00			`type_key: str`
Adds support for YOLO-NAS in OpenVino (#11645) * update onnxruntime * support for yolo-nas in openvino * cleanup notebook * update docs * improve docs * handle AUTO issue and update docs 2024-06-07 13:52:08 +02:00			`supported_models: List[ModelTypeEnum]`
Convert detectors to factory pattern, ability to set different model for each detector (#4635) * refactor detectors * move create_detector and DetectorTypeEnum * fixed code formatting * add detector model config models * fix detector unit tests * adjust SharedMemory size to largest detector model shape * fix detector model config defaults * enable auto-discovery of detectors * simplify config * simplify config changes further * update detectors docs; detect detector configs dynamic * add suggested changes * remove custom detector doc * fix grammar, adjust device defaults 2022-12-15 14:12:52 +01:00
Refactor to simplify support for additional detector types (#3656) * Refactor EdgeTPU and CPU model handling to detector submodules. * Fix selecting the correct detection device type from the config * Remove detector type check when creating ObjectDetectProcess * Fixes after rebasing to 0.11 * Add init file to detector folder * Rename to detect_api Co-authored-by: Nicolas Mowen <nickmowen213@gmail.com> * Add unit test for LocalObjectDetector class * Add configuration for model inputs Support transforming detection regions to RGB or BGR. Support specifying the input tensor shape. The tensor shape has a standard format ["BHWC"] when handed to the detector, but can be transformed in the detector to match the model shape using the model input_tensor config. * Add documentation for new model config parameters * Add input tensor transpose to LocalObjectDetector * Change the model input tensor config to use an enumeration * Updates for model config documentation Co-authored-by: Nicolas Mowen <nickmowen213@gmail.com> 2022-11-04 03:23:09 +01:00			`@abstractmethod`
Convert detectors to factory pattern, ability to set different model for each detector (#4635) * refactor detectors * move create_detector and DetectorTypeEnum * fixed code formatting * add detector model config models * fix detector unit tests * adjust SharedMemory size to largest detector model shape * fix detector model config defaults * enable auto-discovery of detectors * simplify config * simplify config changes further * update detectors docs; detect detector configs dynamic * add suggested changes * remove custom detector doc * fix grammar, adjust device defaults 2022-12-15 14:12:52 +01:00			`def __init__(self, detector_config):`
Implement common post_processing (#11408) * implement common post_processing * fix formatting * rename yolonas to post_process_yolonas 2024-05-17 18:50:45 +02:00			`self.detector_config = detector_config`
			`self.thresh = 0.5`
			`self.height = detector_config.model.height`
			`self.width = detector_config.model.width`
Refactor to simplify support for additional detector types (#3656) * Refactor EdgeTPU and CPU model handling to detector submodules. * Fix selecting the correct detection device type from the config * Remove detector type check when creating ObjectDetectProcess * Fixes after rebasing to 0.11 * Add init file to detector folder * Rename to detect_api Co-authored-by: Nicolas Mowen <nickmowen213@gmail.com> * Add unit test for LocalObjectDetector class * Add configuration for model inputs Support transforming detection regions to RGB or BGR. Support specifying the input tensor shape. The tensor shape has a standard format ["BHWC"] when handed to the detector, but can be transformed in the detector to match the model shape using the model input_tensor config. * Add documentation for new model config parameters * Add input tensor transpose to LocalObjectDetector * Change the model input tensor config to use an enumeration * Updates for model config documentation Co-authored-by: Nicolas Mowen <nickmowen213@gmail.com> 2022-11-04 03:23:09 +01:00
			`@abstractmethod`
			`def detect_raw(self, tensor_input):`
			`pass`
Implement common post_processing (#11408) * implement common post_processing * fix formatting * rename yolonas to post_process_yolonas 2024-05-17 18:50:45 +02:00
			`def post_process_yolonas(self, output):`
			`"""`
			`@param output: output of inference`
			`expected shape: [np.array(1, N, 4), np.array(1, N, 80)]`
			`where N depends on the input size e.g. N=2100 for 320x320 images`

			`@return: best results: np.array(20, 6) where each row is`
			`in this order (class_id, score, y1/height, x1/width, y2/height, x2/width)`
			`"""`

			`N = output[0].shape[1]`

			`boxes = output[0].reshape(N, 4)`
			`scores = output[1].reshape(N, 80)`

			`class_ids = np.argmax(scores, axis=1)`
			`scores = scores[np.arange(N), class_ids]`

			`args_best = np.argwhere(scores > self.thresh)[:, 0]`

			`num_matches = len(args_best)`
			`if num_matches == 0:`
			`return np.zeros((20, 6), np.float32)`
			`elif num_matches > 20:`
			`args_best20 = np.argpartition(scores[args_best], -20)[-20:]`
			`args_best = args_best[args_best20]`

			`boxes = boxes[args_best]`
			`class_ids = class_ids[args_best]`
			`scores = scores[args_best]`

			`boxes = np.transpose(`
			`np.vstack(`
			`(`
			`boxes[:, 1] / self.height,`
			`boxes[:, 0] / self.width,`
			`boxes[:, 3] / self.height,`
			`boxes[:, 2] / self.width,`
			`)`
			`)`
			`)`

			`results = np.hstack(`
			`(class_ids[..., np.newaxis], scores[..., np.newaxis], boxes)`
			`)`

			`return np.resize(results, (20, 6))`

			`def post_process(self, output):`
			`if self.detector_config.model.model_type == ModelTypeEnum.yolonas:`
Reimplement support for rknn detector (#11365) * initial support for rknn detector * remove purge_model_cache option * update rknn * support rk3576 * fix post_process_yolonas call * add yolonas models * update config * exclude yolonas from image * remove code 2024-05-22 00:50:03 +02:00			`return self.post_process_yolonas(output)`
Implement common post_processing (#11408) * implement common post_processing * fix formatting * rename yolonas to post_process_yolonas 2024-05-17 18:50:45 +02:00			`else:`
			`raise ValueError(`
			`f'Model type "{self.detector_config.model.model_type}" is currently not supported.'`
			`)`