Closed-loop Region of Interest Enabling High Spatial and Temporal Resolutions in Object Detection and Tracking via Wireless Camera

Abstract

The trade-off between spatial and temporal resolution remains a fundamental challenge in machine vision. A captured image often contains a significant amount of redundant information, and only a small region of interest (ROI) is necessary for object detection and tracking. In this paper, we first systematically characterize the effects of ROI on camera capturing, data transmission, and image processing. We then present the closed-loop ROI algorithm capable of high spatial and temporal resolution as well as wide scanning field of view (FOV) in single and multi-object detection and tracking via real-time wireless video streaming.With the feedback from real-time object tracking, the wireless camera is able to capture and transmit only the ROI which in turn enhances both the spatial and temporal resolution in object tracking. In addition, the proposed approach can still maintain a large FOV by processing regions outside of the ROI at lower spatial and temporal resolutions. When applied to a high spatial resolution wireless stream (5 MegaPixels), the closed-loop ROI algorithm improves the temporal resolution by up to 10x (from 2.4 FPS to 22.5 FPS). Specifically, camera processing is improved by up to 4.7x, data transmission is improved by up to 160x, and PC processing is improved by up to 2.5x. In a person tracking experiment, the closed-loop ROI algorithm enables a wide-angle camera to outperform both a normal wide-angle camera–which suffers from poor temporal resolution and motion blur–and a pan & tilt camera–which cannot automatically refresh tracking after the tracking is lost.

Publication
IEEE Access
Source Themes