论文标题
DeepSperm:在人口稠密的精液视频中,可靠,实时的实时牛精子孔检测
DeepSperm: A robust and real-time bull sperm-cell detection in densely populated semen videos
论文作者
论文摘要
背景和目标:对象检测是计算机视觉的主要研究兴趣。在人口稠密的公牛精液观察视频中,精子细胞检测提出了挑战,例如部分遮挡,单个视频框架中的大量对象,物体的微小大小,人工制品,低对比度和模糊物体,因为精子细胞的快速运动。这项研究提出了一种称为Deepsperm的体系结构,该体系结构解决了上述挑战,并且比最先进的架构更准确,更快。方法:在拟议的体系结构中,我们仅使用一个检测层,该检测层是针对小对象检测的。为了处理过度拟合和提高的精度,我们设置了更高的网络分辨率,使用辍学层,并对色相,饱和度和曝光进行数据增强。对几个超参数进行了调整以取得更好的性能。我们将提出的方法与传统图像处理方法的方法进行比较,您只能查看一次(Yolov3)和基于掩码区域的卷积神经网络(Mask R-CNN)。结果:在我们的实验中,我们在测试数据集上实现了86.91地图,处理速度为50.3 fps。与Yolov3相比,我们使用一个小型培训数据集的训练速度更快地增加了16.66地图点,测试速度更快3.26 x,其中包含40个视频帧。权重文件大小也大大减小,比Yolov3小16.94 x。此外,与Yolov3相比,它需要1.3 x的图形处理单元(GPU)存储器。结论:这项研究提出了深层植物,这是一种简单,有效,有效的结构,其超参数和构型可以实时稳健地检测牛头精子细胞。在我们的实验中,我们在准确性,速度和资源需求方面超越了艺术的状态。
Background and Objective: Object detection is a primary research interest in computer vision. Sperm-cell detection in a densely populated bull semen microscopic observation video presents challenges such as partial occlusion, vast number of objects in a single video frame, tiny size of the object, artifacts, low contrast, and blurry objects because of the rapid movement of the sperm cells. This study proposes an architecture, called DeepSperm, that solves the aforementioned challenges and is more accurate and faster than state-of-the-art architectures. Methods: In the proposed architecture, we use only one detection layer, which is specific for small object detection. For handling overfitting and increasing accuracy, we set a higher network resolution, use a dropout layer, and perform data augmentation on hue, saturation, and exposure. Several hyper-parameters are tuned to achieve better performance. We compare our proposed method with those of a conventional image processing-based object-detection method, you only look once (YOLOv3), and mask region-based convolutional neural network (Mask R-CNN). Results: In our experiment, we achieve 86.91 mAP on the test dataset and a processing speed of 50.3 fps. In comparison with YOLOv3, we achieve an increase of 16.66 mAP point, 3.26 x faster on testing, and 1.4 x faster on training with a small training dataset, which contains 40 video frames. The weights file size was also reduced significantly, with 16.94 x smaller than that of YOLOv3. Moreover, it requires 1.3 x less graphical processing unit (GPU) memory than YOLOv3. Conclusions: This study proposes DeepSperm, which is a simple, effective, and efficient architecture with its hyper-parameters and configuration to detect bull sperm cells robustly in real time. In our experiment, we surpass the state of the art in terms of accuracy, speed, and resource needs.