视觉导航中多任务学习和半监督学习方法研究毕业论文-毕业作品网站

您现在所在的位置：首页 >>计算机毕业设计 >> 文章内容

摘要

视觉导航是当前深度学习研究的热门领域。环境感知作为视觉导航中最重要的模块之一，包括目标检测、车道线检测、可行驶区域分割等，是一项极具挑战性并具有深远意义的任务。传统的深度学习算法通常只解决环境感知中的一个检测任务，无法满足视觉导航同时感知多种环境因素的需求，因此多任务学习引起了研究人员的广泛关注。深度学习是以数据驱动的，然而数据标注的过程极为繁琐，在多任务学习中该问题更加凸显。半监督学习可有效利用大量未标注图片的特征，丰富训练数据集，提高模型检测效果。

本文围绕视觉导航中多任务学习和半监督学习开展相关研究，主要工作和成果如下：

1. 针对视觉导航中现有多任务环境感知方法精度不高的问题，本文提出了一种基于 YOLOv5 的多任务环境感知算法，能够同时完成目标检测、车道线检测和可行驶区域分割任务。本算法编码部分以 CSPDarknet 作为骨干网络，通过多个池化层级联的方式，加快空间金字塔池化的速度。目标检测的检测头采用自上而下的多尺度结构，车道线和可行驶区域的检测头部分采用实时语义分割网络 ENet。本文为语义分割任务设计了单独的特征金字塔结构，提高了车道线检测准确率，损失函数引入了鲁棒性更强的 -IoU 计算目标检测的损失。实验表明，本文所提多任务环境感知算法在精度和速度上都优于现有的其他网络。

2. 针对多任务学习图像标注繁琐的问题，本文提出了一种基于伪标签的多任务半监督学习方法。首先本文对半监督训练方法 STAC 提出伪标签更新的改进，在训练过程中，教师模型和学生模型不断迭代，根据对未标注图片的推理结果对伪标签进行更新，其次本文结合一致性学习方法，采用两个结构相同初始化不同的网络进行交叉监督训练，引入一致性交叉损失计算损失函数。实验表明，本文所提基于伪标签的多任务半监督学习方法优于监督训练方法和自训练方法

STAC。

3. 为了验证本文所提算法在视觉导航中的有效性，本文在科大西区搭建了校园无人车测试平台。实验表明，本文所提算法在校园视觉导航平台中对直线行驶纠偏和路口转向行驶起到了重要作用，满足了实际的工程需求。

关键词：多任务；环境感知；半监督学习；伪标签；视觉导航

ABSTRACT

Visual navigation is a hot field of deep learning research. As one of the most important modules in visual navigation, environment perception is a very challenging and far-reaching task, including object detection, lane line detection, and drivable area segmentation. Traditional deep learning algorithms usually only solve one detection task in environmental perception, and cannot meet the needs of visual navigation to perceive multiple environmental factors at the same time. Therefore, the study of multi- task environmental perception has attracted extensive attention of researchers. Deep learning is data-driven, but the process of data labeling is extremely cumbersome, and this problem is even more prominent in multi-task learning. Semi-supervised learning can effectively utilize the features of a large number of unlabeled images, enrich the training data set, and improve the model detection effect.

This thesis focuses on multi-task learning and semi-supervised learning in visual navigation. The main work and results are as follows:

1. In view of the low accuracy of the existing multi-task environment perception methods in visual navigation, this thesis proposes a multi-task environment perception algorithm based on YOLOv5, which can simultaneously complete the tasks of target detection, lane line detection and drivable area segmentation. The coding part of this al- gorithm uses CSPDarknet as the backbone network. By concatenating multiple pooling layers, the speed of spatial pyramid pooling is accelerated. The detection head of target detection adopts a top-down multi-scale structure. The detection head part of lane lines and drivable areas adopts the real-time semantic segmentation network ENet. In this thesis, a separate feature pyramid structure is designed for the semantic segmentation task, which improves the accuracy of lane line detection. The loss function calculation

-IoU to calculate the loss of target detection. Experiments show that the proposed multi-task environment perception algorithm is superior to other existing networks in both accuracy and speed.

2. Aiming at the tedious problem of multi-task learning image labeling, this thesis proposes a multi-task semi-supervised learning method based on pseudo-labels. First of all, this thesis proposes an improvement to the pseudo-label update for the semi- supervised training method STAC. During the training process, the teacher model and the student model are iterated continuously, and the pseudo-label is updated accord- ing to the inference results of unlabeled pictures. Secondly, this thesis combines the

consistency learning method. Two networks with the same structure and different ini- tializations are used for cross-supervised training, and a consistent cross loss is intro- duced when calculating the loss function. Experiments show that the pseudo-label- based multi-task semi-supervised learning method proposed in this thesis outperforms the supervised training method and the self-training method STAC.

3. In order to verify the effectiveness of the algorithm proposed in this thesis in visual navigation, this thesis builds a campus unmanned vehicle test platform in the Western District of USTC. Experiments show that the algorithm proposed in this thesis plays an important role in the correction of straight-line driving and turning of inter- sections in the campus visual navigation platform, which meets the actual engineering needs.

Key Words: Multitasking; Environment Perception; Semi-supervised Learning; Pseudo Labels; Visual Navigation

摘要 ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ I Abstract ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ III 第 1 章绪论ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ 1

1.1 研究背景与意义 ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ 1

1.2 相关工作与研究现状ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ 3

1.2.1 多任务环境感知方法 ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ 4

1.2.2 半监督学习方法 ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ 5

1.3 本文研究工作ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ 6

1.4 本文组织结构ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ 7

第 2 章相关技术ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ 9

2.1 卷积神经网络ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ 9

2.1.1 卷积神经网络的发展过程ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ 9

2.1.2 常见的卷积神经网络结构ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ 12

2.2 目标检测 ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ 15

2.2.1

两阶段目标检测

ꞏ

2.2.2

单阶段目标检测

ꞏ

2.3 语义分割 ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ 18

2.4 车道线检测 ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ 20

2.5 半监督学习方法 ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ 21

2.5.1

半监督目标检测

ꞏ

2.5.2

半监督语义分割

ꞏ

2.6 本章小节 ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ 23

第 3 章基于 YOLOv5 的多任务环境感知算法ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ 25 3.1 问题描述 ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ 25

3.2 算法设计和相关内容 ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ ꞏ 26

3.2.1

特征提取器ꞏ ꞏ ꞏ

ꞏ

3.2.2

空间金字塔池化

ꞏ

全套毕业设计论文现成成品资料请咨询微信号：biyezuopinvvp QQ：1015083682 返回首页如转载请注明来源于www.biyezuopin.vip

打印本页 \| 关闭窗口
上一篇文章：基于区块链的分布式账本安全性设计文献综述	下一篇文章：马铃薯分拣机设计开题报告

本类最新文章

基于web的大学生综测系统设计与 …	在线图书阅读平台设计与开发毕业 …	基于SpringBoot和VUE …
基于深度学习的个性化推荐算法优化 …	基于LBS多人快速定位和导航AP …	面向财经新闻的文本挖掘系统设计与 …

| 关于我们 | 友情链接 | 毕业设计招聘 |

Email：biyeshejiba@163.com 微信号：biyezuopinvvp QQ：1015083682
本站毕业设计和毕业论文资料均属原创者所有,仅供学习交流之用,请勿转载并做其他非法用途.如有侵犯您的版权有损您的利益,请联系我们会立即改正或删除有关内容!

摘 要

ABSTRACT

目 录

摘要

目录