设计任务书文档开题答辩说明书格式模板外文翻译范文资料作品文献课程实习指导调研下载网络教育 计算机 网站网页 小程序 商城购物订餐电影安卓 Android Html Html5 SSM SSH Python 爬虫大数据 管理系统 图书校园网考试选题网络安全推荐系统机械模具夹具自动化数控车床汽车故障诊断电机建模 机械手 去壳机千斤顶变速器减速器图纸电气变电站电子 Stm32 单片机 物联网 监控密码锁 Plc 组态控制智能 Matlab 土木建筑结构框架教学楼住宅楼造价施工办公楼给水排水桥梁刚构桥水利重力坝水库采矿环境化工固废工厂 视觉传达 室内设计产品设计 电子商务 物流盈利案例分析评估报告营销报销会计

机械毕业设计

电子电气毕业设计

计算机毕业设计

土木工程毕业设计

视觉传达毕业设计

理工论文

文科论文

毕设资料

帮助中心

设计流程

您现在所在的位置：首页 >>计算机毕业设计 >> 文章内容

我们提供全套毕业设计和毕业论文服务，联系微信号：biyezuopinvvp QQ：1015083682

基于Java和H5的Pubmed文献数据挖掘及可视化系统毕业论文+答辩PPT+项目源码

文章来源：www.biyezuopin.vip 发布者：毕业作品网站

摘要

如今Pubmed文献检索系统上发表的医学文献的数量十分庞大，且数量逐年增加，研究人员如果想人工地去查看找出Pubmed文献里面的知识是绝对不可能，因此，人们转而利用计算机去获取文献里面的知识。

本篇论文介绍了如何借用文本挖掘技术去挖掘出Pubmed文献里面的知识，并且结合了目前文本挖掘技术，讲述了如何实现了一套蛋白质磷酸化修饰的文本信息挖掘系统。

本系统主要应用于挖掘出Pubmed文献里面蛋白质磷酸化的修饰的一些信息，包括被修饰的蛋白质，激酶，修饰位点，以及它们之间的关系。

本文详细叙述了整套系统瀑布流模型的软件过程，首先是需求，然后是设计，再是实现，依次展开。在实现的阶段里面又包含了文本预处理阶段，命名实体识别阶段，实体关系提取阶段，数据可视化阶段，其中着重介绍了文本挖掘技术中两个的关键也是核心阶段的原理：命名实体识别和关系提取。同时也介绍了Abner工具和Rlims-p工具的原理和应用。此外文献数据库的数量庞大，为了提高程序性能和用户体验，于是介绍了几种提高效率，提高用户体验的解决方案，其中有多线程处理，缓存机制，预处理机制。

关键词：文本挖掘；软件工程；Pubmed；多线程

Abstract

Nowadays, The number of published medical literature on Pubmed document retrieval system is very large, growth year after year. It is absolutely impossible to researchers manually discover knowledge among the Pubmed literatures. As a result, researchers turn to use the computer to acquire knowledge inside the literature.

This paper introduce how to acquire knowledge in Pubmed literature by using the technology of text mining and how to implement a text mining system for extracting protein phosphorylation information among Pubmed literature using the current text mining technology. This system is main used for extracting protein phosphorylation information among the Pubmed literatures such as substrate, kinases, sites, and relation among these substances. With the current text mining technology, I

This paper describes the waterfall model of software process about this system. First step is requirement, second step is design, and then implementation, step by step.In the step of implementation, there are four steps as follow: text preprocessing, named entity recognizeation, relationship extracting, Visualization. This paper highlights the principle and application of two very important steps in text mining: named entity recognization, and relationship extracting. At the same time, Abner tools and Rlims-p tools are also introduced in this paper. Moreover, the number of the Pubmed literatures in database is very large, and in order to improve the program performance and user experience，this paper had introduced several method for improve the program performance and user experience such as multithreading, caching, preprocessing.

Keyword: Text mining, Software engineering, Pubmed, Multithreading

摘要

Abstract

第一章引言

1.1 概述

1.2 选题的背景和意义

第二章入门概念

2.1 蛋白质以及翻译后修饰概述

2.2文本挖掘技术概述

2.3 Pubmed生物医学文献检索系统

第三章需求分析和系统设计

3.1需求

3.2 需求分析

3.3 用户用例分析

3.4 系统设计

3.4.1 系统工作流程和场景

3.4.2 模块化设计

3.4.3 软件开发架构

3.4.4 开发环境

第四章程序设计和系统实现

4.1 程序设计

4.2 文本数据源获取

4.3 文本预处理

4.4 命名实体识别和实体关系提取

4.4.1命名实体识别概述

4.4.2 ABNER命名实体识别工具

4.5 实体关系提取

4.5.1 关系提取概念

4.5.2 Rlims-p工具介绍及其工作原理

4.5.3 嵌入使用Rlims-p工具

4.6 多线程处理文档优化

4.7 文档预处理和缓存机制

第五章数据库设计和数据可视化

5.1数据库设计

5.2 数据可视化

第六章总结及展望

6.1 总结

6.2 展望

结束语

参考文献

全套毕业设计论文现成成品资料请咨询微信号：biyezuopinvvp QQ：1015083682 返回首页如转载请注明来源于www.biyezuopin.vip

打印本页 \| 关闭窗口
上一篇文章：基于Phonon+QT的音视频播放器设计与实现毕业论文+项目源码	下一篇文章：基于spring boot的邮件微服务消息中间件设计与实现毕业论文+系统功能图v1.0.vsdx+项目源码

本类最新文章

基于MatlabSimulink …	35kV输电线路继电保护的设计 …	分布式风电场低电压穿越故障建模与 …
大学生内容分享和社交平台的设计与 …	基于SSM框架的企业人事薪酬管理 …	基于大模型的代码注释自动生成与维 …

| 关于我们 | 友情链接 | 毕业设计招聘 |

Email：biyeshejiba@163.com 微信号：biyezuopinvvp QQ：1015083682
本站毕业设计和毕业论文资料均属原创者所有,仅供学习交流之用,请勿转载并做其他非法用途.如有侵犯您的版权有损您的利益,请联系我们会立即改正或删除有关内容!