大数据环境下图书馆公共媒体数据库建设与利用研究
发布时间:2018-09-07 19:10
【摘要】:在云技术和物联网的推动下,全球已经进入了大数据时代。数据信息是图书馆的核心和提供一切服务的基础。现有图书馆中的资源是静态的、结构化和少量半结构化的学术科研文献、基础常识文献、历史小说文献等正式出版物,而网络上的非正式出版物,尤其是公共媒体平台上的用户行为信息、社交网络灰色信息和政府非公开公布的公共管理信息严重缺损,致使图书馆的信息资源在这个大数据时代滞后和不完整。公共媒体信息不仅能完善图书馆的资源,而且能为图书馆提供新的知识服务,帮助图书馆在大数据时代掌握和整合更多的数据信息以增强其核心竞争力。采用传感器、网络爬虫等大数据采集技术从公共媒体平台上获取用户行为数据、社交网络灰色文献以及政府公共管理数据,并基于HBase数据库的存储理论和主题分类、知识地图等组织方法对采集到的三类数据进行整合,初步探讨由用户行为数据库、社交网络数据库和公共管理数据库三个子库组成的公共媒体数据库的规划。 本文从国内外大数据理论及图书馆数据库建设的实践两方面进行分析研究并进行借鉴,来开始探讨适合我国图书馆公共媒体数据库的建设。本文包含如下六个部分: 第一章内容是绪论,阐述选题背景、目的和意义。 第二章内容是国内外图书馆大数据研究现状以及图书馆建立公共媒体数据库必要性进行阐述。 第三章内容是阐述公共媒体资源建设的理论基础和大数据基础理论。 第四章内容是参照第二章中图书馆数据库的建设,在第三章大数据环境下公共媒体资源建设的方式方法上分别论述用户行为子数据库、社交网络文献子数据库和公共管理子数据库三个字库的建设。 第五章内容是针对构建的公共媒体数据库的利用描述。 第六章对本文的研究内容进行分析和总结,并提出今后需要进行关注的问题。
[Abstract]:In cloud technology and the Internet of things, the world has entered the big data era. Data information is the core of library and the basis of providing all services. The resources in the existing libraries are static, structured and few semi-structured academic research literature, basic knowledge literature, historical novel literature and other official publications, while the network of informal publications, Especially, the information of user behavior on the public media platform, the grey information of social network and the information of public management released by the government are seriously defective, which makes the information resources of the library lag behind and incomplete in this era of big data. The public media information can not only perfect the library's resources, but also provide new knowledge service for the library, help the library to master and integrate more data information in the era of big data in order to enhance its core competitiveness. Big data acquisition technology such as sensor, web crawler and so on is used to obtain user behavior data from public media platform, social network grey literature and government public management data, and based on HBase database storage theory and subject classification. The three kinds of data collected are integrated by knowledge map, and the planning of public media database composed of user behavior database, social network database and public management database is discussed preliminarily. This paper analyzes and studies the theory of big data at home and abroad and the practice of library database construction and uses it for reference to begin to explore the construction of library public media database suitable for our country. This paper includes six parts as follows: the first chapter is an introduction, explaining the background, purpose and significance of the topic. The second chapter is about the research status of big data and the necessity of establishing public media database. The third chapter expounds the theoretical basis of the construction of public media resources and big data's basic theory. The fourth chapter refers to the construction of library database in Chapter 2, and discusses the user behavior sub-database on the way and method of the construction of public media resources under the environment of big data in the third chapter. The construction of social network document sub-database and public management sub-database. The fifth chapter is a description of the use of the public media database. The sixth chapter analyzes and summarizes the research contents of this paper, and points out the problems that need to be paid attention to in the future.
【学位授予单位】:辽宁师范大学
【学位级别】:硕士
【学位授予年份】:2014
【分类号】:G250.74
[Abstract]:In cloud technology and the Internet of things, the world has entered the big data era. Data information is the core of library and the basis of providing all services. The resources in the existing libraries are static, structured and few semi-structured academic research literature, basic knowledge literature, historical novel literature and other official publications, while the network of informal publications, Especially, the information of user behavior on the public media platform, the grey information of social network and the information of public management released by the government are seriously defective, which makes the information resources of the library lag behind and incomplete in this era of big data. The public media information can not only perfect the library's resources, but also provide new knowledge service for the library, help the library to master and integrate more data information in the era of big data in order to enhance its core competitiveness. Big data acquisition technology such as sensor, web crawler and so on is used to obtain user behavior data from public media platform, social network grey literature and government public management data, and based on HBase database storage theory and subject classification. The three kinds of data collected are integrated by knowledge map, and the planning of public media database composed of user behavior database, social network database and public management database is discussed preliminarily. This paper analyzes and studies the theory of big data at home and abroad and the practice of library database construction and uses it for reference to begin to explore the construction of library public media database suitable for our country. This paper includes six parts as follows: the first chapter is an introduction, explaining the background, purpose and significance of the topic. The second chapter is about the research status of big data and the necessity of establishing public media database. The third chapter expounds the theoretical basis of the construction of public media resources and big data's basic theory. The fourth chapter refers to the construction of library database in Chapter 2, and discusses the user behavior sub-database on the way and method of the construction of public media resources under the environment of big data in the third chapter. The construction of social network document sub-database and public management sub-database. The fifth chapter is a description of the use of the public media database. The sixth chapter analyzes and summarizes the research contents of this paper, and points out the problems that need to be paid attention to in the future.
【学位授予单位】:辽宁师范大学
【学位级别】:硕士
【学位授予年份】:2014
【分类号】:G250.74
【参考文献】
相关期刊论文 前10条
1 杨海燕;;大数据时代的图书馆服务浅析[J];图书与情报;2012年04期
2 王忠;;美国推动大数据技术发展的战略价值及启示[J];中国发展观察;2012年06期
3 王天泥;;知识咨询:大数据时代图书馆的知识服务增长点[J];图书与情报;2013年02期
4 王珊;王会举;覃雄派;周p,
本文编号:2229165
本文链接:https://www.wllwen.com/guanlilunwen/gonggongguanlilunwen/2229165.html