早教吧 育儿知识 作业答案 考试题库 百科 知识分享

英语翻译随着信息化的快速发展,人们对于数据质量的要求越来越高,数据清洗技术越来越成为人们关注和研究的焦点.如何从海量的股市资讯中快速、准确地挖掘出有用信息,是一个富有开发潜

题目详情
英语翻译
随着信息化的快速发展,人们对于数据质量的要求越来越高,数据清洗技术越来越成为人们关注和研究的焦点.
如何从海量的股市资讯中快速、准确地挖掘出有用信息,是一个富有开发潜力的文本挖掘研究方向.目前的文本挖掘技术还难以快速、准确地识别信息中的错误或者不相关的“脏数据”.
一般而言,低位买入同系机构投资者持股占流通股比例较高(且远大于其他十大流通股东占流通股的比例)的股票,是一种较易获得高收益的投资行为,故本文以在股市资讯中挖掘“十大流通股东同系机构投资者占流通股的比例”这一具有现实意义的问题为具体研究对象,来研究如何利用数据清洗技术来解决这些“脏数据”问题.
本文结合在股市资讯中挖掘“十大流通股东同系机构投资者占流通股的比例”的应用实际,介绍了在股市资讯挖掘系统中数据清洗问题的研究背景,以及文本挖掘、数据清洗技术的国内外研究概况;概述了异常数据清洗相关知识,研究如何应用统计分析技术和人工智能方法来检测及清洗股市资讯挖掘系统中异常数据;进而在介绍重复记录清洗的意义、定义及基本流程的基础上,结合股市资讯挖掘的应用实际,分析研究了重复记录清洗流程中所涉及的算法,并提出了相关改进;最后从系统应用背景、源数据存在的问题,系统框架结构、实验过程与结果、系统评价与创新之处等方面介绍了以挖掘“十大流通股东同系机构投资者占流通股的比例”为主要功能的股市资讯挖掘系统.
▼优质解答
答案和解析
With the fast development of information technology, the data quality reqest is increasingly higher, data cleaning technique has increasingly become a focus of concern and research.
How from the mass market information fast, accurately mine the useful information, is a rich potential for development of text mining research direction. The present text mining technology is also difficult to fast, accurately identify the information in the error or not related to the " dirty data ".
In general, buy low homologous institutional investors holding shares of a higher proportion of ( and far greater than that of the other ten shareholders shares ratio) stock, is a relatively easy to obtain high yield investment behavior in the stock market information, so this thesis dig " ten big circulation stock East fellow investors shares the proportion of " the problem that has the realistic meaning for the specific object of study, to study how to use the data cleaning technique to solve the problem of "dirty data".
According to market information in mining "the ten shareholders of syngeneic institutional investors shares proportion" of the practical application, introduced in the stock market information mining system in data cleaning problems of the research background, text mining, data cleaning technique of the domestic and foreign research general situation; an overview of data cleaning related knowledge, to study how to application of statistical analysis and artificial intelligence technology to detect and cleaning market information mining system in abnormal data; and then in the duplicate records cleansing meaning, definition and basic flow on the basis of stock market information, combined with the mining practice, analysis of the duplicate records cleansing process involved in the algorithm, and put forward relevant improvement; finally from the system application background, source data problems, system framework, the experimental process and results, evaluation system and innovation etc have been introduced to dig "the ten largest shareholder in circulation from institutional investors shares ratio " as the main function of the stock market information mining system.
看了 英语翻译随着信息化的快速发展...的网友还看了以下:

在地球上的某一处,关于重力和质量的说法正确的是A质量越大,重力越大B重力越大,质量越大说一下理由.  2020-03-30 …

不要说数太小可忽略物理书中说在真空中不同质量物体从同一高度下落是同时落地,但我绝的是错了,依据万有  2020-04-25 …

莎士比亚说:“有很多良友,胜过很多财富。”这句话告诉我们[]A.有朋友就有财富B.朋友比财富更重要  2020-06-09 …

威武物产丰富,自古就是“人烟朴地桑拓稠”的富饶之地,其农业发展的优越的自然条件是()A.降水量大,  2020-06-17 …

读图“英国人均国民生产总值与生活质量”上述情形给我们的启示是A.财富和产品越多,生活质量越高B.财  2020-07-15 …

求作文一篇关于财富与社会地位现今社会,许多人认为财富与社会地位成正比,财富越多社会地位越高.你的看法  2020-11-21 …

地质图、地质剖面图如何表述出露的地层系统,识别不整合类型,并确定地质年代,而且确定构造类型、组合形式  2020-11-21 …

自从1985年发现了富勒烯以来,由于其具有独特的物理和化学性质,越来越受到人们的关注.(1)富勒烯(  2020-11-24 …

对于人造地球卫星下列说法中正确的是:A、卫星运行的速度和周期跟质量有关,质量越大则速度越大.周期越短  2021-01-12 …

关于密度下列说法正确的是()A.质量越大,密度越大B.体积越大,密度越大C.将一袋米从贵州运到北京密  2021-02-20 …