早教吧作业答案频道 -->其他-->
英语翻译ExtractingstructureddatafromWebsitesisnotatrivialtask.MostoftheinformationontheWebtodayisintheformofHypertextMarkupLanguage(HTML)documentswhichareviewedbyhumanswithabrowser.HTMLdocumentsaresometimeswrit
题目详情
英语翻译
Extracting structured data from Web sites is not a trivial task.
Most of the information on the Web today is in the form of
Hypertext Markup Language (HTML) documents which are
viewed by humans with a browser.HTML documents are
sometimes written by hand,sometimes with the aid of HTML
tools.Given that the format of HTML documents is designed for
presentation purposes,not automated extraction,and the fact that
most of the HTML content on the Web is ill-formed (“broken”),
extracting data from such documents can be compared to the task
of extracting structure from unstructured documents.
Extracting structured data from Web sites is not a trivial task.
Most of the information on the Web today is in the form of
Hypertext Markup Language (HTML) documents which are
viewed by humans with a browser.HTML documents are
sometimes written by hand,sometimes with the aid of HTML
tools.Given that the format of HTML documents is designed for
presentation purposes,not automated extraction,and the fact that
most of the HTML content on the Web is ill-formed (“broken”),
extracting data from such documents can be compared to the task
of extracting structure from unstructured documents.
▼优质解答
答案和解析
从互联网上提取资料数据并不是一件微不足道的工作.大多数今天发布的信息都是HTML文件,他们都是人工发布到互联网上去的.HTML文件有时候是手写的,有时候借助于HTML工具.设计成HTML版本主要是陈述的目的,不能被自动提取,事实上大多数互联网的HTML文件是错误的格式(被破坏的),所以从这样的数据中提取文件就好比是在没有组织的文件中提取组织文件.
看了英语翻译Extractings...的网友还看了以下:
按顺序.1氢(qīng)氦(hài)锂(lǐ)铍(pí)硼(péng)2碳(tàn)氮(dàn)氧 2020-05-13 …
英语读音mysalaryispaiddirectlyintomybankaccount1.请问di 2020-05-14 …
用这些英文字母拼词这些英文字母打乱了顺序.一个题目一个词.第一题:r,c,t,a,e,s,r第二题 2020-05-16 …
有一根长为L的钢管,当在一端打击一下时,某人在它的另一端听到两次响声,时间间隔为T,若声音在空气中 2020-06-06 …
如图所示,摆长为l的单摆,放在倾为θ的光滑斜面上,当摆球在斜面所在的平面内做小摆角振动时的周期为( 2020-07-12 …
含有字母t,y,x,q,d的单词含有t,y,x,q,d或t,y,l,w,d的单词(词组也行,单词最 2020-07-16 …
下列词语中字的注音,全都正确的一组是A.誊(téng)写名誉(yù)屠戮(lù)戳(chuò)穿B 2020-07-24 …
设命题p:|2x-3|<1;命题q:lg2x-(2t+l)lgx+t(t+l)≤0,(1)若命题q 2020-08-03 …
拼读下列音标并写出相应单词./reə//kaɪts//bedz//ka:dz//hændz//'tʃ 2020-12-04 …
js中的竖线是什么意思('GN="";81e(M,h){9(N!=""){7(N).r="Y"}9( 2021-02-04 …