搜索结果: 1-15 共查到“计算语言学 Web”相关记录16条 . 查询时间(0.073 秒)
Clustering and Diversifying Web Search Results with Graph-Based Word Sense Induction
Clustering and Diversifying Web Search Graph Word Sense Induction
2015/9/11
Web search result clustering aims to facilitate information search on the Web. Rather than the results of a query being presented as a flat list, they are grouped on the basis of their similarity and ...
Orthographic Errors in Web Pages:Toward Cleaner Web Corpora
Orthographic Errors Web Pages Cleaner Web Corpora
2015/9/1
Since the Web by far represents the largest public repository of natural language texts, recent experiments, methods, and tools in the area of corpus linguistics often use the Web as a corpus. For app...
Learning Domain Ontologies from Document Warehouses and Dedicated Web Sites
Document Warehouses Dedicated Web Sites
2015/8/31
We present a method and a tool, OntoLearn, aimed at the extraction of domain ontologies from
Web sites, and more generally from documents shared among the members of virtual organizations. OntoLearn ...
The Web, teeming as it is with language data, of all manner of varieties and languages, in
vast quantity and freely available, is a fabulous linguists’ playground. This special issue of
Computationa...
Parallel corpora have become an essential resource for work in multilingual natural language
processing. In this article, we report on our work using the STRAND system for mining parallel
text on th...
Embedding Web-Based Statistical Translation Models in Cross-Language Information Retrieval
Statistical Translation Models Cross-Language Information
2015/8/28
Although more and more language pairs are covered by machine translation (MT) services, there
are still many pairs that lack translation resources. Cross-language information retrieval (CLIR)
is an ...
wEBMT: Developing and Validating an Example-Based Machine Translation System Using the World Wide Web
Example-Based Machine Translation System
2015/8/28
We have developed an example-based machine translation (EBMT) system that uses the World
Wide Web for two different purposes: First, we populate the system’s memory with translations
gathered from r...
This article shows that the Web can be employed to obtain frequencies for bigrams that are unseen
in a given corpus. We describe a method for retrieving counts for adjective-noun, noun-noun,
and ver...
We describe an algorithm that combines lexical information (from WordNet 1.7) with Web directories (from the Open Directory Project) to associate word senses with such directories. Such
associations ...
Exploiting the Block Structure of the Web for Computing PageRank
the Block Structure the Web Computing PageRank
2015/6/12
The web link graph has a nested block structure: the vast majority of hyperlinks link pages on a host to other pages on the same host, and many of those that do not link pages within the same domain. ...
Clustering the Tagged Web
Clustering Tagged Web
2015/6/12
Automatically clustering web pages into semantic groups promises improved search and browsing on the web. In this paper, we demonstrate how user-generated tags from largescale social bookmarking websi...
语义网——一种能让计算机理解的新型Web内容形式
语义网 Web内容形式 计算机
2009/3/12
目前的万维网其进化、扩大和完善的空间还很大,可以说万维网还没有走出婴儿期。为使万维网迈上一个新的台阶,从此摆脱幼稚,走向成熟和真正的智能化,10年前为我们发明因特网超文本系统的麻省理工学院万维网协会主席蒂姆·伯纳斯·李,现在又在致力于开发新一代的万维网(互联网),他为之取了一个直观的名称——“语义网”(the Semantic Web)。
A new mechanism of resource transmission, Web++, is proposed to further improve
Web performance. It includes three components: a URL scheme sttp for identifying resources;
the Structured Hypertext T...
An Overview of the Web++ Framework
World-Wide Web HTTP Performance Hypertext Resource transmission
2009/2/18
This paper presents an overview of the Web++ framework, a new mechanism of hypertext resource transmission specifically
designed to further improve Web performance. The three components of the framew...
A Brief Introduction of the Web++ Framework
World-Wide Web HTTP Performance Hypertext Resource transmission
2009/2/10
This paper presents an overview of the Web++ framework, a new mechanism of hypertext resource transmission specifically
designed to further improve Web performance. The major components of the framew...