Skip to main content
padlock icon - secure page this page is secure

QUALITY ANALYSIS OF DOMAIN-DEPENDENT INTERNET ARCHIVES

Buy Article:

$60.00 + tax (Refund Policy)

Web archives are powerful sources of data, providing access to a wealth of information applicable in a very wide variety of disciplines. This article presents ways in which web search quality can be improved by a fuller use of Internet archive capabilities based around a scientific conference information retrieval system accustomed to accessing current and archived data. Verification of this application was then carried out on the Association for Computing Machinery (ACM) and the Institute of Electrical and Electronics Engineers (IEEE) conference portals. In the second part of this article, quality metrics are defined which allow monitoring of archive repository past web content. Metrics were then calculated for three “mainstream” generally accessible World Wide Web (WWW) portals. The article concludes by demonstrating the degree to which utilization of past web pages can significantly improve search-and-browse Internet quality.
No Reference information available - sign in for access.
No Citation information available - sign in for access.
No Supplementary Data.
No Article Media
No Metrics

Keywords: data quality; internet archive; past web pages; web archive; web content analysis

Document Type: Research Article

Affiliations: Institute of Informatics, Wroclaw University of Technology, Wroclaw, Poland

Publication date: January 1, 2010

More about this publication?
  • Access Key
  • Free content
  • Partial Free content
  • New content
  • Open access content
  • Partial Open access content
  • Subscribed content
  • Partial Subscribed content
  • Free trial content
Cookie Policy
X
Cookie Policy
Ingenta Connect website makes use of cookies so as to keep track of data that you have filled in. I am Happy with this Find out more