English   español  
Please use this identifier to cite or link to this item: http://hdl.handle.net/10261/174381
Share/Impact:
Statistics
logo share SHARE   Add this article to your Mendeley library MendeleyBASE
Visualizar otros formatos: MARC | Dublin Core | RDF | ORE | MODS | METS | DIDL
Exportar a otros formatos:
Title

A free database of university web links: data collection issues

AuthorsThelwall, Mike
KeywordsWeb links
Web Impact Factor
Search engines
Web crawler
Issue Date2002
PublisherEditorial CSIC
CitationCybermetrics 6/7(1): Paper 2 (2002-2003)
AbstractThis paper describes a free set of databases of the link structures of the university web sites from a selection of countries, as created by a specialist information science web crawler. With the increasing interest in web links by information and computer scientists this is an attempt to make available raw data for research that is not reliant upon the opaque techniques of commercial search engines. Basic tools for querying are also provided. The key issues concerning running an accurate web crawler are also discussed. Access is also given to the normally hidden crawler stop list with the aim of making the crawl process more transparent. The necessity of having such a list is discussed, with the conclusion that fully automatic crawling is not socially or empirically desirable because of the existence of database-generated areas of the web and the proliferation of the phenomenon of mirroring
URIhttp://hdl.handle.net/10261/174381
E-ISSN1137-5019
Appears in Collections:(CINDOC) Cybermetrics: International Journal of Scientometrics, Informetrics and Bibliometrics
Files in This Item:
File Description SizeFormat 
v6i1p2.pdf155,72 kBAdobe PDFThumbnail
View/Open
Show full item record
Review this work
 


WARNING: Items in Digital.CSIC are protected by copyright, with all rights reserved, unless otherwise indicated.