English   español  
Por favor, use este identificador para citar o enlazar a este item: http://hdl.handle.net/10261/161410
logo share SHARE logo core CORE   Add this article to your Mendeley library MendeleyBASE

Visualizar otros formatos: MARC | Dublin Core | RDF | ORE | MODS | METS | DIDL
Exportar a otros formatos:

Privacy-preserving data-mining through micro-aggregation for web-based e-commerce

AutorNavarro-Arribas, Guillermo; Torra, Vicenç
Palabras claveWorldwide web
Fecha de publicación2010
EditorEmerald Group Publishing
CitaciónInternet Research 20: 366- 384 (2010)
ResumenPurpose: The purpose of this paper is to anonymize web server log files used in e-commerce web mining processes. Design/methodology/approach: The paper has applied statistical disclosure control (SDC) techniques to achieve its goal. More precisely, it has introduced the micro-aggregation of web access logs. Findings: The experiments show that the proposed technique provides good results in general, but it is especially outstanding when dealing with relatively small websites. Research limitations/implications: As in all SDC techniques there is always a trade-off between privacy and utility or, in other words, between disclosure risk and information loss. In this proposal, it has borne this issue in mind, providing k-anonymity, while preserving acceptable information accuracy. Practical implications: Web server logs are valuable information used nowadays for user profiling and general data-mining analysis of a website in e-commerce and e-services. This proposal allows anonymizing such logs, so they can be safely outsourced to other companies for marketing purposes, stored for further analysis, or made publicly available, without risking customer privacy. Originality/value: Current solutions to the problem presented here are very poor and scarce. They are normally reduced to the elimination of sensitive information from query strings of URLs in general. Moreover, to its knowledge, the use of SDC techniques has never been applied to the anonymization of web logs. © Emerald Group Publishing Limited.
Identificadoresdoi: 10.1108/10662241011050759
issn: 1066-2243
Aparece en las colecciones: (IIIA) Artículos
Ficheros en este ítem:
Fichero Descripción Tamaño Formato  
accesoRestringido.pdf15,38 kBAdobe PDFVista previa
Mostrar el registro completo

Artículos relacionados:

NOTA: Los ítems de Digital.CSIC están protegidos por copyright, con todos los derechos reservados, a menos que se indique lo contrario.