Scalable Dissemination: What's Hot and What's Not
DocUID: 2004-010 Full Text:
Author: Jonathan Beaver, Nicholas Morsillo, Kirk Pruhs, Panos K. Chrysanthis, Vincenzo Liberatore
Abstract: A major problem in web database applications and on the Internet in general is the scalable delivery of data. One proposed solution for this problem is a hybrid system that uses multicast push to scalably deliver the most popular data, and reserves traditional unicast pull for delivery of less popular data. However, such a hybrid scheme introduces a variety of data management problems at the server. In this paper we examine three of these problems: the push popularity problem, the document classification problem, and the bandwidth division problem. The push popularity problem is to estimate the popularity of the documents in the web site. The document classification problem is to determine which documents should be pushed and which documents must be pulled. The bandwidth division problem is to determine how much of the server bandwidth to devote to pushed documents and how much of the server bandwidth should be reserved for pulled documents. We propose simple and elegant solutions for these problems. We report on experiments with our system that validate our algorithms.
Published In: Proc. of the 7th Int'l Workshop on the Web and Databases
Pages: pp. 1-6
Year Published: 2004
Note: held in conjunction with the ACM SIGMOD'04 Conference, DOI:10.1145/1017074.1017084
Project: Others Subject Area: Sensor Databases, Data Dissemination
Publication Type: Conference Paper
Sponsor: NSF ANI-0123705