Research Publications
Characterizing Web-based Video Sharing Workloads Video sharing services that allow ordinary Web
users to upload video clips of their choice and watch video clips
uploaded by others have recently become very popular.
This paper attempts to identify \emph{invariants} in
video sharing workloads, through comparison of the
workload characteristics of four popular video sharing services.
Our traces contain meta-data on approximately 1.8 million
videos which together have been viewed approximately 6 billion times.
Using these traces, we study the similarities and differences in use
of several Web 2.0 features such as ratings, comments, favorites,
and propensity of uploading content.
In general, we find that active contribution, such as video uploading
and rating of videos, is much less prevalent than passive use.
While uploaders in general are skewed with respect to the number
of videos they upload, the fraction of multi-time uploaders is
found to differ by a factor of two between two of the sites.
The distributions of life-time measures of video popularity are found
to have heavy-tailed forms that are similar across the four sites.
Finally, we consider implications for system design of the
identified invariants. To gain further insight into caching in
video sharing systems, and the relevance to caching of life-time
popularity measures, we gathered an additional data set tracking
views to a set of approximately 1.3 million videos from one of the
services, over a twelve week period.
We find that life-time popularity measures have some relevance for
large cache (hot set) sizes (i.e., a hot set defined according to one
of these measures is indeed relatively ``hot''), but that this
relevance substantially decreases as cache size decreases, owing to
churn in video popularity. Details
| Related Project
Related People |
