Skip Navigation

Colloquium Details

Distributed Data Replenishment

Author:Thinh Nguyen Oregon State University
Date:January 19, 2012
Time:15:30
Location:220 Deschutes
Host:Reza Rejaie

Abstract

We investigate a class of randomized peer-to-peer (P2P) approach to Internet-wide distributed data storage systems that promises to reduce the coordination complexity and increases performance scalability. The core of these randomized P2P data storage systems is the data replenishment mechanism. The data replenishment automates the process of maintaining a sufficient level of data redundancy to ensure the availability of data in presence of peer departures and failures. The dynamics of peers entering and leaving the network is modeled as a stochastic process. A novel analytical time-backward technique is proposed to bound the expected time for a piece of data to remain in P2P systems. Both theoretical and simulation results are in agreement, indicating that a proposed data replenishment via random linear network coding (RLNC) outperforms other strategies that employ popular repetition and channel coding techniques. Specifically, we show that the expected time for a piece of data to remain in a P2P system, the longer the better, is exponential in the redundancy amount for the RLNC-based strategy, while they are quadratic for other strategies. We will also present an experimental distributed video storage and streaming system which employs the proposed distributed replenishment.

Biography

Dr. Nguyen earned a B.S. from the University of Washington, and an M.S. and Ph.D. from U.C. Berkeley in 2000 and 2003, respectively. He co-authored several award-winning papers at major conferences. He is a recipient of the NSF CAREER Award and the Engelbrecht Young Faculty Award.

Dr. Nguyen is interested in theories and applications of all things stochastic. His current and past research projects span a number of application areas ranging from signal processing and video coding, to networking and communication, and recently quantum information theory.