Peer-to-Peer Storage Systems

Spring 2004

This is the home page of the seminar course in Peer-to-Peer Systems for the spring semester of 2004. Information on course goals etc. is available in the course base. Also see below for an abbreviated version of the course description.

Note that there also is a news group it-c.courses.IP2P. where students and teachers can discuss exercises, assignments, etc. online, cf. Sysadm's manual on news groups.

Time, place, and format

Time and place:

Format:

Weekly

and possibly one larger programming exercise.

Exam information

We now have a page with exam information (last updated: June 4nd, 2004).

Material and presentations

The reading material for the seminar course will be research papers in the subject area. The main material for each lecture is enclosed in brackets [ and ] below; supplementary material is enclosed in parenthesis ( and ). Please be sure to check if the material for a presentation has been updated as a consequence of the dry-run the wednesday before the presentation.

The following is a tentative plan of the presentations during the seminar.

DateTopicMaterialPresenters Exercises
1.6/2IntroductionHN
2.13/2Unstructured routing and file sharing: Gnutella and Freenet[Gnutella,CSWH2001]
(CMHSW2002,Rit2001)
Slides: .pdf, .ppt
NR,PFdS,MPT 2
3. 20/2Distributed file systems: Chord and CFS[SML+2002,DFK+2001]
Slides: .ppt
NT, ML, PSK 3
4.27/2Distributed file systems: Tapestry and Oceanstore[YHS+2004, KBC+2000] (REG+2003, RWE+2001)
Slides: II: .pdf, .ppt; III: .ppt
EMV, MF, PGH 4
5.5/3Cooperative webcaches: Pastry and SQUIRREL[RD2001, IRD2001] (FreePastry)
Slides: .ppt; .pdf
OGP, HS, ETL 5
6.12/3Multicast and group communication: Scribe[CDKR2002] (CJKplus2003)
Slides: .ppt
MB, KE 6
7.19/3SkipNet[HJS+2003, HJTW2003]
(HM200?, HDM+2002)
Slides: .ppt
CSM,PT 7
8.26/3Anononymity: Free Haven[DFM2000] (Din2000)
Slides: .ppt
MD, TK 8
9.2/4Searching and querying in P2P systems[HHH+2002, HHL+2003
Slides: .ppt
JM, SRA,AMM 9
9/4Easter break
10.16/4Multimedia streaming[CDK+2003]
Slides: .ppt
MiB, EAJ 10
11.23/4XML StoreSlides: .pptRK, CTH 11
12.30/4?? and evaluation[??]

Material

Gnutella & Freenet

[Gnutella]*
Distributed Search Solutions, "The Gnutella Protocol Sepcification v0.4".
http://rfc-gnutella.sourceforge.net/Development/GnutellaProtocol0_4-rev1_2.pdf
[CSWH2001]*
Ian Clarke, Oskar Sandberg, Brandon Wiley, and Theodore W. Hong, "Freenet: A Distributed Anonymous Information Storage and Retrieval System", in Hannes Federrath (ed), "Designing Privacy Enhancing Technologies: International Workshop on Design Issues in Anonymity and Unobservability", LNCS 2009, Springer, 2001.
http://www.doc.ic.ac.uk/~twh1/academic/papers/icsi-revised.pdf
[CMHSW2002]
Ian Clarke, Scott G. Miller, Theodore W.Hong, Oskar Sandberg, and Brandon Wiley, "Protecting Free Expression Online with Freenet", in IEEE Internet Computing, January/February 2002.
http://freenet.sourceforge.net/papers/freenet-ieee.pdf
[Rit2001]
Jordan Ritter, "Why Gnutella Can't Scale. No, Really.", 2001.
http://www.darkridge.com/~jpr5/doc/gnutella.html

Chord & CFS

[SML+2002]*
Ion Stoica, Robert Morris, David Liben-Nowell, David R. Karger, M. Frans Kaashoek, Frank Dabek, Hari Balakrishnan, "Chord: A Scalable Peer-to-peer Lookup Protocol for Internet Applications". To Appear in IEEE/ACM Transactions on Networking.
http://www.pdos.lcs.mit.edu/chord/papers/paper-ton.pdf
[DFK+2001]*
Frank Dabek, M. Frans Kaashoek, David Karger, Robert Morris, and Ion Stoica, "Wide-area cooperative storage with CFS", in ACM SOSP 2001, Banff, October 2001.
http://www.pdos.lcs.mit.edu/papers/cfs:sosp01/cfs_sosp.pdf

Tapestry & OceanStore

[YHS+2004]*
Ben Y. Zhao, Ling Huang, Jeremy Stribling, Sean C. Rhea, Anthony D. Joseph, and John D. Kubiatowicz, "Tapestry: A Resilient Global-scale Overlay for Service Deployment", appears in IEEE Journal on Selected Areas in Communications, Vol 22, No. 1, January 2004
http://oceanstore.cs.berkeley.edu/publications/papers/pdf/tapestry_jsac.pdf
[KBC+2000]*
John Kubiatowicz, David Bindel, Yan Chen, Steven Czerwinski, Patrick Eaton, Dennis Geels, Ramakrishna Gummadi, Sean Rhea, Hakim Weatherspoon, Westley Weimer, Chris Wells, and Ben Zhao, "OceanStore: An Architecture for Global-Scale Persistent Storage", appears in Proceedings of the Ninth international Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS 2000), November 2000.
http://oceanstore.cs.berkeley.edu/publications/papers/pdf/asplos00.pdf
[REG+2003]
Sean Rhea, Patrick Eaton, Dennis Geels, Hakim Weatherspoon, Ben Zhao, and John Kubiatowicz, "Pond: the OceanStore Prototype", appears in Proceedings of the 2nd USENIX Conference on File and Storage Technologies (FAST '03), March 2003.
http://oceanstore.cs.berkeley.edu/publications/papers/pdf/fast2003-pond.pdf
[RWE+2001]
Sean Rhea, Chris Wells, Patrick Eaton, Dennis Geels, Ben Zhao, Hakim Weatherspoon, and John Kubiatowicz, "Maintenance-Free Global Data Storage", appears in IEEE Internet Computing , Vol 5, No 5, September/October 2001, pp 40-49.
http://oceanstore.cs.berkeley.edu/publications/papers/pdf/ieeeic.pdf

Pastry & SQUIRREL

[IRD2002]*
Sitaram Iyer, Antony Rowstron and Peter Druschel, "SQUIRREL: A decentralized, peer-to-peer web cache", appeared in Principles of Distributed Computing (PODC 2002), Monterey, CA.
http://research.microsoft.com/~antr/PAST/p-squirrel.pdf
[RD2001]*
Antony Rowstron and Peter Druschel, "Pastry: Scalable, distributed object location and routing for large-scale peer-to-peer systems". IFIP/ACM International Conference on Distributed Systems Platforms (Middleware), Heidelberg, Germany, pages 329-350, November, 2001.
http://research.microsoft.com/~antr/PAST/pastry.pdf
[FreePastry]
Peter Druschel et. al, "FreePastry: a modular, open source implementation of the Pastry p2p routing and location substrate".
http://www.cs.rice.edu/CS/Systems/Pastry/FreePastry/

SCRIBE

[CDKR2002]*
Miguel Castro, Peter Druschel, Anne-Marie Kermarrec and Antony Rowstron, "SCRIBE: A large-scale and decentralised application-level multicast infrastructure", IEEE Journal on Selected Areas in Communications (JSAC) (Special issue on Network Support for Multicast Communications). 2002, to appear.
http://research.microsoft.com/~antr/PAST/jsac.pdf
[CJK+2003]
Miguel Castro, Michael B. Jones, Anne-Marie Kermarrec, Antony Rowstron, Marvin Theimer, Helen Wang and Alec Wolman, "An Evaluation of Scalable Application-level Multicast Built Using Peer-to-peer overlays", Infocom 2003, San Francisco, CA, April, 2003.
http://research.microsoft.com/~antr/PAST/infocom-compare.pdf

SkipNet

[HJS+2003]
Nicholas J.A. Harvey, Michael B. Jones, Stefan Saroiu, Marvin Theimer, and Alec Wolman, "SkipNet: A Scalable Overlay Network with Practical Locality Properties", in Proceedings of the Fourth USENIX Symposium on Internet Technologies and Systems (USITS '03), Seattle, WA, March 2003.
http://www.research.microsoft.com/sn/Herald/papers/USITS/SkipNet-Usits.pdf
[HJTW2003]
Nicholas J.A. Harvey, Michael B. Jones, Marvin Theimer, and Alec Wolman, "Efficient Recovery From Organizational Disconnects in SkipNet", In Proceedings of the 2nd International Workshop on Peer-to-Peer Systems (IPTPS '03), Berkeley, CA, February 2003.
http://www.research.microsoft.com/sn/Herald/papers/IPTPS/SkipNet-Iptps.pdf
[HM200?]
Nicholas J.A. Harvey, J. Ian Munro, "Deterministic SkipNet", to appear in Information Processing Letters.
http://theory.lcs.mit.edu/~nickh/IPL-DeterministicSkipNet.pdf
[HDM+2003]
Nicholas J.A. Harvey, John Dunagan, Michael B. Jones, Stefan Saroiu, Marvin Theimer, and Alec Wolman, "SkipNet: A Scalable Overlay Network with Practical Locality Properties", Microsoft Technical Report 2002-92. (Extended version of USITS paper).
http://www.research.microsoft.com/sn/Herald/papers/skipnet.pdf

FreeHaven

[]
Roger Dingledine, Michael J. Freedman, David Molnar, "The Free Haven Project: Distributed Anonymous Storage Service", in Proceedings of the Workshop on Design Issues in Anonymity and Unobservability, July 2000 (LNCS 2009).
http://www.freehaven.net/doc/berk/freehaven-berk.ps
[]
Roger Dingledine, "The Free Haven Project: Design and Deployment of an Anonymous Secure Data Haven", MIT Master's Thesis, June 2000.
http://www.freehaven.net/doc/freehaven.pdf

PIER and DHT-Based querying

[HHH+2002]
Matthew Harren, Joseph M. Hellerstein, Ryan Huebsch, Boon Thau Loo, Scott Shenker, and Ion Stoica, "Complex Queries in DHT-based Peer-to-Peer Networks", IPTPS, March 2002.
http://db.cs.berkeley.edu/papers/iptps02-dhtquery.pdf
[HHL+2003]
Ryan Huebsch, Joseph M. Hellerstein, Nick Lanham, Boon Thau Loo, Scott Shenker, Ion Stoica, "Querying the Internet with PIER.". In VLDB, 2003.
http://db.cs.berkeley.edu/papers/vldb03-pier.pdf

Multimedia streaming (SplitStream)

[CDK+2003]
M. Castro, P. Druschel, A-M. Kermarrec, A. Nandi, A. Rowstron and A. Singh, "SplitStream: High-bandwidth multicast in a cooperative environment", SOSP'03, Lake Bolton, New York, October, 2003.
http://research.microsoft.com/~antr/PAST/SplitStream-sosp.pdf

People

Teachers

Name Initials Office Telephone E-mail
Fritz Henglein FH N203 (DIKU) 35 32 14 63 henglein@diku.dk
Henning Niss HN 2.42 (ITU 38 16 89 55 hniss@itu.dk

Students

Name Initials E-mail Exam Exercises
2 3 4 5 6 7 8 9 10 11 Accept
Anton Michael MessingAMMtmessing Yes x x x x x x x
Christian Schmidt-MadsenCSMcsm Xes x x x x x x x x
Christian Theil HaveCTHcth Yes x x x x x x x
Eirik Todal LabergETLeinkl Xes x x x x x x x
Erik Aarslew-JensenEAJemv Yes x x x x x x x x
Erik Matthias VerdonerEMVemv Xes x x x x x x x
Håvard SemundsetHShavard Yes x x x x x x x
Jesper MunkJMjem Xes x / x x x x x x
Kasper EgdøKE?? Yes x x x x x x x
Mads DanquahMDdanquah Yes x x x x x x x x
Mads LundemannMLthenox Yes x x x x x x x x x x
Matthew Paul TravailleMPTmatre03 Xes x x x x x x x x
Mikkel BlannéMiB?? Yes x x x x x x x
Morten BjerreMoBmortenb Yes x x x x x x x
Morten FranckMFskyfer Xes x x x x x x x x x x x
Nicolai RitzNRnritz Yes x x x x x x x
Niels TeglsboNTteglsbo Yes x x x x x x x x x x
Orri Gautur PálssonOGPorri Yes x x x x x x x
Paul Frederick d'SouzaPFdSpfdsouza Yes x x x x x x x
Peter Gath HansenPGHgath Xes x x x x x x x x
Peter TiedemannPTpetert Yes x x x x x x x
Philip Skov KnudsenPSKskov Yes x x x x x x x
René KofoedRKrkofoed Yes x x x x x x x
Sravan Kumar AnanthulaSKAananthula3 Yes x x x x % x x
Troels KroghTKtok Xes x x x x x x x x

Description

Title:
Peer-to-Peer Storage Systems
Teachers:
Fritz Henglein and Henning Niss
Course ID:
IP2P/830
Time and place:
presentations: 9-12 (room 2.59); reading and exercise solving: 13-15:30 (room 2.59)
Format:
Weekly 1 hour wrap-up from last week, 2 hour presentations, lunch breaks, followed by 2-3 hours of reading and exercise solving. Possibly one larger programming exercise.
Credits:
7.5 ECTS points; oral exam with grade on 13-scale. To qualify for participation in the exam active participation in the seminar is required.
Goals:
In this course/seminar we study fundamental computer science (domain independent) issues in the design and implementation of mobile and distributed software systems, with an emphasis on data management and programming.

This seminar is intended to:

  • present you with fundamental problems that present and future distributed and mobile systems face;
  • explore known techniques for addressing those problems;
  • provide a survey of state-of-the-art systems embodying those techniques, including select distributed file systems and peer-to-peer based systems; and
  • shed fundamental insights into the interplay between programming models and distributed and mobile systems techniques.
Topics:

The focus of the course/seminar will be on fundamental issues such as

  • p2p routing and p2p overlays;
  • file sharing;
  • distributed/p2p file systems;
  • group communication

rather than on the systems addressing those issues. Of couse, we will illustrate these fundamental items by looking at how concrete system address them

  • file sharing systems (Napster, Gnutella, FastTrack, etc);
  • unstructured versus structured routing (fx: Gnutella flooding, FreeNet backtracking, Chord, Tapstry, Pastry, and Kademila content-based hashes)
  • distributed file system (fx: CFS, OceanStore, PAST);
  • group communication (fx: SCRIBE);
  • other applications (fx: XML Store).

This is not the outline of the course schedule!

The course will be followed up with opportunities for students projects of various lengths.

Course material:
Mainly research papers. Will be posted on the course home page.
Prerequisites:
Knowledge about
  • programming and programming languages,
  • distributed systems,
  • concurrency.
Web page (URL):
http://www.itu.dk/courses/IP2P/F2004/ and http://www.plan-x.org/
Additional remarks:
The course is held in English, unless all the participants are fluent in Danish and agree on switching to Danish.

Last updated: Feb 9, 2004 (hniss)