List of distributed computing projects

From Wikipedia, the free encyclopedia

Jump to: navigation, search

A list of distributed computing projects.

Contents

[edit] The fastest Distributed Computers

The PlayStation 3 Folding@home client displays a 3D model of the protein being simulated
  • Folding@Home is of April 2009 sustaining over 8.1 PFLOPS [1], the first computing project of any kind to cross the four petaFLOPS milestone. This level of performance is primarily enabled by the cumulative effort of a vast array of PlayStation 3 and powerful GPU units.[2]
  • The entire BOINC averages over 1.5 PFLOPS as of March 15, 2009[3].
  • SETI@Home computes data averages more than 528 TFLOPS[4]
  • Einstein@Home is crunching more than 150 TFLOPS[5]
  • As of August 2008, GIMPS is sustaining 27 TFLOPS.[6]

Intel Corporation has recently unveiled the experimental multi-core POLARIS chip, which achieves 1 TFLOPS at 3.13 GHz. The 80-core chip can increase this result to 2 TFLOPS at 6.26 GHz, although the thermal dissipation at this frequency exceeds 190 watts[7].

As of 2008, the fastest PC processors (quad-core) perform over 70 GFLOPS (Intel Core i7 965 XE) in double precision[8]. GPUs are considerably more powerful, for example, in the GeForce 200 Series the nVidia GTX 280 performs around 933 GFLOPS on 240 processing elements in single precision calculations[9], and that while GPUs are highly efficient at single precision calculations they are not as flexible as a general purpose CPUs in double precision operations.

[edit] Cycle Scavenging Infrastructure

[edit] Berkeley Open Infrastructure for Network Computing (BOINC)

Berkeley Open Infrastructure for Network Computing (BOINC), originally developed to support SETI@home, became useful as a platform for several distributed applications in areas as diverse as mathematics, medicine, molecular biology, climatology, and astrophysics.[10]

The Berkeley Open Infrastructure for Network Computing (BOINC) is currently the most popular volunteer-based distributed computing platform as of April 2009.[11][12]

[edit] Active Projects

[edit] Upcoming Projects

These projects are considered to be in the Alpha or Beta development stages.

  • Internet

Performance of BOINC projects:

  • over 1,300,000 participants
  • over 2,800,000 computers
  • over 1.2 PetaFLOPS (more than supercomputer Blue Gene) [1]
  • over 12 Petabytes of free disk space
  • SETI@home: 3.4 million years of computing time (January 2008)

[edit] Distributed.net

Distributed.net runs several projects:

[edit] World Community Grid

The World Community Grid is an IBM philanthropic initiative which aims to create the largest public computing grid benefiting humanity. It utilizes the BOINC platform.

[edit] Active Projects

[edit] Upcoming Projects

[edit] Completed Projects

[edit] Commercial Cycle Scavengers

[edit] Gomez Peer

Commercial distributed computing project that uses peers' computers to measure the real-world performance of websites. They pay the active peers 0.10$ for being online more than 20 hours a day and 0.0005$ for each processing minute. Each peer can install their program on more than one computer and sum up online and processing time. Payments are distributed by PayPal (you must have a minimum of $5 to get a payment).

[edit] Parabon Computation

The Parabon Computation client uses a Java VM technology, and is commercial in nature.

[edit] Custom/Uncategorized Platforms

Custom software encompasses distributed computing projects that do not make use of a third-party generic client-server infrastructure or which use one other than those listed above.

Note that not all of these projects use cycle-scavenging technology; some instead focus on more traditional remote-access-to-HPC approaches.

[edit] Active projects

  • Mathematics
  • Internet
    • AssessGRID [7] Addresses obstacles to a wide adoption of Grid technologies by bringing risk management and assessment to this field, enabling use of Grid computing in business and society.
    • A-Ware [8] will develop a stable, supported, commercially exploitable, high quality technology to give easy access to Grid resources.
    • BREIN — uses the Semantic Web and Multi-agent Systems to build simple and reliable Grid systems for business.
    • Cohesion Platform [9] is a Java-based modular Peer-to-Peer multi-application Desktop Grid computing platform for irregularly structured problems developed at the University of Tübingen (Germany).
    • DIMES — is a distributed computing project which maps the structure and evolution of the Internet infrastructure, allowing users to see how the Internet looks from their home.
    • Enabling Grids for E-sciencE
    • GridCOMP [10] — provides an advanced component platform for an effective invisible Grid.
    • GridECON [11] takes a user-oriented perspective and creates solutions to grid challenges to promote the widespread use of grids.
    • Hours — Ongoing project HarmOny and Useful Resource Sharing. Attempts to make use of the trust management and network economics to implement the heterogeneous resource sharing. Currently focusing on the resource allocation in the science grid like Teragrid and OSG. This project is run by the MIST group of Computer Science at Wayne State University.[12]
    • JHDC — Open source programmable Java distributed computing system.
    • Legion — Grid computing platform developed at the University of Virginia.
    • Majestic-12 — Uses a distributed web crawler program to index web sites for a distributed search engine.[13]
    • NESSI-GRID [14] aims to provide a unified view for European research in Services Architectures and Software Infrastructures that will define technologies, strategies and deployment policies fostering new, open, industrial solutions and societal applications that enhance the safety, security and well-being of citizens.
    • OMII-Europe is an EU-funded project which has been established to source key software components that can interoperate across several heterogeneous Grid middleware platforms.
    • OMII-UK provides free Open Source software and support to enable a sustained future for the UK e-Research community.
    • OurGrid — aims to deliver grid technology that can be used today by current users to solve present problems. To achieve this goal, OurGrid chooses a different trade-off compared to most grid projects. It forfeits supporting arbitrary applications in favor of supporting only Bag-of-Tasks applications.
    • ScottNet NCG — This is a distributed neural computing grid. A private commercial effort in continuous operation since 1995. This system performs a series of functions including data synchronization amongst databases, mainframe systems, and other data repositories. E-Commerce transaction processing, automated research and data retrieval, content analysis, web site monitoring, scripted and dynamic user emulation, shipping and fulfillment API integration and management, RSS and NNTP monitoring and analysis, real time security enforcement, and backup / restore functionality.[15]
  • Biology
  • Chemistry
  • Earth Sciences
    • Climateprediction.net — seeks to forecast the climate of the Earth in the 21st century. The original windows client is in process of being retired. At this time the windows client is used for Open University classes only.
  • Art
  • Physics
  • Cryptography
  • Miscellaneous

[edit] Completed projects

[edit] Abandoned projects

These projects were either abandoned outright or in some cases merged with other larger ongoing projects.

[edit] Upcoming projects

  • Storage@home — distributed storage infrastructure developed to solve the problem of backing up and sharing petabytes of scientific results using a distributed model of volunteer managed hosts. Data is maintained by a mixture of replication and monitoring, with repairs done as needed.

[edit] Volunteer distributed computing projects

Popular projects in volunteer distributed computing include:[49][not in citation given]

Project Start Affiliation Area Peak_#hosts Current status Computing power
GIMPS 1996 ? mathematics 10,000 active 27 TFLOPS
distributed.net 1997 U.S. non-profit organization cryptography 100,000 active ?
SETI@home 1999 University of California, Berkeley SETI 362,000 active 528 TFLOPS
Electric Sheep 1999 ? art 57,000 active ?
Folding@home 2000 Stanford University biology 406,000 active 8.1 PFLOPS
BOINC 2002 University of California, Berkeley biomedicine, other 550,000 active 1.5 PFLOPS
Grid.org 2002 philanthropic by United Devices biomedicine, other 3,734,000[50] closed ?
Climateprediction.net 2003 University of Oxford climate change 150,000 active ?
LHC@home 2004 CERN physics 60,000 active ?
World Community Grid 2004 philanthropic by IBM biomedicine, other 700,000[50] active ?
Einstein@home 2005 LIGO astrophysics 200,000 active 150 TFLOPS
Rosetta@home 2005 University of Washington biology 100,000 active ?

[edit] Physical infrastructure projects

These projects attempt to make large physical computation infrastructures available for researchers to use:

[edit] Other distributed computing software platforms

The following are generic software platforms or infrastructures used to implement some of the projects listed in the previous section.

  • Alchemi — A .NET-based system for building enterprise Grids and applications.
  • Amoeba — distributed operating system that is designed for distributed computing tasks.
  • Base One Foundation Component LibraryRAD framework for database-centric distributed computing.
  • Beowulf clustersLinux based parallel computing using commodity hardware.
  • Condor — a flexible high-throughput distributed computing scheduler
  • DCEZ — Simple to set up and use platform to perform distributed computing with a minimum of infrastructure.
  • Distributed objects — systems like CORBA, Microsoft DCOM, Java RMI, and others that try to map object oriented design onto the network.
  • DragonFly BSD — an operating system aiming to support SSI clustering
  • Globus Toolkit — an open source software toolkit used for building Grid systems and applications
  • GreenTea Software — a Java-based P2P generic distributed network computing platform that transmits code and data on-demand to run on heterogeneous OS's.
  • Gridbus Toolkit — an open source software toolkit used for building market-oriented Grid systems and applications
  • Grid MP — an infrastructure created by United Devices, used to run grid.org, and is one of the infrastructures used by World Community Grid.
  • JPPF — an open source computational grid toolkit focused on performance and ease of use
  • JSTM — uses a java Software Transactional Memory implementation for distributed object replication.
  • Popular Power — (Defunct) building a platform for Internet-wide distributed computing.
  • ProActive ProActive is a Java middleware (part of the ObjectWeb consortium, with Open Source code) for parallel, distributed and multi-threaded computing.
  • RPyC — Remote Python Call, a platform for building distributed applications.
  • Sun Grid Engine — a distributed resource management system, similar to Condor
  • SynfiniWayFujitsu's middleware with which a virtualised IT framework can be created that provides a uniform and global view of resources within a department, a company, or a company with its suppliers.
  • Terracotta — Open source Java clustering - Java extension that provides seamless clustering of plain java code. See also Terracotta Cluster.
  • UNICORE — an open source software platform for supporting Grid systems and applications
  • Vaakya — software developed by Vaakya Technologies Pvt. Ltd., a Bangalore-based company. It has its own language and different frameworks (e.g. business application components, handheld devices, 3D graphics) that allow ISVs to develop applications, particularly for businesses, that run entirely on premises on ordinary work stations, not expensive servers.
  • XGE — a Windows-based product which distributes tasks on a local network by virtualizing filesystem access.
  • Xgrid — software developed by Apple's Advanced Computation Group.

Comparison of cluster software

[edit] References

  1. ^ "Client statistics by OS". Folding@Home. 2009-01-22. http://fah-web.stanford.edu/cgi-bin/main.py?qtype=osstats. Retrieved on 2009-01-23. 
  2. ^ Staff (November 6, 2008). "Sony Computer Entertainment's Support for Folding@home™ Project on PlayStation®3 Receives This Year's "Good Design Gold Award"". Sony Computer Entertainment Inc.. Sony Computer Entertainment Inc. (Sony Computer Entertainment Inc.). http://www.scei.co.jp/corporate/release/081106de.html. Retrieved on December 11, 2008. 
  3. ^ "Credit overview". BOINC. http://www.boincstats.com/stats/project_graph.php?pr=bo. Retrieved on 2008-08-04. 
  4. ^ "SETI@Home Credit overview". BOINC. http://www.boincstats.com/stats/project_graph.php?pr=sah. Retrieved on 2008-08-04. 
  5. ^ "Server Status". Einstein@Home. http://einstein.phys.uwm.edu/server_status.php. Retrieved on 2008-07-08. 
  6. ^ Internet PrimeNet Server Parallel Technology for the Great Internet Mersenne Prime Search
  7. ^ http://www.bit-tech.net/hardware/2007/04/30/the_arrival_of_teraflop_computing/2
  8. ^ "Intel Core i7 Performance Preview". TECHGAGE. 2008-11-03. http://techgage.com/article/intel_core_i7_performance_preview/9. Retrieved on 2008-11-17. 
  9. ^ http://www.tomshardware.com/reviews/nvidia-gtx-280,1953-2.html
  10. ^ BOINC - Berkeley Open Infrastructure for Network Computing, Dr. David Anderson describes SETI@home, BOINC and Distributed Computing, youtube.com
  11. ^ BOINCStats, active hosts.
  12. ^ Folding@Home, active hosts.
  13. ^ "Cels@Home" website
  14. ^ "POEM@Home website
  15. ^ "BRaTS@Home" website
  16. ^ "Distributed Exact Cover Solver" website
  17. ^ Ramsey@Home website
  18. ^ Rectilinear Crossing Number website
  19. ^ "Project Neuron" website
  20. ^ "SHA-1 Collision Search" website
  21. ^ "Eternity2.net" website
  22. ^ "NQueens@Home" website
  23. ^ "pPot Tables" website
  24. ^ "Project Sudoku" website
  25. ^ "RenderFarm@Home" website
  26. ^ FreeHAL@home website
  27. ^ "Docking@Home" website
  28. ^ "Genetic Life" website
  29. ^ "GPUGRID.net" website
  30. ^ Hydrogen@Home website
  31. ^ "SciLINC" website
  32. ^ Botanicus.org description of SciLINC
  33. ^ "Superlink@Technion" website
  34. ^ "Virtual Prairie" website
  35. ^ "Milkyway@home" website
  36. ^ "AQUA@home" website
  37. ^ "Leiden Classical" website
  38. ^ LHC@home website
  39. ^ "Magnetism@home" website
  40. ^ "RND@home" website
  41. ^ "APS@Home" website
  42. ^ "Second Computing" website"
  43. ^ "Yoyo@home" website
  44. ^ "www.Bio4All.Tk" website
  45. ^ "sharkGrid" website
  46. ^ Xiaoyun Wang, Dengguo Feng, Xuejia Lai, Hongbo Yu: Collisions for Hash Functions MD4, MD5, HAVAL-128 and RIPEMD, Cryptology ePrint Archive Report 2004/199, 16 Aug 2004, revised 17 Aug 2004.
  47. ^ "DepSpid" website
  48. ^ 3x+1@home website
  49. ^ David P. Anderson (2005-05-23). A Million Years of Computing. http://boinc.berkeley.edu/talks/singapore_public.pdf. Retrieved on 2009-01-30. 
  50. ^ a b Host numbers from the UD platform represent unique installations, so are greater than the number of actual computers.

[edit] External links

Personal tools