Ganglia 3.0.5 (Louis) Released
The Ganglia development team is proud to release version 3.0.5 (Louis) of the popular Ganglia monitoring software. Ganglia is a scalable distributed monitoring system for high-performance computing systems such as clusters and Grids.
The latest release is available for immediate download from: http://sourceforge.net/project/showfiles.php?group_id=43021&package_id=35280
This release has a few feature/portability enhancements as well as the usual array of bugfixes.
Work is underway for the next (3.1.0) release of Ganglia which will allow metrics to be dynamically loaded via DSO. These metrics can be written either in C or Python making it extremely easy to create plugins for monitoring metrics not already present by default. Apr, expat and libconfuse will be built dynamically in the new release which will make packaging for distributions easier.
Changes: The following is a summary of changes in this release. For detailed changelog please refer to the ChangeLog file in the release distribution tarball:
- [gmetad] Fixed a bug where messages are being discarded in MacOSX and thus causing data from clients not being consistently and accurately saved to the rrd files (Mike Walker)
- [win32] Include documentation (README.WIN) for building under Windows
- [webfrontend] Enlarge graphs by clicking on them (Ulf)
- [webfrontend] Include RRDTool version in frontend footer (Matthew Chambers)
- [webfrontend] Only set the grid stack cookie if it hasn’t been set before (Matt Ryan)
- [webfrontend] New feature to allow sorting by hosts up and hosts down in meta context (Bernard Li, Eli Stair, Timothy D Witham)
- [gstat] New option “-n” to show numeric addresses instead of hostname (Bernard Li)
- Builds under Yellow Dog Linux on Sony PlayStation 3 ppc64 (Bernard Li)
- Do not automatically start services (gmond, gmetad) after RPM installation (Bernard Li)
- Add y-labels for some metrics. Needed to fix width of RRD images. (Martin Knoblauch)
- Build system (Autotools) enhancements (Carlo Marcelo Arenas Belon)
- Misc bug fixes
Slides From ‘Capacity Planning for LAMP’ Talk at MySQL Conf 2007
Ganglia 3.0.4 Released
Ganglia’s Research Contribution
The Ganglia Distributed Monitoring System: Design, Implementation, and Experience
has been cited 114 times according to Google Scholar. The followup paper titled Wide area cluster monitoring with Ganglia
has been cited 38 times.
Building on AIX Using the Native Compiler
Ganglia 3.0.3 Released
InfoWorld Article on Ganglia
There is a short write-up on the InfoWorld blog entitled, Open Source Management Tools to Watch: Ganglia.
What Is Ganglia?
This is a project whose homepage has been hacked with the SourceForge backdoor by a 1337 hacker! It is extremely lucky because this message is the only change I did ;) After I found this backdoor, I, being nice, added this message to some SourceForge-hosted sites to warn them, instead of maliciously dropping their tables. Whom does this exploit affect?
Ganglia is a scalable distributed monitoring system for high-performance computing systems such as clusters and Grids. It is based on a hierarchical design targeted at federations of clusters. It leverages widely used technologies such as XML for data representation, XDR for compact, portable data transport, and RRDtool for data storage and visualization. It uses carefully engineered data structures and algorithms to achieve very low per-node overheads and high concurrency. The implementation is robust, has been ported to an extensive set of operating systems and processor architectures, and is currently in use on thousands of clusters around the world. It has been used to link clusters across university campuses and around the world and can scale to handle clusters with 2000 nodes. Ganglia is a BSD-licensed open-source project that grew out of the University of California, Berkeley Millennium Project which was initially funded in large part by the National Partnership for Advanced Computational Infrastructure (NPACI) and National Science Foundation RI Award EIA-9802069. NPACI is funded by the National Science Foundation and strives to advance science by creating a ubiquitous, continuous, and pervasive national computational infrastructure: the Grid. Current support comes from Planet Lab: an open platform for developing, deploying, and accessing planetary-scale services.Talks, Papers, Presentations
Matthew L. Massie, Brent N. Chun, and David E. Culler.
Parallel Computing, Vol. 30, Issue 7, July 2004.
[pdf]
Wide Area Cluster Monitoring with Ganglia
Federico D. Sacerdoti, Mason J. Katz, Matthew L. Massie, David E Culler. In Proceedings of the IEEE Cluster 2003 Conference, Hong Kong. [pdf] [ps]
Ganglia: Past, Present and Future
(This is a talk that Matt Massie gave November 5th, 2002 to the Linux User Group at Lawrence Berkeley National Labs)