Ganglia

Monitoring clusters and Grids since the year 2000

Ganglia 3.0.6 (Foss) Released

The Ganglia development team is pleased to release Ganglia 3.0.6 (Foss) which is available for immediate download from: http://sourceforge.net/project/showfiles.php?group_id=43021&package_id=35280 This release includes a security fix for web frontend cross-scripting vulnerability. All Ganglia web frontend users are strongly recommended to upgrade to this version. In most cases the version of the frontend does not need to match the version of gmetad and/or gmond – if problem arises, please drop us a note at ganglia-general@lists.sourceforge.net. Special thanks to Romain Wartel at CERN for discovering the vulnerability and reporting it to us and to Alex Dean for stepping up with the fix so quickly.

Ganglia 3.0.5 (Louis) Released

The Ganglia development team is proud to release version 3.0.5 (Louis) of the popular Ganglia monitoring software. Ganglia is a scalable distributed monitoring system for high-performance computing systems such as clusters and Grids.

The latest release is available for immediate download from: http://sourceforge.net/project/showfiles.php?group_id=43021&package_id=35280

This release has a few feature/portability enhancements as well as the usual array of bugfixes.

Work is underway for the next (3.1.0) release of Ganglia which will allow metrics to be dynamically loaded via DSO. These metrics can be written either in C or Python making it extremely easy to create plugins for monitoring metrics not already present by default. Apr, expat and libconfuse will be built dynamically in the new release which will make packaging for distributions easier.

Changes: The following is a summary of changes in this release. For detailed changelog please refer to the ChangeLog file in the release distribution tarball:

  • [gmetad] Fixed a bug where messages are being discarded in MacOSX and thus causing data from clients not being consistently and accurately saved to the rrd files (Mike Walker)
  • [win32] Include documentation (README.WIN) for building under Windows
  • [webfrontend] Enlarge graphs by clicking on them (Ulf)
  • [webfrontend] Include RRDTool version in frontend footer (Matthew Chambers)
  • [webfrontend] Only set the grid stack cookie if it hasn’t been set before (Matt Ryan)
  • [webfrontend] New feature to allow sorting by hosts up and hosts down in meta context (Bernard Li, Eli Stair, Timothy D Witham)
  • [gstat] New option “-n” to show numeric addresses instead of hostname (Bernard Li)
  • Builds under Yellow Dog Linux on Sony PlayStation 3 ppc64 (Bernard Li)
  • Do not automatically start services (gmond, gmetad) after RPM installation (Bernard Li)
  • Add y-labels for some metrics. Needed to fix width of RRD images. (Martin Knoblauch)
  • Build system (Autotools) enhancements (Carlo Marcelo Arenas Belon)
  • Misc bug fixes

Ganglia 3.0.4 Released

The Ganglia Development Team is pleased to announce the release of Ganglia 3.0.4 (Otto) which is available for immediate download from http://ganglia.info/downloads.php This release is based on SVN revision 695 and is mainly a bug fix release. Please see the ChangeLog in the Tarball for the complete list of fixes. There are a few new features that should be noted: * On Linux only filesystem mounted RW are counted in the disk statistics * Disk Statistics for Solaris * Added “host spoofing” capabilities to gmetrics/gmond This release has been tested on the following platforms: * Fedora FC4 / ia32 * Fedora Core 6 (i386) * Gentoo Linux 2006.1 (amd64) * Solaris 9 (sparc) * Solaris 10 (i386, amd64 and sparc) * NetBSD 2.0.2 (i386) * NetBSD 3.0 (i386) * NetBSD 3.1 (i386, amd64) * FreeBSD 6.1 (amd64) The development of Ganglia 3.0.5 is now open. Enjoy. The Ganglia team

Building on AIX Using the Native Compiler

Hi, this is basically the README.AIX file that will be in 3.0.4. It now has a better receipe for building with the native XLC compiler. It also describes what is needed to build “gmetad”. I thought it useful to publish this now. Using Ganglia on AIX ~~~~~~~~~~~~~~~~~~~~ This Version is tested on AIX 5.1, 5.2 and 5.3. AIX 4.3 might work as well, but it’s not tested by now. Installation ~~~~~~~~~~~~ You still need some “tricks” to use ganglia on a AIX system: 1. The AIX-Version should not be compiled with shared libraries You must add the “–disable-shared” and “–enable-static” configure flags if you running on AIX
./configure –disable-shared –enable-static
2. You should use “gcc”. xlc does not work out of the box. If you only have “xlc”, the following might work. Run configure first !! a) remove “-Wall” from all Makefiles, especially:
lib/Makefile gmond/gstat/Makefile gmond/Makefile gmetric/Makefile gmetad/Makefile (see below)
This should be done automatically, but automake/autoconf experts are needed. b) to actually build the binaries do:
make CC=”cc -qlanglvl=extc99”
c) To build “gmetad”, the following is needed: c1) install the following software, preferably from RPMs:
libart_lgpl-devel-2.3.16-1 freetype2-devel-2.1.7-2 zlib-1.2.2-4 libpng-1.2.1-6 freetype2-2.1.7-2 libart_lgpl-2.3.16-1 rrdtool-1.2.11.perl56-1 (cp /opt/freeware/include/rrd.h /usr/include/ )
c2) For Gmetad-3.0.3 or earlier: there is a conflict regarding the macro “FRAMESIZE”. In “gmetad/*” change all occurences of “FRAMESIZE” to “GMETAD_FRAMESIZE”. This will be fixed in version 3.0.4. Known problems and Limitations ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ - Occasionally Ganglia might report wrong network statistics, because there is no test for arithmetic overflow of the AIX counters by now. (Will be fixed soon, but might not make it in ganglia-3.0.2) - The following standard metrics are _not_ reported (reported as 0): mem_buffers (-), mem_shared (-), part_max_used(+), cpu_sintr (–), cpu_intr (–), cpu_aidle (+), cpu_nice (-) (–) cpu_nice, cpu_intr and cpu_sintr: There is no way to include this metric, because AIX dose not know anything about them (-) mem_buffers and mem_shared: libperfstat does not report his information, but maybe somebody knows another way. (+) part_max_used and cpu_aidle: it’s quite easy to do this metrics as well using libperfstat, but no body has written code so far.

Ganglia 3.0.3 Released

The Ganglia Development Team is pleased to announce the release of Ganglia 3.0.3 (Orwille) which is available for immediate download from http://ganglia.info/downloads.php This release fixes a bug that caused XML port output to be truncated, fixes FreeBSD compilation errors, makes gmetad more resilient to round-robin database problems, makes “gmond -t” output more complete, updates the underlying Apache runtime library, expands the maximum size of gmetric messages, provides various minor PHP bugfixes in the Web frontend, and adds 3D pie graph effects. This release has been tested on the following platforms:
* Fedora FC4 / ia32 * Solaris 8 / UltraSparc 64-Bit / gcc 3.3.1 –> “CC = gcc -m64 ./configure” * MacOS X * AIX-5.3 / PPC64 / XLC –> See separate post on this topic
The development of Ganglia 3.0.4 is now open. Enjoy. The Ganglia team

What Is Ganglia?

This is a project whose homepage has been hacked with the SourceForge backdoor by a 1337 hacker! It is extremely lucky because this message is the only change I did ;) After I found this backdoor, I, being nice, added this message to some SourceForge-hosted sites to warn them, instead of maliciously dropping their tables. Whom does this exploit affect?

Ganglia is a scalable distributed monitoring system for high-performance computing systems such as clusters and Grids. It is based on a hierarchical design targeted at federations of clusters. It leverages widely used technologies such as XML for data representation, XDR for compact, portable data transport, and RRDtool for data storage and visualization. It uses carefully engineered data structures and algorithms to achieve very low per-node overheads and high concurrency. The implementation is robust, has been ported to an extensive set of operating systems and processor architectures, and is currently in use on thousands of clusters around the world. It has been used to link clusters across university campuses and around the world and can scale to handle clusters with 2000 nodes. Ganglia is a BSD-licensed open-source project that grew out of the University of California, Berkeley Millennium Project which was initially funded in large part by the National Partnership for Advanced Computational Infrastructure (NPACI) and National Science Foundation RI Award EIA-9802069. NPACI is funded by the National Science Foundation and strives to advance science by creating a ubiquitous, continuous, and pervasive national computational infrastructure: the Grid. Current support comes from Planet Lab: an open platform for developing, deploying, and accessing planetary-scale services.

Talks, Papers, Presentations

The Ganglia Distributed Monitoring System: Design, Implementation, and Experience.
Matthew L. Massie, Brent N. Chun, and David E. Culler.
Parallel Computing, Vol. 30, Issue 7, July 2004.
[pdf]

Wide Area Cluster Monitoring with Ganglia
Federico D. Sacerdoti, Mason J. Katz, Matthew L. Massie, David E Culler. In Proceedings of the IEEE Cluster 2003 Conference, Hong Kong. [pdf] [ps]

Ganglia: Past, Present and Future
(This is a talk that Matt Massie gave November 5th, 2002 to the Linux User Group at Lawrence Berkeley National Labs)