Jan 16, 2013 5:52 AM
Status of Zenoss HPC
-
Like (0)
Hello,
I'm interested in trying out Zenoss HPC, so I wanted to ask several questions.
Can you tell me the current status of Zenoss HPC? Is it being developed recently?
And is this (http://dev.zenoss.org/trac/browser/branches/zenoss-2.1.3-hpc/RPMS) the latest version of Zenoss HPC?
Hi Vadim,
We are very happy to answer any questions you may have. We are still developing Zenoss and are currently looking at a major upgrade initiative. I believe we have a more recent version of the software.
I have moved into a new position, but I still oversee this effort just in a different capacity. I would like to get our developers involved in answering any questions you may have. How would you like to proceed? Phone call? E-mail? Just let me know and I will work to get you what you need.
Regards,
Cindy Martin
Dear Cindy,
Thanks for your quick response!
I think email for now will be sufficient.
The main issue for now I think is if it's possible to get and try out
the latest version of Zenoss HPC (and if we could get some technical
documentation - it would be great!).
We are thinking about writing cluster monitoring system ourselves, but
maybe Zenoss HPC can provide us with needed functionality?
We've read some info about it, but it was more of general overview.. The
functionality of Zenoss HPC is promising, but more details would be
great For example, how historical model, root cause isolation or
cooperation with resource schedulers are organized.
Also, it would be really interesting to get some more information about
using Zenoss HPC on such big systems as RoadRunner!
I'm asking for a lot of things, but if you can provide us with any of
this info, I'd really appreciate it!
Thanks in advance!
--
With regards,
Vadim Voevodin.
Dear Cindy,
Will it be possible to get answers for these questions or I'm just asking for too much? :)
Hi Vadim,
My apologies I must have missed your response. Let me have one of my folks to work with you through e-mail to get your questions answered. I will have him contact you today.
Regards,
Cindy
Dear Cindy,
Thanks a lot for your help!
Hello Vadim,
I'm the new guy, trying to pick things up where Cindy left off. I will try to answer your questions as best I can, and for those questions I am not sure about, I will refer them back to the team for clarification.
We are still using the 2.1.3 version of zenoss. Upgrading to V4, is our plan. We need to provide a few more updates/added functionality before we roll a final V2 RPM. I don't have a firm completion date on this, but it is the main focus.
Would a conference call work for you to discuss details on these questions?
How historical model, root cause isolation or cooperation with resource schedulers are organized.
Regards,
Anthony
Dear Anthony,
Since new version is not available for us for now, I think the best for us then would be to try 2.1.3 version in order to see it in practice, after that I'll ask you particular questions
Should it work on CentOS 6?
Also, is it possible to get some information about Zenoss HPC usage on RoadRunner? Or this info is not publicly available?
UPD: Our admin tried to install 2.1.3 (got it here: http://dev.zenoss.org/trac/browser/branches/zenoss-2.1.3-hpc/RPMS) on CentOS release 6.3 (Final) and the following errors apperared:
"
Error: Package: zenoss-2.1.3-1.hpc.el5.x86_64 (/zenoss-2.1.3-1.hpc.el5.x86_64)
Requires: libmysqlclient.so.15()(64bit)
Error: Package: zenoss-2.1.3-1.hpc.el5.x86_64 (/zenoss-2.1.3-1.hpc.el5.x86_64)
Requires: libpq.so.4()(64bit)
Error: Package: zenoss-2.1.3-1.hpc.el5.x86_64 (/zenoss-2.1.3-1.hpc.el5.x86_64)
Requires: libgnutls.so.13(GNUTLS_1_3)(64bit)
Error: Package: zenoss-2.1.3-1.hpc.el5.x86_64 (/zenoss-2.1.3-1.hpc.el5.x86_64)
Requires: libmysqlclient.so.15(libmysqlclient_15)(64bit)
Error: Package: zenoss-2.1.3-1.hpc.el5.x86_64 (/zenoss-2.1.3-1.hpc.el5.x86_64)
Requires: libmysqlclient_r.so.15(libmysqlclient_15)(64bit)
Error: Package: zenoss-2.1.3-1.hpc.el5.x86_64 (/zenoss-2.1.3-1.hpc.el5.x86_64)
Requires: libgnutls.so.13()(64bit)
Error: Package: zenoss-2.1.3-1.hpc.el5.x86_64 (/zenoss-2.1.3-1.hpc.el5.x86_64)
Requires: libmysqlclient_r.so.15()(64bit)
"
Maybe you can tell us how this could be fixed?
Hi Vadim,
It looks to me like the rpm is looking for some libraries that have not been installed. Also the rpm that is posted on the Zenoss Web site was created for installation on rhel5 machines. If you wish to install on a different machine, you may need to build your own rpm.
What kind of information about Zenoss hpc usage on Roadrunner are you looking for?
Laura
Hi Laura,
Concerning RoadRunner there are several questions that are interesting for us:
Hope that would not be too much to ask
Thanks in advance!
UPD: We've tried to build our own rpm, but it seems that spec file is designed for other directory layout...
The following error occurs:
<...>
Executing(%build): /bin/sh -e /var/tmp/rpm-tmp.P5JpNw
+ umask 022
+ cd /home/vadim/rpmbuild/BUILD
+ cd zenoss-2.1.3
+ LANG=C
+ export LANG
+ unset DISPLAY
+ rm -rf /home/vadim/rpmbuild/BUILDROOT/zenoss-2.1.3-1.hpc.el5.x86_64
+ export ZENHOME=/opt/zenoss
+ ZENHOME=/opt/zenoss
+ export ZOPEPASSWORD=zenoss
+ ZOPEPASSWORD=zenoss
+ export PYTHON=/usr/bin/python
+ PYTHON=/usr/bin/python
+ export DESTDIR=/home/vadim/rpmbuild/BUILDROOT/zenoss-2.1.3-1.hpc.el5.x86_64
+ DESTDIR=/home/vadim/rpmbuild/BUILDROOT/zenoss-2.1.3-1.hpc.el5.x86_64
+ ZEN_BUILD_DIR=/home/vadim/rpmbuild/BUILD/zenoss-2.1.3
+ . /home/vadim/rpmbuild/BUILD/zenoss-2.1.3/build-functions.sh
/var/tmp/rpm-tmp.P5JpNw: line 54: /home/vadim/rpmbuild/BUILD/zenoss-2.1.3/build-functions.sh: No such file or directory
error: Bad exit status from /var/tmp/rpm-tmp.P5JpNw (%build)
The needed build_functions.sh is located at different directory (/home/vadim/rpmbuild/BUILD/zenoss-2.1.3/inst/build-functions.sh).
We've tried to change spec file a bit, but another errors occur...
Is it possible for you to tell us what should be changed in this file (or maybe some preliminary settings should be made)?
Hi Vadim,
I will try to answer your questions concerning Roadrunner and see if I can get one of our sys admins to look at you rpm errors.
..........
What types of data are collected and processed by Zenoss HPC and how frequently? For example, CPU usage - once per minute, nodes network activity - every 10 seconds, room temperature - once per 5 minutes etc.
We collect ping up/ping down, cpu usage (at the node level) using our scheduler, network data from syslog, cpu temperature, correctable and uncorrectable memory errors from syslog.
..........
How many instances of Zenoss HPC are needed for monitoring RoadRunner? What hardware is used for that purposes?
Because of the size of rhe Roadrunner cluster, we have built a hierarchy of monitoring nodes. We have 9 at the bottom level which monitor either one or two connected units. Then we have a cluster monitoring box which gathers filtered data from the CU monitoring boxes. Finally we have a top-level box which gathers further filtered data from all the clusters.
………….
As well as we understand, Zenoss HPC is using a set of rules for issue tracking. Accordingly at some point (new sensor value came, timer triggered) these rules are being checked. How many such rules are you using (concerning RoadRunner), and how often this checking is made?
Mostly, the filters are applied to the event as it comes in. Sometimes immediate action is taken such as informing operators and oncall of the problem. Other times we wait until a threshold is met such as the count of correctable memory errors received within a week. In the latter case when we reach the threshold, we can then inform the appropriate people and/or set a node to be drained automatically.
…………….
Do you consider Zenoss HPC as sufficient for RoadRunner? Do you use any other monitoring systems?
Yes.
Laura
First of all, thank you for the answers!
"I will try to answer your questions concerning Roadrunner and see if I can get one of our sys admins to look at you rpm errors."
Any help about rpm issue will be great!
"We collect ping up/ping down, cpu usage (at the node level) using our scheduler, network data from syslog, cpu temperature, correctable and uncorrectable memory errors from syslog."
And how often do you collect such data? Once per second?
Thanks again!
Vadim
Hi Vadim,
Can you paste a copy of what you have in your .rpmmacros file.
Sam
Vadim,
Also what is your BuildRoot path in the spec file..?
Our is :
BuildRoot: %{_tmppath}/%{name}-%{version}-%{release}-root
When the rpm is being built it sets the DESTDIR to
+ export DESTDIR=/var/tmp/zenoss-2.1.3-30.lanl.el5-root
DESTDIR=/var/tmp/zenoss-2.1.3-30.lanl.el5-root
From your ouput it looks like the BuildRoot has been changed but the _tmppath has not
+ . /home/vadim/rpmbuild/BUILD/zenoss-2.1.3/build-functions.sh
/var/tmp/rpm-tmp.P5JpNw: line 54: /home/vadim/rpmbuild/BUILD/zenoss-2.1.3/build-functions.sh: No such file or directory
You will need to change the _tmppath in your .rpmmacros.. Also are you setting your RPM_BUILD_DIR variable.
Here is a good reference for setting up your rpmmacros.
http://pmc.ucsc.edu/~dmk/notes/RPMs/Creating_RPMs.html
Please let me know if you have any questions
Sam
Dear Sam,
Sorry for late response, couldn't asnwer quicker...
Answering your questions:
1. Here is copy of .rpmmacros file:
# cat ~/.rpmmacros
%make_build:
make
%_initdir:
/etc/init.d
%_tmppath:
/tmp
%_topdir %(echo $HOME)/rpmbuild
2. BuildRoot path in the spec file is the same.
And I believe we haven't changed anything else in spec file.
Can it be that path to build-function.sh is set incorrectly in spec?
I'm asking this because we are getting such error:
"
+ . /root/rpmbuild/BUILD/zenoss-2.1.3/build-functions.sh
/:/rpm-tmp.mNfwGy: line 54: /root/rpmbuild/BUILD/zenoss-2.1.3/build-functions.sh: No such file or directory
error: Bad exit status from /:/rpm-tmp.mNfwGy (%build)
"
but as we can see build-functions.sh is located not in zenoss-2.1.3/ dir, but in zenoss-2.1.3/inst/ dir:
# grep build-functions log
-rw-r--r-- root/root 11025 2007-07-06 19:35 zenoss-2.1.3/inst/build-functions.sh
It seems like during build we get to the necessary directory, but the needed build-functions.sh script is somehow one level deeper..
Follow Us On Twitter »
|
Latest from the Zenoss Blog » | Community | Products | Services Resources | Customers Partners | About Us | ||
Copyright © 2005-2011 Zenoss, Inc.
|
||||||||