Archived community.zenoss.org | full text search
Skip navigation
Currently Being Moderated

Dev chat 05/27/2010

VERSION 1 
Created on: Jun 8, 2010 5:33 PM by Matt Ray - Last Modified:  Jun 8, 2010 5:34 PM by Matt Ray

[27-May-2010 10:00:32] <mray> There are a couple of Zenoss developers here today, nzook is on the hook, but he's pretty new, so ask away

[27-May-2010 10:05:34] <Filbert> I'm seeing an issue where various component services are failing to start

[27-May-2010 10:05:44] <Filbert> they're failing with errors that seem to reference non-existant devices

[27-May-2010 10:06:13] <Filbert> I've tried a reindex, increased the zenhub connection timeouts for the most frequent offenders (I was getting timeout issues first)

[27-May-2010 10:06:23] <Filbert> but services still seem to fail with a big traceback in the logs

[27-May-2010 10:06:31] <Filbert> anyone got any advice on how I can go about debugging this further?

[27-May-2010 10:09:39] <mray> could you post the traceback on pastebin.org?

[27-May-2010 10:09:46] <Filbert> sure,one mo

[27-May-2010 10:10:27] <Filbert> http://pastebin.org/285881

[27-May-2010 10:10:53] <Filbert> ldnbh1gxendb02 is a decomissioned server

[27-May-2010 10:11:05] <Filbert> so should have been removed

[27-May-2010 10:11:09] <Filbert> I also can't find it anywhere in the webui

[27-May-2010 10:11:34] <chemist> tips for speeding up web ui anyone?

[27-May-2010 10:12:18] <rocket> chemist: what zenoss version?

[27-May-2010 10:12:35] <chemist> rocket: 2.5.2

[27-May-2010 10:13:05] <rocket> there are a few performance patches you should apply

[27-May-2010 10:13:15] pmcguire is now known as ptmcg

[27-May-2010 10:13:47] <mray> chemist: this seems to have a few good tips: docs/DOC-2521

[27-May-2010 10:14:14] <nzook> Filbert:  Have you looked for the server in dmd?

[27-May-2010 10:14:44] <rocket> @chemist: zenpatch 18589;zenpatch 18890;zenpatch 18908;zenpatch 18941

[27-May-2010 10:15:26] <Filbert> nzook: nope, is there a useful guide on how to do that?

[27-May-2010 10:15:49] <rocket> These patches were found with the following trac query.

[27-May-2010 10:15:51] <rocket> http://dev.zenoss.com/trac/query?status=closed&status=verification&group=patch_state&patch=2.5.3&col=id&col=summary&col=status&col=owner&col=priority&col=milestone&col=component&col=changetime&report=6&order=status

[27-May-2010 10:16:17] fOrsberg is now known as forsberg

[27-May-2010 10:18:00] <nzook> Filbert:  from the command line, type "zendmd".  This will get you a command prompt (it may take a bit).  Do find("ldnbh1gxendb02").

[27-May-2010 10:18:39] <Filbert> nothing returned...

[27-May-2010 10:19:14] <rocket> are you sure that was the managed name?

[27-May-2010 10:19:36] <Filbert> nope :(

[27-May-2010 10:19:44] <Filbert> it's not my original installation I'm afraid :(

[27-May-2010 10:19:49] <Filbert> lemme try by it's IP

[27-May-2010 10:19:54] * chemist applies patches

[27-May-2010 10:20:11] <Filbert> nothing returned for the IP either

[27-May-2010 10:20:23] <Filbert> or FQDN

[27-May-2010 10:20:28] <Filbert> it'd have been one of those

[27-May-2010 10:20:29] <rocket> Filbert: did you run zencheckrels to make sure your relationships are fixed?

[27-May-2010 10:20:40] <Filbert> nope

[27-May-2010 10:20:56] <Filbert> how do I do that?

[27-May-2010 10:21:09] <Filbert> oh, its' a comment

[27-May-2010 10:21:11] <Filbert> er, command

[27-May-2010 10:21:13] <rocket> Filbert: also make sure you are running 2.5.2 if you haven't upgraded .. earlier versions have problems with broken relationships

[27-May-2010 10:21:21] <Filbert> ah, we're on 2.5.1

[27-May-2010 10:21:28] <rocket> get to 2.5.2 asap

[27-May-2010 10:21:38] <rocket> 2.5.1 will eat your lunch....

[27-May-2010 10:21:52] <Filbert> lol

[27-May-2010 10:22:02] <Filbert> ok, was planning ona n upgrade earlier

[27-May-2010 10:22:14] <Filbert> but everything is such a mess here, I was hoping I could put it off for a few weeks ;)

[27-May-2010 10:22:50] <Filbert> zenchkrels running now

[27-May-2010 10:24:50] <Filbert> it seems to be fixing some stuff

[27-May-2010 10:31:42] * Filbert taps his fingers

[27-May-2010 10:31:47] <Filbert> not a very quick process... :/

[27-May-2010 10:31:53] <Filbert> although that might be the size of our ZenOss install

[27-May-2010 10:39:49] <rocket> unfortunately it may crap out if other things access the db at the same time

[27-May-2010 10:42:58] <Simon4> it only commits at the end?

[27-May-2010 10:43:21] <Simon4> or does it commit after each rel fix?

[27-May-2010 10:44:09] <rocket> cant remember .. looking...

[27-May-2010 10:45:49] <rocket> just at the end .. but it could be changed pretty easily

[27-May-2010 10:46:01] <rocket> Products/ZenUtils/CheckRelations.py

[27-May-2010 10:46:23] <Simon4> cool

[27-May-2010 10:46:48] <rocket> it does set savepoints though .. so it should resume ..

[27-May-2010 10:47:16] <rocket> Ideally you do this on a system where nothing is happening though .. no modelling all other daemons are down etc

[27-May-2010 10:47:25] <rocket> but it can take a long time

[27-May-2010 10:47:46] <Simon4> yeah, considering doing it on a dedicated spare box

[27-May-2010 10:47:57] <Simon4> jsut after zeoctl start

[27-May-2010 10:48:02] <mray> rocket: is zenchkrels documented anywhere?

[27-May-2010 10:48:32] <rocket> mray: I am not sure if it is .. ( from our perspective we dont use it too much as it is pretty slow )

[27-May-2010 10:48:58] <rocket> mray: but it is important to anyone who was running 2.5.1 to run it to fix their corrupt zodb

[27-May-2010 10:48:59] <mray> rocket: I'm gonna open a ticket for it

[27-May-2010 10:49:16] <Simon4> rocket: so suggestion is, run zenchkrels, then upgrade to 2.5.2?

[27-May-2010 10:49:20] <rocket> mray: we need to improve it ..

[27-May-2010 10:49:24] <rocket> Simon4: yes

[27-May-2010 10:49:27] <Simon4> before restarting any other daemons etc

[27-May-2010 10:49:33] <Simon4> awesome, will try early next week

[27-May-2010 10:49:43] * Simon4 would love to not have to repair relations every second breath

[27-May-2010 10:49:45] <mray> rocket: I didn't even know what it did

[27-May-2010 10:50:04] <rocket> mray: it checks zope relations and tries to repair them

[27-May-2010 10:50:31] <rocket> mray: eg there is a flaw in 2.5.1 where half of the relation could be deleted and the other half exists

[27-May-2010 10:50:45] <rocket> in certain instances

[27-May-2010 10:50:56] <rocket> specifically if making changes from zendmd in 2.5.1

[27-May-2010 10:51:29] <rocket> then when you try and follow certain relationships and it crosschecks you get tracebacks etc

[27-May-2010 10:52:00] <rocket> internally for enterprise customers we are a bit more surgical about fixing these relationships and have an adhoc tool to do so

[27-May-2010 10:52:28] <rocket> but we *always* recommend getting off 2.5.0/1 and moving to 2.5.2

[27-May-2010 10:52:43] <Filbert> looks like I know what I'm doing tomorrow then :)

[27-May-2010 10:53:24] <mray> rocket: feel free to spice up http://dev.zenoss.org/trac/ticket/6809

[27-May-2010 10:53:27] <rocket> 2.5.2 specifically fixes the issues as far as we know that the zodb can get into an inconsistent state

[27-May-2010 10:58:36] <theacolyte> Oh, hello

[27-May-2010 10:59:14] <theacolyte> Are there any plans of building WMI into core?

[27-May-2010 10:59:15] * Simon4 plots a 2.5.2 upgrade sooner than going to enterprise after reading those comments

[27-May-2010 11:00:18] <rocket> Simon4: have a support contract?

[27-May-2010 11:00:29] <mray> theacolyte: WMI is already in Core, that's how Windows event log and services are monitored

[27-May-2010 11:00:33] <Simon4> rocket: oui

[27-May-2010 11:00:42] <mray> theacolyte: or do you mean the WMI Data Source?

[27-May-2010 11:00:57] <Simon4> yes, even

[27-May-2010 11:01:19] <rocket> Simon4: let me look for a KB article for a tool that you should run instead

[27-May-2010 11:01:24] <Simon4> rocket: thanks

[27-May-2010 11:01:25] <theacolyte> mray: Ah, I thought event logs were provided by egor's plugin

[27-May-2010 11:01:38] <theacolyte> But regardless, yes the WMI datasource

[27-May-2010 11:01:53] <mray> theacolyte: nothing solid yet, but there have been discussions

[27-May-2010 11:02:13] <rocket> Simon4: "Using ZenFixIt to Correct Common Zenoss Issues"

[27-May-2010 11:02:40] * Simon4 goes hunting

[27-May-2010 11:02:43] <theacolyte> Hyperic really did it right for WMI I think - the counters are all in there, you just select them

[27-May-2010 11:02:45] <rocket> Simon4: thats a more surgical repair .. but may not catch everything

[27-May-2010 11:02:51] <theacolyte> No need to look up classes to remember where it's located

[27-May-2010 11:03:01] <Simon4> k, i"ll try it out on a copy of the db early next week

[27-May-2010 11:03:05] <rocket> zenchkrels is really intended to be the broad stroke brush

[27-May-2010 11:04:52] <rocket> mray: added some notes to that ticket for you

[27-May-2010 11:10:40] <mray> New FDS 389 LDAP Server ZenPack: docs/DOC-5889

Comments (0)