Login | Register For Free | Help
Search for: (Advanced)

Mailing List Archive: nsp: juniper

SRX240H Cluster & SNMP

 

 

nsp juniper RSS feed   Index | Next | Previous | View Threaded


eric at atlantech

Aug 20, 2012, 8:51 AM

Post #1 of 5 (539 views)
Permalink
SRX240H Cluster & SNMP

All,
Is there a version above 11.2 where SNMP works properly in a cluster? Seems that when running various versions (11.2R7.4 and 11.4R4.4, so far) on a 240H cluster, SNMP doesn't work properly and starts spitting out 'noSuchObject' errors on perfectly valid queries like when querying the interfaces MIB. I should also mention that the OIDs it seems to have a problem with are primarily ones that have to do with the backup chassis in redundancy-group 0 (ge-5/0/0 through ge-5/0/15). JTAC has thus far been unsuccessful at assisting me.

I have downgraded to 10.4R10.7 on a non-production cluster and it's working successfully, but I really want to take advantage of the global address book. I can certainly live without it, but it does make things much easier.

Thanks in advance,
evt

_______________________________________________
juniper-nsp mailing list juniper-nsp [at] puck
https://puck.nether.net/mailman/listinfo/juniper-nsp


wayne at tuckerlabs

Aug 20, 2012, 11:56 AM

Post #2 of 5 (517 views)
Permalink
Re: SRX240H Cluster & SNMP [In reply to]

I have a couple of SRX240H clusters running 11.2R6. I also have an
SRX650 cluster running 11.2S6. I don't see anything in my logs to
indicate that I'm getting errors and none of my graphs show signs of
failed polls.

I doubt it matters, but I'm polling the devices through their loopback
interfaces. I also filter out some of the interfaces and filter
duplicates:

> show configuration snmp | display inheritance | except ##
filter-interfaces {
interfaces {
fxp2;
gre;
ipip;
lo0.16384;
lo0.16385;
lo0.32768;
lsi;
mtun;
pimd;
pime;
tap;
}
}
filter-duplicates;
[snip]

Does it seem to happen the most when there are lots of queries going
through? Are you doing row-based or column-based queries (one
interface at a time or the same counter across several interfaces)?
The former is supposed to perform better (so, for instance, an
snmpwalk is fairly processor intensive).

Any signs of trouble on your control or fabric interfaces?

Has JTAC already had you enable tracing for SNMP?

:w



On Mon, Aug 20, 2012 at 8:51 AM, Eric Van Tol <eric [at] atlantech> wrote:
> All,
> Is there a version above 11.2 where SNMP works properly in a cluster? Seems that when running various versions (11.2R7.4 and 11.4R4.4, so far) on a 240H cluster, SNMP doesn't work properly and starts spitting out 'noSuchObject' errors on perfectly valid queries like when querying the interfaces MIB. I should also mention that the OIDs it seems to have a problem with are primarily ones that have to do with the backup chassis in redundancy-group 0 (ge-5/0/0 through ge-5/0/15). JTAC has thus far been unsuccessful at assisting me.
>
> I have downgraded to 10.4R10.7 on a non-production cluster and it's working successfully, but I really want to take advantage of the global address book. I can certainly live without it, but it does make things much easier.
>
> Thanks in advance,
> evt
>
> _______________________________________________
> juniper-nsp mailing list juniper-nsp [at] puck
> https://puck.nether.net/mailman/listinfo/juniper-nsp

_______________________________________________
juniper-nsp mailing list juniper-nsp [at] puck
https://puck.nether.net/mailman/listinfo/juniper-nsp


eric at atlantech

Aug 20, 2012, 12:22 PM

Post #3 of 5 (527 views)
Permalink
Re: SRX240H Cluster & SNMP [In reply to]

Hi Wayne,
Answers inline.

> I doubt it matters, but I'm polling the devices through their
> loopback
> interfaces. I also filter out some of the interfaces and filter
> duplicates:

I do the same thing. Just for the Hell of it, I tried to poll through the fxp0 port, but the same thing happens.

> Does it seem to happen the most when there are lots of queries going
> through?

The issue is really just trying add the device to my NMS. The NMS sends out Get requests for all the interfaces to add them into its database. I have no problems doing this for a 3600 cluster or really any other Juniper devices.

> Any signs of trouble on your control or fabric interfaces?

Not that I can tell. No errors or drops.

> Has JTAC already had you enable tracing for SNMP?

They made me get a capture of the queries, which I sent to them, but because the SRX was sending get-response packets back, that seemed to indicate to the JTAC engineer that there was no problem. What he didn't do was actually look at the responses where the SRX is sending 'noSuchObject' back for valid interface objects. Performing a 'show snmp mib walk <oid>' for one of the OIDs for which a 'noSuchObject' was sent elicits an incredibly slow response time from the CLI with an eventual output of the information contained within that OID.

Maybe I'll try 11.2R6 and see if that version works. The SRX3600 cluster is running 11.2R7.4 and I'm not seeing the same problems. It's specifically related to the SRX240, from what I can tell, as both the production cluster and the lab cluster exhibit the same behavior.

-evt

> :w
>
>
>
> On Mon, Aug 20, 2012 at 8:51 AM, Eric Van Tol <eric [at] atlantech>
> wrote:
> > All,
> > Is there a version above 11.2 where SNMP works properly in a
> cluster? Seems that when running various versions (11.2R7.4 and
> 11.4R4.4, so far) on a 240H cluster, SNMP doesn't work properly and
> starts spitting out 'noSuchObject' errors on perfectly valid queries
> like when querying the interfaces MIB. I should also mention that
> the OIDs it seems to have a problem with are primarily ones that have
> to do with the backup chassis in redundancy-group 0 (ge-5/0/0 through
> ge-5/0/15). JTAC has thus far been unsuccessful at assisting me.
> >
> > I have downgraded to 10.4R10.7 on a non-production cluster and it's
> working successfully, but I really want to take advantage of the
> global address book. I can certainly live without it, but it does
> make things much easier.
> >
> > Thanks in advance,
> > evt
> >
> > _______________________________________________
> > juniper-nsp mailing list juniper-nsp [at] puck
> > https://puck.nether.net/mailman/listinfo/juniper-nsp

_______________________________________________
juniper-nsp mailing list juniper-nsp [at] puck
https://puck.nether.net/mailman/listinfo/juniper-nsp


mmk at one

Aug 20, 2012, 3:59 PM

Post #4 of 5 (520 views)
Permalink
Re: SRX240H Cluster & SNMP [In reply to]

Hi Eric,

I had the same issue on my srx240 cluster and a friendly soul found PR800735 for me that mentioned a workaround by doing "set snmp filter-interfaces interfaces gr-0/0/0" that made my Observium instance able to poll the cluster without timeouts.

/Mikkel

On Aug 20, 2012, at 21:22 , Eric Van Tol <eric [at] atlantech> wrote:

> Hi Wayne,
> Answers inline.
>
>> I doubt it matters, but I'm polling the devices through their
>> loopback
>> interfaces. I also filter out some of the interfaces and filter
>> duplicates:
>
> I do the same thing. Just for the Hell of it, I tried to poll through the fxp0 port, but the same thing happens.
>
>> Does it seem to happen the most when there are lots of queries going
>> through?
>
> The issue is really just trying add the device to my NMS. The NMS sends out Get requests for all the interfaces to add them into its database. I have no problems doing this for a 3600 cluster or really any other Juniper devices.
>
>> Any signs of trouble on your control or fabric interfaces?
>
> Not that I can tell. No errors or drops.
>
>> Has JTAC already had you enable tracing for SNMP?
>
> They made me get a capture of the queries, which I sent to them, but because the SRX was sending get-response packets back, that seemed to indicate to the JTAC engineer that there was no problem. What he didn't do was actually look at the responses where the SRX is sending 'noSuchObject' back for valid interface objects. Performing a 'show snmp mib walk <oid>' for one of the OIDs for which a 'noSuchObject' was sent elicits an incredibly slow response time from the CLI with an eventual output of the information contained within that OID.
>
> Maybe I'll try 11.2R6 and see if that version works. The SRX3600 cluster is running 11.2R7.4 and I'm not seeing the same problems. It's specifically related to the SRX240, from what I can tell, as both the production cluster and the lab cluster exhibit the same behavior.
>
> -evt
>
>> :w
>>
>>
>>
>> On Mon, Aug 20, 2012 at 8:51 AM, Eric Van Tol <eric [at] atlantech>
>> wrote:
>>> All,
>>> Is there a version above 11.2 where SNMP works properly in a
>> cluster? Seems that when running various versions (11.2R7.4 and
>> 11.4R4.4, so far) on a 240H cluster, SNMP doesn't work properly and
>> starts spitting out 'noSuchObject' errors on perfectly valid queries
>> like when querying the interfaces MIB. I should also mention that
>> the OIDs it seems to have a problem with are primarily ones that have
>> to do with the backup chassis in redundancy-group 0 (ge-5/0/0 through
>> ge-5/0/15). JTAC has thus far been unsuccessful at assisting me.
>>>
>>> I have downgraded to 10.4R10.7 on a non-production cluster and it's
>> working successfully, but I really want to take advantage of the
>> global address book. I can certainly live without it, but it does
>> make things much easier.
>>>
>>> Thanks in advance,
>>> evt
>>>
>>> _______________________________________________
>>> juniper-nsp mailing list juniper-nsp [at] puck
>>> https://puck.nether.net/mailman/listinfo/juniper-nsp
>
> _______________________________________________
> juniper-nsp mailing list juniper-nsp [at] puck
> https://puck.nether.net/mailman/listinfo/juniper-nsp


_______________________________________________
juniper-nsp mailing list juniper-nsp [at] puck
https://puck.nether.net/mailman/listinfo/juniper-nsp


eric at atlantech

Aug 21, 2012, 2:25 AM

Post #5 of 5 (518 views)
Permalink
Re: SRX240H Cluster & SNMP [In reply to]

> -----Original Message-----
> From: Mikkel Mondrup Kristensen [mailto:mmk [at] one]
> Sent: Monday, August 20, 2012 7:00 PM
> To: Eric Van Tol
> Cc: juniper-nsp [at] puck
> Subject: Re: [j-nsp] SRX240H Cluster & SNMP
>
> Hi Eric,
>
> I had the same issue on my srx240 cluster and a friendly soul found
> PR800735 for me that mentioned a workaround by doing "set snmp
> filter-interfaces interfaces gr-0/0/0" that made my Observium
> instance able to poll the cluster without timeouts.
>
> /Mikkel

Mikkel,
You are my hero. That worked, thanks!

-evt

_______________________________________________
juniper-nsp mailing list juniper-nsp [at] puck
https://puck.nether.net/mailman/listinfo/juniper-nsp

nsp juniper RSS feed   Index | Next | Previous | View Threaded
 
 


Interested in having your list archived? Contact Gossamer Threads
 
  Web Applications & Managed Hosting Powered by Gossamer Threads Inc.