Login | Register For Free | Help
Search for: (Advanced)

Mailing List Archive: Linux-HA: Users

ssh stonith

 

 

Linux-HA users RSS feed   Index | Next | Previous | View Threaded


rode at b1-systems

Nov 20, 2009, 5:52 PM

Post #1 of 4 (866 views)
Permalink
ssh stonith

Hi all,

i have a little problem with my ssh-stonith-device (yes i know that is
only for testing...)

To simluate a failure, i kill the openais daemon (and all the rest) on
node2. Then node1 is trying to reset node2, but unfortunately
unsuccessful :(

node2:~ # rpm -qa | grep -E "pace|openais"
libpacemaker3-1.0.3-4.1
pacemaker-1.0.3-4.1
libopenais2-0.80.3-26.1
openais-0.80.3-26.1

on a sles11 with hae.

Here is the entry from log for stonith:

Nov 21 02:35:35 node1 stonithd: [3952]: info: client tengine [pid: 3957]
requests a STONITH operation RESET on node node2
Nov 21 02:35:35 node1 stonithd: [3952]: info: we can't manage node2,
broadcast request to other nodes
Nov 21 02:35:35 node1 stonithd: [3952]: info: Broadcasting the message
succeeded: require others to stonith node node2.

Configuration:

# crm configure show
[...]
primitive st-ssh stonith:external/ssh \
params hostlist="node1-intern node2-intern"
clone fencing st-ssh
property $id="cib-bootstrap-options" \
dc-version="1.0.3-0080ec086ae9c20ad5c4c3562000c0ad68374f0a" \
expected-quorum-votes="2" \
stonith-enabled="true" \
no-quorum-policy="ignore" \
last-lrm-refresh="1258769996"


When i'm doing this on the console with

stonith -t ssh -p "node1-intern node2-intern" -T reset node2-intern

all works fine! So i hope someone can help me.

Thanks!

Michel
Attachments: signature.asc (0.19 KB)


andrew at beekhof

Nov 20, 2009, 11:28 PM

Post #2 of 4 (818 views)
Permalink
Re: ssh stonith [In reply to]

On Sat, Nov 21, 2009 at 2:52 AM, Michel Rode <rode [at] b1-systems> wrote:
> Hi all,
>
> i have a little problem with my ssh-stonith-device (yes i know that is
> only for testing...)
>
> To simluate a failure, i kill the openais daemon (and all the rest) on
> node2. Then node1 is trying to reset node2, but unfortunately
> unsuccessful :(


make sure atd is installed and started at boot.
also make sure that the nodes can ssh to one-another _without_ a
password (ie. a public key that requires no password).

>
> node2:~ # rpm -qa | grep -E "pace|openais"
> libpacemaker3-1.0.3-4.1
> pacemaker-1.0.3-4.1
> libopenais2-0.80.3-26.1
> openais-0.80.3-26.1
>
> on a sles11 with hae.
>
> Here is the entry from log for stonith:
>
> Nov 21 02:35:35 node1 stonithd: [3952]: info: client tengine [pid: 3957]
> requests a STONITH operation RESET on node node2
> Nov 21 02:35:35 node1 stonithd: [3952]: info: we can't manage node2,
> broadcast request to other nodes
> Nov 21 02:35:35 node1 stonithd: [3952]: info: Broadcasting the message
> succeeded: require others to stonith node node2.
>
> Configuration:
>
> # crm configure show
> [...]
> primitive st-ssh stonith:external/ssh \
>        params hostlist="node1-intern node2-intern"
> clone fencing st-ssh
> property $id="cib-bootstrap-options" \
>        dc-version="1.0.3-0080ec086ae9c20ad5c4c3562000c0ad68374f0a" \
>        expected-quorum-votes="2" \
>        stonith-enabled="true" \
>        no-quorum-policy="ignore" \
>        last-lrm-refresh="1258769996"
>
>
> When i'm doing this on the console with
>
> stonith -t ssh -p "node1-intern node2-intern" -T reset node2-intern
>
> all works fine! So i hope someone can help me.
>
> Thanks!
>
> Michel
>
> _______________________________________________
> Linux-HA mailing list
> Linux-HA [at] lists
> http://lists.linux-ha.org/mailman/listinfo/linux-ha
> See also: http://linux-ha.org/ReportingProblems
>
_______________________________________________
Linux-HA mailing list
Linux-HA [at] lists
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems


rode at b1-systems

Nov 21, 2009, 2:47 AM

Post #3 of 4 (822 views)
Permalink
Re: ssh stonith [In reply to]

Hey,

[...]
>
> make sure atd is installed and started at boot.
> also make sure that the nodes can ssh to one-another _without_ a
> password (ie. a public key that requires no password).
>
[...]
>> When i'm doing this on the console with
>>
>> stonith -t ssh -p "node1-intern node2-intern" -T reset node2-intern
>>
>> all works fine! So i hope someone can help me.

So i think that is okay ;)

Michel
_______________________________________________
Linux-HA mailing list
Linux-HA [at] lists
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems


lmb at suse

Nov 23, 2009, 4:39 AM

Post #4 of 4 (802 views)
Permalink
Re: ssh stonith [In reply to]

On 2009-11-21T02:52:06, Michel Rode <rode [at] b1-systems> wrote:

> node2:~ # rpm -qa | grep -E "pace|openais"
^^^^^

> Here is the entry from log for stonith:
>
> Nov 21 02:35:35 node1 stonithd: [3952]: info: client tengine [pid: 3957]
> requests a STONITH operation RESET on node node2
^^^^^^^
> Nov 21 02:35:35 node1 stonithd: [3952]: info: we can't manage node2,
> broadcast request to other nodes
> Nov 21 02:35:35 node1 stonithd: [3952]: info: Broadcasting the message
> succeeded: require others to stonith node node2.
>
> Configuration:
>
> # crm configure show
> [...]
> primitive st-ssh stonith:external/ssh \
> params hostlist="node1-intern node2-intern"
^^^^^^^^^^^^

The hostname you specified in the hostlist doesn't match the hostnames
the cluster actually uses, so when it goes to look them up, it doesn't
find them.


> When i'm doing this on the console with
>
> stonith -t ssh -p "node1-intern node2-intern" -T reset node2-intern
^^^^^^^^^^^^^^
>
> all works fine! So i hope someone can help me.

Unsurprising, since here, hostlist includes the node you actually want
to reset.


Regards,
Lars

--
Architect Storage/HA, OPS Engineering, Novell, Inc.
SUSE LINUX Products GmbH, GF: Markus Rex, HRB 16746 (AG Nürnberg)
"Experience is the name everyone gives to their mistakes." -- Oscar Wilde

_______________________________________________
Linux-HA mailing list
Linux-HA [at] lists
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Linux-HA users RSS feed   Index | Next | Previous | View Threaded
 
 


Interested in having your list archived? Contact Gossamer Threads
 
  Web Applications & Managed Hosting Powered by Gossamer Threads Inc.