Login | Register For Free | Help
Search for: (Advanced)

Mailing List Archive: Linux-HA: Users

Help required to develop OCF Resource Agent Script for Master-Slave script

 

 

Linux-HA users RSS feed   Index | Next | Previous | View Threaded


radhamadhavi at huawei

Nov 22, 2009, 9:15 PM

Post #1 of 9 (1314 views)
Permalink
Help required to develop OCF Resource Agent Script for Master-Slave script

I have developed a OCF resource agent script for the master/slave
application. While testing the RA, i have faced following problems.

First, it is not able to promote once instance to the master, after i have
called
"crm_master -v 100 -l reboot" utility command in the notify function of
post-start branch, so in ha-log, there are following log information

tengine[4803]: 2009/11/19_15:21:43 info: extract_event: Aborting on
transient_attributes changes for "node of uuid"
tengine[4803]: 2009/11/19_15:21:43 info: update_abort_priority: Abort
priority upgraded to 1000000
tengine[4803]: 2009/11/19_15:21:43 info: update_abort_priority: Abort action
0 superceeded by 2


Second issue, crm doesnot check the status of the instance on each node, so
that when i try to kill one instance on any node, crm_mon display is still
"started" for this instance on this node. In other words, crm can not call
the monitor function in the interval time specified in the cib.xml.

Please help me to resolve this problem. Thanks alot!!!!!
--
View this message in context: http://old.nabble.com/Help-required-to-develop-OCF-Resource-Agent-Script-for-Master-Slave-script-tp26472940p26472940.html
Sent from the Linux-HA mailing list archive at Nabble.com.

_______________________________________________
Linux-HA mailing list
Linux-HA [at] lists
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems


andrew at beekhof

Nov 23, 2009, 4:11 AM

Post #2 of 9 (1270 views)
Permalink
Re: Help required to develop OCF Resource Agent Script for Master-Slave script [In reply to]

On Mon, Nov 23, 2009 at 6:15 AM, Jessy <radhamadhavi [at] huawei> wrote:
>
> I have developed a OCF resource agent script for the master/slave
> application. While testing the RA, i have faced following problems.
>
> First, it is not able to promote once instance to the master, after i have
> called
> "crm_master -v 100 -l reboot" utility command in the notify function of
> post-start branch, so in ha-log, there are following log information
>
> tengine[4803]: 2009/11/19_15:21:43 info: extract_event: Aborting on
> transient_attributes changes for "node of uuid"
> tengine[4803]: 2009/11/19_15:21:43 info: update_abort_priority: Abort
> priority upgraded to 1000000
> tengine[4803]: 2009/11/19_15:21:43 info: update_abort_priority: Abort action
> 0 superceeded by 2

Looks right to me. What version are you running?

> Second issue, crm doesnot check the status of the instance on each node,

Did you add monitor operation to the resource?

> so
> that when i try to kill one instance on any node, crm_mon display is still
> "started" for this instance on this node. In other words, crm can not call
> the monitor function in the interval time specified in the cib.xml.
>
> Please help me to resolve this problem. Thanks alot!!!!!
> --
> View this message in context: http://old.nabble.com/Help-required-to-develop-OCF-Resource-Agent-Script-for-Master-Slave-script-tp26472940p26472940.html
> Sent from the Linux-HA mailing list archive at Nabble.com.
>
> _______________________________________________
> Linux-HA mailing list
> Linux-HA [at] lists
> http://lists.linux-ha.org/mailman/listinfo/linux-ha
> See also: http://linux-ha.org/ReportingProblems
>
_______________________________________________
Linux-HA mailing list
Linux-HA [at] lists
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems


radhamadhavi at huawei

Nov 29, 2009, 8:26 PM

Post #3 of 9 (1200 views)
Permalink
Re: Help required to develop OCF Resource Agent Script for Master-Slave script [In reply to]

Andrew Beekhof-3 wrote:
>
> On Mon, Nov 23, 2009 at 6:15 AM, Jessy <radhamadhavi [at] huawei> wrote:
>>
>> I have developed a OCF resource agent script for the master/slave
>> application. While testing the RA, i have faced following problems.
>>
>> First, it is not able to promote once instance to the master, after i
>> have
>> called
>> "crm_master -v 100 -l reboot" utility command in the notify function of
>> post-start branch, so in ha-log, there are following log information
>>
>> tengine[4803]: 2009/11/19_15:21:43 info: extract_event: Aborting on
>> transient_attributes changes for "node of uuid"
>> tengine[4803]: 2009/11/19_15:21:43 info: update_abort_priority: Abort
>> priority upgraded to 1000000
>> tengine[4803]: 2009/11/19_15:21:43 info: update_abort_priority: Abort
>> action
>> 0 superceeded by 2
>
> Looks right to me. What version are you running?
>
> [Jessy] : Ther version i'm using is Heartbeat v2.0.8 on SUSE 10 SP1, the
> cluster constists of two nodes.
>
>
>> Second issue, crm doesnot check the status of the instance on each node,
>
> Did you add monitor operation to the resource?
>
> [Jessy] : Monitor function has been implemented in the script. The script
> also consists of start/stop/promote/demote/notify
> functions too.
>
>> so
>> that when i try to kill one instance on any node, crm_mon display is
>> still
>> "started" for this instance on this node. In other words, crm can not
>> call
>> the monitor function in the interval time specified in the cib.xml.
>>
>> Please help me to resolve this problem. Thanks alot!!!!!
>> --
>> View this message in context:
>> http://old.nabble.com/Help-required-to-develop-OCF-Resource-Agent-Script-for-Master-Slave-script-tp26472940p26472940.html
>> Sent from the Linux-HA mailing list archive at Nabble.com.
>>
>> _______________________________________________
>> Linux-HA mailing list
>> Linux-HA [at] lists
>> http://lists.linux-ha.org/mailman/listinfo/linux-ha
>> See also: http://linux-ha.org/ReportingProblems
>>
> _______________________________________________
> Linux-HA mailing list
> Linux-HA [at] lists
> http://lists.linux-ha.org/mailman/listinfo/linux-ha
> See also: http://linux-ha.org/ReportingProblems
>
>

--
View this message in context: http://old.nabble.com/Help-required-to-develop-OCF-Resource-Agent-Script-for-Master-Slave-script-tp26472940p26569453.html
Sent from the Linux-HA mailing list archive at Nabble.com.

_______________________________________________
Linux-HA mailing list
Linux-HA [at] lists
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems


radhamadhavi at huawei

Nov 29, 2009, 9:55 PM

Post #4 of 9 (1205 views)
Permalink
Re: Help required to develop OCF Resource Agent Script for Master-Slave script [In reply to]

Andrew Beekhof-3 wrote:
>
> On Mon, Nov 23, 2009 at 6:15 AM, Jessy <radhamadhavi [at] huawei> wrote:
>>
>> I have developed a OCF resource agent script for the master/slave
>> application. While testing the RA, i have faced following problems.
>>
>> First, it is not able to promote once instance to the master, after i
>> have
>> called
>> "crm_master -v 100 -l reboot" utility command in the notify function of
>> post-start branch, so in ha-log, there are following log information
>>
>> tengine[4803]: 2009/11/19_15:21:43 info: extract_event: Aborting on
>> transient_attributes changes for "node of uuid"
>> tengine[4803]: 2009/11/19_15:21:43 info: update_abort_priority: Abort
>> priority upgraded to 1000000
>> tengine[4803]: 2009/11/19_15:21:43 info: update_abort_priority: Abort
>> action
>> 0 superceeded by 2
>
> Looks right to me. What version are you running?
>
> [Jessy] : Ther version i'm using is Heartbeat v2.0.8 on SUSE 10 SP1, the
> cluster constists of two nodes.
>
>
>> Second issue, crm doesnot check the status of the instance on each node,
>
> Did you add monitor operation to the resource?
>
> [Jessy] : Monitor function has been implemented in the script. The script
> also consists of start/stop/promote/demote/notify
> functions too.
>
>> so
>> that when i try to kill one instance on any node, crm_mon display is
>> still
>> "started" for this instance on this node. In other words, crm can not
>> call
>> the monitor function in the interval time specified in the cib.xml.
>>
>> Please help me to resolve this problem. Thanks alot!!!!!
>> --
>> View this message in context:
>> http://old.nabble.com/Help-required-to-develop-OCF-Resource-Agent-Script-for-Master-Slave-script-tp26472940p26472940.html
>> Sent from the Linux-HA mailing list archive at Nabble.com.
>>
>> _______________________________________________
>> Linux-HA mailing list
>> Linux-HA [at] lists
>> http://lists.linux-ha.org/mailman/listinfo/linux-ha
>> See also: http://linux-ha.org/ReportingProblems
>>
> _______________________________________________
> Linux-HA mailing list
> Linux-HA [at] lists
> http://lists.linux-ha.org/mailman/listinfo/linux-ha
> See also: http://linux-ha.org/ReportingProblems
>
>

--
View this message in context: http://old.nabble.com/Help-required-to-develop-OCF-Resource-Agent-Script-for-Master-Slave-script-tp26472940p26569453.html
Sent from the Linux-HA mailing list archive at Nabble.com.

_______________________________________________
Linux-HA mailing list
Linux-HA [at] lists
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems


andrew at beekhof

Dec 1, 2009, 1:15 AM

Post #5 of 9 (1199 views)
Permalink
Re: Help required to develop OCF Resource Agent Script for Master-Slave script [In reply to]

On Mon, Nov 30, 2009 at 5:26 AM, Jessy <radhamadhavi [at] huawei> wrote:
>
>
>
> Andrew Beekhof-3 wrote:
>>
>> On Mon, Nov 23, 2009 at 6:15 AM, Jessy <radhamadhavi [at] huawei> wrote:
>>>
>>> I have developed a OCF resource agent script for the master/slave
>>> application. While testing the RA, i have faced following problems.
>>>
>>> First, it is not able to promote once instance to the master, after i
>>> have
>>> called
>>> "crm_master -v 100 -l reboot" utility command in the notify function of
>>> post-start branch, so in ha-log, there are following log information
>>>
>>> tengine[4803]: 2009/11/19_15:21:43 info: extract_event: Aborting on
>>> transient_attributes changes for "node of uuid"
>>> tengine[4803]: 2009/11/19_15:21:43 info: update_abort_priority: Abort
>>> priority upgraded to 1000000
>>> tengine[4803]: 2009/11/19_15:21:43 info: update_abort_priority: Abort
>>> action
>>> 0 superceeded by 2
>>
>> Looks right to me.  What version are you running?
>>
>> [Jessy] : Ther version i'm using is Heartbeat v2.0.8 on SUSE 10 SP1, the
>> cluster constists of two nodes.

Yikes!
You really need to update. No wonder its not working.

The latest for from SUSE for SLES10 is 2.1.4 which will be a lot better.

>>
>>
>>> Second issue, crm doesnot check the status of the instance on each node,
>>
>> Did you add monitor operation to the resource?
>>
>> [Jessy] : Monitor function has been implemented in the script. The script
>> also consists of                  start/stop/promote/demote/notify
>> functions too.

Yes, but did you add a monitor action to the resource's definition in
the configuration?
_______________________________________________
Linux-HA mailing list
Linux-HA [at] lists
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems


radhamadhavi at huawei

Dec 1, 2009, 8:23 PM

Post #6 of 9 (1190 views)
Permalink
Re: Help required to develop OCF Resource Agent Script for Master-Slave script [In reply to]

Andrew Beekhof-3 wrote:
>
> On Mon, Nov 30, 2009 at 5:26 AM, Jessy <radhamadhavi [at] huawei> wrote:
>>
>>
>>
>> Andrew Beekhof-3 wrote:
>>>
>>> On Mon, Nov 23, 2009 at 6:15 AM, Jessy <radhamadhavi [at] huawei> wrote:
>>>>
>>>> I have developed a OCF resource agent script for the master/slave
>>>> application. While testing the RA, i have faced following problems.
>>>>
>>>> First, it is not able to promote once instance to the master, after i
>>>> have
>>>> called
>>>> "crm_master -v 100 -l reboot" utility command in the notify function of
>>>> post-start branch, so in ha-log, there are following log information
>>>>
>>>> tengine[4803]: 2009/11/19_15:21:43 info: extract_event: Aborting on
>>>> transient_attributes changes for "node of uuid"
>>>> tengine[4803]: 2009/11/19_15:21:43 info: update_abort_priority: Abort
>>>> priority upgraded to 1000000
>>>> tengine[4803]: 2009/11/19_15:21:43 info: update_abort_priority: Abort
>>>> action
>>>> 0 superceeded by 2
>>>
>>> Looks right to me.  What version are you running?
>>>
>>> [Jessy] : Ther version i'm using is Heartbeat v2.0.8 on SUSE 10 SP1, the
>>> cluster constists of two nodes.
>
> Yikes!
> You really need to update. No wonder its not working.
>
>
> The latest for from SUSE for SLES10 is 2.1.4 which will be a lot better.
>
>>>
>>>
>>>> Second issue, crm doesnot check the status of the instance on each
>>>> node,
>>>
>>> Did you add monitor operation to the resource?
>>>
>>> [Jessy] : Monitor function has been implemented in the script. The
>>> script
>>> also consists of                  start/stop/promote/demote/notify
>>> functions too.
>
> Yes, but did you add a monitor action to the resource's definition in
> the configuration?
>
> [Jessy] : I have added monitor operation defination in cib.xml with
> certain interval time in cib.xml file as below.
> <op id="Op_ms_app_prog10" name="monitor" interval="10s" timeout="30s"
> role="Master"/>
> <op id="Op_ms_app_prog11" name="monitor" interval="20s" timeout="50s"
> role="Slave"/>
>
> More over, i've also added the defination of monitor action in the
> meta-data of RA 'MaSlApp' as follows:
> <actions>
> <action name="start" timeout="240"/>
> <action name="promote" timeout="90"/>
> <action name="demote" timeout="90"/>
> <action name="notify" timeout="90"/>
> <action name="stop" timeout="100"/>
> <action name="monitor" depth="0" timeout="20" interval="30"
> start-delay="50" role="Slave"/>
> <action name="monitor" depth="0" timeout="20" interval="10"
> start-delay="30" role="Master"/>
> <action name="meta-data" timeout="10" />
> <action name="validate-all" timeout="30" />
> </actions>
>
> Thanks in advance!!!
>
>
>
> _______________________________________________
> Linux-HA mailing list
> Linux-HA [at] lists
> http://lists.linux-ha.org/mailman/listinfo/linux-ha
> See also: http://linux-ha.org/ReportingProblems
>
>

--
View this message in context: http://old.nabble.com/Help-required-to-develop-OCF-Resource-Agent-Script-for-Master-Slave-script-tp26472940p26601657.html
Sent from the Linux-HA mailing list archive at Nabble.com.

_______________________________________________
Linux-HA mailing list
Linux-HA [at] lists
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems


radhamadhavi at huawei

Dec 2, 2009, 8:03 PM

Post #7 of 9 (1170 views)
Permalink
Re: Help required to develop OCF Resource Agent Script for Master-Slave script [In reply to]

Andrew Beekhof-3 wrote:
>
> On Mon, Nov 30, 2009 at 5:26 AM, Jessy <radhamadhavi [at] huawei> wrote:
>>
>>
>>
>> Andrew Beekhof-3 wrote:
>>>
>>> On Mon, Nov 23, 2009 at 6:15 AM, Jessy <radhamadhavi [at] huawei> wrote:
>>>>
>>>> I have developed a OCF resource agent script for the master/slave
>>>> application. While testing the RA, i have faced following problems.
>>>>
>>>> First, it is not able to promote once instance to the master, after i
>>>> have
>>>> called
>>>> "crm_master -v 100 -l reboot" utility command in the notify function of
>>>> post-start branch, so in ha-log, there are following log information
>>>>
>>>> tengine[4803]: 2009/11/19_15:21:43 info: extract_event: Aborting on
>>>> transient_attributes changes for "node of uuid"
>>>> tengine[4803]: 2009/11/19_15:21:43 info: update_abort_priority: Abort
>>>> priority upgraded to 1000000
>>>> tengine[4803]: 2009/11/19_15:21:43 info: update_abort_priority: Abort
>>>> action
>>>> 0 superceeded by 2
>>>
>>> Looks right to me.  What version are you running?
>>>
>>> [Jessy] : Ther version i'm using is Heartbeat v2.0.8 on SUSE 10 SP1, the
>>> cluster constists of two nodes.
>
> Yikes!
> You really need to update. No wonder its not working.
>
>
> The latest for from SUSE for SLES10 is 2.1.4 which will be a lot better.
>
>>>
>>>
>>>> Second issue, crm doesnot check the status of the instance on each
>>>> node,
>>>
>>> Did you add monitor operation to the resource?
>>>
>>> [Jessy] : Monitor function has been implemented in the script. The
>>> script
>>> also consists of                  start/stop/promote/demote/notify
>>> functions too.
>
> Yes, but did you add a monitor action to the resource's definition in
> the configuration?
>
> [Jessy] : I have added monitor operation defination in cib.xml with
> certain interval time in cib.xml file as below.
> <op id="Op_ms_app_prog10" name="monitor" interval="10s" timeout="30s"
> role="Master"/>
> <op id="Op_ms_app_prog11" name="monitor" interval="20s" timeout="50s"
> role="Slave"/>
>
> More over, i've also added the defination of monitor action in the
> meta-data of RA 'MaSlApp' as follows:
> <actions>
> <action name="start" timeout="240"/>
> <action name="promote" timeout="90"/>
> <action name="demote" timeout="90"/>
> <action name="notify" timeout="90"/>
> <action name="stop" timeout="100"/>
> <action name="monitor" depth="0" timeout="20" interval="30"
> start-delay="50" role="Slave"/>
> <action name="monitor" depth="0" timeout="20" interval="10"
> start-delay="30" role="Master"/>
> <action name="meta-data" timeout="10" />
> <action name="validate-all" timeout="30" />
> </actions>
>
> Thanks in advance!!!
>
>
>
> _______________________________________________
> Linux-HA mailing list
> Linux-HA [at] lists
> http://lists.linux-ha.org/mailman/listinfo/linux-ha
> See also: http://linux-ha.org/ReportingProblems
>
>

--
View this message in context: http://old.nabble.com/Help-required-to-develop-OCF-Resource-Agent-Script-for-Master-Slave-script-tp26472940p26601657.html
Sent from the Linux-HA mailing list archive at Nabble.com.

_______________________________________________
Linux-HA mailing list
Linux-HA [at] lists
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems


andrew at beekhof

Dec 8, 2009, 12:28 AM

Post #8 of 9 (1095 views)
Permalink
Re: Help required to develop OCF Resource Agent Script for Master-Slave script [In reply to]

On Wed, Dec 2, 2009 at 5:23 AM, Jessy <radhamadhavi [at] huawei> wrote:
>> Yes, but did you add a monitor action to the resource's definition in
>> the configuration?
>>
>> [Jessy] : I have added monitor operation defination in cib.xml with
>> certain interval time in cib.xml file as below.
>> <op id="Op_ms_app_prog10" name="monitor" interval="10s" timeout="30s"
>> role="Master"/>
>> <op id="Op_ms_app_prog11" name="monitor" interval="20s" timeout="50s"
>> role="Slave"/>
>>
>> More over, i've also added the defination of monitor action in the
>> meta-data of RA 'MaSlApp' as follows:
>>    <actions>
>>    <action name="start" timeout="240"/>
>>    <action name="promote" timeout="90"/>
>>    <action name="demote" timeout="90"/>
>>    <action name="notify" timeout="90"/>
>>    <action name="stop" timeout="100"/>
>>    <action name="monitor" depth="0" timeout="20" interval="30"
>> start-delay="50" role="Slave"/>
>>    <action name="monitor" depth="0" timeout="20" interval="10"
>> start-delay="30" role="Master"/>
>>    <action name="meta-data" timeout="10" />
>>    <action name="validate-all" timeout="30" />
>>    </actions>
>>
>> Thanks in advance!!!

Ok, and what happened?
Did you also upgrade?
_______________________________________________
Linux-HA mailing list
Linux-HA [at] lists
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems


radhamadhavi at huawei

Dec 15, 2009, 10:45 PM

Post #9 of 9 (966 views)
Permalink
Re: Help required to develop OCF Resource Agent Script for Master-Slave script [In reply to]

Andrew Beekhof-3 wrote:
>
> On Wed, Dec 2, 2009 at 5:23 AM, Jessy <radhamadhavi [at] huawei> wrote:
>>> Yes, but did you add a monitor action to the resource's definition in
>>> the configuration?
>>>
>>> [Jessy] : I have added monitor operation defination in cib.xml with
>>> certain interval time in cib.xml file as below.
>>> <op id="Op_ms_app_prog10" name="monitor" interval="10s" timeout="30s"
>>> role="Master"/>
>>> <op id="Op_ms_app_prog11" name="monitor" interval="20s" timeout="50s"
>>> role="Slave"/>
>>>
>>> More over, i've also added the defination of monitor action in the
>>> meta-data of RA 'MaSlApp' as follows:
>>>    <actions>
>>>    <action name="start" timeout="240"/>
>>>    <action name="promote" timeout="90"/>
>>>    <action name="demote" timeout="90"/>
>>>    <action name="notify" timeout="90"/>
>>>    <action name="stop" timeout="100"/>
>>>    <action name="monitor" depth="0" timeout="20" interval="30"
>>> start-delay="50" role="Slave"/>
>>>    <action name="monitor" depth="0" timeout="20" interval="10"
>>> start-delay="30" role="Master"/>
>>>    <action name="meta-data" timeout="10" />
>>>    <action name="validate-all" timeout="30" />
>>>    </actions>
>>>
>>> Thanks in advance!!!
>
> Ok, and what happened?
> Did you also upgrade?
> _______________________________________________
> Linux-HA mailing list
> Linux-HA [at] lists
> http://lists.linux-ha.org/mailman/listinfo/linux-ha
> See also: http://linux-ha.org/ReportingProblems
>
>

--
View this message in context: http://old.nabble.com/Help-required-to-develop-OCF-Resource-Agent-Script-for-Master-Slave-script-tp26472940p26806893.html
Sent from the Linux-HA mailing list archive at Nabble.com.

_______________________________________________
Linux-HA mailing list
Linux-HA [at] lists
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Linux-HA users RSS feed   Index | Next | Previous | View Threaded
 
 


Interested in having your list archived? Contact Gossamer Threads
 
  Web Applications & Managed Hosting Powered by Gossamer Threads Inc.