Login | Register For Free | Help
Search for: (Advanced)

Mailing List Archive: Linux-HA: Users

3.0.1 removed /etc/ha.d/shellfuncs but file still used in several places

 

 

Linux-HA users RSS feed   Index | Next | Previous | View Threaded


bschatz at yahoo

Nov 12, 2009, 2:34 PM

Post #1 of 2 (1067 views)
Permalink
3.0.1 removed /etc/ha.d/shellfuncs but file still used in several places

Hi,

I am new to Linux HA.

I am using the 3.0.1 version of heartbeat with the 1.0.6 stable build of pacemaker on a fresh fc11 system.

I installed everything as per:

http://clusterlabs.org/wiki/Install

# wget -O /etc/yum.repos.d/pacemaker.repo http://clusterlabs.org/rpm/fedora-11/clusterlabs.repo
# yum install -y pacemaker corosync heartbeat

When I attempted to start heartbeat, it failed as follows:

root [at] fc11-:# /etc/init.d/heartbeat start
/etc/init.d/heartbeat: line 51: /etc/ha.d/shellfuncs: No such file or directory

I commented out the call to shellfuncs and manually set HA_BIN and HA_NOARCHBIN as follows:

HA_BIN=/usr/lib64/heartbeat
HA_NOARCHBIN=/usr/share/heartbeat

When I attempted to start heartbeat again I received this error message:

root [at] fc11-:# /etc/init.d/heartbeat start
Starting High-Availability services: /etc/init.d/heartbeat: line 147: ha_parameter: command not found
/etc/init.d/heartbeat: line 182: ha_parameter: command not found
Done.

However, the crm was started.

Next, I attempted to bring an IPaddr resource online with the command:

root [at] fc11-# crm_resource -r SS-IP1 -p target_role -v started

My configuration information is:

root [at] fc11-# crm configure show
node $id="306cec3f-d583-4748-a4d5-98c8d745ee93" fc11-1
node $id="9f84dba6-90ae-4bc6-bd89-87f8b8c8828d" fc11-2
primitive SS-IP1 ocf:heartbeat:IPaddr \
params ip="172.16.80.250" target_role="started" \
op monitor interval="10s"
property $id="cib-bootstrap-options" \
dc-version="1.0.6-cebe2b6ff49b36b29a3bd7ada1c4701c7470febe" \
cluster-infrastructure="Heartbeat" \
stonith-enabled="false"

The log shows:

Nov 12 14:01:16 fc11-2 crmd: [5301]: info: run_graph: ====================================================
Nov 12 14:01:16 fc11-2 crmd: [5301]: notice: run_graph: Transition 23 (Complete=0, Pending=0, Fired=0, Skipped=0, Incomplete=0, Source=/var/lib/pengine/pe-warn-12.bz2): Complete
Nov 12 14:01:16 fc11-2 crmd: [5301]: debug: print_graph: ## Empty transition graph ##
Nov 12 14:01:16 fc11-2 crmd: [5301]: info: te_graph_trigger: Transition 23 is now complete
Nov 12 14:01:16 fc11-2 crmd: [5301]: debug: notify_crmd: Processing transition completion in state S_TRANSITION_ENGINE
Nov 12 14:01:16 fc11-2 crmd: [5301]: info: notify_crmd: Transition 23 status: done - <null>
Nov 12 14:01:16 fc11-2 crmd: [5301]: debug: s_crmd_fsa: Processing I_TE_SUCCESS: [ state=S_TRANSITION_ENGINE cause=C_FSA_INTERNAL origin=notify_crmd ]
Nov 12 14:01:16 fc11-2 crmd: [5301]: debug: do_fsa_action: actions:trace: // A_LOG
Nov 12 14:01:16 fc11-2 crmd: [5301]: info: do_state_transition: State transition S_TRANSITION_ENGINE -> S_IDLE [ input=I_TE_SUCCESS cause=C_FSA_INTERNAL origin=notify_crmd ]
Nov 12 14:01:16 fc11-2 crmd: [5301]: info: do_state_transition: Starting PEngine Recheck Timer
Nov 12 14:01:16 fc11-2 crmd: [5301]: debug: crm_timer_start: Started PEngine Recheck Timer (I_PE_CALC:900000ms), src=111
Nov 12 14:01:16 fc11-2 crmd: [5301]: debug: do_fsa_action: actions:trace: // A_DC_TIMER_STOP
Nov 12 14:01:16 fc11-2 crmd: [5301]: debug: do_fsa_action: actions:trace: // A_INTEGRATE_TIMER_STOP
Nov 12 14:01:16 fc11-2 crmd: [5301]: debug: do_fsa_action: actions:trace: // A_FINALIZE_TIMER_STOP
Nov 12 14:01:16 fc11-2 pengine: [5346]: WARN: process_pe_message: Transition 23: WARNINGs found during PE processing. PEngine Input stored in: /var/lib/pengine/pe-warn-12.bz2
Nov 12 14:01:16 fc11-2 pengine: [5346]: info: process_pe_message: Configuration WARNINGs found during PE processing. Please run "crm_verify -L" to identify issues.
Nov 12 14:01:19 fc11-2 cib: [5297]: debug: cib_process_xpath: cib_query: //cib/configuration/resources//*[@id="SS-IP1"]//meta_attributes//nvpair[@name="target_role"] does not exist
Nov 12 14:01:19 fc11-2 cib: [5297]: debug: cib_process_xpath: Processing cib_query op for //cib/configuration/resources//*[@id="SS-IP1"]//instance_attributes//nvpair[@name="target_role"] (/cib/configuration/resources/primitive/instance_attributes/nvpair[2])
Nov 12 14:01:19 fc11-2 cib: [5297]: info: cib_process_request: Operation complete: op cib_modify for section resources (origin=local/crm_resource/5, version=0.14.11): ok (rc=0)
Nov 12 14:01:19 fc11-2 cib: [5297]: debug: xmlfromIPC: Peer disconnected
Nov 12 14:02:06 fc11-2 cib: [5297]: debug: cib_process_xpath: cib_query: //cib/configuration/resources//*[@id="SS-IP1"]//meta_attributes//nvpair[@name="target_role"] does not exist
Nov 12 14:02:06 fc11-2 cib: [5297]: debug: cib_process_xpath: Processing cib_query op for //cib/configuration/resources//*[@id="SS-IP1"]//instance_attributes//nvpair[@name="target_role"] (/cib/configuration/resources/primitive/instance_attributes/nvpair[2])
Nov 12 14:02:06 fc11-2 cib: [5297]: info: cib_process_request: Operation complete: op cib_modify for section resources (origin=local/crm_resource/5, version=0.14.11): ok (rc=0)

I then did crm_verify as follows:

root [at] fc11-:# crm_verify -L
crm_verify[13367]: 2009/11/12_14:03:14 ERROR: unpack_rsc_op: Hard error - SS-IP1_monitor_0 failed with rc=5: Preventing SS-IP1 from re-starting on fc11-2
crm_verify[13367]: 2009/11/12_14:03:14 ERROR: unpack_rsc_op: Hard error - SS-IP1_monitor_0 failed with rc=5: Preventing SS-IP1 from re-starting on fc11-1
Warnings found during check: config may not be valid
Use -V for more details

I tried to debug this problem by running the /etc/ha.d/resource.d/IPaddr script manually as follows:

root [at] fc11-:# sh -x /etc/ha.d/resource.d/IPaddr start
+ . /etc/ha.d/resource.d//hto-mapfuncs
++ . /etc/ha.d/shellfuncs
/etc/ha.d/resource.d//hto-mapfuncs: line 9: /etc/ha.d/shellfuncs: No such file or directory

I commented out line 9 and then ran into this problem:

root [at] fc11-:# sh -x /etc/ha.d/resource.d/IPaddr 172.16.80.250 start
+ . /etc/ha.d/resource.d//hto-mapfuncs
++ OCF_ROOT_DIR=/usr/lib/ocf
++ OCF_RA_DIR=/usr/lib/ocf/resource.d/
++ OCF_RA_VERSION_MAJOR=1
++ export OCF_RA_VERSION_MAJOR
++ OCF_RA_VERSION_MINOR=0
++ export OCF_RA_VERSION_MINOR
++ OCF_ROOT=/usr/lib/ocf
++ export OCF_ROOT
++ LEGAL_ACTIONS='(start|stop|status|usage|meta-data)'
+ '[' 2 '!=' 2 ']'
++ echo 172.16.80.250
++ sed 's%/.*%%'
+ BASEIP=172.16.80.250
+ OCF_RESKEY_ip=172.16.80.250
+ export OCF_RESKEY_ip
++ echo 172.16.80.250
++ sed 's%^172.16.80.250/*%%'
+ str=
+ '[' '!' -z '' ']'
+ case $2 in
+ OCF_TYPE=IPaddr
+ OCF_RESKEY_lvs_support=0
+ OCF_RESOURCE_INSTANCE=IPaddr_172.16.80.250
+ export OCF_TYPE OCF_RESOURCE_INSTANCE OCF_RESKEY_lvs_support
+ ra_execocf start
+ '[' xstart = x ']'
+ . /usr/lib/ocf/resource.d//heartbeat/.ocf-shellfuncs
/etc/ha.d/resource.d//hto-mapfuncs: line 35: /usr/lib/ocf/resource.d//heartbeat/.ocf-shellfuncs: No such file or directory

I am in the process of debug this problem.

Has anyone else already fixed these problems or other gotchas with heartbeat 3.0.1?


Thanks,

Bob



_______________________________________________
Linux-HA mailing list
Linux-HA [at] lists
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems


lmb at suse

Nov 13, 2009, 9:46 AM

Post #2 of 2 (1053 views)
Permalink
Re: 3.0.1 removed /etc/ha.d/shellfuncs but file still used in several places [In reply to]

On 2009-11-12T14:34:00, Bob Schatz <bschatz [at] yahoo> wrote:

> Hi,
>
> I am new to Linux HA.
>
> I am using the 3.0.1 version of heartbeat with the 1.0.6 stable build of pacemaker on a fresh fc11 system.
>
> I installed everything as per:
>
> http://clusterlabs.org/wiki/Install
>
> # wget -O /etc/yum.repos.d/pacemaker.repo http://clusterlabs.org/rpm/fedora-11/clusterlabs.repo
> # yum install -y pacemaker corosync heartbeat
>
> When I attempted to start heartbeat, it failed as follows:

You need cluster-glue + resource-agents installed too, did they not end
up being installed auotmatically? If not, the fedora packages miss
dependencies.


> I then did crm_verify as follows:
>
> root [at] fc11-:# crm_verify -L
> crm_verify[13367]: 2009/11/12_14:03:14 ERROR: unpack_rsc_op: Hard error - SS-IP1_monitor_0 failed with rc=5: Preventing SS-IP1 from re-starting on fc11-2
> crm_verify[13367]: 2009/11/12_14:03:14 ERROR: unpack_rsc_op: Hard error - SS-IP1_monitor_0 failed with rc=5: Preventing SS-IP1 from re-starting on fc11-1

You posted only the pacemaker messages. What you need to look at
are the log messages from the IPAddr resource agent detailing why it
returned this error code, and fix those.

> root [at] fc11-:# sh -x /etc/ha.d/resource.d/IPaddr start
> + . /etc/ha.d/resource.d//hto-mapfuncs
> ++ . /etc/ha.d/shellfuncs
> /etc/ha.d/resource.d//hto-mapfuncs: line 9: /etc/ha.d/shellfuncs: No such file or directory

That again is a borked install. Please install
cluster-glue/resource-agents.

> /etc/ha.d/resource.d//hto-mapfuncs: line 35: /usr/lib/ocf/resource.d//heartbeat/.ocf-shellfuncs: No such file or directory

Same here.


Regards,
Lars

--
Architect Storage/HA, OPS Engineering, Novell, Inc.
SUSE LINUX Products GmbH, GF: Markus Rex, HRB 16746 (AG Nürnberg)
"Experience is the name everyone gives to their mistakes." -- Oscar Wilde

_______________________________________________
Linux-HA mailing list
Linux-HA [at] lists
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Linux-HA users RSS feed   Index | Next | Previous | View Threaded
 
 


Interested in having your list archived? Contact Gossamer Threads
 
  Web Applications & Managed Hosting Powered by Gossamer Threads Inc.