MPIO on AIX

Post Reply
dp-sg
Posts: 1
Joined: Mon Mar 01, 2010 2:00 pm

MPIO on AIX

Post by dp-sg »

We are in the process of migrating our storage from EMC to 3Par on AIX servers. I'm trying to find some readmes, if there are any, from 3Par regarding configuring mpio on aix. I guess my account with 3Par support is pending authorization or something because I'm not able to grab any from the 3Par site. Can anyone point me to some information?
Thanks
User avatar
Richard Siemers
Site Admin
Posts: 1333
Joined: Tue Aug 18, 2009 10:35 pm
Location: Dallas, Texas

Re: MPIO on AIX

Post by Richard Siemers »

As far as MPIO on AIX goes, I am a little confused by 3Par's naming and handling of this. It's my underdstanding that it's IBM's native MPIO for AIX, documentation for MPIO should be retrieved from IBM sources. Why 3Par calls this product "3Par Multipath I/O for IBM AIX" is a mystery to me. It appears to be nothing more than ODM definitions and scripts, but I am no AIX guru/admin. I do know that lspath is the command used to view the paths.

Attached is the readme that comes with the install CD, and the Implementation Guide for IBM AIX 6.1

aixmpio_readme_2.2.pdf
3PAR for AIX MPIO 2.2 Readme
(64.53 KiB) Downloaded 4282 times

3PAR_IG_AIX61_IBM_320-200106_RevB .pdf
3Par Implementation Guide for AIX 6.1
(291.67 KiB) Downloaded 4249 times
Richard Siemers
The views and opinions expressed are my own and do not necessarily reflect those of my employer.
User avatar
Richard Siemers
Site Admin
Posts: 1333
Joined: Tue Aug 18, 2009 10:35 pm
Location: Dallas, Texas

Re: MPIO on AIX

Post by Richard Siemers »

I wanted to follow up on this, since posting I have dabbled with AIX alot more.

I have encountered many who are under the impression that Powerpath and 3PAR MPIO can not co-exist on AIX. While certain specific windows environments cannot have the 2 applications co-exist, AIX is a different animal.

Lets be clear, MPIO belongs to AIX, it is part of the operating system. The software delivered by 3PAR is just ODM definitions despite it's fancy marketing name claiming "Multipath software". Since the inclusion of MPIO, it has been co-existing with Powerpath. In fact EMC supports using MPIO for Symmetrix connections simultaneously while connecting to Clariion with powerpath, and visa versa. The 3Par ODM defines it's LUNs as MPIO devices and Powerpath will not touch them.
Richard Siemers
The views and opinions expressed are my own and do not necessarily reflect those of my employer.
User avatar
Richard Siemers
Site Admin
Posts: 1333
Joined: Tue Aug 18, 2009 10:35 pm
Location: Dallas, Texas

Re: MPIO on AIX

Post by Richard Siemers »

More on AIX...

We recently brought some new ports online on the 3PAR and began rebalancing our connected hosts to the new ports. We assumed that with AIX MPIO we could kill then add a path hot and online, just as long as the system always had live paths to work with... the assumption was good, and we were able to move systems around without immediate interuption ... except... we had a server crash about 45 minutes after the moves were done!

IBM support reviewed the dump files and pointed the finger at 3PAR kernel extensions, however, the real problem was the result of the AIX system not having its FAST_FAIL and DYNAMIC_TRACKING settings on the HBA enabled per both AIX and 3PAR documentation (why these settings aren't on by default boggles me).

3PAR support responded with this well written, well referenced, accurate response to my issue. Compared to recent support I have been getting from both Symantec and Microsoft, I am pleased and impressed with the details of this response which made it crystal clear what my problem was, and how to fix it.


Case Analysis Report
SR#: 482626-181966951
Reported Symptom: 3par_pcmke kernel extension causes AIX servers to crash
Reported By: Pier 1 Services Company

Description:
After successfully adding new MPIO paths, then deleting the old ones to an AIX 5.3 server, about an hour later, that server crashed. Crash dumps were collected and sent to IBM and they responded with the following:

Subject: PMR 18921,004,000 3par_pcmke kernel extension
CRASH INFORMATION: CPU 0 CSA F00000002FF47600 at time of crash, error code for LEDs: 30000000
pvthread+03AB00 STACK: [04167B70]3par_pcmke:pcmSelectIoctlPath+0000DC (F1000110104AD350,
F100011010433800)
--
The problem is due to some issue in the 3par_pcmke kernel extension.
The owner of this kernel extension is 3PAR company.

Findings:
3PAR investigation shows that a similar kernel extension crash was reported to 3PAR engineering and was determined to be caused by HBA settings related to the dynamic tracking and fast fail attribute settings.

Per 3PAR engineering when dynamic tracking is not enabled in the HBA, the 3PAR MPIO path pointers can get null values which can cause problems similar to what you reported.

Further research with IBM reveals that , for hosts systems that run an AIX® 5.2 or later operating system, the fast fail and dynamic tracking attributes must be enabled.

See link:
IBM Aix Config for Fast Fail and Dynamic Tracking

Review of the log lsattr_fscsi.out you provided, we confirmed that the dynamic tracking and fast fail attributes are not enabled on
this host as recommended.

From lsattr_fscsi.out:

### fscsi0

attach switch How this adapter is CONNECTED False
dyntrk no Dynamic Tracking of FC Devices True
fc_err_recov delayed_fail FC Fabric Event Error RECOVERY Policy True
scsi_id 0xa30024 Adapter SCSI ID False
sw_fc_class 3 FC Class for Fabric True

…

### fscsi2

attach switch How this adapter is CONNECTED False
dyntrk no Dynamic Tracking of FC Devices True
fc_err_recov delayed_fail FC Fabric Event Error RECOVERY Policy True
scsi_id 0xa20020 Adapter SCSI ID False
sw_fc_class 3 FC Class for Fabric True


When dynamic tracking of FC devices is enabled, the FC adapter driver can detect when the Fiber Channel N_Port ID of a device changes and re-route traffic destined for that device to the new address while the devices are still online.

The 3PAR Implementation Guide for AIX also additional information on these settings including a list of events when the N_Port ID can change. See section 3.2.5 of the attached AIX implementation guide.

Solution :
The dynamic tracking and fast fail commands can be enabled by running these commands.

chdev -l fscsi0 -a fc_err_recov=fast_fail
chdev -l fscsi0 -a dyntrk=yes

Notes:
1. Change the settings on all applicable HBA in the system.
2. A Reboot may be required for these changes to take effect.

***Please follow all necessary pre-cautions before rebooting your host. ***

For further details and other considerations please refer to the IBM documentation on how to implement these changes.

IBM Aix Config Fast Fail and Dynamic Tracking
Richard Siemers
The views and opinions expressed are my own and do not necessarily reflect those of my employer.
Post Reply