E
dit
A
ttach
P
rintable
r18 - 2023-10-12 - 16:15:14 -
TimFeeney
You are here:
TWiki
>
Deployment Web
>
DeploymentPlanningAndDesign
>
ApproachesToImplementingHAAndDR
>
ImplementingApplicationHAUsingPowerHA
<div id="header-title" style="padding: 10px 15px; border-width:1px; border-style:solid; border-color:#FFD28C; background-image: url(<nop>https://jazz.net/wiki/pub/Deployment/WebPreferences/TLASE.jpg); background-size: cover; font-size:120%"> ---+!! Implementing Jazz application HA using !PowerHA %DKGRAY% Authors: Main.TimFeeney, Main.StevenBeard, Main.MikeDelargy, Main.MikeLyons <br> Build basis: Jazz products in CLM and SSE %ENDCOLOR%</div></sticky> <!-- Page contents top of page on right hand side in box --> <sticky><div style="float:right; border-width:1px; border-style:solid; border-color:#DFDFDF; background-color:#F6F6F6; margin:0 0 15px 15px; padding: 0 15px 0 15px;"> %TOC{title="Page contents"}% </div></sticky> <sticky><div style="margin:15px;"></sticky> The IBM !PowerHA !SystemMirror software provides a low-cost commercial computing environment that ensures quick recovery of mission-critical applications from hardware and software failures. This high availability system combines custom software with industry-standard hardware to minimize downtime by quickly restoring services when a system, component, or application fails. Although not instantaneous, the restoration of service is rapid, usually within 30 to 300 seconds. !PowerHA is designed to ensure application availability in the event of system failures, operator error or planned maintenance. Servers or LPARs are grouped into clusters that monitor each other; if a failure makes an application unavailable, it can be restarted on the backup hardware. Administrators can also deliberately move/activate an application on backup hardware in order to perform maintenance without affecting the application. Clustering two or more servers to back up critical applications is a cost-effective high availability option. Application clusters also provide a gradual, scalable growth path. It is easy to add a processor to the cluster to share the growing workload. You can also upgrade one or more of the processors in the cluster to a more powerful model. If you were using a fault tolerant strategy, you must add two processors, one as a redundant backup that does no processing during normal operations. <div style="text-align:center;"><img src="%ATTACHURLPATH%/PowerHA-2node.png" alt="PowerHA-2node.png" width="30%" height="30%" /><br><br> __A typical two node cluster__ </div> !PowerHA uses a heartbeat mechanism sent between nodes and to a shared disk to detect and diagnose failures. <div style="text-align:center;"><img src="%ATTACHURLPATH%/CLM_PowerHA.png" alt="CLM_PowerHA.png" width="75%" height="75%"/><br><br> __Recommended CLM !PowerHA topology__ </div> In this distributed enterprise topology, based off of the [[RecommendedCLMDeploymentTopologies][CLM-E1]] topology, each Jazz application runs on its own primary LPAR and has a secondary/standby LPAR with the same configuration (memory, CPUs, etc.). The secondary LPAR has at least the OS running and possibly WAS but not the secondary application instance. #JazzApplicationRecovery ---++ Jazz application recovery !PowerHA can further monitor applications in one of two ways: * Process Monitor determines the death of a process * Custom Monitor monitors health of the application using a monitor method you provide Decisions upon failure * Restart Can establish a number of restarts to restart locally. After a specified restart count, if app continues to fail you can escalate to a fallover. * Notify Send email notification * Fallover Move application and associated resource group to next candidate node. Application Monitoring can be suspended/resumed at anytime. A simple script using curl can be written to detect whether a Jazz application is available or not. For example, <verbatim> curl -s -o /dev/null -w "%{http_code}" http://www.example.org/ </verbatim> will return an HTTP code of 200 if the application at www.example.org is accessible. A script using that statement and some additional code could be written that looks for anything other than 200 or specific error codes and counts these occurrences to determine if the application is inaccessible. This could then be reported to !PowerHA and a recovery process activated. Configure !PowerHA recovery process to restart the failed Jazz application. If the application fails to restart after 2-3 times, restart WAS per [[WasRecovery]]. A simpler approach would be to avoid the restarts and failover to the secondary LPAR. #WasRecovery ---++ WAS recovery Configure !PowerHA [[http://publib.boulder.ibm.com/infocenter/aix/v7r1/topic/com.ibm.aix.powerha.smartassist/smartassist_pdf.pdf][Smart Assist for WebSphere]] to detect if WAS is available. If not, use the start/stop scripts provided by the Smart Assist to restart WAS up to two times and then the Jazz application. If WAS does not restart successfully, !PowerHA fails over to the secondary LPAR. #SingleLPARRecovery ---++ Single LPAR recovery If !PowerHA detects the loss of heartbeat packets failover occurs. * Shutdown the primary instance of the application * Transfer the SAN data partition with the Jazz indexes from the primary to secondary LPAR * On the DNS server, transfer the hostname resolution of the secondary LPAR (i.e. update the routing so the public URI resolves to the secondary LPAR) * Start-up the Jazz application on the secondary LPAR * Secondary LPAR has OS and WAS running ---++ Other considerations It is imperative that the configuration of the secondary application nodes be kept in sync with their primary. Any software upgrades, hotfixes, application configuration chages, etc., made on the primary node need to be applied as well to the secondary. Otherwise, you run the risk that the failover will fail or not provide the same level of service. ---++ Benefits of a !PowerHA based Application HA solution This solution is very similar to a cold standby solution. The key difference is the automation provided by !PowerHA. You could have a phased implementation that starts with a manual cold standby configuration and progressively evolves to a fully automated configuration. !PowerHA is a mature Power technology and is already in use at many enterprise customers for their application, data and storage tiers. * Offers automated failover with faster recovery time leading to more reliable, less error prone failover * Greater confidence in failover improves triaging options of SW or HW issues in production by using failover to secondary * Can be extended to support availability in the web tier (redundancy of the reverse proxy servers) * Has capabilities to support Disaster Recovery as well The two node configuration described above is a very common configuration, relatively fast to implement, low complexity and low risk. ---+++++!! Related topics: [[ApproachesToImplementingHAAndDR][Approaches to implementing high availability and disaster recovery for Rational Jazz environments]], [[HighAvailability][High availability principles]] ---+++++!! External links: * [[http://publib.boulder.ibm.com/infocenter/aix/v7r1/topic/com.ibm.aix.powerha.navigation/powerha_pdf.htm][PowerHA SystemMirror 7.1 for AIX PDFs]] * [[http://publib.boulder.ibm.com/infocenter/aix/v7r1/topic/com.ibm.aix.powerha.concepts/hacmpconcepts_pdf.pdf][PowerHA SystemMirror concepts]] * [[http://publib.boulder.ibm.com/infocenter/aix/v7r1/topic/com.ibm.aix.powerha.smartassist/smartassist_pdf.pdf][Smart Assist for WebSphere]] * [[http://www.ibm.com/developerworks/aix/library/au-powerhaintro/][Introduction to PowerHA]] ---+++++!! Additional contributors: None <sticky></div></sticky>
E
dit
|
A
ttach
|
P
rintable
|
V
iew topic
|
Backlinks:
We
b
,
A
l
l Webs
|
H
istory
: r18
<
r17
<
r16
<
r15
<
r14
|
M
ore topic actions
Deployment.ImplementingApplicationHAUsingPowerHA moved from Deployment.ImplementingApplicationHABasedUponPowerHA on 2014-03-29 - 16:27 by Main.sbeard
-
Deployment
Deployment web
Planning and design
Installing and upgrading
Migrating and evolving
Integrating
Administering
Monitoring
Troubleshooting
Community information and contribution guidelines
Create new topic
Topic list
Search
Advanced search
Notify
RSS
Atom
Changes
Statistics
Web preferences
NOTE: Please use the Sandbox web for testing
Status icon key:
To do
Under construction
New
Updated
Constant change
None - stable page
Smaller versions of status icons for inline text:
Copyright © by IBM and non-IBM contributing authors. All material on this collaboration platform is the property of the contributing authors.
Contributions are governed by our
Terms of Use.
Please read the following
disclaimer
.
Dashboards and work items are no longer publicly available, so some links may be invalid. We now provide similar information through other means. Learn more
here
.