Rational Design Manager performance and scalability 5.0

David Hirsch, Performance Lead, Design Management, IBM Rational
Last updated: Aug 28, 2014
Build basis: Rational Rhapsody Design Manager Rational & Software Architect Design Manager 5.0

Introduction
Summary
Overview of the Design Manager architecture
Performance test environment overview
Performance test topology
Workload model
Scalability
Tests Results
Disk Performance
Performance tuning tips
Performance comparison between DM 5.0 && DM 4.0.6

Introduction

Performance and scalability are paramount in any multi-users client-server application. Many users often access the application, remotely, providing good response times as well as being able to handle the load is one of the key factors on making a client-server application successful.

Rational Design Manager (DM) application is a client-server application allowing users to manage and collaborate around design resources using one of its rich clients or a web browser.

In this article, we will outline the performance & scalability results of our performance tests based on the improvements done in DM 5.0. Following up the DM 4.0.6 & DM previous releases, further special emphasis has been put into tracking and improving the performance of the application, client & server side.
Major performance improvements have been achieved by adopting major architectural changes, including solving the performance bottle-necks of write scenarios.

The rest of this article describes the current scalability limitations in terms of concurrent users and size of the model data. This is only an additional step in our journey to improve DM performances. Please stay tuned in the upcoming releases for more performance improvements.

Summary

This article describes the recommended scalability in terms of concurrent users and size of model data.

DM 5.0 supports up to 300 concurrent users load (average 1 operation/min) working on a 3 GB MB model size. Performance is expected to be reasonable (below 2 sec. for a single action) in such an environment.
3 main bottle-necks:
- Index files: Data volume is affecting the index files size, decreasing index lookup performance. You can improve this by increasing process memory (index files will be loaded into the memory) or by improving disk performance (SSD Disks).
- Server CPU: The Server CPU utilization is the main bottle neck to support more users. You can increase the number of CPUs on the servers to help DM handle a scaled user load.
- Write scenarios: Write scenarios are heavy operations, which affect performance. You should differentiate between Actively managed users vs. Externally managed users. You can split the models into separated project areas to reduce Write scenarios performance degradation affecting the Read scenarios.
To maximize DM performance, refer to the performance tuning tips at the end of this article. Generally, you can size the hardware and network according to the expected number of concurrent users, increase the size of the thread pool used by the server’s web application container from the default, and set the size of the heap that the JVM uses appropriately.
We also added a performance comparison table between the Design Manager 5.0 release vs. the Design Manager 4.0.6.

Overview of the Design Manager architecture

The DM server is a set of JavaEE applications, each of them contributing to the overall DM features. Figure 1 provides an overview of the main DM parts as well as DM rich clients.

Figure 1 – Design Manager main components

Jazz Team Server: provides common services to the Jazz applications, like project area management, or user management and authentication.
Configuration Application: Provides version and configuration management for Jazz applications.
Design Management Application: The core of Design Manager server. Provides design, collaboration (reviewing and commenting) as well as Domain modeling capabilities.
RSA Domain Extension Application: Used to create, edit and delete Rational Software Architect (RSA) based resources.
RSA Rich Client: Extension allowing RSA integration with DM. It provides editing and collaborative capabilities directly into RSA.
Rhapsody Rich Client: Rich client allowing Rhapsody users to interact with DM. It provides editing and collaborative capabilities directly into Rhapsody.

Performance test environment overview

The tests are usually run during one hour and a half with a frequency of one operation per user per minute. Test results of the first and last 15 min are omitted to disable noise. As the tests are performed at the DM server services level, results are applicable to RSA DM as well as Rhapsody DM. Test results presented in this document are taken from a Rhapsody DM Server run (see work item 46975) . We see similar results running tests on RSA Design Manager Server.

Performance test topology

As shown in figure 2, a three-tier topology has been used, with two application servers running Tomcat 7.0 and one database server running DB 2 9.7. The Jazz Team Server (JTS) has been deployed on a different server than the Design Management, RSA Domain Extension and Configuration applications. This allows a better scalability when integrating different Jazz-based applications as each of them could be deployed on a different server allowing a better separation as well as maximizing the hardware resources usage. As we will see in the performance tip section, this topology increases the network usage making the overall performance more dependent on the network latency.

Figure 2 – Performance test environment

The test clients are all built on the same model and used to run up to 300 concurrent users. Users are scaled up by adding new test clients to avoid exhausting the hardware resources of the test clients that will lead to erroneous performance results. All the machines (test clients and servers) used are VMWare virtual machines with the specifications shown in figure 3.

Figure 3 – Machines specification

Workload model

The DM performance tests are performed using a custom-made automated performance testing framework capable of simulating an arbitrary number of users interacting with a DM server over an arbitrary period of time. Each user executing a configurable mix of user-level operations (use case scenarios). The framework drives the tests and records the response time of every user-level operation invoked during the test run. As performance is affected by Write scenario’s 2 to different tests have been run, to simulate externally managed users, that typically update the server overnight, and access the server to read data during the day, vs. actively managed users that create/update resources more heavily during day.

The list below provides an overview of the scenarios used in the performance tests, grouped in two categories, read and write scenarios.

Read Scenarios:
- Open a UML Diagram: This scenario simulates a user opening a UML diagram
- Expand a node in the explorer: This scenario involves fetching the children for one of the root nodes in the performance test model data.
- Open a resource: This scenario simulates a user opening a resources in the resource editor, reviewing the resource properties.
- Search a resource: This scenario simulates a user making a keyword search in all models and in all project areas.
- Search a diagram: This scenario simulates a user making a keyword search in all diagrams.
- Rich Client Scenarios:
  - Expand a node in the explorer: This scenario simulates one of the rich clients fetching the children for one of the root nodes.
  - Open properties for a UML resource: This scenario simulates one of the rich clients opening the properties for a resource.
  - OSLC Get: This scenario simulates the OSLC retrieving a resource.
  - OSLC Query: This scenario simulates an OSLC Client (RM) querying DM for OSLC links on a certain resource.
Write Scenarios:
- Create/Delete a comment: This scenario simulates the creation/deletion of a text comment on a resource.
- Create/Delete an OSLC Link: This scenario simulates the creation/deletion of an OSLC link between 2 DM resources.
- Create a resource: This scenario simulates a user creating an ontology resource (in the separated Users Domains Project).
- Save a resource: This scenario simulates a user updating the title of an ontology resource (in the separated Users Domains Project).
- Lock/Unlock a resource: This scenario simulates the locking/unlocking of a resource.

Table 1 shows the scenarios distribution in the workload mix used during the externally managed performance tests. The workload used is a 80% reads for 20% writes that matches the common usage pattern where DM is used for reviewing resources and resources collaboration.

Scenarios	%run/%min/1user	Distribution
Write Scenarios		22%
Create comment	1 run/1 min	11%
Create link	1 run/30 min	0%
Delete comment	1 run/1 min	11%
Delete link	1 run/30 min	0%
Read Scenarios		78%
Expand explorer	1 run/1 min	11%
Get Link	1 run/2 min	5%
Get Comment	1 run/1 min	11%
Related Elements	1 run/2 min	2%
OSLC Get	1 run/1 min	11%
Open diagram	1 run/2 min	5%
Open form	1 run/2 min	5%
OSLC Query	1 run/1 min	11%
Search diagrams	1 run/1 min	4%
Search resources	1 run/1 min	11%

Table 1 – Workload scenarios distribution Externally Managed mode.

Table 2 shows the scenarios distribution in the workload mix used during the actively managed performance tests. The workload used is a 50% reads and 50% writes that matches the common usage pattern where DM is used for editing resources and resources collaboration.

Scenarios	%run/%min/1user	Distribution
Write Scenarios		49%
Create comment	1 run/1 min	10%
Create DM Resource	1 run/1.33 min	8%
Save DM Resource	1 run/1 min	10%
Lock Resource	1 run/1 min	10%
Unlock Resource	1 run/1 min	10%
Read Scenarios		51%
Rich Client Expand Explorer	1 run/1 min	12%
Expand Explorer	1 run/1 min	8%
OSLC Get	1 run/1 min	10%
Open Diagram	1 run/2 min	5%
Open Form	1 run/2 min	5%
Search Diagrams	1 run/1 min	4%
Search Resources	1 run/1 min	6%

Table 2 – Workload scenarios distribution Actively Managed mode.

Scalability

The scalability of the system can be expressed in terms of user load vs. workspace load. The tests we ran was to identify the number of concurrent users and the given workspace load DM server could handle before the performance degraded significantly. We marked a 2 sec. average response time per single action as an acceptable performance. Server performance is affected by both scalability factors (user load and workspace load). Therefore we run different load scenario’s to find the supported balance between the 2 loads.

Users scalability

The first scalability test we ran was to identify the number of concurrent users DM could handle before the performance degraded significantly. The tests were performed with 100, 200 and 300 on different workspace loads.

Changing the test environment of the user transaction frequency would have an effect on the number of concurrent users. For example, running the same test with a scenario frequency of one transaction every three minutes, resulted on the application being able to handle at least 300 concurrent users.

Workspace scalability

To identify the maximum model data size that DM can handle before the performance started to decrease significantly, we imported 3 different Rational Rhapsody Models into Design Manager and run the tests using the resources generated by the import. The tests were performed with 3 Gb, 4 Gb and 5 Gb of model size. The test run against the Rhapsody Sample Project model AMR System (20Mb). To simulate workspace load, we have added additional models into the same workspace. Overall we have 1 Gb of model in the workspace. To size to 3Gb, 4Gb, 5Gb we have create additional project areas with 1 Gb of model size each.

The workspaces were containing respectively 3Gb, 4Gb, 5Gb of model data. Table 3 shows the resulting number of resources and elements in the Design Manager project area for each workspace.

Model size on disk (MB)	# Resources after import	# triples after import
3 Gb	450.000	27.000,000
4 Gb	600.000	36.000,000
5 Gb	750.000	45.000,000

Table 3: Number of resources and elements by size of workspace

Note: The test itself has run against one workspace with a maximum model data of 1 Gb. Sizing the data is done by creating additional project areas.

Tests Results

Externally Managed Tests Results

Figure 4 clearly shows that the tipping point, for our test environment and with a user transaction per minute, seems to be around 400 users and 4 Gb of models.

You can see that increasing the user and workspace load from 300 users/3Gb to 400 users/4GB users doesn’t have a big negative impact on performance (1.23 vs 1.01. It’s only when moving up to 500 users/5GB, were the CPU of the machine gets exhausted.

Figure 4 – Weighted mean response time in aspect of 300 – 500 users with a 3Gb – 5Gb model of an external managed test suite

Looking more closely to the response time of every single scenario in figure 5 and figure 6, you can see that the heaviest operations are the Write scenarios of creating and deleting a link where the response time jumps from 4 sec to 5 sec.

In figure 6, you can see that when increasing data load to 500 users / 5Gb the cpu is getting exhausted causing unacceptable regressions. Adding additional cpu cores to 8 cpu instead of 2 solves this issue.

Figure 5 – Weighted mean response time of single test scenarios in aspect of 300 – 400 users with a 3Gb – 4Gb model

Figure 6 – Weighted mean response time of single test scenarios in aspect of 300 – 500 users with a 3Gb – 5Gb model (2 core vs. 8 core).

Actively Managed Tests Results

Figure 7 shows that also the Actively Managed test suite was able to perform with 4 users / 4GB load.

Figure 7 – Weighted mean response time in aspect of 300 – 400 users with a 3Gb – 4Gb model of an actively managed test suite

Disk Performance

One of the main bottle-necks for performance and scalability of the design manager server is the RDF index files implemented by Jena DB. We have identified that the performance and scalability is highly dependent on the index files accessibility.

To improve the index accessibility, 2 options can be taken.

Increase the server’s RAM: Increasing the memory to 64Gb RAM will cause Jena to load the index files into memory to maximize the performance.
Use SSD Disks: Performance of accessing the indexing files on the disk will be improved significantly. In Table 6 below you can see an overall improvement of approx. 30% by simply moving to an SSD disk.

In table 6 below, you can see performance results of a test run on 2 sets of machines with similar hardware settings. The difference between the machines is the disk used. One was using an SSD disk while the other wasn’t.
We see an overall performance gain of approx. 30 % average.

Table 6 – Comparison between an SSD Hard drive vs. a simple Hard drive of an actively managed test suite

Performance tuning tips

Here are some performance tuning tips that would help you maximize the performance of DM.

The performance tuning considerations for Design Manager are similar to those for other Jazz-based applications, in particular:

Size your hardware and network accordingly to the number of concurrent users you are expecting.
Use SSD Disks
Increase Server RAM
Increase the size of the thread pool used by the server’s web application container from the default.
Set the size of the heap the JVM uses appropriately.

Hardware sizing

To maximize DM performances use a 64-bit architecture with at least 2 CPU cores, and at least 8 GB RAM. As noted above, the main resource contention observed during the tests in the DM server is the CPU cores, so increasing the number of CPU cores of the DM Server should help increase the number of concurrent users and the size of the model data that DM can handle. As seen above, moving to SSD Disk’s or increase RAM Memory will boost performance significantly, allowing to scale to more workspace data.

Network connectivity

Previously described, in the test topology, the DM application, the JTS application and the database were installed in different machines. This allows to increase the CPU, memory and disk available for each application, but in return it puts more pressure on the network connectivity, especially the network latency. To mitigate this, it is recommended to locate the three servers on the same subnet.

Thread pool size

The size of the thread pool used by the DM server’s web application container should be at least 2.5 times the expected active user load. For example, if you expect to have 100 concurrently active users, like in the tests described above, set the thread pool size to at least 250 for both the DM Server and the JTS server.

JVM heap size

It is recommended to set the maximum JVM heap size to at least 6 GB. However, you can only do that if the server has at least 8 GB of RAM. As a rule of thumb, avoid setting the maximum JVM heap size to more than about 70-80% of the amount of RAM in the server has or you may experience poor performance due to thrashing.

If you are running Tomcat, you will need to edit the server-startup script to change the default values of -Xms and -Xmx to the desired value (8GB or more). Set both parameters to the same value to avoid the overhead of dynamic Java Virtual Machine (JVM) heap management. You will need to stop and restart the server for the changes to take effect.

If you are running Websphere Application Server (WAS), see the “Java virtual machine settings” section in the WAS information center for instructions specific to your WAS deployment.

Performance comparison between DM 5.0 && DM 4.0.6

As a conclusion to this article, here is a comparison between the Design Manager 5.0 performance results and the Design Manager 4.0.6 results.
Due to adopting Jazz Foundation architectural changes, like JFS POJO API && VVC Configuration Cache, response times have been improved significantly. Especially, the Write scenario’s barrier, like Creating/Deleting a link or DM Resources, has been resolved resulting in more than 80% improvement.

Figure 8 – Comparison between DM 5.0 && DM 4.0.6 Actively Managed 3GB/300 users results

Figure 9 – Comparison between DM 5.0 && DM 4.0.6 Externally Managed 3GB/300 users results

Sources

IBM Rational Software Architect Design Manager 4.0 Performance And Sizing Report

Rational Design Manager performance and scalability 4.0.4

Rational Design Manager performance and scalability 4.0.6

About the author

David Hirsch is a senior developer in the Design Management development team. He was responsible for the Rational Rhapsody Design Manager before taking the lead on the DM effort to improve Design Manager performances. David can be contacted at davidhir@il.ibm.com.

Dashboards and work items are no longer publicly available, so some links may be invalid. We now provide similar information through other means. Learn more here.

Feedback

Was this information helpful?

Yes

No 0 people rated this as helpful.