Rational Design Manager performance and scalability 4.0.6

Table of contents

Introduction
Summary
Overview of the Design Manager architecture
Performance test environment overview
Performance test topology
Workload model
Scalability
Tests Results
Disk Performance
Performance tuning tips

Introduction

Performance and scalability are paramount in any multi-users client-server application. As many users access, often remotely, the application, providing good response times as well as being able to handle the load is one of the key factors on making a client-server application successful.

Rational Design Manager (DM) application is a client-server application allowing users to manages and collaborate around design resources using a web browser or one of its rich clients.

In this article we will outline the performance & scalability results of our performance tests based on the improvements done in DM 4.0.6. Following up the DM 4.0.4 & DM 4.0.5 releases, further special emphasis has been put into tracking and improving the performance of the application, client & server side.

The rest of this article describes the current scalability limitations in terms of concurrent users and size of the model data. This is only an additional step in our journey to improve DM performances. Please stay tuned in the upcoming releases for more performance improvements.

Summary

This article describes the recommended scalability in terms of concurrent users and size of model data.

  • DM 4.0.6 supports up to 150-200 concurrent users load (average 1 operation/min) working on a 1GB MB model size. Performance is expected to be reasonable (below 2 sec. for a single action) in such an environment.
  • 3 main bottle-necks:
    • Write scenario’s: The performance is highly affected by Write scenario’s. You should differentiate between Actively managed users vs. Externally managed users. You can split the models into separated project areas to reduce Write scenario’s performance degradation. (Planned to be addressed in D.M 5.0)
    • Index files: Data volume is affecting the index files size, decreasing index lookup performance. You can improve this by increasing process memory (index files will be loaded into the memory) or by improving disk performance (SSD Disks).
    • Server CPI: The Server CPU utilization is the main bottle neck to support more users. You can increase the number of CPUs on the servers to help DM handle a scaled user load.
  • To maximize DM performance, refer to the performance tuning tips at the end of this article. Generally, you can size the hardware and network according to the expected number of concurrent users, increase the size of the thread pool used by the server’s web application container from the default, and set the size of the heap that the JVM uses appropriately.

Overview of the Design Manager architecture

The DM server is a set of JavaEE applications, each of them contributing to the overall DM features. Figure 1 provides an overview of the main DM parts as well as DM rich clients.

Figure 1 – Design Manager main components

  • Jazz Team Server: provides common services to the Jazz applications, like project area management, or user management and authentication.
  • Configuration Application: Provides version and configuration management for Jazz applications.
  • Design Management Application: The core of Design Manager server. Provides design, collaboration (reviewing and commenting) as well as Domain modeling capabilities.
  • RSA Domain Extension Application: Used to create, edit and delete Rational Software Architect (RSA) based resources.
  • RSA Rich Client: Extension allowing RSA integration with DM. It provides editing and collaborative capabilities directly into RSA.
  • Rhapsody Rich Client: Rich client allowing Rhapsody users to interact with DM. It provides editing and collaborative capabilities directly into Rhapsody.

Performance test environment overview

The tests are usually run during one hour and a half with a frequency of one operation per user per minute. Test results of the first and last 15 min are omitted to disable noise. As the tests are performed at the DM server services level, results are applicable to RSA DM as well as Rhapsody DM. Test results presented in this document are taken from a Rhapsody DM Server run (see work item 43833) . We see similar results running tests on RSA Design Manager Server.

Performance test topology

As shown in figure 2, a three-tier topology has been used, with two application servers running Tomcat 7.0 and one database server running DB 2 9.7. The Jazz Team Server (JTS) has been deployed on a different server than the Design Management, RSA Domain Extension and Configuration applications. This allows a better scalability when integrating different Jazz-based applications as each of them could be deployed on a different server allowing a better separation as well as maximizing the hardware resources usage. As we will see in the performance tip section, this topology increases the network usage making the overall performance more dependent on the network latency.

Figure 2 – Performance test environment

The test clients are all built on the same model and used to run up to 150 concurrent users. Users are scaled up by adding new test clients to avoid exhausting the hardware resources of the test clients that will lead to erroneous performance results. All the machines (test clients and servers) used are VMWare virtual machines with the specifications shown in figure 3.

Figure 3 – Machines specification

Workload model

The DM performance tests are performed using a custom-made automated performance testing framework capable of simulating an arbitrary number of users interacting with a DM server over an arbitrary period of time. Each user executing a configurable mix of user-level operations (use case scenarios). The framework drives the tests and records the response time of every user-level operation invoked during the test run. As performance is highly affected by Write scenario’s 2 to different tests have been run, to simulate externally managed users, that typically update the server overnight, and access the server to read data during the day, vs. actively managed users that create/update resources more heavily during day.

The list below provides an overview of the scenarios used in the performance tests, grouped in two categories, read and write scenarios.

  • Read Scenarios:
    • Open a UML Diagram: This scenario simulates a user opening a UML diagram
    • Expand a node in the explorer: This scenario involves fetching the children for one of the root nodes in the performance test model data.
    • Open a resource: This scenario simulates a user opening a resources in the resource editor, reviewing the resource properties.
    • Search a resource: This scenario simulates a user making a keyword search in all models and in all project areas.
    • Search a diagram: This scenario simulates a user making a keyword search in all diagrams.
    • Rich Client Scenarios:
      • Expand a node in the explorer: This scenario simulates one of the rich clients fetching the children for one of the root nodes.
      • Open properties for a UML resource: This scenario simulates one of the rich clients opening the properties for a resource.
      • OSLC Get: This scenario simulates the OSLC retrieving a resource.
      • OSLC Query: This scenario simulates an OSLC Client (RM) querying DM for OSLC links on a certain resource.
  • Write Scenarios:
    • Create/Delete a comment: This scenario simulates the creation/deletion of a text comment on a resource.
    • Create/Delete an OSLC Link: This scenario simulates the creation/deletion of an OSLC link between 2 DM resources.
    • Create a resource: This scenario simulates a user creating an ontology resource (in the separated Users Domains Project).
    • Save a resource: This scenario simulates a user updating the title of an ontology resource (in the separated Users Domains Project).
    • Lock/Unlock a resource: This scenario simulates the locking/unlocking of a resource.

Table 1 shows the scenarios distribution in the workload mix used during the externally managed performance tests. The workload used is a 80% reads for 20% writes that matches the common usage pattern where DM is used for reviewing resources and resources collaboration.

Scenarios %run/%min/1user Distribution
Write Scenarios 22%
Create comment 1 run/1 min 11%
Create link 1 run/30 min 0%
Delete comment 1 run/1 min 11%
Delete link 1 run/30 min 0%
Read Scenarios 78%
Expand explorer 1 run/1 min 11%
Get Link 1 run/2 min 5%
Get Comment 1 run/1 min 11%
Related Elements 1 run/2 min 2%
OSLC Get 1 run/1 min 11%
Open diagram 1 run/2 min 5%
Open form 1 run/2 min 5%
OSLC Query 1 run/1 min 11%
Search diagrams 1 run/1 min 4%
Search resources 1 run/1 min 11%

Table 1 – Workload scenarios distribution Externally Managed mode.

Table 2 shows the scenarios distribution in the workload mix used during the actively managed performance tests. The workload used is a 50% reads and 50% writes that matches the common usage pattern where DM is used for editing resources and resources collaboration.

Scenarios %run/%min/1user Distribution
Write Scenarios 49%
Create comment 1 run/1 min 10%
Create DM Resource 1 run/1.33 min 8%
Save DM Resource 1 run/1 min 10%
Lock Resource 1 run/1 min 10%
Unlock Resource 1 run/1 min 10%
Read Scenarios 51%
Rich Client Expand Explorer 1 run/1 min 12%
Expand Explorer1 run/1 min 8%
OSLC Get1 run/1 min 10%
Open Diagram1 run/2 min 5%
Open Form1 run/2 min 5%
Search Diagrams1 run/1 min 4%
Search Resources1 run/1 min 6%

Table 2 – Workload scenarios distribution Actively Managed mode.

Scalability

The scalability of the system can be expressed in terms of user load vs. workspace load. The tests we ran was to identify the number of concurrent users and the given workspace load DM server could handle before the performance degraded significantly. We marked a 2 sec. average response time per single action as an acceptable performance. Server performance is affected by both scalability factors (user load and workspace load). Therefore we run different load scenario’s to find the supported balance between the 2 loads.

Users scalability

The first scalability test we ran was to identify the number of concurrent users DM could handle before the performance degraded significantly. The tests were performed with 100, 200 and 300 on different workspace loads.

Changing the test environment of the user transaction frequency would have an effect on the number of concurrent users. For example, running the same test with a scenario frequency of one transaction every three minutes, resulted on the application being able to handle at least 200 concurrent users.

Workspace scalability

To identify the maximum model data size that DM can handle before the performance started to decrease significantly, we imported 3 different Rational Rhapsody Models into Design Manager and run the tests using the resources generated by the import. The tests were performed with 0.5Gb, 1 Gb and 1.5 Gb of model size. The test run against the Rhapsody Sample Project model AMR System (20Mb). To simulate workspace load, we have added an additional 450MB model into the same workspace. Overall we have 470Mb of model in the workspace. To size to 1Gb, 1.5 Gb we have create additional project areas with the 2 models.

The workspaces were containing respectively 63 MB, 110 MB and 230 MB of model data. Table 3 shows the resulting number of resources and elements in the Design Manager project area for each workspace.

Model size on disk (MB) # Resources after import # triples after import
0.5 Gb 75.000 4.500,000
1 Gb 150.000 9.000,000
1.5 Gb 225.000 13.500,000

Table 3: Number of resources and elements by size of workspace

Note: The test itself has run against one workspace with a maximum model data of 500Mb. Sizing the data is done by creating additional project areas.

Tests Results

Externally Managed Tests Results

Figure 4 clearly shows that the tipping point, for our test environment and with a user transaction per minute, seems to be around 200 users and 1 Gb of models.

You can see that increasing the user load from 100 to 200 users doesn’t have a big negative impact on performance (1.81 vs 2.29 (26%)., and 1.18 vs. 1.24 (5%). It’s only when moving up to 300 users, were the CPU of the machine gets exhausted.

Increasing the workspace load on the other hand has a much bigger impact (1.18 vs 1.81 (50%) and 2.29 vs. 1.24 (85%)). Running on 1.5 Gb Load model degraded performance exponential.

Figure 4 – Weighted mean response time in aspect of 100 – 300 users with a 0.5Gb – 1Gb model of an external managed test suite

Looking more closely to the response time of every single scenario in figure 5 and figure 6, you can see that the main bottle neck is the Write scenarios of creating and deleting a link where the response time jumps from 27 sec to 40 sec.

This is due to an architectural constraint in which updating a configuration is highly affected by the workspace load. During that update, all read scenarios are affected, resulting in Read peaks during those times. This constraint is planned to be addressed in the future DM 5.0 release.

Figure 5 – Weighted mean response time of single test scenarios in aspect of 100 – 200 users with a 0.5Gb – 1Gb model

Figure 6 – Weighted mean response time of single test scenario create/delete OSLC link in aspect of 100 – 200 users with a 0.5Gb – 1Gb model

Actively Managed Tests Results

Figure 7 clearly shows that the tipping point, for our test environment and with a user transaction per minute, seems to be around 150 users and 1 Gb of models.

You can see that increasing the user load from 100 to 150 users already has a big negative impact on performance (1.11 vs 13.14 (>1000%)., and 1.26 vs. 14.32 (>1000%).

Regarding increasing the workspace load we see that moving to 1Gb doesn’t really affect performance from (1.11 vs 1.26 (15%). Running on 1.5 Gb Load model degraded performance exponential.

Figure 7 – Weighted mean response time in aspect of 100 – 200 users with a 0.5Gb – 1.5Gb model of an actively managed test suite

Disk Performance

One of the main bottle necks for performance and scalability of the design manager server is the RDF index files implemented by Jena DB. We have identified that the performance and scalability is highly dependent on the index files accessibility.

To improve the index accessibility, 2 options can be taken.

  1. Increase the server’s RAM: Increasing the memory to 64Gb RAM will cause Jena to load the index files into memory to maximize the performance.
  2. Use SSD Disks: Performance of accessing the indexing files on the disk will be improved significantly. In Table 6 below you can see an overall improvement of approx. 30% by simply moving to an SSD disk.

In table 6 below you can see performance results of a test run on 2 sets of machines with similar hardware settings. The difference between the machines is the disk used. One was using an SSD disk while the other wasn’t.
We see an overall performance gain of approx. 30 % average.

Table 6 – Comparison between an SSD Hard drive vs. a simple Hard drive of an actively managed test suite

Performance tuning tips

As a conclusion to this article here are some performance tuning tips that would help you maximize the performance of DM.

The performance tuning considerations for Design Manager are similar to those for other Jazz-based applications, in particular:

  • Size your hardware and network accordingly to the number of concurrent users you are expecting.
  • Use SSD Disks
  • Increase Server RAM
  • Increase the size of the thread pool used by the server’s web application container from the default.
  • Set the size of the heap the JVM uses appropriately.

Hardware sizing

To maximize DM performances use a 64-bit architecture with at least 2 CPU cores, and at least 8 GB RAM. As noted above, the main resource contention observed during the tests in the DM server is the CPU cores, so increasing the number of CPU cores of the DM Server should help increase the number of concurrent users and the size of the model data that DM can handle. As seen above, moving to SSD Disk’s or increase RAM Memory will boost performance significantly, allowing to scale to more workspace data.

Network connectivity

In the test topology described previously, the DM application, the JTS application and the database were installed in different machines. This allows to increase the CPU, memory and disk available for each application, but in return it puts more pressure on the network connectivity, especially the network latency. To mitigate this, it is recommended to locate the three servers on the same subnet.

Thread pool size

The size of the thread pool used by the DM server’s web application container should be at least 2.5 times the expected active user load. For example, if you expect to have 100 concurrently active users, like in the tests described above, set the thread pool size to at least 250 for both the DM Server and the JTS server.

JVM heap size

It is recommended to set the maximum JVM heap size to at least 6 GB. However, you can only do that if the server has at least 8 GB of RAM. As a rule of thumb, avoid setting the maximum JVM heap size to more than about 70-80% of the amount of RAM in the server has or you may experience poor performance due to thrashing.

If you are running Tomcat, you will need to edit the server-startup script to change the default values of -Xms and -Xmx to the desired value (8GB or more). Set both parameters to the same value to avoid the overhead of dynamic Java Virtual Machine (JVM) heap management. You will need to stop and restart the server for the changes to take effect.

If you are running Websphere Application Server (WAS), see the “Java virtual machine settings” section in the WAS information center for instructions specific to your WAS deployment.

Sources

IBM Rational Software Architect Design Manager 4.0 Performance And Sizing Report

Rational Design Manager performance and scalability

About the author

David Hirsch is a senior developer in the Design Management development team. He was responsible for the Rational Rhapsody Design Manager before taking the lead on the DM effort to improve Design Manager performances. David can be contacted at davidhir@il.ibm.com.

Dashboards and work items are no longer publicly available, so some links may be invalid. We now provide similar information through other means. Learn more here.
Feedback
Was this information helpful? Yes No 2 people rated this as helpful.