Jazz Forum Welcome to the Jazz Community Forum Connect and collaborate with IBM Engineering experts and users

Does RTC have issues with a very large number fo files?

Wehave a project with two million files (~200Gb) and 100K directoriesDeveloper workspaces require approximately 200K files (six to eight Gb) in 25K directories. The developers are in remote locations, and we had planned to use the ClearCase remote client. However, updating a snapshot view with this many files takes > 30 min, even when there have been no changes. We now want to consider using RTC instead.

Does anyone have a project of this size?  Can you tell me if RTC would have similar issues with a very large no. of files?

2 votes

Comments

For ClearCase (may be different with RTC) we verified that network bandwidth and/or latency was not the issue.  Rather it is checking done on the very large number of files in the workspace.

We considered re-structuring the source, but found we could not sufficiently reduce the number of files without consolidating/combining them.  This would make updates cumbersome and time consuming, as the fragments would have to be extracted and then re-inserted in the consolidated files.


Accepted answer

Permanent link
I use the Eclipse client, and usually load about 30K-60K files in my sandbox ... it takes a while to download all the files initially from the repository to my local disk, but after that an "update" is instantaneous ... the RTC eclipse plug-in uses Eclipse editor notifications to notice when I've changed files, and the server protocol for what has changed on the server is proportional to the number of files that have changed, and not to the size of the workspace.  And even when I say "refresh" (in case I've modified a file using some external editor), the "refresh" is just a minute or two.   So you should be fine. 
Judy Beegle selected this answer as the correct answer

4 votes


2 other answers

Permanent link
RTC will be able to handle your situation. Based on what you have mentioned, you should organize your files into components and keeping the size of components as small as possible. A component is a collection of logically related files/directories. You can create streams by adding components to it. For a remote access situations, you could use distributed scm. You can find more information here - https://jazz.net/library/article/535/

1 vote


Permanent link
you will have similar problems. in my prior job, the company had remote builds from central RTC storage.
basically consumed our entire network bandwidth.

distr SCM is not easy to manage, it requires a complete RTC complex install in the remote location too.
and double license usage during cross repository synch operations.

see my design to mitigate this problem here
https://jazz.net/forum/questions/112889/caching-server-useful-for-rtc-workitems

0 votes

Comments

Sam: You are talking about the performance of actually downloading the file content from the server to the local sandbox, right?   I believe Judy is asking about the performance of detecting changes in the sandbox and on the server (based on her comment of "even when there have been no changes").

Your answer

Register or log in to post your answer.

Dashboards and work items are no longer publicly available, so some links may be invalid. We now provide similar information through other means. Learn more here.

Search context
Follow this question

By Email: 

Once you sign in you will be able to subscribe for any updates here.

By RSS:

Answers
Answers and Comments
Question details

Question asked: Aug 01 '13, 3:21 p.m.

Question was seen: 4,180 times

Last updated: Aug 02 '13, 4:30 p.m.

Confirmation Cancel Confirm