It's all about the answers!

Ask a question

Querying the Jazz repository for contributors and teams


Theo van Oostrum (14) | asked Mar 10 '09, 5:27 p.m.
Dear Jazz participant,

I am working on a research project at UC Irvine involving mining the Jazz repository (the repository we are using is the anonimized version made available to the academic community). At this point, I am trying to deduce two pieces of data as relationships:

TEAM - TEAM MEMBER
TEAM MEMBER - LOCATION

When I look at the database structure, and examine the users in table REPOSITORY.CONTRIBUTORS, I get 13,000 contributors. This strikes me as a highly unlikely number. A subset of these have real e-mail addresses attached, though there does not seem to be a particular scheme why (the subset involves 79 users). At the same time, I cannot find any location info on contributors (e.g., their geographical location).

Could anyone help me with the following questions:

1. userids in Jazz, are they capturing both real users and internal
Jazz processes or anything like it?

2. Are only the userids with real e-mail addresses attached real
users, or only those users who are actually linked to a team?

Thanks,
Theo van Oostrum

One answer



permanent link
Geoffrey Clemm (29.9k23035) | answered Mar 10 '09, 11:29 p.m.
FORUM ADMINISTRATOR / FORUM MODERATOR / JAZZ DEVELOPER
All 13000 accounts were created by humans (it's a popular system :-).
We do not collect information like geographical location, for privacy
reasons. I don't know why a few of the accounts in the anonomized
version still have email addresses attached.

Cheers,
Geoff

gt.oostrum wrote:
Dear Jazz participant,

I am working on a research project at UC Irvine involving mining the
Jazz repository (the repository we are using is the anonimized
version made available to the academic community). At this point, I
am trying to deduce two pieces of data as relationships:

TEAM - TEAM MEMBER
TEAM MEMBER - LOCATION

When I look at the database structure, and examine the users in table
REPOSITORY.CONTRIBUTORS, I get 13,000 contributors. This strikes me
as a highly unlikely number. A subset of these have real e-mail
addresses attached, though there does not seem to be a particular
scheme why (the subset involves 79 users). At the same time, I
cannot find any location info on contributors (e.g., their
geographical location).

Could anyone help me with the following questions:

1. userids in Jazz, are they capturing both real users and internal
Jazz processes or anything like it?

2. Are only the userids with real e-mail addresses attached real
users, or only those users who are actually linked to a team?

Thanks,
Theo van Oostrum

Your answer


Register or to post your answer.