Querying the Jazz repository for contributors and teams
![](http://jazz.net/_images/myphoto/128ca04817fda61660844390333a0c4d.jpg)
Dear Jazz participant,
I am working on a research project at UC Irvine involving mining the Jazz repository (the repository we are using is the anonimized version made available to the academic community). At this point, I am trying to deduce two pieces of data as relationships:
TEAM - TEAM MEMBER
TEAM MEMBER - LOCATION
When I look at the database structure, and examine the users in table REPOSITORY.CONTRIBUTORS, I get 13,000 contributors. This strikes me as a highly unlikely number. A subset of these have real e-mail addresses attached, though there does not seem to be a particular scheme why (the subset involves 79 users). At the same time, I cannot find any location info on contributors (e.g., their geographical location).
Could anyone help me with the following questions:
1. userids in Jazz, are they capturing both real users and internal
Jazz processes or anything like it?
2. Are only the userids with real e-mail addresses attached real
users, or only those users who are actually linked to a team?
Thanks,
Theo van Oostrum
I am working on a research project at UC Irvine involving mining the Jazz repository (the repository we are using is the anonimized version made available to the academic community). At this point, I am trying to deduce two pieces of data as relationships:
TEAM - TEAM MEMBER
TEAM MEMBER - LOCATION
When I look at the database structure, and examine the users in table REPOSITORY.CONTRIBUTORS, I get 13,000 contributors. This strikes me as a highly unlikely number. A subset of these have real e-mail addresses attached, though there does not seem to be a particular scheme why (the subset involves 79 users). At the same time, I cannot find any location info on contributors (e.g., their geographical location).
Could anyone help me with the following questions:
1. userids in Jazz, are they capturing both real users and internal
Jazz processes or anything like it?
2. Are only the userids with real e-mail addresses attached real
users, or only those users who are actually linked to a team?
Thanks,
Theo van Oostrum
One answer
![](http://jazz.net/_images/myphoto/128ca04817fda61660844390333a0c4d.jpg)
All 13000 accounts were created by humans (it's a popular system :-).
We do not collect information like geographical location, for privacy
reasons. I don't know why a few of the accounts in the anonomized
version still have email addresses attached.
Cheers,
Geoff
gt.oostrum wrote:
We do not collect information like geographical location, for privacy
reasons. I don't know why a few of the accounts in the anonomized
version still have email addresses attached.
Cheers,
Geoff
gt.oostrum wrote:
Dear Jazz participant,
I am working on a research project at UC Irvine involving mining the
Jazz repository (the repository we are using is the anonimized
version made available to the academic community). At this point, I
am trying to deduce two pieces of data as relationships:
TEAM - TEAM MEMBER
TEAM MEMBER - LOCATION
When I look at the database structure, and examine the users in table
REPOSITORY.CONTRIBUTORS, I get 13,000 contributors. This strikes me
as a highly unlikely number. A subset of these have real e-mail
addresses attached, though there does not seem to be a particular
scheme why (the subset involves 79 users). At the same time, I
cannot find any location info on contributors (e.g., their
geographical location).
Could anyone help me with the following questions:
1. userids in Jazz, are they capturing both real users and internal
Jazz processes or anything like it?
2. Are only the userids with real e-mail addresses attached real
users, or only those users who are actually linked to a team?
Thanks,
Theo van Oostrum