summaryrefslogtreecommitdiff
path: root/README
blob: 0cc060946751bd39a20c2777170b853fda6dffa6 (plain)
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
This is the configuration we use to extract changesets and run gitdm on them.

* Tweaking your affiliation

This mostly happens in domain_map; there should be no need to touch
the 'oracle' and 'go-oo' files, which tag legacy account affiliation.

* Running mkstats.sh

To run this, you will also need a patched gitdm next to this repo;
something like this:

cd ..
git clone git://git.lwn.net/gitdm.git
cd gitdm
for p in ../gitdm-config/gitdm-patches/*; do
	patch -p1 < $p
done

Then - simply run:

./mkstats.sh

(which will clone a lot of repos under ./checkouts, make sure the
filesystem has some GB of space)

This no longer handles the old split-repo source setup, which was ~impossible
to extract accurate statistics out of, anyway. As an upshot, it now
handles almost every git repo that is under the TDF umbrella.

Having got that right - you should end up with a stats-$year-$week
directory containing the required data.

	Best of luck interpreting it.

Authors:
	Cedric Bosdonnat
	Michael Meeks
    Thorsten Behrens



File information:

domain-name, contains all companies who have a blanket license
aliases, contains know email aliases
mr_configs, contains the mr configurations
audit, contains the audit history
licensePersonal.csv, contain person licenses
licenseCompany.csv, contain company blanket licenses


Chart generation (AB etc)
Here's the generating howto:
- run ./mkstats.sh in gitdm-config
- take the old spreadsheets (in charts)
- insert sheet from category dir, file: git-hackers-data.csv
- data->pivot table->refresh on both Commits.. sheets
  (make sure the new months are added at the bottom)
- adapt the data ranges of the charts to the new rows & columns