crowdtruth.models.job - Code Metrics - CrowdTruth/CrowdTruth-core - Measure and Improve Code Quality continuously with Scrutinizer

crowdtruth.models.job A
last analyzed 2019-05-10 09:08 UTC

↳ Parent: Project

Complexity

Total Complexity

Size/Duplication

Total Lines	47
Duplicated Lines	0 %

Test Coverage

Coverage

100%

Importance

Changes

Metric	Value
wmc	3
eloc	20
dl	0
loc	47
ccs	14
cts	14
cp	1
rs	10
c	0
b	0
f	0

1 Method

Rating	Name	Duplication	Size	Complexity
A	Job.aggregate()	0	37	3

"""
Job initialization.
"""

class Job():
    """
    Performs general statistics over the crowdsourcing jobs.
    """

    @staticmethod
    def aggregate(units, judgments, config):
        """
        Aggregates information about the total number of units, total number of judgments,
        total number of workers that provided annotations and the total duration of the job.

        Args:
            units: Units contained in the job.
            judgments: Judgments contained in the job.
            config: Job configuration as provided as input for the metrics.

        Returns:
            A dataframe of one row that stores general stats on the crowdsourcing jobs.
        """
        agg = {
            'unit' : 'nunique',
            'judgment' : 'nunique',
            'worker' : 'nunique',
            'duration' : 'mean'
        }
        job = judgments.groupby('job').agg(agg)

        # compute job runtime
        runtime = (max(judgments['submitted']) - min(judgments['started']))
        job['runtime'] = runtime #float(runtime.days) * 24 + float(runtime.seconds) / 3600
        job['runtime.per_unit'] = job['runtime'] / job['unit']
        job['judgments.per.worker'] = job['judgment'] / job['worker']

        metrics = ['unique_annotations', 'annotations']
        for metric in metrics:
            for col in config.output.values():
                # aggregate unit metrics
                job[col+'.'+metric] = units[col+'.'+metric].mean()

        job = job.reindex(sorted(job.columns), axis=1)

        return job


1		"""
2		Job initialization.
3		"""
4
5	1	class Job():
6		"""
7		Performs general statistics over the crowdsourcing jobs.
8		"""
9
10	1	@staticmethod
11		def aggregate(units, judgments, config):
12		"""
13		Aggregates information about the total number of units, total number of judgments,
14		total number of workers that provided annotations and the total duration of the job.
15
16		Args:
17		units: Units contained in the job.
18		judgments: Judgments contained in the job.
19		config: Job configuration as provided as input for the metrics.
20
21		Returns:
22		A dataframe of one row that stores general stats on the crowdsourcing jobs.
23		"""
24	1	agg = {
25		'unit' : 'nunique',
26		'judgment' : 'nunique',
27		'worker' : 'nunique',
28		'duration' : 'mean'
29		}
30	1	job = judgments.groupby('job').agg(agg)
31
32		# compute job runtime
33	1	runtime = (max(judgments['submitted']) - min(judgments['started']))
34	1	job['runtime'] = runtime #float(runtime.days) * 24 + float(runtime.seconds) / 3600
35	1	job['runtime.per_unit'] = job['runtime'] / job['unit']
36	1	job['judgments.per.worker'] = job['judgment'] / job['worker']
37
38	1	metrics = ['unique_annotations', 'annotations']
39	1	for metric in metrics:
40	1	for col in config.output.values():
41		# aggregate unit metrics
42	1	job[col+'.'+metric] = units[col+'.'+metric].mean()
43
44	1	job = job.reindex(sorted(job.columns), axis=1)
45
46		return job
47

CrowdTruth / CrowdTruth-core

GitHub Access Token became invalid

crowdtruth.models.job A last analyzed 2019-05-10 09:08 UTC

Complexity

Size/Duplication

Test Coverage

Importance

1 Method

Duplication Side-by-Side

Filter issues like

crowdtruth.models.job A
last analyzed 2019-05-10 09:08 UTC