diff_classifier.utils - Code Metrics - ccurtis7/diff_classifier - Measure and Improve Code Quality continuously with Scrutinizer

diff_classifier.utils A
last analyzed 2019-07-12 06:15 UTC

↳ Parent: Project

Complexity

Total Complexity

Size/Duplication

Total Lines	69
Duplicated Lines	84.06 %

Importance

Changes

Metric	Value
wmc	5
eloc	35
dl	58
loc	69
rs	10
c	0
b	0
f	0

1 Function

Rating	Name	Duplication	Size	Complexity
B	csv_to_pd()	58	58	5

How to fix Duplicated Code

"""Utility functions used throughout diff_classifier.

This module includes general functions for tasks such as importing files and
converting between data types. Currently only includes a function to generate
pandas dataframes for csv output from Trackmate.

"""
import pandas as pd


def csv_to_pd(csvfname):

    """Reads Trackmate csv output file and converts to pandas dataframe.

    A specialized function designed specifically for TrackMate output files.
    This edits out the header at the beginning of the file.

    Parameters
    ----------
    csvfname : string
        Output csv from a file similar to trackmate_template.  Must
        include line 'Data starts here.\n' line in order to parse correctly.

    Returns
    -------
    data : pandas DataFrame
        Contains all trajectories from csvfname.

    Examples
    --------
    >>> data = csv_to_pd('../data/test.csv')

    """
    csvfile = open(csvfname)

    try:
        line = 'test'
        counter = 0
        while line != 'Data starts here.\n':
            line = csvfile.readline()
            counter = counter + 1
            if counter > 2000:
                break

        data = pd.read_csv(csvfname, skiprows=counter)
        data.sort_values(['Track_ID', 'Frame'], ascending=[1, 1])
        data = data.astype('float64')

        partids = data.Track_ID.unique()
        counter = 0
        for partid in partids:
            data.loc[data.Track_ID == partid, 'Track_ID'] = counter
            counter = counter + 1
    except:
        print('No data in csv file.')
        rawd = {'Track_ID': [],
                'Spot_ID': [],
                'Frame': [],
                'X': [],
                'Y': [],
                'Quality': [],
                'SN_Ratio': [],
                'Mean_Intensity': []}
        cols = ['Track_ID', 'Spot_ID', 'Frame', 'X', 'Y', 'Quality', 'SN_Ratio', 'Mean_Intensity']
        data = pd.DataFrame(data=rawd, index=[])
        data = data[cols]
        data = data.astype('float64')

    return data


1		"""Utility functions used throughout diff_classifier.
2
3		This module includes general functions for tasks such as importing files and
4		converting between data types. Currently only includes a function to generate
5		pandas dataframes for csv output from Trackmate.
6
7		"""
8		import pandas as pd
9
10
11	View Code Duplication	def csv_to_pd(csvfname):
		0 ignored issues – show Duplication introduced 2018-08-16 17:54 UTC by Report Bug Copy Issue Report This code seems to be duplicated in your project. Loading history...
12		"""Reads Trackmate csv output file and converts to pandas dataframe.
13
14		A specialized function designed specifically for TrackMate output files.
15		This edits out the header at the beginning of the file.
16
17		Parameters
18		----------
19		csvfname : string
20		Output csv from a file similar to trackmate_template. Must
21		include line 'Data starts here.\n' line in order to parse correctly.
22
23		Returns
24		-------
25		data : pandas DataFrame
26		Contains all trajectories from csvfname.
27
28		Examples
29		--------
30		>>> data = csv_to_pd('../data/test.csv')
31
32		"""
33		csvfile = open(csvfname)
34
35		try:
36		line = 'test'
37		counter = 0
38		while line != 'Data starts here.\n':
39		line = csvfile.readline()
40		counter = counter + 1
41		if counter > 2000:
42		break
43
44		data = pd.read_csv(csvfname, skiprows=counter)
45		data.sort_values(['Track_ID', 'Frame'], ascending=[1, 1])
46		data = data.astype('float64')
47
48		partids = data.Track_ID.unique()
49		counter = 0
50		for partid in partids:
51		data.loc[data.Track_ID == partid, 'Track_ID'] = counter
52		counter = counter + 1
53		except:
54		print('No data in csv file.')
55		rawd = {'Track_ID': [],
56		'Spot_ID': [],
57		'Frame': [],
58		'X': [],
59		'Y': [],
60		'Quality': [],
61		'SN_Ratio': [],
62		'Mean_Intensity': []}
63		cols = ['Track_ID', 'Spot_ID', 'Frame', 'X', 'Y', 'Quality', 'SN_Ratio', 'Mean_Intensity']
64		data = pd.DataFrame(data=rawd, index=[])
65		data = data[cols]
66		data = data.astype('float64')
67
68		return data
69

ccurtis7 / diff_classifier

diff_classifier.utils A last analyzed 2019-07-12 06:15 UTC

Complexity

Size/Duplication

Importance

1 Function

How to fix Duplicated Code

Duplicated Code

Duplication Side-by-Side

Filter issues like

diff_classifier.utils A
last analyzed 2019-07-12 06:15 UTC