BradleyOpenMPConverter - Code Metrics - richlewis42/scikit-chem - Measure and Improve Code Quality continuously with Scrutinizer

BradleyOpenMPConverter A
last analyzed 2016-09-01 14:43 UTC

↳ Parent: Project

Complexity

Total Complexity

Size/Duplication

Total Lines	44
Duplicated Lines	0 %

Test Coverage

Coverage

16.13%

Importance

Changes	1
Bugs	0	Features	1

Metric	Value
wmc	6
c	1
b	0
f	1
dl	0
loc	44
ccs	5
cts	31
cp	0.1613
rs	10

5 Methods

Rating	Name	Size	Complexity
B	__init__()	25	3
A	filter_bad()	6	1
A	parse_smiles()	5	2
A	parse_data()	4	1
A	fix_mp()	4	1

#! /usr/bin/env python
class SomeClass:
    def some_method(self):
        """Do x and return foo."""
#
# Copyright (C) 2016 Rich Lewis <[email protected]>
# License: 3-clause BSD

import os
import logging
logger = logging.getLogger(__name__)


import pandas as pd
# .scrutinizer.yml
before_commands:
    - sudo pip install abc # Python2
    - sudo pip3 install abc # Python3

from .base import Converter, default_pipeline, contiguous_order
from ...core import Mol
from ...cross_validation import SimThresholdSplit

class BradleyOpenMPConverter(Converter):
class SomeClass:
    def some_method(self):
        """Do x and return foo."""

    def __init__(self, directory, output_directory, output_filename='bradley_open_mp.h5'):
class SomeParent:
    def __init__(self):
        self.x = 1

class SomeChild(SomeParent):
    def __init__(self):
        # Initialize the super class
        SomeParent.__init__(self)

        output_path = os.path.join(output_directory, output_filename)
        data = self.parse_data(os.path.join(directory, 'bradley_melting_point_dataset.xlsx'))
        data = self.filter_bad(data)

        def parse_smiles(smi):
class SomeClass:
    def some_method(self):
        """Do x and return foo."""
            try:
                return Mol.from_smiles(smi)

            except ValueError:
                return None

        data['structure'] = data.smiles.apply(parse_smiles)
        data = data[data.structure.notnull()]
        ms, y = data.structure, self.fix_mp(data)


        pipeline = default_pipeline()
        ms, y = pipeline.transform_filter(ms, y)


        cv = SimThresholdSplit(min_threshold=0.6, n_jobs=-1).fit(ms)

        train, valid, test = cv.split((70, 15, 15))
        (ms, y, train, valid, test) = contiguous_order((ms, y, train, valid, test), (train, valid, test))

        splits = (('train', train), ('valid', valid), ('test', test))

        self.run(ms, y, output_path=output_path, splits=splits)

    @staticmethod
    def parse_data(path):
class SomeClass:
    def some_method(self):
        """Do x and return foo."""
        logger.info('Parsing data at %s...', path)
        return pd.read_excel(path, index_col=0)

    @staticmethod
    def filter_bad(data):
class SomeClass:
    def some_method(self):
        """Do x and return foo."""
        logger.info('Removing manually annotated errors...')
        bad_data = data.donotuse.notnull()
        logger.debug('Removed %s', bad_data.sum())
        return data[~bad_data]

    @staticmethod
    def fix_mp(data):
class SomeClass:
    def some_method(self):
        """Do x and return foo."""
        logger.info('Converting temperature to Kelvin...')
        return data.mpC + 278.15

if __name__ == '__main__':
    logging.basicConfig(level=logging.DEBUG)
    LOGGER.info('Converting Bradley Open Melting Point Dataset...')

    BradleyOpenMPConverter.convert()


BradleyOpenMPConverter A
last analyzed 2016-09-01 14:43 UTC

Complexity

Size/Duplication

Test Coverage

Importance

5 Methods

1. Missing Dependencies

2. Missing init.py files

1		#! /usr/bin/env python
		0 ignored issues – show Coding Style introduced 2016-06-12 20:23 UTC by Report Bug Copy Issue Report This module should have a docstring. The coding style of this project requires that you add a docstring to this code element. Below, you find an example for methods: class SomeClass: def some_method(self): """Do x and return foo.""" If you would like to know more about docstrings, we recommend to read PEP-257: Docstring Conventions. Loading history...
2		#
3		# Copyright (C) 2016 Rich Lewis <[email protected]>
4		# License: 3-clause BSD
5
6	1	import os
7	1	import logging
8	1	logger = logging.getLogger(__name__)
		0 ignored issues – show Coding Style Naming introduced 2016-06-12 20:23 UTC by Report Bug Copy Issue Report The name `logger` does not conform to the constant naming conventions (`(([A-Z_][A-Z0-9_])\|(__.__))$`). This check looks for invalid names for a range of different identifiers. You can set regular expressions to which the identifiers must conform if the defaults do not match your requirements. If your project includes a Pylint configuration file, the settings contained in that file take precedence. To find out more about Pylint, please refer to their site. Loading history...
9
10	1	import pandas as pd
		0 ignored issues – show Configuration introduced 2016-06-12 20:23 UTC by Report Bug Copy Issue Report The import `pandas` could not be resolved. This can be caused by one of the following: 1. Missing Dependencies This error could indicate a configuration issue of Pylint. Make sure that your libraries are available by adding the necessary commands. # .scrutinizer.yml before_commands: - sudo pip install abc # Python2 - sudo pip3 install abc # Python3 Tip: We are currently not using virtualenv to run pylint, when installing your modules make sure to use the command for the correct version. 2. Missing __init__.py files This error could also result from missing `__init__.py` files in your module folders. Make sure that you place one file in each sub-folder. Loading history...
11
12	1	from .base import Converter, default_pipeline, contiguous_order
13	1	from ...core import Mol
14	1	from ...cross_validation import SimThresholdSplit
15
16	1	class BradleyOpenMPConverter(Converter):
		0 ignored issues – show Coding Style introduced 2016-06-12 20:23 UTC by Report Bug Copy Issue Report This class should have a docstring. The coding style of this project requires that you add a docstring to this code element. Below, you find an example for methods: class SomeClass: def some_method(self): """Do x and return foo.""" If you would like to know more about docstrings, we recommend to read PEP-257: Docstring Conventions. Loading history...
17
18	1	def __init__(self, directory, output_directory, output_filename='bradley_open_mp.h5'):
		0 ignored issues – show Bug introduced 2016-06-12 20:23 UTC by Report Bug Copy Issue Report The `__init__` method of the super-class `Converter` is not called. It is generally advisable to initialize the super-class by calling its `__init__` method: class SomeParent: def __init__(self): self.x = 1 class SomeChild(SomeParent): def __init__(self): # Initialize the super class SomeParent.__init__(self) Loading history...
19
20		output_path = os.path.join(output_directory, output_filename)
21		data = self.parse_data(os.path.join(directory, 'bradley_melting_point_dataset.xlsx'))
22		data = self.filter_bad(data)
23
24		def parse_smiles(smi):
		0 ignored issues – show Coding Style introduced 2016-08-05 16:20 UTC by Report Bug Copy Issue Report This function should have a docstring. The coding style of this project requires that you add a docstring to this code element. Below, you find an example for methods: class SomeClass: def some_method(self): """Do x and return foo.""" If you would like to know more about docstrings, we recommend to read PEP-257: Docstring Conventions. Loading history...
25		try:
26		return Mol.from_smiles(smi)
		0 ignored issues – show Bug introduced 2016-08-05 16:20 UTC by Report Bug Copy Issue Report The Class `Mol` does not seem to have a member named `from_smiles`. This check looks for calls to members that are non-existent. These calls will fail. The member could have been renamed or removed. Loading history...
27		except ValueError:
28		return None
29
30		data['structure'] = data.smiles.apply(parse_smiles)
31		data = data[data.structure.notnull()]
32		ms, y = data.structure, self.fix_mp(data)
		0 ignored issues – show Coding Style Naming introduced 2016-06-12 20:23 UTC by Report Bug Copy Issue Report The name `ms` does not conform to the variable naming conventions (`[a-z_][a-z0-9_]{2,30}$`). This check looks for invalid names for a range of different identifiers. You can set regular expressions to which the identifiers must conform if the defaults do not match your requirements. If your project includes a Pylint configuration file, the settings contained in that file take precedence. To find out more about Pylint, please refer to their site. Loading history... Coding Style Naming introduced 2016-06-12 20:23 UTC by Report Bug Copy Issue Report The name `y` does not conform to the variable naming conventions (`[a-z_][a-z0-9_]{2,30}$`). This check looks for invalid names for a range of different identifiers. You can set regular expressions to which the identifiers must conform if the defaults do not match your requirements. If your project includes a Pylint configuration file, the settings contained in that file take precedence. To find out more about Pylint, please refer to their site. Loading history...
33
34		pipeline = default_pipeline()
35		ms, y = pipeline.transform_filter(ms, y)
		0 ignored issues – show Coding Style Naming introduced 2016-08-05 16:20 UTC by Report Bug Copy Issue Report The name `ms` does not conform to the variable naming conventions (`[a-z_][a-z0-9_]{2,30}$`). This check looks for invalid names for a range of different identifiers. You can set regular expressions to which the identifiers must conform if the defaults do not match your requirements. If your project includes a Pylint configuration file, the settings contained in that file take precedence. To find out more about Pylint, please refer to their site. Loading history... Coding Style Naming introduced 2016-08-05 16:20 UTC by Report Bug Copy Issue Report The name `y` does not conform to the variable naming conventions (`[a-z_][a-z0-9_]{2,30}$`). This check looks for invalid names for a range of different identifiers. You can set regular expressions to which the identifiers must conform if the defaults do not match your requirements. If your project includes a Pylint configuration file, the settings contained in that file take precedence. To find out more about Pylint, please refer to their site. Loading history...
36
37		cv = SimThresholdSplit(min_threshold=0.6, n_jobs=-1).fit(ms)
		0 ignored issues – show Coding Style Naming introduced 2016-08-05 16:20 UTC by Report Bug Copy Issue Report The name `cv` does not conform to the variable naming conventions (`[a-z_][a-z0-9_]{2,30}$`). This check looks for invalid names for a range of different identifiers. You can set regular expressions to which the identifiers must conform if the defaults do not match your requirements. If your project includes a Pylint configuration file, the settings contained in that file take precedence. To find out more about Pylint, please refer to their site. Loading history...
38		train, valid, test = cv.split((70, 15, 15))
39		(ms, y, train, valid, test) = contiguous_order((ms, y, train, valid, test), (train, valid, test))
		0 ignored issues – show Coding Style introduced 2016-08-05 16:20 UTC by Report Bug Copy Issue Report This line is too long as per the coding-style (105/100). This check looks for lines that are too long. You can specify the maximum line length. Loading history... Coding Style Naming introduced 2016-08-05 16:20 UTC by Report Bug Copy Issue Report The name `ms` does not conform to the variable naming conventions (`[a-z_][a-z0-9_]{2,30}$`). This check looks for invalid names for a range of different identifiers. You can set regular expressions to which the identifiers must conform if the defaults do not match your requirements. If your project includes a Pylint configuration file, the settings contained in that file take precedence. To find out more about Pylint, please refer to their site. Loading history... Coding Style Naming introduced 2016-08-05 16:20 UTC by Report Bug Copy Issue Report The name `y` does not conform to the variable naming conventions (`[a-z_][a-z0-9_]{2,30}$`). This check looks for invalid names for a range of different identifiers. You can set regular expressions to which the identifiers must conform if the defaults do not match your requirements. If your project includes a Pylint configuration file, the settings contained in that file take precedence. To find out more about Pylint, please refer to their site. Loading history...
40		splits = (('train', train), ('valid', valid), ('test', test))
41
42		self.run(ms, y, output_path=output_path, splits=splits)
43
44	1	@staticmethod
45		def parse_data(path):
		0 ignored issues – show Coding Style introduced 2016-06-12 20:23 UTC by Report Bug Copy Issue Report This method should have a docstring. The coding style of this project requires that you add a docstring to this code element. Below, you find an example for methods: class SomeClass: def some_method(self): """Do x and return foo.""" If you would like to know more about docstrings, we recommend to read PEP-257: Docstring Conventions. Loading history...
46		logger.info('Parsing data at %s...', path)
47		return pd.read_excel(path, index_col=0)
48
49	1	@staticmethod
50		def filter_bad(data):
		0 ignored issues – show Coding Style introduced 2016-06-12 20:23 UTC by Report Bug Copy Issue Report This method should have a docstring. The coding style of this project requires that you add a docstring to this code element. Below, you find an example for methods: class SomeClass: def some_method(self): """Do x and return foo.""" If you would like to know more about docstrings, we recommend to read PEP-257: Docstring Conventions. Loading history...
51		logger.info('Removing manually annotated errors...')
52		bad_data = data.donotuse.notnull()
53		logger.debug('Removed %s', bad_data.sum())
54		return data[~bad_data]
55
56	1	@staticmethod
57		def fix_mp(data):
		0 ignored issues – show Coding Style introduced 2016-06-12 20:23 UTC by Report Bug Copy Issue Report This method should have a docstring. The coding style of this project requires that you add a docstring to this code element. Below, you find an example for methods: class SomeClass: def some_method(self): """Do x and return foo.""" If you would like to know more about docstrings, we recommend to read PEP-257: Docstring Conventions. Loading history...
58		logger.info('Converting temperature to Kelvin...')
59		return data.mpC + 278.15
60
61	1	if __name__ == '__main__':
62		logging.basicConfig(level=logging.DEBUG)
63		LOGGER.info('Converting Bradley Open Melting Point Dataset...')
		0 ignored issues – show Comprehensibility Best Practice introduced 2016-07-18 17:48 UTC by Report Bug Copy Issue Report Undefined variable 'LOGGER' Loading history...
64		BradleyOpenMPConverter.convert()
65

richlewis42 / scikit-chem

BradleyOpenMPConverter A last analyzed 2016-09-01 14:43 UTC

Complexity

Size/Duplication

Test Coverage

Importance

5 Methods

1. Missing Dependencies

2. Missing __init__.py files

Duplication Side-by-Side

Filter issues like

BradleyOpenMPConverter A
last analyzed 2016-09-01 14:43 UTC

2. Missing init.py files