pocketutils.tools.unit_tools.UnitTools.round_to_sigfigs() - Code Metrics - Inspection of "feat: add new batteries" - dmyersturnbull/pocketutils - Measure and Improve Code Quality continuously with Scrutinizer

Passed

Push — main ( 15d22f...4e3485 )

by Douglas

created 2023-07-09 01:50 UTC

UnitTools.round_to_sigfigs() A

↳ Parent: pocketutils.tools.unit_tools

Complexity

Conditions

Size

Total Lines	22
Code Lines	11

Duplication

Lines	0
Ratio	0 %

Importance

Changes

Metric	Value
cc	4
eloc	11
nop	3
dl	0
loc	22
rs	9.85
c	0
b	0
f	0

import logging

import math
from datetime import date, datetime, timedelta
from typing import SupportsFloat

import regex

from pint import Quantity, UnitRegistry

from pint.errors import PintTypeError


from pocketutils.core.exceptions import OutOfRangeError, StringPatternError

from pocketutils.tools.string_tools import StringTools

logger = logging.getLogger("pocketutils")
_UNIT_REG = UnitRegistry()


class UnitTools:

    @classmethod
    def format_approx_big_number(cls, n: int) -> str:

        for k, v in {1e15: "", 1e12: "T", 1e9: "B", 1e6: "M", 1e3: "k"}.items():

            if n >= k:
                return str(n // k) + v
        return str(n)

    @classmethod
    def approx_time_wrt(

        cls,

        now: date | datetime,

        then: date | datetime,

        *,

        skip_today: bool = False,

        sig: int = 3,

    ) -> str:
        """
        Describes ``then`` with higher resolution for smaller differences to ``now``.

        Examples:
            - ``approx_time_wrt(date(2021, 1, 12), date(1996, 10, 1))  # "1996"``
            - ``approx_time_wrt(date(2021, 1, 12), date(2021, 10, 1))  # "2021-01-12"``
            - ``approx_time_wrt(date(2021, 10, 1), datetime(2021, 10, 1, 11, 55))  # "2021-01-12 11:55"``

            - ``approx_time_wrt(date(2021, 10, 1), datetime(2021, 10, 1, 11, 0, 0, 30, 222222))  # "2021-01-12 00:00:30"``

            - ``approx_time_wrt(date(2021, 10, 1), datetime(2021, 10, 1, 11, 0, 0, 2, 222222))  # "2021-01-12 00:00:02.222"``

            - ``approx_time_wrt(date(2021, 10, 1), datetime(2021, 10, 1, 11, 0, 0, 2, 22))  # "2021-01-12 00:00:02.000022"``

        """
        delta = now - then if now > then else then - now
        tot_days = (delta.days) + (delta.seconds / 86400) + (delta.microseconds / 86400 / 10**6)
        tot_secs = tot_days * 86400
        _today = "" if skip_today and then.date() == now.date() else "%Y-%m-%d "
        if tot_days > sig * 365.24219:

            return str(then.year)
        elif tot_days > sig * 30.437:
            return then.strftime("%Y-%m")
        elif tot_days > sig:
            return then.strftime("%Y-%m-%d")
        elif tot_secs > sig * 60:
            return then.strftime(_today + "%H:%M")
        elif tot_secs > sig:
            return then.strftime(_today + "%H:%M:%S")
        elif tot_secs > sig / 1000:
            return then.strftime(_today + "%H:%M:%S") + "." + str(round(then.microsecond / 1000))
        else:
            return then.strftime(_today + "%H:%M:%S.%f")

    @classmethod
    def delta_time_to_str(cls, delta_sec: float | timedelta, *, space: str = "") -> str:
        """
        Returns a pretty string from a difference in time in seconds.
        Rounds hours and minutes to 2 decimal places, and seconds to 1.
        Ex: delta_time_to_str(313) == 5.22min
            delta_sec: The time in seconds
            space: Space char between digits and units;
                   good choices are empty, ASCII space, Chars.narrownbsp, Chars.thinspace,
                   and Chars.nbsp.

        Returns:
            A string with units 'hr', 'min', or 's'
        """
        if isinstance(delta_sec, timedelta):
            delta_sec = delta_sec.total_seconds()
        if abs(delta_sec) > 60 * 60 * 3:

            return (
                StringTools.strip_empty_decimal(str(round(delta_sec / 60 / 60, 2))) + space + "hr"
            )
        elif abs(delta_sec) > 180:
            return StringTools.strip_empty_decimal(str(round(delta_sec / 60, 2))) + space + "min"
        else:
            return StringTools.strip_empty_decimal(str(round(delta_sec, 1))) + space + "s"

    @classmethod
    def ms_to_minsec(cls, ms: int, space: str = "") -> str:

        """
        Converts a number of milliseconds to one of the following formats.
        Will be one of these:
            - 10ms         if < 1 sec
            - 10:15        if < 1 hour
            - 10:15:33     if < 1 day
            - 5d:10:15:33  if > 1 day
        Prepends a minus sign (−) if negative.

        Args:
            ms: The milliseconds
            space: Space char between digits and 'ms' (if used);
                   good choices are empty, ASCII space, Chars.narrownbsp,
                   Chars.thinspace, and Chars.nbsp.

        Returns:
            A string of one of the formats above
        """
        ms = abs(int(ms))
        seconds = int((ms / 1000) % 60)
        minutes = int((ms / (1000 * 60)) % 60)
        hours = int((ms / (1000 * 60 * 60)) % 24)
        days = int(ms / (1000 * 60 * 60 * 24))
        z_hr = str(hours).zfill(2)
        z_min = str(minutes).zfill(2)
        z_sec = str(seconds).zfill(2)
        sgn = "−" if ms < 0 else ""
        if ms < 1000:

            return f"{sgn}{ms}{space}ms"
        elif days > 1:
            return f"{days}d:{z_hr}:{z_min}:{z_sec}"
        elif hours > 1:
            return f"{sgn}{z_hr}:{z_min}:{z_sec}"
        else:
            return f"{sgn}{z_min}:{z_sec}"

    @classmethod
    def round_to_sigfigs(cls, num: SupportsFloat, sig_figs: int | None) -> float:
        """
        Round to specified number of sigfigs.

        Args:
            num: A Python or Numpy float or something that supports __float__
            sig_figs: The number of significant figures, non-negative

        Returns:
            A Python integer
        """
        if sig_figs is None:
            return float(num)
        if sig_figs < 0:
            raise OutOfRangeError(f"sig_figs {sig_figs} is negative", minimum=0)
        num = float(num)
        if num != 0:

            digits = -int(math.floor(math.log10(abs(num))) - (sig_figs - 1))
            return round(num, digits)
        else:
            return 0  # can't take the log of 0

    @classmethod
    def format_micromolar(
        cls,

        micromolar: float,

        n_sigfigs: int | None = 5,

        *,

        adjust_units: bool = True,

        use_sigfigs: bool = True,

        space: str = "",

    ) -> str:
        """
        Returns a concentration with units, with the units scaled as needed.
        Can handle millimolar, micromolar, nanomolar, and picomolar.

        Args:
            micromolar: Value
            n_sigfigs: For rounding; no rounding if None
            adjust_units: If False, will always use micromolar
            use_sigfigs: If True, rounds to a number of significant figures; otherwise round to decimal places

            space: Space char between digits and units;
                   good choices are empty, ASCII space,
                   :attr:`pocketutils.core.chars.Chars.narrownbsp`,
                   :attr:`pocketutils.core.chars.Chars.thinspace`,
                   and :attr:`pocketutils.core.chars.Chars.nbsp`.

        Returns:
            The concentration with a suffix of µM, mM, nM, or mM
        """
        d = micromolar

        m = abs(d)

        unit = "µM"
        if adjust_units:
            if m < 1e-6:
                d *= 1e9

                unit = "fM"
            elif m < 1e-3:
                d *= 1e6

                unit = "pM"
            elif m < 1:
                d *= 1e3

                unit = "nM"
            elif m >= 1e6:
                d /= 1e6

                unit = "M"
            elif m >= 1e3:
                d /= 1e3

                unit = "mM"
        if n_sigfigs is None:
            pass
        elif use_sigfigs:
            d = cls.round_to_sigfigs(d, n_sigfigs)

        else:
            d = round(d, n_sigfigs)

        if round(d) == d and str(d).endswith(".0"):

            return str(d)[:-2] + space + unit
        else:
            return str(d) + space + unit

    @classmethod
    def split_species_micromolar(cls, text: str) -> tuple[str, float | None]:

        """
        Splits a name into a chemical/concentration pair, falling back with the full name.
        Ex: "abc 3.5uM" → (abc, 3.5)
        Ex: "abc 3.5 µM" → (abc, 3.5)
        Ex: "abc (3.5mM)" → (abc, 3500.0)
        Ex: "abc 3.5mM" → (abc, None)
        Ex: "3.5mM" → (3.5mM, None)  # an edge case: don't pass in only units
        Uses a moderately strict pattern for the drug and dose:
            - The dose must terminate the string, except for end parenthesis or whitespace.
            - The drug and dose must be separated by at least one non-alphanumeric, non-dot, non-hyphen character.

            - Units must follow the digits, separated by at most whitespace, and are case-sensitive.
        """
        # lazy ops in the first group and in the non-(alphanumeric/dot/dash) separator between the drug and dose

        pat = regex.compile(
            r"^\s*(.*?)(?:[^A-Za-z0-9.\-]+?[\s(\[{]*(\d+(?:.\d*)?)\s*([mµunpf]M)\s*[)\]}]*)?\s*$",
            flags=regex.V1,
        )
        match = pat.fullmatch(text)
        if match is None:
            raise StringPatternError(f"Text {text} couldn't be parsed", value=text, pattern=pat)
        if match.group(2) is None:

            return text.strip(), None
        else:
            drug = match.group(1).strip("([{)]}")
            dose = UnitTools.concentration_to_micromolar(float(match.group(2)), match.group(3))
            return drug, dose

    @classmethod
    def extract_micromolar(cls, text: str) -> float | None:
        """
        Returns what looks like a concentration with units. Accepts one of: mM, µM, uM, nM, pM.
        Searches pretty flexibly.
        If no matches are found, returns None.
        If multiple matches are found, warns and returns None.
        """
        # we need to make sure mM ex isn't part of a larger name
        pat1 = regex.compile(r"(\d+(?:.\d*)?)\s*([mµunpf]M)\s*[)\]}]*", flags=regex.V1)

        def find(pat):
            return {
                UnitTools.concentration_to_micromolar(float(match.group(1)), match.group(2))
                for match in pat.finditer(text)
                if match is not None
            }

        matches = find(pat1)
        if len(matches) == 1:

            return next(iter(matches))
        elif len(matches) > 1:
            logger.warning(f"Found {len(matches)} potential doses: {matches} . Returning None.")

        return None

    @classmethod
    def concentration_to_micromolar(cls, digits: SupportsFloat, units: str) -> float:
        """
        Converts a concentration with units to micromolar.

        Args:
            digits: Float or float-compatible value
            units: Units that ``digits`` are in

        Example:
            .. code-block::

                concentration_to_micromolar(53, 'nM')  # returns 0.053

        See Also:
            :meth:`extract_micromolar`
        """
        return (
            float(digits)
            * {
                "M": 1e6,
                "mM": 1e3,
                "µM": 1,
                "uM": 1,
                "nM": 1e-3,
                "pM": 1e-6,
                "fM": 1e-9,
            }[units]
        )

    @classmethod
    def canonicalize_quantity(cls, s: str, dimensionality: str) -> Quantity:

        """
        Returns a quantity in reduced units from a magnitude with units.

        Args:
            s: The string to parse; e.g. ``"1 m/s^2"``.
               Unit names and symbols permitted, and spaces may be omitted.
            dimensionality: The resulting Quantity is checked against this;
                            e.g. ``"[length]/[meter]^2"``

        Returns:
            a pint ``Quantity``

        Raise:
            PintTypeError: If the dimensionality is inconsistent
        """
        q = _UNIT_REG.Quantity(s).to_reduced_units()

        if not q.is_compatible_with(dimensionality):
            raise PintTypeError(f"{s} not of dimensionality {dimensionality}")
        return q


__all__ = ["UnitTools"]


1			import logging
			0 ignored issues – show introduced 2021-01-25 05:25 UTC by Report Bug Copy Issue Report Missing module docstring Loading history...
2			import math
3			from datetime import date, datetime, timedelta
4			from typing import SupportsFloat
5
6			import regex
			0 ignored issues – show introduced 2021-09-09 00:35 UTC by Report Bug Copy Issue Report Unable to import 'regex' Loading history...
7			from pint import Quantity, UnitRegistry
			0 ignored issues – show introduced 2021-10-11 02:12 UTC by Report Bug Copy Issue Report Unable to import 'pint' Loading history...
8			from pint.errors import PintTypeError
			0 ignored issues – show introduced 2021-10-11 02:12 UTC by Report Bug Copy Issue Report Unable to import 'pint.errors' Loading history...
9
10			from pocketutils.core.exceptions import OutOfRangeError, StringPatternError
			0 ignored issues – show Bug introduced 2021-10-28 02:10 UTC by Report Bug Copy Issue Report The name `core` does not seem to exist in module `pocketutils`. Loading history...
11			from pocketutils.tools.string_tools import StringTools
12
13			logger = logging.getLogger("pocketutils")
14			_UNIT_REG = UnitRegistry()
15
16
17			class UnitTools:
			0 ignored issues – show introduced 2021-01-25 05:25 UTC by Report Bug Copy Issue Report Missing class docstring Loading history...
18			@classmethod
19			def format_approx_big_number(cls, n: int) -> str:
			0 ignored issues – show Coding Style Naming introduced 2023-07-09 01:49 UTC by Report Bug Copy Issue Report Argument name "n" doesn't conform to snake_case naming style ('([^\\W\\dA-Z][^\\WA-Z]2,\|_[^\\WA-Z]*\|__[^\\WA-Z\\d_][^\\WA-Z]+__)$' pattern) This check looks for invalid names for a range of different identifiers. You can set regular expressions to which the identifiers must conform if the defaults do not match your requirements. If your project includes a Pylint configuration file, the settings contained in that file take precedence. To find out more about Pylint, please refer to their site. Loading history... introduced 2023-07-09 01:49 UTC by Report Bug Copy Issue Report Missing function or method docstring Loading history...
20			for k, v in {1e15: "", 1e12: "T", 1e9: "B", 1e6: "M", 1e3: "k"}.items():
			0 ignored issues – show Coding Style Naming introduced 2023-07-09 01:49 UTC by Report Bug Copy Issue Report Variable name "v" doesn't conform to snake_case naming style ('([^\\W\\dA-Z][^\\WA-Z]2,\|_[^\\WA-Z]*\|__[^\\WA-Z\\d_][^\\WA-Z]+__)$' pattern) This check looks for invalid names for a range of different identifiers. You can set regular expressions to which the identifiers must conform if the defaults do not match your requirements. If your project includes a Pylint configuration file, the settings contained in that file take precedence. To find out more about Pylint, please refer to their site. Loading history...
21			if n >= k:
22			return str(n // k) + v
23			return str(n)
24
25			@classmethod
26			def approx_time_wrt(
			0 ignored issues – show best-practice introduced 2021-10-11 23:10 UTC by Report Bug Copy Issue Report Too many return statements (7/6) Loading history...
27			cls,
			0 ignored issues – show Coding Style introduced 2021-01-25 05:25 UTC by Report Bug Copy Issue Report Wrong hanging indentation before block (add 4 spaces). Loading history...
28			now: date \| datetime,
			0 ignored issues – show Coding Style introduced 2021-01-25 05:25 UTC by Report Bug Copy Issue Report Wrong hanging indentation before block (add 4 spaces). Loading history...
29			then: date \| datetime,

dmyersturnbull / pocketutils

Push — main ( 15d22f...4e3485 )

UnitTools.round_to_sigfigs() A

Complexity

Size

Duplication

Importance

Duplication Side-by-Side

Filter issues like