Reference

textcase #

Text case conversion.

UNDERSCORE `module-attribute` #

UNDERSCORE = Boundary.from_delimiter('_')

Splits on underscore, consuming the character on segmentation.

Examples:

>>> UNDERSCORE.match("_")
True

>>> UNDERSCORE.match("_brown")
True

>>> UNDERSCORE.match(".brown")
False

>>> UNDERSCORE.start
0

>>> UNDERSCORE.length
1

HYPHEN `module-attribute` #

HYPHEN = Boundary.from_delimiter('-')

Splits on hyphen, consuming the character on segmentation.

Examples:

>>> HYPHEN.match("-")
True

>>> HYPHEN.match("-brown")
True

>>> HYPHEN.match(".brown")
False

>>> HYPHEN.start
0

>>> HYPHEN.length
1

SPACE `module-attribute` #

SPACE = Boundary.from_delimiter(' ')

Splits on space, consuming the character on segmentation.

Examples:

>>> SPACE.match(" ")
True

>>> SPACE.match(" brown")
True

>>> SPACE.match(".brown")
False

>>> SPACE.start
0

>>> SPACE.length
1

INTERPUNCT `module-attribute` #

INTERPUNCT = Boundary.from_delimiter('·')

Splits on interpunct, consuming the character on segmentation.

Examples:

>>> INTERPUNCT.match("·")
True

>>> INTERPUNCT.match("·brown")
True

>>> INTERPUNCT.match(".brown")
False

>>> INTERPUNCT.start
0

>>> INTERPUNCT.length
1

LOWER_UPPER `module-attribute` #

LOWER_UPPER = Boundary(match=lambda s: s[:1].islower() and s[1:2].isupper(), start=1)

Splits where a lowercase letter is followed by an uppercase letter.

This is seldom used, and is not included in the default boundaries.

Examples:

>>> LOWER_UPPER.match("aA")
True

>>> LOWER_UPPER.match("Aa")
False

>>> LOWER_UPPER.start
1

>>> LOWER_UPPER.length
0

UPPER_LOWER `module-attribute` #

UPPER_LOWER = Boundary(match=lambda s: s[:1].isupper() and s[1:2].islower(), start=1)

Splits where an uppercase letter is followed by a lowercase letter.

Examples:

>>> UPPER_LOWER.match("Aa")
True

>>> UPPER_LOWER.match("aA")
False

>>> UPPER_LOWER.start
1

>>> UPPER_LOWER.length
0

ACRONYM `module-attribute` #

ACRONYM = Boundary(match=lambda s: s[:1].isupper() and s[1:2].isupper() and s[2:3].islower(), start=1)

Acronyms are identified by two uppercase letters followed by a lowercase letter.

The word boundary is between the two uppercase letters. For example, "HTTPRequest" would have an acronym boundary identified at "PRe" and split into "HTTP" and "Request".

Examples:

>>> ACRONYM.match("AAa")
True

>>> ACRONYM.match("1Aa")
False

>>> ACRONYM.match("AAA")
False

>>> ACRONYM.start
1

>>> ACRONYM.length
0

LOWER_DIGIT `module-attribute` #

LOWER_DIGIT = Boundary(match=lambda s: s[:1].islower() and s[1:2].isdigit(), start=1)

Splits where a lowercase letter is followed by a digit.

Examples:

>>> LOWER_DIGIT.match("a1")
True

>>> LOWER_DIGIT.match("1a")
False

>>> LOWER_DIGIT.start
1

>>> LOWER_DIGIT.length
0

UPPER_DIGIT `module-attribute` #

UPPER_DIGIT = Boundary(match=lambda s: s[:1].isupper() and s[1:2].isdigit(), start=1)

Splits where an uppercase letter is followed by a digit.

Examples:

>>> UPPER_DIGIT.match("A1")
True

>>> UPPER_DIGIT.match("1A")
False

>>> UPPER_DIGIT.start
1

>>> UPPER_DIGIT.length
0

DIGIT_LOWER `module-attribute` #

DIGIT_LOWER = Boundary(match=lambda s: s[:1].isdigit() and s[1:2].islower(), start=1)

Splits where digit is followed by a lowercase letter.

Examples:

>>> DIGIT_LOWER.match("1a")
True

>>> DIGIT_LOWER.match("1A")
False

>>> DIGIT_LOWER.start
1

>>> DIGIT_LOWER.length
0

DIGIT_UPPER `module-attribute` #

DIGIT_UPPER = Boundary(match=lambda s: s[:1].isdigit() and s[1:2].isupper(), start=1)

Splits where digit is followed by an uppercase letter.

Examples:

>>> DIGIT_UPPER.match("1A")
True

>>> DIGIT_UPPER.match("1a")
False

>>> DIGIT_UPPER.start
1

>>> DIGIT_UPPER.length
0

snake `module-attribute` #

snake = Case(delimiter='_', transform=lambda words: map(str.lower, words))

Snake case strings are delimited by underscores and are all lowercase.

Examples:

>>> snake("Hello, world!")
'hello_world'

>>> snake.match("hello_world")
True

>>> snake.match("Hello, world!")
False

constant `module-attribute` #

constant = Case(delimiter='_', transform=lambda words: map(str.upper, words))

Constant case strings are delimited by underscores and are all uppercase.

Examples:

>>> constant("Hello, world!")
'HELLO_WORLD'

>>> constant.match("HELLO_WORLD")
True

>>> constant.match("Hello, world!")
False

kebab `module-attribute` #

kebab = Case(delimiter='-', transform=lambda words: map(str.lower, words))

Kebab case strings are delimited by hyphens and are all lowercase.

Examples:

>>> kebab("Hello, world!")
'hello-world'

>>> kebab.match("hello-world")
True

>>> kebab.match("Hello, world!")
False

middot `module-attribute` #

middot = Case(delimiter='·', transform=lambda words: map(str.lower, words))

Middot case strings are delimited by interpuncts and are all lowercase.

Examples:

>>> middot("Hello, world!")
'hello·world'

>>> middot.match("hello·world")
True

>>> middot.match("Hello, world!")
False

camel `module-attribute` #

camel = Case(transform=lambda words: word.lower() if i == 0 else word.capitalize() for (i, word) in enumerate(words))

Camel case strings are lowercase, but for every word except the first the first letter is capitalized.

Examples:

>>> camel("Hello, world!")
'helloWorld'

>>> camel.match("helloWorld")
True

>>> camel.match("Hello, world!")
False

pascal `module-attribute` #

pascal = Case(transform=lambda words: map(str.capitalize, words))

Pascal case strings are lowercase, but for every word the first letter is capitalized.

Examples:

>>> pascal("Hello, world!")
'HelloWorld'

>>> pascal.match("HelloWorld")
True

>>> pascal.match("Hello, world!")
False

lower `module-attribute` #

lower = Case(delimiter=' ', transform=lambda words: map(str.lower, words))

Lowercase strings are delimited by spaces and all characters are lowercase.

Examples:

>>> lower("Hello, world!")
'hello world'

>>> lower.match("hello world")
True

>>> lower.match("Hello, world!")
False

upper `module-attribute` #

upper = Case(delimiter=' ', transform=lambda words: map(str.upper, words))

Uppercase strings are delimited by spaces and all characters are uppercase.

Examples:

>>> upper("Hello, world!")
'HELLO WORLD'

>>> upper.match("HELLO WORLD")
True

>>> upper.match("Hello, world!")
False

title `module-attribute` #

title = Case(delimiter=' ', transform=lambda words: map(str.capitalize, words))

Title case strings are delimited by spaces. Only the leading character of each word is uppercase.

No inferences are made about language, so words like "as", "to", and "for" will still be capitalized.

Examples:

>>> title("Hello, world!")
'Hello World'

>>> title.match("Hello World")
True

>>> title.match("Hello, world!")
False

sentence `module-attribute` #

sentence = Case(delimiter=' ', transform=lambda words: word.capitalize() if i == 0 else word.lower() for (i, word) in enumerate(words))

Sentence case strings are delimited by spaces. Only the leading character of the first word is uppercase.

Examples:

>>> sentence("Hello, world!")
'Hello world'

>>> sentence.match("Hello world")
True

>>> sentence.match("Hello, world!")
False

Boundary `dataclass` #

Boundary(match: Callable[[str], bool], start: int = 0, length: int = 0)

Represents a condition for splitting an identifier into words.

Some boundaries, HYPHEN, UNDERSCORE, SPACE, and INTERPUNCT consume the character they split on, whereas the other boundaries do not.

Examples:

>>> DOT = Boundary(match=lambda text: text[:1] == ".", length=1)

>>> DOT.match(".brown")
True

>>> DOT.match("_brown")
False

>>> DOT.start
0

>>> DOT.length
1

match `instance-attribute` #

match: Callable[[str], bool]

A function that determines if this boundary is present in the string.

start `class-attribute` `instance-attribute` #

start: int = 0

Where the beginning of the boundary is.

length `class-attribute` `instance-attribute` #

length: int = 0

The length of the boundary. This is the number of graphemes that are removed when splitting.

from_delimiter `staticmethod` #

from_delimiter(delimiter: str) -> Boundary

Create a new boundary instance from a delimiter string.

This method makes it easier to create basic boundaries like UNDERSCORE, HYPHEN, SPACE, and INTERPUNCT.

PARAMETER	DESCRIPTION
`delimiter`	A string to be used as the delimiter for creating the boundary. TYPE: `str`

RETURNS	DESCRIPTION
`Boundary`	A new boundary instance, configured to match the provided delimiter.

Examples:

>>> DOT = Boundary.from_delimiter(".")

>>> DOT.match(".brown")
True

>>> DOT.match("_brown")
False

>>> DOT.start
0

>>> DOT.length
1

Source code in src/textcase/__init__.py

@staticmethod
def from_delimiter(delimiter: str) -> "Boundary":
    """Create a new boundary instance from a delimiter string.

    This method makes it easier to create basic boundaries like `UNDERSCORE`, `HYPHEN`, `SPACE`, and `INTERPUNCT`.

    Args:
        delimiter: A string to be used as the delimiter for creating the boundary.

    Returns:
        A new boundary instance, configured to match the provided delimiter.

    Examples:

        >>> DOT = Boundary.from_delimiter(".")

        >>> DOT.match(".brown")
        True

        >>> DOT.match("_brown")
        False

        >>> DOT.start
        0

        >>> DOT.length
        1
    """
    return Boundary(match=lambda s: s.startswith(delimiter), length=len(delimiter))

Case `dataclass` #

Case(delimiter: str = '', transform: Callable[[Iterable[str]], Iterable[str]] = lambda words: words)

Represents a text case style.

Each case instance defines how to split and transform text into a specific case style.

Examples:

>>> dot = Case(delimiter=".", transform=lambda words: map(str.lower, words))

>>> dot("Dot case var")
'dot.case.var'

>>> dot.match("dot.case.var")
True

>>> dot.match("Dot case var")
False

>>> dot.delimiter
'.'

delimiter `class-attribute` `instance-attribute` #

delimiter: str = ''

The string used to join the transformed words together.

transform `class-attribute` `instance-attribute` #

transform: Callable[[Iterable[str]], Iterable[str]] = lambda words: words

A callable that defines how to transform the split words into the desired case format.

match #

match(
    text: str,
    *,
    boundaries: Iterable[Boundary] = (
        UNDERSCORE,
        HYPHEN,
        SPACE,
        INTERPUNCT,
        LOWER_UPPER,
        LOWER_DIGIT,
        UPPER_DIGIT,
        DIGIT_LOWER,
        DIGIT_UPPER,
        ACRONYM,
    ),
    strip_punctuation: bool = True,
) -> bool

Check if the given string matches the specified text case style.

This method compares the input string with its converted version.

PARAMETER	DESCRIPTION
`text`	The input string to be checked. TYPE: `str`
`boundaries`	The boundaries that define how to split the given string. TYPE: `Iterable[Boundary]` DEFAULT: `(UNDERSCORE, HYPHEN, SPACE, INTERPUNCT, LOWER_UPPER, LOWER_DIGIT, UPPER_DIGIT, DIGIT_LOWER, DIGIT_UPPER, ACRONYM)`
`strip_punctuation`	Whether to remove punctuation during conversion. TYPE: `bool` DEFAULT: `True`

RETURNS	DESCRIPTION
`bool`	`True` if the given string matches the specified text case style, and `False` otherwise.

Source code in src/textcase/__init__.py

def match(
    self,
    text: str,
    *,
    boundaries: Iterable[Boundary] = (
        UNDERSCORE,
        HYPHEN,
        SPACE,
        INTERPUNCT,
        LOWER_UPPER,
        LOWER_DIGIT,
        UPPER_DIGIT,
        DIGIT_LOWER,
        DIGIT_UPPER,
        ACRONYM,
    ),
    strip_punctuation: bool = True,
) -> bool:
    """Check if the given string matches the specified text case style.

    This method compares the input string with its converted version.

    Args:
        text: The input string to be checked.
        boundaries: The boundaries that define how to split the given string.
        strip_punctuation: Whether to remove punctuation during conversion.

    Returns:
        `True` if the given string matches the specified text case style, and `False` otherwise.
    """
    return self(text, boundaries=boundaries, strip_punctuation=strip_punctuation) == text

call #

__call__(
    text: str,
    *,
    boundaries: Iterable[Boundary] = (
        UNDERSCORE,
        HYPHEN,
        SPACE,
        INTERPUNCT,
        LOWER_UPPER,
        LOWER_DIGIT,
        UPPER_DIGIT,
        DIGIT_LOWER,
        DIGIT_UPPER,
        ACRONYM,
    ),
    strip_punctuation: bool = True,
) -> str

Convert the given string to the specified case format.

PARAMETER	DESCRIPTION
`text`	The input string to be converted. TYPE: `str`
`boundaries`	The boundaries that define how to split the given string. TYPE: `Iterable[Boundary]` DEFAULT: `(UNDERSCORE, HYPHEN, SPACE, INTERPUNCT, LOWER_UPPER, LOWER_DIGIT, UPPER_DIGIT, DIGIT_LOWER, DIGIT_UPPER, ACRONYM)`
`strip_punctuation`	Whether to remove punctuation during conversion. TYPE: `bool` DEFAULT: `True`

RETURNS	DESCRIPTION
`str`	The input string converted to the specified case format.

Source code in src/textcase/__init__.py

def __call__(
    self,
    text: str,
    *,
    boundaries: Iterable[Boundary] = (
        UNDERSCORE,
        HYPHEN,
        SPACE,
        INTERPUNCT,
        LOWER_UPPER,
        LOWER_DIGIT,
        UPPER_DIGIT,
        DIGIT_LOWER,
        DIGIT_UPPER,
        ACRONYM,
    ),
    strip_punctuation: bool = True,
) -> str:
    """Convert the given string to the specified case format.

    Args:
        text: The input string to be converted.
        boundaries: The boundaries that define how to split the given string.
        strip_punctuation: Whether to remove punctuation during conversion.

    Returns:
        The input string converted to the specified case format.
    """
    words: list[str] = []
    text_length = len(text)
    last_boundary_end = 0

    for i in range(text_length):
        for boundary in boundaries:
            if boundary.match(text[i:]):
                boundary_start = i + boundary.start
                if last_boundary_end < boundary_start:
                    words.append(text[last_boundary_end:boundary_start])
                last_boundary_end = boundary_start + boundary.length
                break

    if last_boundary_end < text_length:
        words.append(text[last_boundary_end:])

    text = self.delimiter.join(self.transform(words))

    if strip_punctuation:
        text = text.strip(self.delimiter).translate(str.maketrans("", "", punctuation.replace(self.delimiter, "")))

    return text

Reference

textcase #

UNDERSCORE module-attribute #

HYPHEN module-attribute #

SPACE module-attribute #

INTERPUNCT module-attribute #

LOWER_UPPER module-attribute #

UPPER_LOWER module-attribute #

ACRONYM module-attribute #

LOWER_DIGIT module-attribute #

UPPER_DIGIT module-attribute #

DIGIT_LOWER module-attribute #

DIGIT_UPPER module-attribute #

snake module-attribute #

constant module-attribute #

kebab module-attribute #

middot module-attribute #

camel module-attribute #

pascal module-attribute #

lower module-attribute #

upper module-attribute #

title module-attribute #

sentence module-attribute #

Boundary dataclass #

match instance-attribute #

start class-attribute instance-attribute #

length class-attribute instance-attribute #

from_delimiter staticmethod #

Case dataclass #

delimiter class-attribute instance-attribute #

transform class-attribute instance-attribute #

match #

__call__ #

UNDERSCORE `module-attribute` #

HYPHEN `module-attribute` #

SPACE `module-attribute` #

INTERPUNCT `module-attribute` #

LOWER_UPPER `module-attribute` #

UPPER_LOWER `module-attribute` #

ACRONYM `module-attribute` #

LOWER_DIGIT `module-attribute` #

UPPER_DIGIT `module-attribute` #

DIGIT_LOWER `module-attribute` #

DIGIT_UPPER `module-attribute` #

snake `module-attribute` #

constant `module-attribute` #

kebab `module-attribute` #

middot `module-attribute` #

camel `module-attribute` #

pascal `module-attribute` #

lower `module-attribute` #

upper `module-attribute` #

title `module-attribute` #

sentence `module-attribute` #

Boundary `dataclass` #

match `instance-attribute` #

start `class-attribute` `instance-attribute` #

length `class-attribute` `instance-attribute` #

from_delimiter `staticmethod` #

Case `dataclass` #

delimiter `class-attribute` `instance-attribute` #

transform `class-attribute` `instance-attribute` #

call #