Use ruff instead of flake8 and isort #3213

Lingepumpe · 2023-04-21T13:32:28Z

/!\ This PR contains automatically generated code changes (the run of ruff --fix). Please review the commits individually, to be able to pay more attention to the parts I did manually /!\

Ruff (https://github.com/charliermarsh/ruff) is a very active and very fast tool that can replace flake8 and isort. It includes a lot of advanced rules that flake8 only has if you install additional packages. In addition to formatting includes, ruff can also auto-fix some linter errors automatically. In this PR we remove flake8 and isort, and replace the functionality (and more) with ruff.
Also, a full run of ruff --fix . is run on the code base, which fixes-up a lot of small things throughout. For a impressive example for the ruff automatic fixes, check out this automatic change in embeddings/base.py:

# Before ruff --fix
def _everything_embedded(self, data_points: Sequence[DT]) -> bool:
    for data_point in data_points:
        if self.name not in data_point._embeddings.keys():
            return False
    return True

# After ruff --fix
def _everything_embedded(self, data_points: Sequence[DT]) -> bool:
    return all(self.name in data_point._embeddings for data_point in data_points)

In addition to the automatic fixes, I enabled all rules that seemed sensible and where I was able to manually fix any issues it found - so this PR also contains quite a few manual code quality fixes that ruff suggested.

Lingepumpe · 2023-04-25T08:37:19Z

Rebased on current master.

helpmefindaname

´Thank you for this PR, it looks very exciting to see so many small improvements on the code :-)
There a just a few comments.

requirements-dev.txt

tests/embedding_test_utils.py

pyproject.toml

helpmefindaname

The changes look good to me, there is still a rebase on the main required, but then I would consider this MR mergeable

Lingepumpe · 2023-04-27T19:04:12Z

Rebased on updated master

alanakbik · 2023-04-28T08:35:20Z

@Lingepumpe thanks a lot for adding this! Looking forward to trying this out!

alanakbik · 2023-04-28T11:59:36Z

flair/data.py

+        if bioes_tag[:2] in {"B-", "S-"} or (
+            in_span and previous_tag[2:] != bioes_tag[2:] and (bioes_tag[:2] == "I-" or previous_tag[2:] == "S-")
+        ):
+            # B- and S- always start new spans
+            # if the predicted class changes, I- starts a new span
+            # if the predicted class changes and S- was previous tag, start a new span


Debatable if this improves clarity ;) but many of the other changes are really good!

Yes this one is debatable, I considered putting a "#noqa: XYZ" there and keeping the condition as is, but on the other hand this version also has its merits (not duplicate the body in the if - so I just did it the way that ruff was also happy with it :D

alanakbik · 2023-04-28T12:01:19Z

flair/datasets/relation_extraction.py

@@ -77,7 +74,7 @@ def __init__(
                augment_train=augment_train,
            )

-        super(RE_ENGLISH_SEMEVAL2010, self).__init__(
+        super().__init__(


These automatic super() call changes are somewhat surprising. I wonder what is the motivation for this?

super() is sequivalent to super(TheNameOfTheCurrentClass, self) - so if you really are passing the current class name and self, then the shorter way to write it is chosen by ruff - it also makes it clear at first view that nothing extra ordinary with regards to super is going on. If you really want to e.g. skip one hierarchy on inheritance by doing super(TheNameOfASuperClass, self) [which would skip the TheNameOfASuperClass and search "above" it in the class hiarchy], then you would still pass the parameters to super [ruff would not remove them], and a reviewer would see that you are really doing something special with super here.

Ah ok, thanks for the info!

Lingepumpe force-pushed the use_ruff_instead_of_flake8_and_isort branch 8 times, most recently from 2255755 to 9e19531 Compare April 25, 2023 08:33

helpmefindaname requested changes Apr 25, 2023

View reviewed changes

requirements-dev.txt Outdated Show resolved Hide resolved

tests/embedding_test_utils.py Outdated Show resolved Hide resolved

pyproject.toml Show resolved Hide resolved

pyproject.toml Outdated Show resolved Hide resolved

Lingepumpe requested a review from helpmefindaname April 25, 2023 21:14

helpmefindaname approved these changes Apr 27, 2023

View reviewed changes

Lingepumpe added 7 commits April 27, 2023 20:48

Use ruff instead of flake8+isort

173e154

Execute ruff --fix .

c495399

Reduce enabled ruff rules and manually fix issues

006b21b

Fix uncovered mypy errors (due to adding type information to functions)

b31e835

Integrate PR feedback

a96a31c

Better google docstring docstring formatting

d812dbb

Relock for changed dependencies (ruff, pytest-ruff, pytest-black-ng)

7a57952

Lingepumpe force-pushed the use_ruff_instead_of_flake8_and_isort branch from 27d08b6 to 7a57952 Compare April 27, 2023 18:58

Make sure virtualenvs.in-project is true when poetry install is cached

1b90fd1

alanakbik merged commit 4435a31 into flairNLP:master Apr 28, 2023

alanakbik reviewed Apr 28, 2023

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use ruff instead of flake8 and isort #3213

Use ruff instead of flake8 and isort #3213

Lingepumpe commented Apr 21, 2023 •

edited

Loading

Lingepumpe commented Apr 25, 2023

helpmefindaname left a comment

helpmefindaname left a comment

Lingepumpe commented Apr 27, 2023

alanakbik commented Apr 28, 2023

alanakbik Apr 28, 2023

Lingepumpe Apr 28, 2023

alanakbik Apr 28, 2023

Lingepumpe Apr 28, 2023 •

edited

Loading

alanakbik Apr 29, 2023

Use ruff instead of flake8 and isort #3213

Use ruff instead of flake8 and isort #3213

Conversation

Lingepumpe commented Apr 21, 2023 • edited Loading

Lingepumpe commented Apr 25, 2023

helpmefindaname left a comment

Choose a reason for hiding this comment

helpmefindaname left a comment

Choose a reason for hiding this comment

Lingepumpe commented Apr 27, 2023

alanakbik commented Apr 28, 2023

alanakbik Apr 28, 2023

Choose a reason for hiding this comment

Lingepumpe Apr 28, 2023

Choose a reason for hiding this comment

alanakbik Apr 28, 2023

Choose a reason for hiding this comment

Lingepumpe Apr 28, 2023 • edited Loading

Choose a reason for hiding this comment

alanakbik Apr 29, 2023

Choose a reason for hiding this comment

Lingepumpe commented Apr 21, 2023 •

edited

Loading

Lingepumpe Apr 28, 2023 •

edited

Loading