[Inference Client] Add task parameters and a maintenance script of these parameters #2561

hanouticelina · 2024-09-24T14:45:01Z

This PR adds the possibility to pass additional parameters to task methods in the inference client, aligning these task methods with their corresponding parameter specs.

Key changes

Add support for additional parameters in the following task methods: audio-classification, document-question-answering, fill-mask, image-classification, image-segmentation, object-detection, question-answering, summarization, text-classification, token-classification, translation and zero-shot-image-classification.
Add a semi-automatic script to maintain consistency between task methods and their parameter specs. See Some InferenceClient tasks missing parameters argument, inconsistent with task specifications #2557 for related discussion.
Update utils/generate_inference_types.py to add task-specific prefixes to shared type aliases:
- example: rename ClassificationOutputTransform to TextClassificationOutputTransform for text-classification task
- this prevents naming conflicts when importing types from different tasks
- drawbacks of this solution: when you need to make a change that should apply to all tasks, you have to update multiple places
Some code refactoring in utils/.
Update (automatically) text-to-speech task parameters (following Support VLM in chat completion (+some specs updates) #2556).

Note: automatic-speech-recognition and image-to-text tasks are not included in this update due to an existing parameter naming discrepancy (see issue here).

HuggingFaceDocBuilderDev · 2024-09-24T14:48:37Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

Wauplin · 2024-09-24T15:03:17Z

This PR will complete some tasks from #2063 :)

hanouticelina · 2024-09-25T14:52:47Z

The goal of the script utils/generate_task_parameters.py is :

Based on the schemas defined in ./src/huggingface_hub/inference/_generated/types/, check if there are missing parameters in the InferenceClient task methods.
Update the InferenceClient code by adding the missing parameters to the method signature, update the args section in the docstring and add the necessary imports.

A first implementation was done using ast but it excludes comments and some whitespaces since it focuses only on representing the structure of the code. Instead, we use libCST (github, doc) as it's more suited for our use case :

we have lossless round-trip: it allows to parse, modify, and regenerate code without losing any information (it preserves formatting and comments).
it provides some nice abstraction to easily identify and work with different parts of the code.

libCST operates on a tree-like structure that represents the entire source code, including whitespace and comments. Here, we leverage two main concepts:

Visitors: These are classes that "visit" each node in the tree. It's used in the script to collect information about existing parameters and imports.
Transformers: These are similar to visitors but allow us to modify the tree. We use transformers classes to add new parameters, update docstrings, and insert import statements.

The script is still experimental and there are known limitations (non-exhaustive):

adding new attributes in the docstring depends heavily of the "format" of the method docstring.
some tasks are excluded as they don't follow the pattern inputs/parameters (i.e. there is no Parameters dataclass defined). This is something that we should standardize across all tasks to have an efficient parameters check.
it could benefit from more robust error handling.

Wauplin

Looks good!

Thanks for the explanations about LibCST. It seems like the right tool to use in our case. Since it's a dev dependency, it's fine adding it. To be honest I did not review the full utils/generate_task_parameters.py script but I'm pretty confident it does the job. Let's hope it'll not become a nightmare to maintain (same for all utils/ scripts^^). If it starts to be the case, we can always reassess. Updating utils is less stressful given we don't have to think about backward compatibility.

Apart from the comments below, I think it's pretty much ready to merge :)

Makefile

Wauplin · 2024-10-02T14:02:37Z

src/huggingface_hub/inference/_client.py

    ) -> List[AudioClassificationOutputElement]:
        """
        Perform audio classification on the provided audio content.
+        For more details about the input parameters, see the [pipeline documentation](https://huggingface.co/docs/transformers/en/main_classes/pipelines#transformers.AudioClassificationPipeline).


I find it a bit confusing to redirect from Inference docs to the transformers pipeline docs. I feel that we should better document the parameters if that's what's missing here.

Because here for example https://huggingface.co/docs/transformers/en/main_classes/pipelines#transformers.AudioClassificationPipeline.__call__ do not provide any information about function_to_apply, meaning we would have to maintain both docs. Better not to have this additional "docs dependency".

I added this because for almost all task parameters, the default values are only documented in the pipeline docs, e.g. for text-classification here
but you're right, it's still an additional doc dependency but it would be nice to have access to the default values in the inference doc.

Yes indeed, we should better document the specs themselves but it will be done in separate PRs 😕

Wauplin · 2024-10-02T14:10:39Z

src/huggingface_hub/inference/_client.py

+        parameters = {
+            "threshold": threshold,
+        }
+        if all(parameter is None for parameter in parameters.values()):
+            # if no parameters are provided, the image can be raw bytes, an image file, or URL to an online image
+            data = image
+            payload: Optional[Dict[str, Any]] = None
+        else:
+            # if parameters are provided, the image needs to be a base64-encoded string
+            data = None
+            payload = {"inputs": _b64_encode(image)}
+            for key, value in parameters.items():
+                if value is not None:
+                    payload.setdefault("parameters", {})[key] = value


(out of scope for this PR) I feel that we should factorize the logic to handle inputs + parameters i.e. rules like "base64 encode only if at least 1 parameter", "provide parameters if at least 1 parameter", "provide only not none parameters".

utils/generate_task_parameters.py

…ign-inference-inputs-with-specs

hanouticelina · 2024-10-04T09:20:22Z

@Wauplin do you have an idea why the build of the PR documentation is failing? my guess: private message

Add additional parameters to Inference Client tasks

79b4b40

hanouticelina mentioned this pull request Sep 25, 2024

Support VLM in chat completion (+some specs updates) #2556

Merged

Add and run task params generation script

27faa3d

hanouticelina added 10 commits September 25, 2024 16:57

Add back missing test

4200f94

Add comments to parameters generation script

8f1b89e

Merge branch 'main' into align-inference-inputs-with-specs

aa8abc0

Fix shared classes imports + text-to-speech task

4fd4aba

Satisfy end-of-file-fixer hook

e91e920

Move helper function to avoid duplicates across scripts

65cec00

Rename helper function for more clarity

72302ae

Fix bug in node traversing

0c58003

Add comments

9a6af4b

improve docstring formatting

aea264e

hanouticelina marked this pull request as ready for review October 1, 2024 09:57

hanouticelina requested a review from Wauplin October 1, 2024 09:57

Wauplin reviewed Oct 2, 2024

View reviewed changes

hanouticelina added 2 commits October 4, 2024 11:10

fixes post-review

2d69e75

Merge branch 'main' of github.com:huggingface/huggingface_hub into al…

646fbee

…ign-inference-inputs-with-specs

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Inference Client] Add task parameters and a maintenance script of these parameters #2561

[Inference Client] Add task parameters and a maintenance script of these parameters #2561

hanouticelina commented Sep 24, 2024 •

edited

Loading

HuggingFaceDocBuilderDev commented Sep 24, 2024

Wauplin commented Sep 24, 2024

hanouticelina commented Sep 25, 2024 •

edited

Loading

Wauplin left a comment

Wauplin Oct 2, 2024

hanouticelina Oct 4, 2024 •

edited

Loading

Wauplin Oct 4, 2024

Wauplin Oct 2, 2024

hanouticelina Oct 4, 2024

hanouticelina commented Oct 4, 2024

[Inference Client] Add task parameters and a maintenance script of these parameters #2561

Are you sure you want to change the base?

[Inference Client] Add task parameters and a maintenance script of these parameters #2561

Conversation

hanouticelina commented Sep 24, 2024 • edited Loading

Key changes

HuggingFaceDocBuilderDev commented Sep 24, 2024

Wauplin commented Sep 24, 2024

hanouticelina commented Sep 25, 2024 • edited Loading

Wauplin left a comment

Choose a reason for hiding this comment

Wauplin Oct 2, 2024

Choose a reason for hiding this comment

hanouticelina Oct 4, 2024 • edited Loading

Choose a reason for hiding this comment

Wauplin Oct 4, 2024

Choose a reason for hiding this comment

Wauplin Oct 2, 2024

Choose a reason for hiding this comment

hanouticelina Oct 4, 2024

Choose a reason for hiding this comment

hanouticelina commented Oct 4, 2024

hanouticelina commented Sep 24, 2024 •

edited

Loading

hanouticelina commented Sep 25, 2024 •

edited

Loading

hanouticelina Oct 4, 2024 •

edited

Loading