ModelInfo bug #2186

Narsil · 2024-04-02T13:24:37Z

Describe the bug

Cannot load model information on some repo.

Reproduction

from huggingface_hub import HfApi
api = HfApi()
api.model_info("CohereForAI/c4ai-command-r-v01")

Logs

TypeError: SafeTensorsInfo.__init__() got an unexpected keyword argument 'sharded'



### System info

```shell
- huggingface_hub version: 0.22.2
- Platform: Linux-5.15.0-1048-aws-x86_64-with-glibc2.31
- Python version: 3.11.6
- Running in iPython ?: No
- Running in notebook ?: No
- Running in Google Colab ?: No
- Token path ?: /data/token
- Has saved token ?: True
- Who am I ?: Narsil
- Configured git credential helpers:
- FastAI: N/A
- Tensorflow: 2.16.1
- Torch: 2.2.1
- Jinja2: 3.1.2
- Graphviz: N/A
- keras: 3.1.1
- Pydot: N/A
- Pillow: 10.2.0
- hf_transfer: 0.1.5
- gradio: 4.16.0
- tensorboard: N/A
- numpy: 1.26.4
- pydantic: 2.6.4
- aiohttp: 3.8.5
- ENDPOINT: https://huggingface.co
- HF_HUB_CACHE: /data/hub
- HF_ASSETS_CACHE: /data/assets
- HF_TOKEN_PATH: /data/token
- HF_HUB_OFFLINE: False
- HF_HUB_DISABLE_TELEMETRY: False
- HF_HUB_DISABLE_PROGRESS_BARS: None
- HF_HUB_DISABLE_SYMLINKS_WARNING: False
- HF_HUB_DISABLE_EXPERIMENTAL_WARNING: False
- HF_HUB_DISABLE_IMPLICIT_TOKEN: False
- HF_HUB_ENABLE_HF_TRANSFER: False
- HF_HUB_ETAG_TIMEOUT: 10
- HF_HUB_DOWNLOAD_TIMEOUT: 10

The text was updated successfully, but these errors were encountered:

kresimirfijacko · 2024-04-02T13:41:08Z

it happened to me as well while working (no changes in dependencies/code etc.)
i guess it is something related to hf api?

Wauplin · 2024-04-02T13:46:19Z

Thanks for reporting @Narsil @kresimirfijacko! Will check if this is a breaking change server side and open a PR to fix it client side anyway.

kresimirfijacko · 2024-04-02T13:56:49Z

Thanks for reporting @Narsil @kresimirfijacko! Will check if this is a breaking change server side and open a PR to fix it client side anyway.

yeah it's probably something related server side
i experienced it in vllm which uses huggingface-hub under the hood

python -u -m vllm.entrypoints.openai.api_server \ --model Qwen/Qwen1.5-72B-Chat-GPTQ-Int4 \ ///

all of a sudden stopped working
when i changed --model to absolute path on disk, it worked ok

kubs0ne · 2024-04-02T14:26:12Z

Hey I have the same issue, this started happpening today around 3PM. I cannot use vllm server nor the huggingface-cli in order to download the model. Everything returns the same error:
TypeError: SafeTensorsInfo.__init__() got an unexpected keyword argument 'sharded'

Supermax197 · 2024-04-02T14:38:45Z

work around,add sharded: None in the file hf_api.py,like this :
@DataClass
class SafeTensorsInfo(dict):
parameters: List[Dict[str, int]]
total: int
sharded: None
def post_init(self): # hack to make SafeTensorsInfo backward compatible
self.update(asdict(self))

0-hero · 2024-04-02T14:55:02Z

+1

Whylickspittle · 2024-04-02T14:55:13Z

work around,add sharded: None in the file hf_api.py,like this : @DataClass class SafeTensorsInfo(dict): parameters: List[Dict[str, int]] total: int sharded: None def post_init(self): # hack to make SafeTensorsInfo backward compatible self.update(asdict(self))
it works bro

KevinNaidoo · 2024-04-02T15:14:11Z

Just have the same issue. It was working earlier today.

docker run --runtime nvidia --gpus all \
    -v ~/.cache/huggingface:/root/.cache/huggingface \
    --env "HUGGING_FACE_HUB_TOKEN=<secret>" \
    -p 8000:8000 \
    --ipc=host \
    vllm/vllm-openai:latest \
    --model TheBloke/Mixtral-8x7B-Instruct-v0.1-AWQ --quantization awq --tensor-parallel-size 4

jyotsnar · 2024-04-02T15:14:37Z

Same issue here too. Started today.

Supermax197 · 2024-04-02T15:36:26Z

i think the server side is fixed,so this work around is deprecated.

work around,add sharded: None in the file hf_api.py,like this : @DataClass class SafeTensorsInfo(dict): parameters: List[Dict[str, int]] total: int sharded: None def post_init(self): # hack to make SafeTensorsInfo backward compatible self.update(asdict(self))

Wauplin · 2024-04-02T15:47:32Z

Hey everyone, thanks for quickly reporting issues and suggesting a workaround. Failure is indeed due to a server-side change and we are discussing solutions to mitigate it. In the meantime, I opened #2190 to fix the issue client-side (which will make the class future-proof). To get an immediate fix, please install from this branch:

pip install git+https://github.com/huggingface/huggingface_hub@2186-fix-safetensors-info

EDIT: no need to install a new version of huggingface_hub. A server-side fix has been deployed, making the fix above optional. See #2186 (comment).

binhnq94 · 2024-04-02T15:49:04Z

How can I re-upload my model after a 24h training process die because of this bug?
I still have model folder in local.

martina-zxy · 2024-04-02T15:52:42Z

+1

momo-exaion · 2024-04-02T16:02:47Z

Thank you @Wauplin, as a reminder you can use this syntax to include optional dependencies

pip install "huggingface_hub[cli,hf_transfer] @ git+https://github.com/huggingface/huggingface_hub@2186-fix-safetensors-info"

EDIT: no need to install a new version of huggingface_hub. A server-side fix has been deployed, making the fix above optional. See #2186 (comment).

youkaichao · 2024-04-02T16:13:43Z

i think the server side is fixed,so this work around is deprecated.

Is this fixed? I still get this error :(

Wauplin · 2024-04-02T16:20:46Z

A fix has been deployed a few minutes ago. This should be fixed for everyone without updating any dependencies. Sorry again for the inconvenience and thanks everyone for your reactivity on this 🤗

binhnq94 · 2024-04-04T04:44:29Z

Hey everyone, thanks for quickly reporting issues and suggesting a workaround. Failure is indeed due to a server-side change and we are discussing solutions to mitigate it. In the meantime, I opened #2190 to fix the issue client-side (which will make the class future-proof). To get an immediate fix, please install from this branch:
pip install git+https://github.com/huggingface/huggingface_hub@2186-fix-safetensors-info

I got error: git checkout -q 2186-fix-safetensors-info did not run successfully.
If we fixed it, should I use which huggingface hub version?

Wauplin · 2024-04-04T07:02:44Z

I got error: git checkout -q 2186-fix-safetensors-info did not run successfully.
If we fixed it, should I use which huggingface hub version?

Yes sorry, PR has been merged and is now on main. But another fix has been deployed server-side meaning you don't even need to update your dependencies. Any huggingface_hub version from PyPI will work correctly.

Narsil added the bug Something isn't working label Apr 2, 2024

kresimirfijacko mentioned this issue Apr 2, 2024

[Bug]: Can't run vllm entrypoint after a fresh install because of a bug in huggingface-hub vllm-project/vllm#3796

Closed

Wauplin mentioned this issue Apr 2, 2024

Fix SafeTensorsInfo initialization #2190

Merged

youkaichao mentioned this issue Apr 2, 2024

[Bug]: SafeTensorsInfo.__init__() got an unexpected keyword argument 'sharded' vllm-project/vllm#3800

Closed

Wauplin closed this as completed Apr 2, 2024

beamaia mentioned this issue Apr 2, 2024

change hugging-face hub module with hotfix issue weni-ai/worker-runpod-vllm#4

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ModelInfo bug #2186

ModelInfo bug #2186

Narsil commented Apr 2, 2024

kresimirfijacko commented Apr 2, 2024 •

edited

Loading

Wauplin commented Apr 2, 2024

kresimirfijacko commented Apr 2, 2024

kubs0ne commented Apr 2, 2024

Supermax197 commented Apr 2, 2024

0-hero commented Apr 2, 2024

Whylickspittle commented Apr 2, 2024

KevinNaidoo commented Apr 2, 2024 •

edited

Loading

jyotsnar commented Apr 2, 2024

Supermax197 commented Apr 2, 2024

Wauplin commented Apr 2, 2024 •

edited

Loading

binhnq94 commented Apr 2, 2024

martina-zxy commented Apr 2, 2024

momo-exaion commented Apr 2, 2024 •

edited by Wauplin

Loading

youkaichao commented Apr 2, 2024

Wauplin commented Apr 2, 2024

binhnq94 commented Apr 4, 2024

Wauplin commented Apr 4, 2024

ModelInfo bug #2186

ModelInfo bug #2186

Comments

Narsil commented Apr 2, 2024

Describe the bug

Reproduction

Logs

kresimirfijacko commented Apr 2, 2024 • edited Loading

Wauplin commented Apr 2, 2024

kresimirfijacko commented Apr 2, 2024

kubs0ne commented Apr 2, 2024

Supermax197 commented Apr 2, 2024

0-hero commented Apr 2, 2024

Whylickspittle commented Apr 2, 2024

KevinNaidoo commented Apr 2, 2024 • edited Loading

jyotsnar commented Apr 2, 2024

Supermax197 commented Apr 2, 2024

Wauplin commented Apr 2, 2024 • edited Loading

binhnq94 commented Apr 2, 2024

martina-zxy commented Apr 2, 2024

momo-exaion commented Apr 2, 2024 • edited by Wauplin Loading

youkaichao commented Apr 2, 2024

Wauplin commented Apr 2, 2024

binhnq94 commented Apr 4, 2024

Wauplin commented Apr 4, 2024

kresimirfijacko commented Apr 2, 2024 •

edited

Loading

KevinNaidoo commented Apr 2, 2024 •

edited

Loading

Wauplin commented Apr 2, 2024 •

edited

Loading

momo-exaion commented Apr 2, 2024 •

edited by Wauplin

Loading