Download `model.onnx_data` #343

kozistr · 2024-07-13T11:19:13Z

What does this PR do?

Fixes #341

log warning message instead of throwing an error or panic when failing to download the model.onnx_data file because I guess it is optional.
The below error is out of topic. it failed to download the model.onnx_data due to the SSL issue in my environment (Windows 10, WSL2). maybe it's an issue with my env or cdn-lfs.huggingface.co, or okay with other environments.

2024-07-13T11:13:34.821780Z  INFO download_artifacts: text_embeddings_backend: backends/src/lib.rs:384: Downloading `onnx/model.onnx_data`
2024-07-13T11:14:36.250951Z  WARN download_artifacts: text_embeddings_backend: backends/src/lib.rs:388: Could not download `onnx/model.onnx_data`: request error: request or response body error: error reading a body from connection: error:0A000119:SSL routines:ssl3_get_record:decryption failed or bad record mac:../ssl/record/ssl3_record.c:613:

when i manually download and run, it works.

2024-07-13T11:48:51.692055Z  INFO text_embeddings_router: router/src/main.rs:175: Args { model_id: "./mul*********-**-**rge", revision: None, tokenization_workers: None, dtype: None, pooling: None, max_concurrent_requests: 512, max_batch_tokens: 16384, max_batch_requests: None, max_client_batch_size: 32, auto_truncate: false, default_prompt_name: None, default_prompt: None, hf_api_token: None, hostname: "0.0.0.0", port: 8080, uds_path: "/tmp/text-embeddings-inference-server", huggingface_hub_cache: None, payload_limit: 2000000, api_key: None, json_output: false, otlp_endpoint: None, otlp_service_name: "text-embeddings-inference.server", cors_allow_origin: None }
2024-07-13T11:48:52.190830Z  INFO text_embeddings_router: router/src/lib.rs:199: Maximum number of tokens per request: 512
2024-07-13T11:48:52.193407Z  INFO text_embeddings_core::tokenization: core/src/tokenization.rs:28: Starting 4 tokenization workers
2024-07-13T11:48:53.218580Z  INFO text_embeddings_router: router/src/lib.rs:241: Starting model backend
2024-07-13T11:49:21.728148Z  WARN text_embeddings_router: router/src/lib.rs:267: Backend does not support a batch size > 8
2024-07-13T11:49:21.728179Z  WARN text_embeddings_router: router/src/lib.rs:268: forcing `max_batch_requests=8`
2024-07-13T11:49:21.739675Z  INFO text_embeddings_router::http::server: router/src/http/server.rs:1778: Starting HTTP server: 0.0.0.0:8080
2024-07-13T11:49:21.739704Z  INFO text_embeddings_router::http::server: router/src/http/server.rs:1779: Ready
2024-07-13T11:49:35.807165Z  INFO embed{total_time="77.204302ms" tokenization_time="193.7µs" queue_time="333.2µs" inference_time="76.557102ms"}: text_embeddings_router::http::server: router/src/http/server.rs:706: Success

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline,
Pull Request section?
Was this discussed/approved via a Github issue or the forum? Please add a link to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.

@OlivierDehaene OR @Narsil

OlivierDehaene

Thanks!

freinold · 2024-07-15T16:47:29Z

Thanks for the quick fix @kozistr!

update: download model.onnx_data

d0ae274

kozistr changed the title ~~Download model.onnx_data too~~ Download model.onnx_data Jul 13, 2024

OlivierDehaene approved these changes Jul 15, 2024

View reviewed changes

OlivierDehaene merged commit ce1edf4 into huggingface:main Jul 15, 2024

kozistr deleted the fix/download-onnx-files branch July 15, 2024 22:20

MasakiMu319 pushed a commit to MasakiMu319/text-embeddings-inference that referenced this pull request Nov 27, 2024

fix: Download model.onnx_data (huggingface#343)

baeecb5

aagnone3 pushed a commit to StratisLLC/hf-text-embeddings-inference that referenced this pull request Dec 11, 2024

fix: Download model.onnx_data (huggingface#343)

4ea0d00

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Download `model.onnx_data` #343

Download `model.onnx_data` #343

Uh oh!

kozistr commented Jul 13, 2024 •

edited

Loading

Uh oh!

OlivierDehaene left a comment

Uh oh!

freinold commented Jul 15, 2024

Uh oh!

Uh oh!

Download model.onnx_data #343

Download model.onnx_data #343

Uh oh!

Conversation

kozistr commented Jul 13, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Before submitting

Who can review?

Uh oh!

OlivierDehaene left a comment

Choose a reason for hiding this comment

Uh oh!

freinold commented Jul 15, 2024

Uh oh!

Uh oh!

Download `model.onnx_data` #343

Download `model.onnx_data` #343

kozistr commented Jul 13, 2024 •

edited

Loading