Retour aux articles
IAHugging Face Blog

Accelerating over 130,000 Hugging Face models with ONNX Runtime

Back to Articles Accelerating over 130,000 Hugging Face models with ONNX Runtime Published October 4, 2023 Update on GitHub Upvote 1 Sophie Schoenmeyer sschoenmeyer Follow guest Morgan Funtowicz mfuntowic...

Le flux RSS ne fournissait qu'un extrait. FlowMarket a récupéré le contenu public disponible depuis la page originale, sans contourner les contenus réservés.

Accelerating over 130,000 Hugging Face models with ONNX Runtime

Accelerating over 130,000 Hugging Face models with ONNX Runtime

Sophie Schoenmeyer
Morgan Funtowicz

What is ONNX Runtime?

ONNX Runtime is a cross-platform machine learning tool that can be used to accelerate a wide variety of models, particularly those with ONNX support.

Hugging Face ONNX Runtime Support

There are over 130,000 ONNX-supported models on Hugging Face, an open source community that allows users to build, train, and deploy hundreds of thousands of publicly available machine learning models. These ONNX-supported models, which include many increasingly popular large language models (LLMs) and cloud models, can leverage ONNX Runtime to improve performance, along with other benefits. For example, using ONNX Runtime to accelerate the whisper-tiny model can improve average latency per inference, with an up to 74.30% gain over PyTorch. ONNX Runtime works closely with Hugging Face to ensure that the most popular models on the site are supported. In total, over 90 Hugging Face model architectures are supported by ONNX Runtime, including the 11 most popular architectures (where popularity is determined by the corresponding number of models uploaded to the Hugging Face Hub):

Learn More

To learn more about accelerating Hugging Face models with ONNX Runtime, check out our recent post on the Microsoft Open Source Blog .

More Articles from our Blog

Safetensors is Joining the PyTorch Foundation

Mixture of Experts (MoEs) in Transformers

  • +3

Community

· Sign up or log in to comment

Besoin d'un workflow n8n ou d'aide pour l'installer ?

Après la veille, passez à l'action : trouvez un template n8n ou un créateur capable de l'adapter à vos outils.

Source

Hugging Face Blog - huggingface.co

Voir la publication originale