qaihm-bot commited on
Commit
f78d9c2
·
verified ·
1 Parent(s): 423fb5e

See https://github.com/quic/ai-hub-models/releases/v0.35.0 for changelog.

.gitattributes CHANGED
@@ -40,3 +40,5 @@ WhisperDecoderInf.so filter=lfs diff=lfs merge=lfs -text
40
  Whisper-Tiny-En_WhisperDecoderInf.dlc filter=lfs diff=lfs merge=lfs -text
41
  Whisper-Tiny-En_WhisperEncoderInf.dlc filter=lfs diff=lfs merge=lfs -text
42
  DEPLOYMENT_MODEL_LICENSE.pdf filter=lfs diff=lfs merge=lfs -text
 
 
 
40
  Whisper-Tiny-En_WhisperDecoderInf.dlc filter=lfs diff=lfs merge=lfs -text
41
  Whisper-Tiny-En_WhisperEncoderInf.dlc filter=lfs diff=lfs merge=lfs -text
42
  DEPLOYMENT_MODEL_LICENSE.pdf filter=lfs diff=lfs merge=lfs -text
43
+ Whisper-Tiny-En_HfWhisperDecoder.dlc filter=lfs diff=lfs merge=lfs -text
44
+ Whisper-Tiny-En_HfWhisperEncoder.dlc filter=lfs diff=lfs merge=lfs -text
LICENSE CHANGED
@@ -1,2 +1,2 @@
1
- The license of the original trained model can be found at https://github.com/openai/whisper/blob/main/LICENSE.
2
  The license for the deployable model files (.tflite, .onnx, .dlc, .bin, etc.) can be found in DEPLOYMENT_MODEL_LICENSE.pdf.
 
1
+ The license of the original trained model can be found at https://github.com/huggingface/transformers/blob/v4.42.3/LICENSE.
2
  The license for the deployable model files (.tflite, .onnx, .dlc, .bin, etc.) can be found in DEPLOYMENT_MODEL_LICENSE.pdf.
README.md CHANGED
@@ -11,12 +11,12 @@ pipeline_tag: automatic-speech-recognition
11
  ![](https://qaihub-public-assets.s3.us-west-2.amazonaws.com/qai-hub-models/models/whisper_tiny_en/web-assets/model_demo.png)
12
 
13
  # Whisper-Tiny-En: Optimized for Mobile Deployment
14
- ## Automatic speech recognition (ASR) model for English transcription as well as translation
15
 
16
 
17
- OpenAI’s Whisper ASR (Automatic Speech Recognition) model is a state-of-the-art system designed for transcribing spoken language into written text. It exhibits robust performance in realistic, noisy environments, making it highly reliable for real-world applications. Specifically, it excels in long-form transcription, capable of accurately transcribing audio clips up to 30 seconds long. Time to the first token is the encoder's latency, while time to each additional token is decoder's latency, where we assume a mean decoded length specified below.
18
 
19
- This model is an implementation of Whisper-Tiny-En found [here](https://github.com/openai/whisper/tree/main).
20
 
21
 
22
  This repository provides scripts to run Whisper-Tiny-En on Qualcomm® devices.
@@ -29,72 +29,74 @@ More details on model performance across various devices, can be found
29
 
30
  - **Model Type:** Model_use_case.speech_recognition
31
  - **Model Stats:**
32
- - Model checkpoint: tiny.en
33
  - Input resolution: 80x3000 (30 seconds audio)
34
- - Mean decoded sequence length: 112 tokens
35
- - Number of parameters (WhisperEncoderInf): 9.39M
36
- - Model size (WhisperEncoderInf) (float): 35.9 MB
37
- - Number of parameters (WhisperDecoderInf): 28.3M
38
- - Model size (WhisperDecoderInf) (float): 108 MB
39
 
40
  | Model | Precision | Device | Chipset | Target Runtime | Inference Time (ms) | Peak Memory Range (MB) | Primary Compute Unit | Target Model
41
  |---|---|---|---|---|---|---|---|---|
42
- | WhisperEncoderInf | float | QCS8275 (Proxy) | Qualcomm® QCS8275 (Proxy) | TFLITE | 403.879 ms | 19 - 41 MB | GPU | [Whisper-Tiny-En.tflite](https://huggingface.co/qualcomm/Whisper-Tiny-En/blob/main/Whisper-Tiny-En.tflite) |
43
- | WhisperEncoderInf | float | QCS8275 (Proxy) | Qualcomm® QCS8275 (Proxy) | QNN_DLC | 104.864 ms | 0 - 248 MB | NPU | [Whisper-Tiny-En.dlc](https://huggingface.co/qualcomm/Whisper-Tiny-En/blob/main/Whisper-Tiny-En.dlc) |
44
- | WhisperEncoderInf | float | QCS8450 (Proxy) | Qualcomm® QCS8450 (Proxy) | TFLITE | 134.744 ms | 20 - 63 MB | GPU | [Whisper-Tiny-En.tflite](https://huggingface.co/qualcomm/Whisper-Tiny-En/blob/main/Whisper-Tiny-En.tflite) |
45
- | WhisperEncoderInf | float | QCS8450 (Proxy) | Qualcomm® QCS8450 (Proxy) | QNN_DLC | 101.25 ms | 1 - 233 MB | NPU | [Whisper-Tiny-En.dlc](https://huggingface.co/qualcomm/Whisper-Tiny-En/blob/main/Whisper-Tiny-En.dlc) |
46
- | WhisperEncoderInf | float | QCS8550 (Proxy) | Qualcomm® QCS8550 (Proxy) | TFLITE | 102.578 ms | 20 - 69 MB | GPU | [Whisper-Tiny-En.tflite](https://huggingface.co/qualcomm/Whisper-Tiny-En/blob/main/Whisper-Tiny-En.tflite) |
47
- | WhisperEncoderInf | float | QCS8550 (Proxy) | Qualcomm® QCS8550 (Proxy) | QNN_DLC | 53.275 ms | 0 - 83 MB | NPU | [Whisper-Tiny-En.dlc](https://huggingface.co/qualcomm/Whisper-Tiny-En/blob/main/Whisper-Tiny-En.dlc) |
48
- | WhisperEncoderInf | float | QCS9075 (Proxy) | Qualcomm® QCS9075 (Proxy) | TFLITE | 168.689 ms | 20 - 44 MB | GPU | [Whisper-Tiny-En.tflite](https://huggingface.co/qualcomm/Whisper-Tiny-En/blob/main/Whisper-Tiny-En.tflite) |
49
- | WhisperEncoderInf | float | QCS9075 (Proxy) | Qualcomm® QCS9075 (Proxy) | QNN_DLC | 53.369 ms | 0 - 245 MB | NPU | [Whisper-Tiny-En.dlc](https://huggingface.co/qualcomm/Whisper-Tiny-En/blob/main/Whisper-Tiny-En.dlc) |
50
- | WhisperEncoderInf | float | SA7255P ADP | Qualcomm® SA7255P | TFLITE | 403.879 ms | 19 - 41 MB | GPU | [Whisper-Tiny-En.tflite](https://huggingface.co/qualcomm/Whisper-Tiny-En/blob/main/Whisper-Tiny-En.tflite) |
51
- | WhisperEncoderInf | float | SA7255P ADP | Qualcomm® SA7255P | QNN_DLC | 104.864 ms | 0 - 248 MB | NPU | [Whisper-Tiny-En.dlc](https://huggingface.co/qualcomm/Whisper-Tiny-En/blob/main/Whisper-Tiny-En.dlc) |
52
- | WhisperEncoderInf | float | SA8255 (Proxy) | Qualcomm® SA8255P (Proxy) | TFLITE | 99.301 ms | 20 - 67 MB | GPU | [Whisper-Tiny-En.tflite](https://huggingface.co/qualcomm/Whisper-Tiny-En/blob/main/Whisper-Tiny-En.tflite) |
53
- | WhisperEncoderInf | float | SA8255 (Proxy) | Qualcomm® SA8255P (Proxy) | QNN_DLC | 50.547 ms | 0 - 87 MB | NPU | [Whisper-Tiny-En.dlc](https://huggingface.co/qualcomm/Whisper-Tiny-En/blob/main/Whisper-Tiny-En.dlc) |
54
- | WhisperEncoderInf | float | SA8295P ADP | Qualcomm® SA8295P | TFLITE | 104.275 ms | 20 - 49 MB | GPU | [Whisper-Tiny-En.tflite](https://huggingface.co/qualcomm/Whisper-Tiny-En/blob/main/Whisper-Tiny-En.tflite) |
55
- | WhisperEncoderInf | float | SA8295P ADP | Qualcomm® SA8295P | QNN_DLC | 80.552 ms | 1 - 236 MB | NPU | [Whisper-Tiny-En.dlc](https://huggingface.co/qualcomm/Whisper-Tiny-En/blob/main/Whisper-Tiny-En.dlc) |
56
- | WhisperEncoderInf | float | SA8650 (Proxy) | Qualcomm® SA8650P (Proxy) | TFLITE | 102.517 ms | 20 - 68 MB | GPU | [Whisper-Tiny-En.tflite](https://huggingface.co/qualcomm/Whisper-Tiny-En/blob/main/Whisper-Tiny-En.tflite) |
57
- | WhisperEncoderInf | float | SA8650 (Proxy) | Qualcomm® SA8650P (Proxy) | QNN_DLC | 50.435 ms | 0 - 83 MB | NPU | [Whisper-Tiny-En.dlc](https://huggingface.co/qualcomm/Whisper-Tiny-En/blob/main/Whisper-Tiny-En.dlc) |
58
- | WhisperEncoderInf | float | SA8775P ADP | Qualcomm® SA8775P | TFLITE | 168.689 ms | 20 - 44 MB | GPU | [Whisper-Tiny-En.tflite](https://huggingface.co/qualcomm/Whisper-Tiny-En/blob/main/Whisper-Tiny-En.tflite) |
59
- | WhisperEncoderInf | float | SA8775P ADP | Qualcomm® SA8775P | QNN_DLC | 53.369 ms | 0 - 245 MB | NPU | [Whisper-Tiny-En.dlc](https://huggingface.co/qualcomm/Whisper-Tiny-En/blob/main/Whisper-Tiny-En.dlc) |
60
- | WhisperEncoderInf | float | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 Mobile | TFLITE | 100.617 ms | 20 - 68 MB | GPU | [Whisper-Tiny-En.tflite](https://huggingface.co/qualcomm/Whisper-Tiny-En/blob/main/Whisper-Tiny-En.tflite) |
61
- | WhisperEncoderInf | float | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 Mobile | QNN_DLC | 53.194 ms | 0 - 86 MB | NPU | [Whisper-Tiny-En.dlc](https://huggingface.co/qualcomm/Whisper-Tiny-En/blob/main/Whisper-Tiny-En.dlc) |
62
- | WhisperEncoderInf | float | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 Mobile | ONNX | 65.442 ms | 35 - 168 MB | NPU | [Whisper-Tiny-En.onnx](https://huggingface.co/qualcomm/Whisper-Tiny-En/blob/main/Whisper-Tiny-En.onnx) |
63
- | WhisperEncoderInf | float | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | TFLITE | 78.74 ms | 18 - 54 MB | GPU | [Whisper-Tiny-En.tflite](https://huggingface.co/qualcomm/Whisper-Tiny-En/blob/main/Whisper-Tiny-En.tflite) |
64
- | WhisperEncoderInf | float | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | QNN_DLC | 42.372 ms | 1 - 249 MB | NPU | [Whisper-Tiny-En.dlc](https://huggingface.co/qualcomm/Whisper-Tiny-En/blob/main/Whisper-Tiny-En.dlc) |
65
- | WhisperEncoderInf | float | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | ONNX | 52.212 ms | 50 - 450 MB | NPU | [Whisper-Tiny-En.onnx](https://huggingface.co/qualcomm/Whisper-Tiny-En/blob/main/Whisper-Tiny-En.onnx) |
66
- | WhisperEncoderInf | float | Snapdragon 8 Elite QRD | Snapdragon® 8 Elite Mobile | QNN_DLC | 34.517 ms | 0 - 242 MB | NPU | [Whisper-Tiny-En.dlc](https://huggingface.co/qualcomm/Whisper-Tiny-En/blob/main/Whisper-Tiny-En.dlc) |
67
- | WhisperEncoderInf | float | Snapdragon 8 Elite QRD | Snapdragon® 8 Elite Mobile | ONNX | 44.148 ms | 49 - 416 MB | NPU | [Whisper-Tiny-En.onnx](https://huggingface.co/qualcomm/Whisper-Tiny-En/blob/main/Whisper-Tiny-En.onnx) |
68
- | WhisperEncoderInf | float | Snapdragon X Elite CRD | Snapdragon® X Elite | QNN_DLC | 51.084 ms | 51 - 51 MB | NPU | [Whisper-Tiny-En.dlc](https://huggingface.co/qualcomm/Whisper-Tiny-En/blob/main/Whisper-Tiny-En.dlc) |
69
- | WhisperEncoderInf | float | Snapdragon X Elite CRD | Snapdragon® X Elite | ONNX | 65.846 ms | 67 - 67 MB | NPU | [Whisper-Tiny-En.onnx](https://huggingface.co/qualcomm/Whisper-Tiny-En/blob/main/Whisper-Tiny-En.onnx) |
70
- | WhisperDecoderInf | float | QCS8275 (Proxy) | Qualcomm® QCS8275 (Proxy) | TFLITE | 6.39 ms | 3 - 96 MB | NPU | [Whisper-Tiny-En.tflite](https://huggingface.co/qualcomm/Whisper-Tiny-En/blob/main/Whisper-Tiny-En.tflite) |
71
- | WhisperDecoderInf | float | QCS8275 (Proxy) | Qualcomm® QCS8275 (Proxy) | QNN_DLC | 4.048 ms | 10 - 60 MB | NPU | [Whisper-Tiny-En.dlc](https://huggingface.co/qualcomm/Whisper-Tiny-En/blob/main/Whisper-Tiny-En.dlc) |
72
- | WhisperDecoderInf | float | QCS8450 (Proxy) | Qualcomm® QCS8450 (Proxy) | TFLITE | 4.145 ms | 3 - 91 MB | NPU | [Whisper-Tiny-En.tflite](https://huggingface.co/qualcomm/Whisper-Tiny-En/blob/main/Whisper-Tiny-En.tflite) |
73
- | WhisperDecoderInf | float | QCS8450 (Proxy) | Qualcomm® QCS8450 (Proxy) | QNN_DLC | 2.782 ms | 9 - 64 MB | NPU | [Whisper-Tiny-En.dlc](https://huggingface.co/qualcomm/Whisper-Tiny-En/blob/main/Whisper-Tiny-En.dlc) |
74
- | WhisperDecoderInf | float | QCS8550 (Proxy) | Qualcomm® QCS8550 (Proxy) | TFLITE | 3.568 ms | 3 - 40 MB | NPU | [Whisper-Tiny-En.tflite](https://huggingface.co/qualcomm/Whisper-Tiny-En/blob/main/Whisper-Tiny-En.tflite) |
75
- | WhisperDecoderInf | float | QCS8550 (Proxy) | Qualcomm® QCS8550 (Proxy) | QNN_DLC | 2.172 ms | 1 - 17 MB | NPU | [Whisper-Tiny-En.dlc](https://huggingface.co/qualcomm/Whisper-Tiny-En/blob/main/Whisper-Tiny-En.dlc) |
76
- | WhisperDecoderInf | float | QCS9075 (Proxy) | Qualcomm® QCS9075 (Proxy) | TFLITE | 4.599 ms | 3 - 96 MB | NPU | [Whisper-Tiny-En.tflite](https://huggingface.co/qualcomm/Whisper-Tiny-En/blob/main/Whisper-Tiny-En.tflite) |
77
- | WhisperDecoderInf | float | QCS9075 (Proxy) | Qualcomm® QCS9075 (Proxy) | QNN_DLC | 2.881 ms | 10 - 59 MB | NPU | [Whisper-Tiny-En.dlc](https://huggingface.co/qualcomm/Whisper-Tiny-En/blob/main/Whisper-Tiny-En.dlc) |
78
- | WhisperDecoderInf | float | SA7255P ADP | Qualcomm® SA7255P | TFLITE | 6.39 ms | 3 - 96 MB | NPU | [Whisper-Tiny-En.tflite](https://huggingface.co/qualcomm/Whisper-Tiny-En/blob/main/Whisper-Tiny-En.tflite) |
79
- | WhisperDecoderInf | float | SA7255P ADP | Qualcomm® SA7255P | QNN_DLC | 4.048 ms | 10 - 60 MB | NPU | [Whisper-Tiny-En.dlc](https://huggingface.co/qualcomm/Whisper-Tiny-En/blob/main/Whisper-Tiny-En.dlc) |
80
- | WhisperDecoderInf | float | SA8255 (Proxy) | Qualcomm® SA8255P (Proxy) | TFLITE | 3.649 ms | 3 - 44 MB | NPU | [Whisper-Tiny-En.tflite](https://huggingface.co/qualcomm/Whisper-Tiny-En/blob/main/Whisper-Tiny-En.tflite) |
81
- | WhisperDecoderInf | float | SA8255 (Proxy) | Qualcomm® SA8255P (Proxy) | QNN_DLC | 2.108 ms | 10 - 30 MB | NPU | [Whisper-Tiny-En.dlc](https://huggingface.co/qualcomm/Whisper-Tiny-En/blob/main/Whisper-Tiny-En.dlc) |
82
- | WhisperDecoderInf | float | SA8295P ADP | Qualcomm® SA8295P | TFLITE | 4.735 ms | 3 - 88 MB | NPU | [Whisper-Tiny-En.tflite](https://huggingface.co/qualcomm/Whisper-Tiny-En/blob/main/Whisper-Tiny-En.tflite) |
83
- | WhisperDecoderInf | float | SA8295P ADP | Qualcomm® SA8295P | QNN_DLC | 3.07 ms | 1 - 47 MB | NPU | [Whisper-Tiny-En.dlc](https://huggingface.co/qualcomm/Whisper-Tiny-En/blob/main/Whisper-Tiny-En.dlc) |
84
- | WhisperDecoderInf | float | SA8650 (Proxy) | Qualcomm® SA8650P (Proxy) | TFLITE | 3.56 ms | 3 - 39 MB | NPU | [Whisper-Tiny-En.tflite](https://huggingface.co/qualcomm/Whisper-Tiny-En/blob/main/Whisper-Tiny-En.tflite) |
85
- | WhisperDecoderInf | float | SA8650 (Proxy) | Qualcomm® SA8650P (Proxy) | QNN_DLC | 2.217 ms | 10 - 30 MB | NPU | [Whisper-Tiny-En.dlc](https://huggingface.co/qualcomm/Whisper-Tiny-En/blob/main/Whisper-Tiny-En.dlc) |
86
- | WhisperDecoderInf | float | SA8775P ADP | Qualcomm® SA8775P | TFLITE | 4.599 ms | 3 - 96 MB | NPU | [Whisper-Tiny-En.tflite](https://huggingface.co/qualcomm/Whisper-Tiny-En/blob/main/Whisper-Tiny-En.tflite) |
87
- | WhisperDecoderInf | float | SA8775P ADP | Qualcomm® SA8775P | QNN_DLC | 2.881 ms | 10 - 59 MB | NPU | [Whisper-Tiny-En.dlc](https://huggingface.co/qualcomm/Whisper-Tiny-En/blob/main/Whisper-Tiny-En.dlc) |
88
- | WhisperDecoderInf | float | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 Mobile | TFLITE | 3.654 ms | 3 - 35 MB | NPU | [Whisper-Tiny-En.tflite](https://huggingface.co/qualcomm/Whisper-Tiny-En/blob/main/Whisper-Tiny-En.tflite) |
89
- | WhisperDecoderInf | float | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 Mobile | QNN_DLC | 2.119 ms | 1 - 21 MB | NPU | [Whisper-Tiny-En.dlc](https://huggingface.co/qualcomm/Whisper-Tiny-En/blob/main/Whisper-Tiny-En.dlc) |
90
- | WhisperDecoderInf | float | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 Mobile | ONNX | 3.972 ms | 0 - 91 MB | NPU | [Whisper-Tiny-En.onnx](https://huggingface.co/qualcomm/Whisper-Tiny-En/blob/main/Whisper-Tiny-En.onnx) |
91
- | WhisperDecoderInf | float | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | TFLITE | 2.769 ms | 0 - 102 MB | NPU | [Whisper-Tiny-En.tflite](https://huggingface.co/qualcomm/Whisper-Tiny-En/blob/main/Whisper-Tiny-En.tflite) |
92
- | WhisperDecoderInf | float | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | QNN_DLC | 1.685 ms | 0 - 62 MB | NPU | [Whisper-Tiny-En.dlc](https://huggingface.co/qualcomm/Whisper-Tiny-En/blob/main/Whisper-Tiny-En.dlc) |
93
- | WhisperDecoderInf | float | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | ONNX | 3.018 ms | 26 - 109 MB | NPU | [Whisper-Tiny-En.onnx](https://huggingface.co/qualcomm/Whisper-Tiny-En/blob/main/Whisper-Tiny-En.onnx) |
94
- | WhisperDecoderInf | float | Snapdragon 8 Elite QRD | Snapdragon® 8 Elite Mobile | QNN_DLC | 1.35 ms | 1 - 54 MB | NPU | [Whisper-Tiny-En.dlc](https://huggingface.co/qualcomm/Whisper-Tiny-En/blob/main/Whisper-Tiny-En.dlc) |
95
- | WhisperDecoderInf | float | Snapdragon 8 Elite QRD | Snapdragon® 8 Elite Mobile | ONNX | 2.708 ms | 2 - 193 MB | NPU | [Whisper-Tiny-En.onnx](https://huggingface.co/qualcomm/Whisper-Tiny-En/blob/main/Whisper-Tiny-En.onnx) |
96
- | WhisperDecoderInf | float | Snapdragon X Elite CRD | Snapdragon® X Elite | QNN_DLC | 2.112 ms | 166 - 166 MB | NPU | [Whisper-Tiny-En.dlc](https://huggingface.co/qualcomm/Whisper-Tiny-En/blob/main/Whisper-Tiny-En.dlc) |
97
- | WhisperDecoderInf | float | Snapdragon X Elite CRD | Snapdragon® X Elite | ONNX | 3.469 ms | 74 - 74 MB | NPU | [Whisper-Tiny-En.onnx](https://huggingface.co/qualcomm/Whisper-Tiny-En/blob/main/Whisper-Tiny-En.onnx) |
 
 
98
 
99
 
100
 
@@ -255,14 +257,14 @@ Explore all available models on [Qualcomm® AI Hub](https://aihub.qualcomm.com/)
255
 
256
  ## License
257
  * The license for the original implementation of Whisper-Tiny-En can be found
258
- [here](https://github.com/openai/whisper/blob/main/LICENSE).
259
  * The license for the compiled assets for on-device deployment can be found [here](https://qaihub-public-assets.s3.us-west-2.amazonaws.com/qai-hub-models/Qualcomm+AI+Hub+Proprietary+License.pdf)
260
 
261
 
262
 
263
  ## References
264
  * [Robust Speech Recognition via Large-Scale Weak Supervision](https://cdn.openai.com/papers/whisper.pdf)
265
- * [Source Model Implementation](https://github.com/openai/whisper/tree/main)
266
 
267
 
268
 
 
11
  ![](https://qaihub-public-assets.s3.us-west-2.amazonaws.com/qai-hub-models/models/whisper_tiny_en/web-assets/model_demo.png)
12
 
13
  # Whisper-Tiny-En: Optimized for Mobile Deployment
14
+ ## Transformer-based automatic speech recognition (ASR) model for multilingual transcription and translation available on HuggingFace
15
 
16
 
17
+ HuggingFace Whisper-Small ASR (Automatic Speech Recognition) model is a state-of-the-art system designed for transcribing spoken language into written text. This model is based on the transformer architecture and has been optimized for edge inference by replacing Multi-Head Attention (MHA) with Single-Head Attention (SHA) and linear layers with convolutional (conv) layers. It exhibits robust performance in realistic, noisy environments, making it highly reliable for real-world applications. Specifically, it excels in long-form transcription, capable of accurately transcribing audio clips up to 30 seconds long. Time to the first token is the encoder's latency, while time to each additional token is decoder's latency, where we assume a max decoded length specified below.
18
 
19
+ This model is an implementation of Whisper-Tiny-En found [here](https://github.com/huggingface/transformers/tree/v4.42.3/src/transformers/models/whisper).
20
 
21
 
22
  This repository provides scripts to run Whisper-Tiny-En on Qualcomm® devices.
 
29
 
30
  - **Model Type:** Model_use_case.speech_recognition
31
  - **Model Stats:**
32
+ - Model checkpoint: openai/whisper-tiny
33
  - Input resolution: 80x3000 (30 seconds audio)
34
+ - Max decoded sequence length: 200 tokens
35
+ - Number of parameters (HfWhisperEncoder): 9.39M
36
+ - Model size (HfWhisperEncoder) (float): 35.9 MB
37
+ - Number of parameters (HfWhisperDecoder): 28.4M
38
+ - Model size (HfWhisperDecoder) (float): 109 MB
39
 
40
  | Model | Precision | Device | Chipset | Target Runtime | Inference Time (ms) | Peak Memory Range (MB) | Primary Compute Unit | Target Model
41
  |---|---|---|---|---|---|---|---|---|
42
+ | HfWhisperEncoder | float | QCS8275 (Proxy) | Qualcomm® QCS8275 (Proxy) | TFLITE | 405.15 ms | 19 - 43 MB | GPU | [Whisper-Tiny-En.tflite](https://huggingface.co/qualcomm/Whisper-Tiny-En/blob/main/Whisper-Tiny-En.tflite) |
43
+ | HfWhisperEncoder | float | QCS8275 (Proxy) | Qualcomm® QCS8275 (Proxy) | QNN_DLC | 61.635 ms | 1 - 61 MB | NPU | [Whisper-Tiny-En.dlc](https://huggingface.co/qualcomm/Whisper-Tiny-En/blob/main/Whisper-Tiny-En.dlc) |
44
+ | HfWhisperEncoder | float | QCS8450 (Proxy) | Qualcomm® QCS8450 (Proxy) | TFLITE | 139.055 ms | 20 - 67 MB | GPU | [Whisper-Tiny-En.tflite](https://huggingface.co/qualcomm/Whisper-Tiny-En/blob/main/Whisper-Tiny-En.tflite) |
45
+ | HfWhisperEncoder | float | QCS8450 (Proxy) | Qualcomm® QCS8450 (Proxy) | QNN_DLC | 54.112 ms | 1 - 68 MB | NPU | [Whisper-Tiny-En.dlc](https://huggingface.co/qualcomm/Whisper-Tiny-En/blob/main/Whisper-Tiny-En.dlc) |
46
+ | HfWhisperEncoder | float | QCS8550 (Proxy) | Qualcomm® QCS8550 (Proxy) | TFLITE | 108.657 ms | 9 - 63 MB | GPU | [Whisper-Tiny-En.tflite](https://huggingface.co/qualcomm/Whisper-Tiny-En/blob/main/Whisper-Tiny-En.tflite) |
47
+ | HfWhisperEncoder | float | QCS8550 (Proxy) | Qualcomm® QCS8550 (Proxy) | QNN_DLC | 20.259 ms | 0 - 18 MB | NPU | [Whisper-Tiny-En.dlc](https://huggingface.co/qualcomm/Whisper-Tiny-En/blob/main/Whisper-Tiny-En.dlc) |
48
+ | HfWhisperEncoder | float | QCS9075 (Proxy) | Qualcomm® QCS9075 (Proxy) | TFLITE | 190.523 ms | 19 - 43 MB | GPU | [Whisper-Tiny-En.tflite](https://huggingface.co/qualcomm/Whisper-Tiny-En/blob/main/Whisper-Tiny-En.tflite) |
49
+ | HfWhisperEncoder | float | QCS9075 (Proxy) | Qualcomm® QCS9075 (Proxy) | QNN_DLC | 23.489 ms | 0 - 61 MB | NPU | [Whisper-Tiny-En.dlc](https://huggingface.co/qualcomm/Whisper-Tiny-En/blob/main/Whisper-Tiny-En.dlc) |
50
+ | HfWhisperEncoder | float | SA7255P ADP | Qualcomm® SA7255P | TFLITE | 405.15 ms | 19 - 43 MB | GPU | [Whisper-Tiny-En.tflite](https://huggingface.co/qualcomm/Whisper-Tiny-En/blob/main/Whisper-Tiny-En.tflite) |
51
+ | HfWhisperEncoder | float | SA7255P ADP | Qualcomm® SA7255P | QNN_DLC | 61.635 ms | 1 - 61 MB | NPU | [Whisper-Tiny-En.dlc](https://huggingface.co/qualcomm/Whisper-Tiny-En/blob/main/Whisper-Tiny-En.dlc) |
52
+ | HfWhisperEncoder | float | SA8255 (Proxy) | Qualcomm® SA8255P (Proxy) | TFLITE | 116.015 ms | 13 - 56 MB | GPU | [Whisper-Tiny-En.tflite](https://huggingface.co/qualcomm/Whisper-Tiny-En/blob/main/Whisper-Tiny-En.tflite) |
53
+ | HfWhisperEncoder | float | SA8255 (Proxy) | Qualcomm® SA8255P (Proxy) | QNN_DLC | 20.124 ms | 0 - 20 MB | NPU | [Whisper-Tiny-En.dlc](https://huggingface.co/qualcomm/Whisper-Tiny-En/blob/main/Whisper-Tiny-En.dlc) |
54
+ | HfWhisperEncoder | float | SA8295P ADP | Qualcomm® SA8295P | TFLITE | 116.144 ms | 19 - 51 MB | GPU | [Whisper-Tiny-En.tflite](https://huggingface.co/qualcomm/Whisper-Tiny-En/blob/main/Whisper-Tiny-En.tflite) |
55
+ | HfWhisperEncoder | float | SA8295P ADP | Qualcomm® SA8295P | QNN_DLC | 50.493 ms | 0 - 64 MB | NPU | [Whisper-Tiny-En.dlc](https://huggingface.co/qualcomm/Whisper-Tiny-En/blob/main/Whisper-Tiny-En.dlc) |
56
+ | HfWhisperEncoder | float | SA8650 (Proxy) | Qualcomm® SA8650P (Proxy) | TFLITE | 111.141 ms | 10 - 67 MB | GPU | [Whisper-Tiny-En.tflite](https://huggingface.co/qualcomm/Whisper-Tiny-En/blob/main/Whisper-Tiny-En.tflite) |
57
+ | HfWhisperEncoder | float | SA8650 (Proxy) | Qualcomm® SA8650P (Proxy) | QNN_DLC | 20.279 ms | 0 - 14 MB | NPU | [Whisper-Tiny-En.dlc](https://huggingface.co/qualcomm/Whisper-Tiny-En/blob/main/Whisper-Tiny-En.dlc) |
58
+ | HfWhisperEncoder | float | SA8775P ADP | Qualcomm® SA8775P | TFLITE | 190.523 ms | 19 - 43 MB | GPU | [Whisper-Tiny-En.tflite](https://huggingface.co/qualcomm/Whisper-Tiny-En/blob/main/Whisper-Tiny-En.tflite) |
59
+ | HfWhisperEncoder | float | SA8775P ADP | Qualcomm® SA8775P | QNN_DLC | 23.489 ms | 0 - 61 MB | NPU | [Whisper-Tiny-En.dlc](https://huggingface.co/qualcomm/Whisper-Tiny-En/blob/main/Whisper-Tiny-En.dlc) |
60
+ | HfWhisperEncoder | float | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 Mobile | TFLITE | 114.057 ms | 12 - 65 MB | GPU | [Whisper-Tiny-En.tflite](https://huggingface.co/qualcomm/Whisper-Tiny-En/blob/main/Whisper-Tiny-En.tflite) |
61
+ | HfWhisperEncoder | float | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 Mobile | QNN_DLC | 20.071 ms | 0 - 15 MB | NPU | [Whisper-Tiny-En.dlc](https://huggingface.co/qualcomm/Whisper-Tiny-En/blob/main/Whisper-Tiny-En.dlc) |
62
+ | HfWhisperEncoder | float | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 Mobile | ONNX | 56.155 ms | 11 - 107 MB | NPU | [Whisper-Tiny-En.onnx.zip](https://huggingface.co/qualcomm/Whisper-Tiny-En/blob/main/Whisper-Tiny-En.onnx.zip) |
63
+ | HfWhisperEncoder | float | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | TFLITE | 85.868 ms | 20 - 65 MB | GPU | [Whisper-Tiny-En.tflite](https://huggingface.co/qualcomm/Whisper-Tiny-En/blob/main/Whisper-Tiny-En.tflite) |
64
+ | HfWhisperEncoder | float | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | QNN_DLC | 15.68 ms | 0 - 70 MB | NPU | [Whisper-Tiny-En.dlc](https://huggingface.co/qualcomm/Whisper-Tiny-En/blob/main/Whisper-Tiny-En.dlc) |
65
+ | HfWhisperEncoder | float | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | ONNX | 42.132 ms | 38 - 659 MB | NPU | [Whisper-Tiny-En.onnx.zip](https://huggingface.co/qualcomm/Whisper-Tiny-En/blob/main/Whisper-Tiny-En.onnx.zip) |
66
+ | HfWhisperEncoder | float | Snapdragon 8 Elite QRD | Snapdragon® 8 Elite Mobile | TFLITE | 84.386 ms | 20 - 48 MB | GPU | [Whisper-Tiny-En.tflite](https://huggingface.co/qualcomm/Whisper-Tiny-En/blob/main/Whisper-Tiny-En.tflite) |
67
+ | HfWhisperEncoder | float | Snapdragon 8 Elite QRD | Snapdragon® 8 Elite Mobile | QNN_DLC | 12.541 ms | 0 - 68 MB | NPU | [Whisper-Tiny-En.dlc](https://huggingface.co/qualcomm/Whisper-Tiny-En/blob/main/Whisper-Tiny-En.dlc) |
68
+ | HfWhisperEncoder | float | Snapdragon 8 Elite QRD | Snapdragon® 8 Elite Mobile | ONNX | 35.963 ms | 7 - 415 MB | NPU | [Whisper-Tiny-En.onnx.zip](https://huggingface.co/qualcomm/Whisper-Tiny-En/blob/main/Whisper-Tiny-En.onnx.zip) |
69
+ | HfWhisperEncoder | float | Snapdragon X Elite CRD | Snapdragon® X Elite | QNN_DLC | 19.564 ms | 2 - 2 MB | NPU | [Whisper-Tiny-En.dlc](https://huggingface.co/qualcomm/Whisper-Tiny-En/blob/main/Whisper-Tiny-En.dlc) |
70
+ | HfWhisperEncoder | float | Snapdragon X Elite CRD | Snapdragon® X Elite | ONNX | 59.836 ms | 66 - 66 MB | NPU | [Whisper-Tiny-En.onnx.zip](https://huggingface.co/qualcomm/Whisper-Tiny-En/blob/main/Whisper-Tiny-En.onnx.zip) |
71
+ | HfWhisperDecoder | float | QCS8275 (Proxy) | Qualcomm® QCS8275 (Proxy) | TFLITE | 5.211 ms | 3 - 97 MB | NPU | [Whisper-Tiny-En.tflite](https://huggingface.co/qualcomm/Whisper-Tiny-En/blob/main/Whisper-Tiny-En.tflite) |
72
+ | HfWhisperDecoder | float | QCS8275 (Proxy) | Qualcomm® QCS8275 (Proxy) | QNN_DLC | 3.626 ms | 0 - 46 MB | NPU | [Whisper-Tiny-En.dlc](https://huggingface.co/qualcomm/Whisper-Tiny-En/blob/main/Whisper-Tiny-En.dlc) |
73
+ | HfWhisperDecoder | float | QCS8450 (Proxy) | Qualcomm® QCS8450 (Proxy) | TFLITE | 3.568 ms | 3 - 115 MB | NPU | [Whisper-Tiny-En.tflite](https://huggingface.co/qualcomm/Whisper-Tiny-En/blob/main/Whisper-Tiny-En.tflite) |
74
+ | HfWhisperDecoder | float | QCS8450 (Proxy) | Qualcomm® QCS8450 (Proxy) | QNN_DLC | 2.837 ms | 10 - 44 MB | NPU | [Whisper-Tiny-En.dlc](https://huggingface.co/qualcomm/Whisper-Tiny-En/blob/main/Whisper-Tiny-En.dlc) |
75
+ | HfWhisperDecoder | float | QCS8550 (Proxy) | Qualcomm® QCS8550 (Proxy) | TFLITE | 3.169 ms | 1 - 307 MB | NPU | [Whisper-Tiny-En.tflite](https://huggingface.co/qualcomm/Whisper-Tiny-En/blob/main/Whisper-Tiny-En.tflite) |
76
+ | HfWhisperDecoder | float | QCS8550 (Proxy) | Qualcomm® QCS8550 (Proxy) | QNN_DLC | 2.155 ms | 0 - 29 MB | NPU | [Whisper-Tiny-En.dlc](https://huggingface.co/qualcomm/Whisper-Tiny-En/blob/main/Whisper-Tiny-En.dlc) |
77
+ | HfWhisperDecoder | float | QCS9075 (Proxy) | Qualcomm® QCS9075 (Proxy) | TFLITE | 3.899 ms | 3 - 98 MB | NPU | [Whisper-Tiny-En.tflite](https://huggingface.co/qualcomm/Whisper-Tiny-En/blob/main/Whisper-Tiny-En.tflite) |
78
+ | HfWhisperDecoder | float | QCS9075 (Proxy) | Qualcomm® QCS9075 (Proxy) | QNN_DLC | 2.719 ms | 6 - 42 MB | NPU | [Whisper-Tiny-En.dlc](https://huggingface.co/qualcomm/Whisper-Tiny-En/blob/main/Whisper-Tiny-En.dlc) |
79
+ | HfWhisperDecoder | float | SA7255P ADP | Qualcomm® SA7255P | TFLITE | 5.211 ms | 3 - 97 MB | NPU | [Whisper-Tiny-En.tflite](https://huggingface.co/qualcomm/Whisper-Tiny-En/blob/main/Whisper-Tiny-En.tflite) |
80
+ | HfWhisperDecoder | float | SA7255P ADP | Qualcomm® SA7255P | QNN_DLC | 3.626 ms | 0 - 46 MB | NPU | [Whisper-Tiny-En.dlc](https://huggingface.co/qualcomm/Whisper-Tiny-En/blob/main/Whisper-Tiny-En.dlc) |
81
+ | HfWhisperDecoder | float | SA8255 (Proxy) | Qualcomm® SA8255P (Proxy) | TFLITE | 3.108 ms | 0 - 314 MB | NPU | [Whisper-Tiny-En.tflite](https://huggingface.co/qualcomm/Whisper-Tiny-En/blob/main/Whisper-Tiny-En.tflite) |
82
+ | HfWhisperDecoder | float | SA8255 (Proxy) | Qualcomm® SA8255P (Proxy) | QNN_DLC | 2.174 ms | 0 - 54 MB | NPU | [Whisper-Tiny-En.dlc](https://huggingface.co/qualcomm/Whisper-Tiny-En/blob/main/Whisper-Tiny-En.dlc) |
83
+ | HfWhisperDecoder | float | SA8295P ADP | Qualcomm® SA8295P | TFLITE | 4.158 ms | 3 - 101 MB | NPU | [Whisper-Tiny-En.tflite](https://huggingface.co/qualcomm/Whisper-Tiny-En/blob/main/Whisper-Tiny-En.tflite) |
84
+ | HfWhisperDecoder | float | SA8295P ADP | Qualcomm® SA8295P | QNN_DLC | 2.967 ms | 0 - 45 MB | NPU | [Whisper-Tiny-En.dlc](https://huggingface.co/qualcomm/Whisper-Tiny-En/blob/main/Whisper-Tiny-En.dlc) |
85
+ | HfWhisperDecoder | float | SA8650 (Proxy) | Qualcomm® SA8650P (Proxy) | TFLITE | 3.195 ms | 1 - 262 MB | NPU | [Whisper-Tiny-En.tflite](https://huggingface.co/qualcomm/Whisper-Tiny-En/blob/main/Whisper-Tiny-En.tflite) |
86
+ | HfWhisperDecoder | float | SA8650 (Proxy) | Qualcomm® SA8650P (Proxy) | QNN_DLC | 2.146 ms | 0 - 31 MB | NPU | [Whisper-Tiny-En.dlc](https://huggingface.co/qualcomm/Whisper-Tiny-En/blob/main/Whisper-Tiny-En.dlc) |
87
+ | HfWhisperDecoder | float | SA8775P ADP | Qualcomm® SA8775P | TFLITE | 3.899 ms | 3 - 98 MB | NPU | [Whisper-Tiny-En.tflite](https://huggingface.co/qualcomm/Whisper-Tiny-En/blob/main/Whisper-Tiny-En.tflite) |
88
+ | HfWhisperDecoder | float | SA8775P ADP | Qualcomm® SA8775P | QNN_DLC | 2.719 ms | 6 - 42 MB | NPU | [Whisper-Tiny-En.dlc](https://huggingface.co/qualcomm/Whisper-Tiny-En/blob/main/Whisper-Tiny-En.dlc) |
89
+ | HfWhisperDecoder | float | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 Mobile | TFLITE | 3.061 ms | 0 - 302 MB | NPU | [Whisper-Tiny-En.tflite](https://huggingface.co/qualcomm/Whisper-Tiny-En/blob/main/Whisper-Tiny-En.tflite) |
90
+ | HfWhisperDecoder | float | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 Mobile | QNN_DLC | 2.152 ms | 3 - 30 MB | NPU | [Whisper-Tiny-En.dlc](https://huggingface.co/qualcomm/Whisper-Tiny-En/blob/main/Whisper-Tiny-En.dlc) |
91
+ | HfWhisperDecoder | float | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 Mobile | ONNX | 5.004 ms | 19 - 37 MB | NPU | [Whisper-Tiny-En.onnx.zip](https://huggingface.co/qualcomm/Whisper-Tiny-En/blob/main/Whisper-Tiny-En.onnx.zip) |
92
+ | HfWhisperDecoder | float | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | TFLITE | 2.328 ms | 0 - 111 MB | NPU | [Whisper-Tiny-En.tflite](https://huggingface.co/qualcomm/Whisper-Tiny-En/blob/main/Whisper-Tiny-En.tflite) |
93
+ | HfWhisperDecoder | float | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | QNN_DLC | 1.62 ms | 0 - 43 MB | NPU | [Whisper-Tiny-En.dlc](https://huggingface.co/qualcomm/Whisper-Tiny-En/blob/main/Whisper-Tiny-En.dlc) |
94
+ | HfWhisperDecoder | float | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | ONNX | 4.514 ms | 23 - 417 MB | NPU | [Whisper-Tiny-En.onnx.zip](https://huggingface.co/qualcomm/Whisper-Tiny-En/blob/main/Whisper-Tiny-En.onnx.zip) |
95
+ | HfWhisperDecoder | float | Snapdragon 8 Elite QRD | Snapdragon® 8 Elite Mobile | TFLITE | 2.058 ms | 0 - 99 MB | NPU | [Whisper-Tiny-En.tflite](https://huggingface.co/qualcomm/Whisper-Tiny-En/blob/main/Whisper-Tiny-En.tflite) |
96
+ | HfWhisperDecoder | float | Snapdragon 8 Elite QRD | Snapdragon® 8 Elite Mobile | QNN_DLC | 1.35 ms | 1 - 32 MB | NPU | [Whisper-Tiny-En.dlc](https://huggingface.co/qualcomm/Whisper-Tiny-En/blob/main/Whisper-Tiny-En.dlc) |
97
+ | HfWhisperDecoder | float | Snapdragon 8 Elite QRD | Snapdragon® 8 Elite Mobile | ONNX | 3.411 ms | 24 - 176 MB | NPU | [Whisper-Tiny-En.onnx.zip](https://huggingface.co/qualcomm/Whisper-Tiny-En/blob/main/Whisper-Tiny-En.onnx.zip) |
98
+ | HfWhisperDecoder | float | Snapdragon X Elite CRD | Snapdragon® X Elite | QNN_DLC | 1.917 ms | 101 - 101 MB | NPU | [Whisper-Tiny-En.dlc](https://huggingface.co/qualcomm/Whisper-Tiny-En/blob/main/Whisper-Tiny-En.dlc) |
99
+ | HfWhisperDecoder | float | Snapdragon X Elite CRD | Snapdragon® X Elite | ONNX | 4.214 ms | 75 - 75 MB | NPU | [Whisper-Tiny-En.onnx.zip](https://huggingface.co/qualcomm/Whisper-Tiny-En/blob/main/Whisper-Tiny-En.onnx.zip) |
100
 
101
 
102
 
 
257
 
258
  ## License
259
  * The license for the original implementation of Whisper-Tiny-En can be found
260
+ [here](https://github.com/huggingface/transformers/blob/v4.42.3/LICENSE).
261
  * The license for the compiled assets for on-device deployment can be found [here](https://qaihub-public-assets.s3.us-west-2.amazonaws.com/qai-hub-models/Qualcomm+AI+Hub+Proprietary+License.pdf)
262
 
263
 
264
 
265
  ## References
266
  * [Robust Speech Recognition via Large-Scale Weak Supervision](https://cdn.openai.com/papers/whisper.pdf)
267
+ * [Source Model Implementation](https://github.com/huggingface/transformers/tree/v4.42.3/src/transformers/models/whisper)
268
 
269
 
270
 
Whisper-Tiny-En_WhisperDecoderInf.dlc → Whisper-Tiny-En_HfWhisperDecoder.dlc RENAMED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:8df2c0133d8c62a6ac0594dd7a89e0f05d1eaf22220886877c19e453338f6b74
3
- size 96795673
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:3d38de3ecc8535d9e791a8e076a7bacf57b96c218c900cd401d7fe37b32cb86f
3
+ size 97117892
Whisper-Tiny-En_WhisperDecoderInf.onnx.zip → Whisper-Tiny-En_HfWhisperDecoder.onnx.zip RENAMED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:5e6199b1549a03a946d76619bb4721a0b84e7bb2112a04e66ce833ca0a8fb59c
3
- size 120022750
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:612d6532fb6d5fd0efca659e44d93f2e9febb1c972de90c2d4adaea577fcf24c
3
+ size 120336357
Whisper-Tiny-En_WhisperDecoderInf.tflite → Whisper-Tiny-En_HfWhisperDecoder.tflite RENAMED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:8f2e94746b483b48cb02424ad34d83e949c1f5e1cd900fcd24052eca240514e7
3
- size 113437748
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:8315aaddb046ceecb27b76f595bb8a63bd7a13fa30c8bf566e9c4d5a3df4c976
3
+ size 113817292
Whisper-Tiny-En_WhisperEncoderInf.onnx.zip → Whisper-Tiny-En_HfWhisperEncoder.dlc RENAMED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:b200418cebc248f45c2059482333bdaef37f5c32470c10b4fb6cd05159c0eea0
3
- size 23620488
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:2f571e8f730a98bda9ddb6ceec23c606e2f922566410bbbb8dbaa10dcd899bfe
3
+ size 19145812
Whisper-Tiny-En_WhisperEncoderInf.tflite → Whisper-Tiny-En_HfWhisperEncoder.onnx.zip RENAMED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:802d5e08a6ab7e8794f9826c939239d5063012e6144a931be5a65bdf392b21d7
3
- size 37612748
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:0ddd344d424e5c76fba3931e9ea6b3fda86dfa263bd35f3da986599345ffb27b
3
+ size 23537294
Whisper-Tiny-En_WhisperEncoderInf.dlc → Whisper-Tiny-En_HfWhisperEncoder.tflite RENAMED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:f2f7a43dcf6a291d57739e50d845c72653a235c3aaf8d1fb9591ff6c8f9366cf
3
- size 19003655
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:882ef43964f7ea37f5d14da6e04c1e406ddaa5872ab216b90b6c6af399eeffe5
3
+ size 37644668
precompiled/qualcomm-snapdragon-x-elite/Whisper-Tiny-En_HfWhisperDecoder.bin ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:feb5c7e6383d0ae11fbb594f06942aada96e1b1987981193209ec4946f9bc416
3
+ size 97153024
precompiled/qualcomm-snapdragon-x-elite/Whisper-Tiny-En_HfWhisperDecoder.onnx.zip ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:b466dccaa1552c1d4e46b1f7c1a4f00720bc9d92a055cb0a12e798c447666dfe
3
+ size 88673991
precompiled/qualcomm-snapdragon-x-elite/Whisper-Tiny-En_HfWhisperEncoder.bin ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:78687761a168dd50311f79621eedc78cc79dc89f34963d9768dc65b321a889d0
3
+ size 29900800
precompiled/qualcomm-snapdragon-x-elite/Whisper-Tiny-En_HfWhisperEncoder.onnx.zip ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:ed77350cd8b947c8240049f18de9e34c3a1c9f142d8dd9e1eb683faff5b19407
3
+ size 19175298
precompiled/qualcomm-snapdragon-x-elite/Whisper-Tiny-En_WhisperDecoderInf.bin DELETED
@@ -1,3 +0,0 @@
1
- version https://git-lfs.github.com/spec/v1
2
- oid sha256:839d592de2cac0407fdbc3771a4bf0828997814875b51291e8fee07a4c72b3cd
3
- size 97450056
 
 
 
 
precompiled/qualcomm-snapdragon-x-elite/Whisper-Tiny-En_WhisperDecoderInf.onnx.zip DELETED
@@ -1,3 +0,0 @@
1
- version https://git-lfs.github.com/spec/v1
2
- oid sha256:2cae000221d2fc5aca74b6bae58f5521bbfc51da110f16f7281c16b5f422cbd5
3
- size 89869723
 
 
 
 
precompiled/qualcomm-snapdragon-x-elite/Whisper-Tiny-En_WhisperEncoderInf.bin DELETED
@@ -1,3 +0,0 @@
1
- version https://git-lfs.github.com/spec/v1
2
- oid sha256:98797356ec3fc25edaea77a5e87d5a8bc27f35f567d8c76d299e95fa086d9545
3
- size 32293824
 
 
 
 
precompiled/qualcomm-snapdragon-x-elite/Whisper-Tiny-En_WhisperEncoderInf.onnx.zip DELETED
@@ -1,3 +0,0 @@
1
- version https://git-lfs.github.com/spec/v1
2
- oid sha256:9cdcfd68823fb3c34d1eb8376ee5788da82ea71e22a3784cf79cd372c4797bee
3
- size 19987941
 
 
 
 
precompiled/qualcomm-snapdragon-x-elite/sdk_versions.yml CHANGED
@@ -1,5 +1,5 @@
1
  sdk_versions:
2
  qnn_context_binary:
3
- qairt: 2.34.2.250528164111_119506
4
  precompiled_qnn_onnx:
5
  qairt: 2.33.2.250410134701_117956
 
1
  sdk_versions:
2
  qnn_context_binary:
3
+ qairt: 2.37.0.250724175447_124859
4
  precompiled_qnn_onnx:
5
  qairt: 2.33.2.250410134701_117956
sdk_versions.yml ADDED
@@ -0,0 +1,8 @@
 
 
 
 
 
 
 
 
 
1
+ sdk_versions:
2
+ tflite:
3
+ tflite: 2.17.0
4
+ qnn_dlc:
5
+ qairt: 2.37.0.250724175447_124859
6
+ onnx:
7
+ qairt: 2.33.2.250410134701_117956
8
+ onnx_runtime: 1.22.0