Uses MPS (Mac acceleration) by default when available #382

Conversation

Currently, Whisper defaults to using the CPU on MacOS devices despite the fact that PyTorch has introduced Metal Performance Shaders framework for Apple devices in the nightly release (more info).

With my changes to init.py, torch checks in MPS is available if torch.device has not been specified. If it is, and CUDA is not available, then Whisper defaults to MPS.

This way, Mac users can experience speedups from their GPU by default.

bbgdzxng1, TychoWerner, mackwic, bramses, sandeepthukral, ralaska, mliradelc, MachineLearningLearner14, GoelBiju, djpain, and 4 more reacted with thumbs up emoji

@dwarkeshsp have you measured any speedups compared to using the CPU?

Doesn't this also require switching FP16 off?

DiegoGiovany commented Nov 9, 2022 •

edited

I'm getting this error when try to use MPS

/Users/diego/.pyenv/versions/3.10.6/lib/python3.10/site-packages/whisper-1.0-py3.10.egg/whisper/decoding.py:629: UserWarning: The operator 'aten::repeat_interleave.self_int' is not currently supported on the MPS backend and will fall back to run on the CPU. This may have performance implications. (Triggered internally at /Users/diego/Projects/pytorch/aten/src/ATen/mps/MPSFallback.mm:11.)
audio_features = audio_features.repeat_interleave(self.n_group, dim=0)
/AppleInternal/Library/BuildRoots/2d9b4df9-4b93-11ed-b0fc-2e32217d8374/Library/Caches/com.apple.xbs/Sources/MetalPerformanceShaders/MPSCore/Types/MPSNDArray.mm:794: failed assertion `[MPSNDArray, initWithBuffer:descriptor:] Error: buffer is not large enough. Must be 23200 bytes
'
Abort trap: 6
/Users/diego/.pyenv/versions/3.10.6/lib/python3.10/multiprocessing/resource_tracker.py:224: UserWarning: resource_tracker: There appear to be 1 leaked semaphore objects to clean up at shutdown

any clues?

gltanaka and MachineLearningLearner14 reacted with thumbs up emoji

@DiegoGiovany Not an expert on this but It looks like PyTorch itself is missing some operators for MPS. See for example
pytorch/pytorch#77764 (comment)
(which refers to repeat_interleave)

and
pytorch/pytorch#87219

andresindlp reacted with confused emoji

gltanaka commented Nov 17, 2022 •

edited

Thanks for your work. I just tried this. Unfortunately, it didn't work for me on my m1 max with 32GB.
Here is what I did:
pip install git+https://github.com/openai/whisper.git@refs/pull/382/head

No errors on install and it works fine when run without mps: whisper audiofile_name --model medium

When I run: whisper audiofile_name --model medium --device mps

Here is the error I get:
Detecting language using up to the first 30 seconds. Use --language to specify the language
loc("mps_multiply"("(mpsFileLoc): /AppleInternal/Library/BuildRoots/810eba08-405a-11ed-86e9-6af958a02716/Library/Caches/com.apple.xbs/Sources/MetalPerformanceShadersGraph/mpsgraph/MetalPerformanceShadersGraph/Core/Files/MPSGraphUtilities.mm":228:0)): error: input types 'tensor<1x1024x3000xf16>' and 'tensor<1xf32>' are not broadcast compatible
LLVM ERROR: Failed to infer result type(s).

When I run: whisper audiofile_name --model medium --device mps --fp16 False

Here is the error I get:
Detecting language using up to the first 30 seconds. Use --language to specify the language
Detected language: English
/anaconda3/lib/python3.9/site-packages/whisper/decoding.py:633: UserWarning: The operator 'aten::repeat_interleave.self_int' is not currently supported on the MPS backend and will fall back to run on the CPU. This may have performance implications. (Triggered internally at /Users/runner/work/pytorch/pytorch/pytorch/aten/src/ATen/mps/MPSFallback.mm:11.)
audio_features = audio_features.repeat_interleave(self.n_group, dim=0)
/AppleInternal/Library/BuildRoots/f0468ab4-4115-11ed-8edc-7ef33c48bc85/Library/Caches/com.apple.xbs/Sources/MetalPerformanceShaders/MPSCore/Types/MPSNDArray.mm:794: failed assertion `[MPSNDArray, initWithBuffer:descriptor:] Error: buffer is not large enough. Must be 1007280 bytes

Basically, same error as @DiegoGiovany.

Any ideas on how to fix?

megeek commented Nov 28, 2022

+1 for me! I'm actually using an Intel Mac with Radeon Pro 560X 4 GB...

PhDLuffy commented Dec 8, 2022 •

edited

@dwarkeshsp

not work，with mbp2015 pytorch 1.3 stable，egpu RX580, MacOS 12.3.

changed the code as the same as yours.

changed to use --device mps but show error, maybe there is still somewhere to change or modify.

use --device cpu, it works.

with other pytorch-metal project, MPS works.

Reviewers

No reviews

Assignees

No one assigned

Labels

None yet

Projects

None yet

Milestone

No milestone

8 participants

Uses MPS (Mac acceleration) by default when available by dwarkeshsp · Pull Reque...

Uses MPS (Mac acceleration) by default when available #382

Conversation

DiegoGiovany commented Nov 9, 2022 •

gltanaka commented Nov 17, 2022 •

megeek commented Nov 28, 2022

PhDLuffy commented Dec 8, 2022 •

Recommend

币安：正在将资金从存款地址整合到热钱包，为下一阶段的POR审计做准备

今天加班的报个到🙋‍

The 7 Greatest British Sports Cars Of All Time

Raspberry Pi Pico W

Crypto Community Under Shock after FTX Collapse as Metacade (MCADE) Keeps Rising

抛下羽绒服的年轻人，瞄上冲锋衣

苹果爱立信达成5G全球专利许可协议|爱立信|苹果|5G_科技频道_新浪科技_新浪网

怎么把原来的 url 得到的某个地址，替换其中的域名呢？

如果我设计了一款能打败 Salesforce 的 SaaS 平台，占多少股份比较合适

计划绕月航行的日本亿万富翁，选中7位艺术家作为乘客

About Joyk