WaaS - Whisper as a Service

Backend flask application for the Speech To Text service.

This service is powered by OpenAI Whisper

API Documentation

POST `/v1/transcribe`

Add a new transcribe job to the queue. The job will be processed by the worker asynchroniously.

The response will be a JSON object with job_id that can be used to check the status of the job.

Query parameters:

REQUIRED: email_callback: string
OPTIONAL: language: string (default: automatic detection)
OPTIONAL: model: string (default: tiny)
OPTIONAL: task: string (default: transcribe)
- transcribe: Transcribe audio to text
- translate: Transcribe then translate audio to text
OPTIONAL: filename: string (default: untitled-transcription)

Body:

REQUIRED: binary data: Raw data with the audio content to transcribe

OPTIONS `/v1/transcribe`

Get the available options for the transcribe route.

POST `/v1/detect`

Detect the language of the audio file.

Query parameters:

OPTIONAL: model: string (default: tiny)

Body:

REQUIRED: binary data: Raw data with the audio content to detect the language for

OPTIONS `/v1/detect`

Get the available options for the detect route.

GET `/v1/download/<job_id>`

Receive the finished job result as the requested output format.

Query parameters:

OPTIONAL: output: string (default: srt)
- json: JSON response of the model output
- timecode_txt: Plain text file with timecodes(srt)
- txt: Plain text file of the detected text
- vtt: WebVTT file with the detected text
- srt: WebVTT file with the detected text

OPTIONS `/v1/download/<job_id>`

Get the available options for the download route.

GET `/v1/jobs/<job_id>`

Get the status and metadata of the provided job.

GET `/v1/queue`

Get the available length of the queue as JSON object with the key length.

Contributing

Requirements

Required amount of VRAM depends on the model used. The smallest model is tiny which requires about 1GB of VRAM.

You can see the full list of models here with information about the required VRAM.

The codebase is expected to be compatible with Python 3.8-3.10. This would be the same as the OpenAI Whisper requirements.

Installation

python3 -mvenv .venv
source .venv/bin/activate
pip install -r requirements.txt

Running full setup using docker-compose

First create a .envrc file with the following content:

export BASE_URL=https://example.com
export [email protected]
export EMAIL_SENDER_PASSWORD=example
export EMAIL_SENDER_HOST=smtp.example.com

export DISCLAIMER='This is a <a href="example.com">disclaimer</a>'

Then run the following command

docker-compose --env-file .envrc up

This will start three docker containers.

redis
api running flask fra src
worker running rq from src

Running full setup using devcontainers

Install remote-development extensions (containers) And then in vscode do Devcontainers: open folder in container Then you are inside the api-container and can do stuff

To upload a file called audunspodssounds.mp3 in norwegian from your download directory

curl --location --request POST 'localhost:5000/v1/transcribe?output=vtt' \
  --header 'Content-Type: audio/mpeg' \
  --data-binary '@/Users/<user>/Downloads/audunspodssounds.mp3'

Running tests

$ pytest

How to fix `[SSL: CERTIFICATE_VERIFY_FAILED] certificate verify failed: unable to get local issuer certificate`?

$ /Applications/Python\ 3.7/Install\ Certificates.command

GitHub - schibsted/WAAS: Whisper as a Service (Basic WIP API for transcribing sp...

WaaS - Whisper as a Service

API Documentation

POST `/v1/transcribe`

OPTIONS `/v1/transcribe`

POST `/v1/detect`

OPTIONS `/v1/detect`

GET `/v1/download/<job_id>`

OPTIONS `/v1/download/<job_id>`

GET `/v1/jobs/<job_id>`

GET `/v1/queue`

Contributing

Requirements

Installation

Running full setup using docker-compose

Running full setup using devcontainers

Running tests

How to fix `[SSL: CERTIFICATE_VERIFY_FAILED] certificate verify failed: unable to get local issuer certificate`?

Recommend

Ford is expected to announce a new LFP battery factory with CATL in the US

超60%患者症状改善，潜在“first-in-class”疗法2期试验积极结果公布

Tesla faces a $7.5 billion ultimatum: Open up its charging network to rivals or...

Statistics on Functional Silanes Market Leaders Are Constantly Improving Their L...

2023年春节档电影的市场表现，是否已经追平疫情前水平？

电管家完成超亿元B轮融资

Design thinking was supposed to fix the world. Where did it go wrong?

欲打造中国版OpenAl，原美团联合创始人王慧文进军人工智能

Digital Payment Solutions Market worth USD 14.1 billion by 2023

从用户满意度层面聊需求筛选

About Joyk

GitHub - schibsted/WAAS: Whisper as a Service (Basic WIP API for transcribing sp...

WaaS - Whisper as a Service

API Documentation

POST /v1/transcribe

OPTIONS /v1/transcribe

POST /v1/detect

OPTIONS /v1/detect

GET /v1/download/<job_id>

OPTIONS /v1/download/<job_id>

GET /v1/jobs/<job_id>

GET /v1/queue

Contributing

Requirements

Installation

Running full setup using docker-compose

Running full setup using devcontainers

Running tests

How to fix [SSL: CERTIFICATE_VERIFY_FAILED] certificate verify failed: unable to get local issuer certificate?

Recommend

About Joyk

POST `/v1/transcribe`

OPTIONS `/v1/transcribe`

POST `/v1/detect`

OPTIONS `/v1/detect`

GET `/v1/download/<job_id>`

OPTIONS `/v1/download/<job_id>`

GET `/v1/jobs/<job_id>`

GET `/v1/queue`

How to fix `[SSL: CERTIFICATE_VERIFY_FAILED] certificate verify failed: unable to get local issuer certificate`?