brew install ffmpeg
sudo apt-get install ffmpeg
Parameter | Type | Default | Description |
---|---|---|---|
base_dir | Path | Path.cwd() | Base directory for audio files |
read_files_in_base_dir | bool | True | Whether to register the read_files function |
path_or_hf_repo | str | "mlx-community/whisper-large-v3-turbo" | Path or HuggingFace repo for the model |
verbose | bool | None | Enable verbose output |
temperature | float or Tuple[float, ...] | None | Temperature for sampling |
compression_ratio_threshold | float | None | Compression ratio threshold |
logprob_threshold | float | None | Log probability threshold |
no_speech_threshold | float | None | No speech threshold |
condition_on_previous_text | bool | None | Whether to condition on previous text |
initial_prompt | str | None | Initial prompt for transcription |
word_timestamps | bool | None | Enable word-level timestamps |
prepend_punctuations | str | None | Punctuations to prepend |
append_punctuations | str | None | Punctuations to append |
clip_timestamps | str or List[float] | None | Clip timestamps |
hallucination_silence_threshold | float | None | Hallucination silence threshold |
decode_options | dict | None | Additional decoding options |
Function | Description |
---|---|
transcribe | Transcribes an audio file using MLX Whisper |
read_files | Lists all audio files in the base directory |