Skip to content

notaTeapot/whisper-annotation-tools

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

9 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Text+

Code Release zum Vortrag "Automatische Transkription von Podcastfolgen für Korpuslinguistische Untersuchungen."

Slides: https://zenodo.org/doi/10.5281/zenodo.12584114

Skripte für die Whisper Automatisierung auf HPC-Sytemen: https://github.com/notaTeapot/whisper-hpc-automation

Whisper Annotation Tools

Whisper Annotation Tools (WAT) is a set of utilities transforming whisper ASR and pyannote.speaker-diarization into transcript formats ready for further linguistic analysis.

Currently supporting:

Installation

pip install git+https://github.com/notaTeapot/whisper-annotation-tools.git

Usage

usage: whisper-annotation-tools.exe [-h] --file-name FILE_NAME --output-name OUTPUT_NAME [--prepend-time PREPEND_TIME] [--metadata-file METADATA_FILE] [--audio-file AUDIO_FILE]

Whisper Annotation Tools

options:
  -h, --help            show this help message and exit
  --file-name FILE_NAME
                        Path to the whisper output json
  --output-name OUTPUT_NAME
                        Path to the desired output file, Mode defined by file extension (.flk-->FOLKER Compatible, .xml-->Corpus Tools, .json)
  --prepend-time PREPEND_TIME
                        cut off time from beginning of whisper output
  --metadata-file METADATA_FILE
                        Path to json file containing episode.
  --audio-file AUDIO_FILE
                        Path to episode audio file, used for .flk.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages