FuzzyAudioSearch

Retrieve Audio Segment in Large Text Corpus

Installation

git clone https://github.com/davoodwadi/FuzzyAudioSearch.git
cd FuzzyAudioSearch
python -m venv venv

# if on Windows
source venv/Scripts/activate
# if on MacOS or Linux
source venv/bin/activate

pip install -r requirements.txt

Usage

python FuzzyAudioSearch.py -a audio_file -t text_file

where audio_file is the path to the audio file you want to use to search.

text_file is the path to the large corpus.

You can optionally pass -c to set the chunk of the start and end of audio to find matches.

python FuzzyAudioSearch.py -a audio_file -t text_file -c 100

While the default model size, tiny, is sufficient for many texts, for multilingual texts (e.g. books by Nietzsche, which contain English and German text) it helps to use larger whisper models.

python FuzzyAudioSearch.py -a audio_file -t text_file -c 100 -m tiny

model options:

tiny
small
medium
large-v3

Credits

Citation

@software{Wadi_Retrieve_Audio_Segment_2023,
author = {Wadi, Davood},
month = jan,
title = {{Retrieve Audio Segment in Large Text Corpus }},
version = {0.0.1},
year = {2023}
}

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
.gitattributes		.gitattributes
CITATION.cff		CITATION.cff
LICENSE		LICENSE
README.md		README.md
audio_in_book.py		audio_in_book.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

FuzzyAudioSearch

Installation

Usage

Credits

Citation

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

davoodwadi/FuzzyAudioSearch

Folders and files

Latest commit

History

Repository files navigation

FuzzyAudioSearch

Installation

Usage

Credits

Citation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages