Город МОСКОВСКИЙ
00:10:26

OpenAI Whisper Speaker Diarization - Transcription with Speaker Names

Аватар
JavaScript Фрилансерская Платформа
Просмотры:
32
Дата загрузки:
29.11.2023 16:03
Длительность:
00:10:26
Категория:
Обучение

Описание

High level overview of what's happening with OpenAI Whisper Speaker Diarization:

Using Open AI's Whisper model to seperate audio into segments and generate transcripts.
Then generating speaker embeddings for each segments.
Then using agglomerative clustering on the embeddings to identify the speaker for each segment.

Speaker Identification or Speaker Labelling is very important for Podcast Transcription or Conversations Audio Transcription. This code helps you do that.

Dwarkesh's Patel Tweet Announcement - https://twitter.com/dwarkesh_sp/status/1579672641887408129

Colab - https://colab.research.google.com/drive/1V-Bt5Hm2kjaDb4P1RyMSswsDKyrzc2-3?usp=sharing

https://huggingface.co/spaces/dwarkesh/whisper-speaker-recognition

Рекомендуемые видео