Speaker Segmentation Transcription (SST)

Ahmad, Talha; Asif, Shuban; Talha, M.; Supervised by Dr. Shibli Nisar

DSpace Home
→
E-Theses
→
MCS
→
Electrical Engineering
→
BETE
→
View Item

dc.contributor.author	Ahmad, Talha
dc.contributor.author	Asif, Shuban
dc.contributor.author	Talha, M.
dc.contributor.author	Supervised by Dr. Shibli Nisar
dc.date.accessioned	2025-02-12T12:29:50Z
dc.date.available	2025-02-12T12:29:50Z
dc.date.issued	2023-04
dc.identifier.other	PTC-438
dc.identifier.uri	http://10.250.8.41:8080/xmlui/handle/123456789/49788
dc.description.abstract	Speaker segmentation is an important task in speech processing that involves identifying the boundaries between different speakers in an audio or video recording. The objective of speaker segmentation is to separate the speech of different speakers and assign each segment of speech to the appropriate speaker. Speaker segmentation is a challenging task due to the variability in speech signals caused by different speakers, acoustic conditions, and languages. In this project, we propose a speaker segmentation algorithm based on the clustering technique. The algorithm uses a set of acoustic features extracted from the speech signal to cluster speech segments belonging to the same speaker. We evaluate the proposed algorithm on a dataset of speech recordings and compare its performance with that of other state-of-theart speaker segmentation algorithms. The results show that the proposed algorithm outperforms the other algorithms in terms of accuracy and robustness. The proposed algorithm has the potential to be used in a wide range of speech processing applications, such as speaker diarization, automatic transcription, and speaker recognition.	en_US
dc.language.iso	en	en_US
dc.publisher	MCS	en_US
dc.title	Speaker Segmentation Transcription (SST)	en_US
dc.type	Project Report	en_US