NUST Institutional Repository

Speaker Segmentation Transcription (SST)

Show simple item record

dc.contributor.author Ahmad, Talha
dc.contributor.author Asif, Shuban
dc.contributor.author Talha, M.
dc.contributor.author Supervised by Dr. Shibli Nisar
dc.date.accessioned 2025-02-12T12:29:50Z
dc.date.available 2025-02-12T12:29:50Z
dc.date.issued 2023-04
dc.identifier.other PTC-438
dc.identifier.uri http://10.250.8.41:8080/xmlui/handle/123456789/49788
dc.description.abstract Speaker segmentation is an important task in speech processing that involves identifying the boundaries between different speakers in an audio or video recording. The objective of speaker segmentation is to separate the speech of different speakers and assign each segment of speech to the appropriate speaker. Speaker segmentation is a challenging task due to the variability in speech signals caused by different speakers, acoustic conditions, and languages. In this project, we propose a speaker segmentation algorithm based on the clustering technique. The algorithm uses a set of acoustic features extracted from the speech signal to cluster speech segments belonging to the same speaker. We evaluate the proposed algorithm on a dataset of speech recordings and compare its performance with that of other state-of-theart speaker segmentation algorithms. The results show that the proposed algorithm outperforms the other algorithms in terms of accuracy and robustness. The proposed algorithm has the potential to be used in a wide range of speech processing applications, such as speaker diarization, automatic transcription, and speaker recognition. en_US
dc.language.iso en en_US
dc.publisher MCS en_US
dc.title Speaker Segmentation Transcription (SST) en_US
dc.type Project Report en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search DSpace


Advanced Search

Browse

My Account