Loughborough University
Browse
- No file added yet -

Stuttering detection using atrous convolutional neural networks

Download (350.06 kB)
conference contribution
posted on 2022-08-16, 08:16 authored by Abedal-karim Al-Banna, Eran Edirisinghe, Hui FangHui Fang
Stuttering is a neurodevelopmental speech disorder that affects 70 million people worldwide, approximately 1% of the whole population. People who stutter (PWS) have common speech symptoms such as block, interjection, repetition, and prolongation. The speech-language pathologists (SLPs) commonly observe these four groups of symptoms to evaluate stuttering severity. The evaluation process is tedious and time-consuming for (SLP) and (PWS). Therefore, this paper proposes a new model for stuttering events detection that may help (SLP) to evaluate stuttering severity. Our model is based on a log mel spectrogram and 2D atrous convolutional network designed to learn spectral and temporal features. We rigorously evaluate the performance of our model on two stuttering datasets (UCLASS and FluencyBank) using common speech metrics, i.e. F1-score, recall, and the area under the curve (AUC). Our experimental results indicate that our model outperforms state-of-the-art methods in prolongation with an F1 of 52% and 44.5% on the UCLASS and FluencyBank datasets, respectively. Also, we gain 5% and 3% margins on the UCLASS and FluencyBank datasets for fluent class.

History

School

  • Science

Department

  • Computer Science

Published in

2022 13th International Conference on Information and Communication Systems (ICICS)

Pages

252 - 256

Source

2022 13th International Conference on Information and Communication Systems (ICICS)

Publisher

IEEE

Version

  • AM (Accepted Manuscript)

Rights holder

© IEEE

Publisher statement

© 2022 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.

Publication date

2022-07-04

Copyright date

2022

ISBN

9781665480970

eISSN

2573-3346

Language

  • en

Location

Irbid, Jordan

Event dates

21st June 2022 - 23rd June 2022

Depositor

Dr Hui Fang. Deposit date: 15 August 2022

Usage metrics

    Loughborough Publications

    Categories

    No categories selected

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC