Speaker Differentiation in AAC Data Logging Using Deep Learning

Lalit Sharma

Speaker Differentiation in AAC Data Logging Using Deep Learning (P1-P5)

Authors: Lalit Sharma

Published: 2025/4/28

Abstract

High-technology augmentative and alternative communication (AAC) devices are essential tools for individuals with complex communication needs. Automated data logging in these devices enables researchers and clinicians to analyze user performance. However, existing systems cannot distinguish between users when multiple individuals access the same device, compromising the validity of data logs and complicating performance evaluation. This paper proposes a deep neural network-based visual analysis approach to address this limitation. By processing video recordings of practice sessions, the method detects and identifies different AAC users, ensuring that data logs accurately reflect individual contributions. This solution has the potential to significantly improve the validity of performance data, streamline analysis, and ultimately enhance AAC outcome measures. Through a combination of advanced video processing and neural network techniques, this approach represents a major step forward in AAC research and clinical practice. It addresses a critical gap in current data logging systems and paves the way for more accurate, user-specific performance evaluation.

Keywords

Augmentative and Alternative Communication (AAC), High-technology, AAC devices, Complex communication needs, User performance analysis, Automated data logging systems, Deep neural networks.

References

D. J. Higginbotham and C. R. Engelke, “A primer for doing talk-in-interaction research in augmentative and alternative communication,”Augmentative Altern. Commun., vol. 29, no. 1, pp. 3–19, 2013. T. Kovacs and K. Hill, “Language samples from children who use speech-generating devices: Making sense of small samples and utterance length,” Am. J. Speech Lang. Pathol., vol. 26, no. 3, pp. 939–950, 2017.
S.-H. K. Chen, S. Wadhwa, and E. Nyberg, “Design and analysis of interoperable data logs for augmentative communication practice,” in The 21st Int. ACM SIGACCESS Conf. Comput. Accessibility, Pitts-urgh, PA, USA, 2019, pp. 533–535.
Pawełoszek, I., Kumar, N., & Solanki, U. (2022). Artificial intelligence, digital technologies and the future of law. Futurity Economics & Law, 2(2), 24–33. https://doi.org/10.57125/FEL.2022.06.25.03
Vinay Singh, Alok Agggarwal and Narendra Kumar: “A Rapid Transition from Subversion to Git: Time, Space, Branching, Merging, Offline commits & Offline builds and Repository aspects, Recent Advances in computers Sciences and communications, Recent Advances in Computer Science and Communications, Bentham Science, vol 15 (5) 2022 pp 0-8, (DOI : 10.2174/2666255814666210621121914)June 2021 (SCOPUS/ SCI indexed)
S. C. Sennott, J. C. Light, and D. McNaughton, “AAC modelling intervention research review,” Res. Pract. Persons Severe Disabil-ities, vol. 41, no. 2, pp. 101–115, 2016.
CoughDrop, Inc., “CoughDrop App,” 2019. [Online] Available:https://https://www.assistiveware.com/products/proloquo2go/, Ac- cessed on: Apr. 2, 2021.

Download PDF

How to Cite

Lalit Sharma, (2025/4/28). Speaker Differentiation in AAC Data Logging Using Deep Learning. JANOLI International Journal of Artificial Intelligence and its Applications, Volume hJAEiqNzZqWjtpPaXKzr, Issue 2.

ISSN: 3048-6815

JANOLI International Journal of
Artificial Intelligence and its Applications ( JIJAIA )