RAPNIC Project: Automatic Recognition of Unintelligible Speech in Catalan

Ferran Benito on 5 March 2025

Noi amb paràlisi cerebral, potencial beneficiari del Projecte RAPNIC Reconeixement Automatic de la Parla No Intelligible en catala

RAPNIC is a project focused on developing and training an Artificial Intelligence (AI) model capable of recognizing unintelligible speech in Catalan.

(2025-2026)

Despite technological advancements, voice recognition systems still face significant challenges in identifying and interpreting speech patterns that deviate from standard norms, particularly in the case of so-called “unintelligible speech.” As a result, people with speech disorders, such as dysarthria, often struggle to be understood and are unable to access technological tools that could facilitate their communication and autonomy. Furthermore, the few existing solutions are primarily available in English, creating an additional barrier for speakers of other languages, including Catalan.

The RAPNIC project (Automatic Recognition of Unintelligible Speech in Catalan) aims to address this need by developing an AI-powered solution capable of recognizing and interpreting unintelligible speech in Catalan. This technology will allow individuals with speech disorders to interact effectively with voice assistants, speech-to-text transcription systems, and other digital tools, thereby improving their autonomy and quality of life.

What is RAPNIC?

RAPNIC is a pioneering initiative by the iSocial Foundation that seeks to eliminate a major technological barrier for people with speech disorders. Using Artificial Intelligence (AI), RAPNIC aims to develop a system capable of recognizing and understanding unintelligible speech, which current voice recognition systems fail to comprehend.

To achieve this, the project plans to create a database of unintelligible Catalan speech, built from voice recordings of individuals with speech disorders, with a particular focus on dysarthria— a motor impairment affecting facial muscles and making the pronunciation of certain phonemes difficult. This database will train AI algorithms using deep learning technology to recognize the sound patterns of these speech variations.

In the project’s initial phase, work will be carried out on a corpus of recordings from individuals with Down syndrome and cerebral palsy, as their speech patterns generally present fewer alterations than other forms of dysarthria, making AI training more manageable. The database will contain at least 100 hours of recorded speech and involve the participation of 120 volunteer speakers with Down syndrome and cerebral palsy, along with social professionals, speech therapists, deep learning specialists, and computational linguists.

The ultimate goal of RAPNIC is to extend this system, first to the 22,000 people with Down syndrome and cerebral palsy within the Catalan linguistic community and, in a later phase, to the entire population of 49,000 individuals with speech disorders in the Catalan-speaking region, including those with more severe conditions. RAPNIC will enable affected individuals to communicate more effectively with those around them, give voice commands to digital assistants, transcribe their speech into text, and interact with other digital services more independently.

CSC Impulsa 2024 Award

The RAPNIC Project has been recognized as the winner of the CSC Impulsa 2024 Awards in the category of “Innovative AI Projects in the Social Sector.”

This recognition, granted by the Catalan Health and Social Consortium (CSC), includes financial support of €20,000 and technical guidance from the consortium, which will be crucial in transforming this initiative into a tangible reality.

Project Goals

RAPNIC aims to achieve the following key objectives:

Develop an open-access dataset of unintelligible Catalan speech, based on voice recordings from individuals with speech disorders, particularly those with Down syndrome and cerebral palsy.
Create an Artificial Intelligence model for the recognition of unintelligible speech, improving accessibility and digital inclusion for people with speech disorders.
Launch an online application that allows individuals with speech disorders to test the model and assess its accuracy in transcribing their speech.
Make the open dataset available to technology companies, enabling existing Catalan speech recognition tools to integrate the recognition of unintelligible speech.
Publish a scientific study detailing the data collection process and research outcomes. This study will be presented at an international Natural Language Processing (NLP) conference, contributing to the dissemination of knowledge and methodologies used in the project.

Expected Impact and Benefits

RAPNIC has significant potential to improve the lives of people with speech disorders by enhancing their interaction with digital technologies and fostering social inclusion. The main expected benefits include:

Increased autonomy for individuals with speech disorders, especially the 22,000 Catalan speakers with mild speech impairments, who will gain access to voice assistants, speech captioning tools, and automatic transcription services currently unavailable to them.
Improved accessibility and communication for affected individuals, allowing them to use the same digital tools as the rest of the population, with assurance that their specific needs will be addressed.
Reduction of the digital divide affecting people with speech disorders, particularly those with additional motor impairments.
Creation of a speech dataset that will enable researchers to continue training open-source AI models in Catalan and help companies develop speech recognition tools that support unintelligible speech.
Development of an AI model and methodology that could, in the future, be applied to more severe speech disorders (e.g., stroke, brain injuries) and adapted to other languages.

Leads:

Partners:

With the support of:

Projectes d’innovació

Category: Featured Home, Projects

Cookie	Duration	Description
_GRECAPTCHA	5 months 27 days	This cookie is set by the Google recaptcha service to identify bots to protect the website against malicious spam attacks.
cookielawinfo-checkbox-advertisement	1 year	Set by the GDPR Cookie Consent plugin, this cookie is used to record the user consent for the cookies in the "Advertisement" category .
cookielawinfo-checkbox-analytics	1 year	Set by the GDPR Cookie Consent plugin, this cookie is used to record the user consent for the cookies in the "Analytics" category .
cookielawinfo-checkbox-functional	1 year	The cookie is set by the GDPR Cookie Consent plugin to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary	1 year	Set by the GDPR Cookie Consent plugin, this cookie is used to record the user consent for the cookies in the "Necessary" category .
cookielawinfo-checkbox-others	1 year	Set by the GDPR Cookie Consent plugin, this cookie is used to store the user consent for cookies in the category "Others".
cookielawinfo-checkbox-performance	1 year	Set by the GDPR Cookie Consent plugin, this cookie is used to store the user consent for cookies in the category "Performance".
CookieLawInfoConsent	1 year	Records the default button state of the corresponding category & the status of CCPA. It works only in coordination with the primary cookie.

Cookie	Duration	Description
_ga	1 year 1 month 4 days	The _ga cookie, installed by Google Analytics, calculates visitor, session and campaign data and also keeps track of site usage for the site's analytics report. The cookie stores information anonymously and assigns a randomly generated number to recognize unique visitors.
_ga_*	1 year 1 month 4 days	Google Analytics sets this cookie to store and count page views.
_gat_gtag_UA_*	1 minute	Google Analytics sets this cookie to store a unique user ID.
_gid	1 day	Installed by Google Analytics, _gid cookie stores information on how visitors use a website, while also creating an analytics report of the website's performance. Some of the data that are collected include the number of visitors, their source, and the pages they visit anonymously.

RAPNIC Project: Automatic Recognition of Unintelligible Speech in Catalan

RAPNIC Project: Automatic Recognition of Unintelligible Speech in Catalan

What is RAPNIC?

CSC Impulsa 2024 Award

Project Goals

Expected Impact and Benefits

NIDUS Mental Health

Solidigital

DigitaliSSB (Promotion of the Digitalization of the Basic Social Services of Catalonia)

Vincles Alt Pirineu-Aran

Rehab-Lab

ALL BY MYSELF

Newsletter

Subscribe if you want to be informed about all the trends in innovation in social services.