We aim to create a difference in the Arabic speech resources
We share, support and encourage everyone to contribute to Arabic speech resources
Numerous efforts have been given to produce spoken Arabic data set resources. From CallHome task (1996/97 NIST benchmark) to
the Global Autonomous Language Exploitation (GALE) [2006-2009], many resources have been created.
Here we list some publicly available speech corpora.
Hours of Arabic Speech Data
Arabic Speech Recognition Resources
Dialectal Arabic Code-Switching Dataset: includes the annotated two-hours Egyptian dataset from the ADI-5 development split in the MGB-3 challenge
Arabic Dialect Identification Resources
Planning to contribute to ArabicSpeech community!
Looking to make ArabicSpeech great, full of resources and support open source!
Join our community!