RATS language identification
"Comprised of approximately 5,400 hours of Levantine Arabic, Farsi, Dari, Pashto and Urdu conversational telephone speech with annotation of speech segments. The corpus was created to provide training, development and initial test sets for the Language Identification (LID) task in the DARPA RAT...
Format: | Book |
---|---|
Language: | Arabic Persian Pushto Urdu |
Published: |
[Philadelphia, Pennsylvania] :
Linguistic Data Consortium,
[2017]
|
Subjects: |