First DIHARD challenge development : eight sources /

"First DIHARD Challenge Development - Eight Sources was developed by the Linguistic Data Consortium (LDC) and contains approximately 17 hours of English and Chinese speech data along with corresponding annotations used in support of the First DIHARD Challenge. The First DIHARD Challenge was an...

Full description

Bibliographic Details
Main Author: Ryant, Neville (Creator)
Format: Audio Book
Language:Multiple
Published: [Philadelphia, PA] : Linguistic Data Consortium, 2019
Subjects:
LEADER 03087nim a22005177i 4500
001 0de923a1-a2d1-45f0-96df-7427f6b613b8
005 20240818000000.0
008 190627s2019 paunnn q nn mul d
020 |a 1585638870 
035 |a 14474794 
040 |a CtY  |b eng  |e rda  |c CtY 
090 |a yuldset 
090 |a yuldsetsnd 
090 |a yuldsettxt 
245 0 0 |a First DIHARD challenge development :  |b eight sources /  |c Linguistic Data Consortium 
264 1 |a [Philadelphia, PA] :  |b Linguistic Data Consortium,  |c 2019 
300 |a 1 DVD-ROM ;  |c 4 3/4 in 
336 |a computer dataset  |b cod  |2 rdacontent 
336 |a computer program  |b cop  |2 rdacontent 
336 |a spoken word  |b spw  |2 rdacontent 
336 |a text  |b txt  |2 rdacontent 
337 |a computer  |b c  |2 rdamedia 
338 |a computer disc  |b cd  |2 rdacarrier 
347 |a audio file  |2 rdaft 
347 |a text file  |2 rdaft 
500 |a Applications: speech activity detection, diarization 
500 |a Authors: Neville Ryant, Mark Liberman, James Fiumara, Christopher Cieri 
500 |a Data source: microphone speech, broadcast conversation, meeting speech, web collection 
500 |a Data type: sound, text, software 
500 |a LDC2019S09 
500 |a This Yale-originated record is shareable under Creative Commons license CC0  |5 CTY 
500 |a Tittle from disc label 
506 |a Access restricted by licensing agreement 
520 |a "First DIHARD Challenge Development - Eight Sources was developed by the Linguistic Data Consortium (LDC) and contains approximately 17 hours of English and Chinese speech data along with corresponding annotations used in support of the First DIHARD Challenge. The First DIHARD Challenge was an attempt to reinvigorate work on diarization through a shared task focusing on "hard" diarization; that is, speech diarization for challenging corpora where there was an expectation that existing state-of-the-art systems would fare poorly. As such, it included speech from a wide sampling of domains representing diversity in number of speakers, speaker demographics, interaction style, recording quality, and environmental conditions, including, but not limited to: clinical interviews, extended child language acquisition recordings, YouTube recordings, and conversations collected in restaurants." --LDC online catalog 
538 |a DVD 
546 |a English, Mandarin Chinese 
590 |a Access is available to the Yale community 
650 0 |a Corpora (Linguistics) 
655 7 |a Data sets  |2 lcgft 
655 7 |a Sound recordings  |2 lcgft 
655 7 |a Speech corpora  |2 lcgft 
655 7 |a Text corpora  |2 lcgft 
700 1 |a Ryant, Neville,  |e creator 
710 2 |a Linguistic Data Consortium,  |e issuing body 
999 1 0 |i 0de923a1-a2d1-45f0-96df-7427f6b613b8  |l 14474794  |s US-CTY  |m first_dihard_challenge_developmenteight_sources____________________________2019_______lingui___________________________________________________________________________e 
999 1 1 |l 14474794  |s ISIL:US-CTY  |t REC  |a csssi  |b 39002131552276  |c In process.  |g 0  |v 1 piece  |x lsfc  |y 12291275  |p LOANABLE