First DIHARD challenge development : eight sources /

"First DIHARD Challenge Development - Eight Sources was developed by the Linguistic Data Consortium (LDC) and contains approximately 17 hours of English and Chinese speech data along with corresponding annotations used in support of the First DIHARD Challenge. The First DIHARD Challenge was an...

Full description

Bibliographic Details
Main Author:	Ryant, Neville (Creator)
Format:	Audio Book
Language:	Multiple
Published:	[Philadelphia, PA] : Linguistic Data Consortium, 2019
Subjects:	Corpora (Linguistics) Data sets Sound recordings Speech corpora Text corpora


LEADER	03087nim a22005177i 4500
001	0de923a1-a2d1-45f0-96df-7427f6b613b8
005	20240818000000.0
008	190627s2019 paunnn q nn mul d
020			\|a 1585638870
035			\|a 14474794
040			\|a CtY \|b eng \|e rda \|c CtY
090			\|a yuldset
090			\|a yuldsetsnd
090			\|a yuldsettxt
245	0	0	\|a First DIHARD challenge development : \|b eight sources / \|c Linguistic Data Consortium
264		1	\|a [Philadelphia, PA] : \|b Linguistic Data Consortium, \|c 2019
300			\|a 1 DVD-ROM ; \|c 4 3/4 in
336			\|a computer dataset \|b cod \|2 rdacontent
336			\|a computer program \|b cop \|2 rdacontent
336			\|a spoken word \|b spw \|2 rdacontent
336			\|a text \|b txt \|2 rdacontent
337			\|a computer \|b c \|2 rdamedia
338			\|a computer disc \|b cd \|2 rdacarrier
347			\|a audio file \|2 rdaft
347			\|a text file \|2 rdaft
500			\|a Applications: speech activity detection, diarization
500			\|a Authors: Neville Ryant, Mark Liberman, James Fiumara, Christopher Cieri
500			\|a Data source: microphone speech, broadcast conversation, meeting speech, web collection
500			\|a Data type: sound, text, software
500			\|a LDC2019S09
500			\|a This Yale-originated record is shareable under Creative Commons license CC0 \|5 CTY
500			\|a Tittle from disc label
506			\|a Access restricted by licensing agreement
520			\|a "First DIHARD Challenge Development - Eight Sources was developed by the Linguistic Data Consortium (LDC) and contains approximately 17 hours of English and Chinese speech data along with corresponding annotations used in support of the First DIHARD Challenge. The First DIHARD Challenge was an attempt to reinvigorate work on diarization through a shared task focusing on "hard" diarization; that is, speech diarization for challenging corpora where there was an expectation that existing state-of-the-art systems would fare poorly. As such, it included speech from a wide sampling of domains representing diversity in number of speakers, speaker demographics, interaction style, recording quality, and environmental conditions, including, but not limited to: clinical interviews, extended child language acquisition recordings, YouTube recordings, and conversations collected in restaurants." --LDC online catalog
538			\|a DVD
546			\|a English, Mandarin Chinese
590			\|a Access is available to the Yale community
650		0	\|a Corpora (Linguistics)
655		7	\|a Data sets \|2 lcgft
655		7	\|a Sound recordings \|2 lcgft
655		7	\|a Speech corpora \|2 lcgft
655		7	\|a Text corpora \|2 lcgft
700	1		\|a Ryant, Neville, \|e creator
710	2		\|a Linguistic Data Consortium, \|e issuing body
999	1	0	\|i 0de923a1-a2d1-45f0-96df-7427f6b613b8 \|l 14474794 \|s US-CTY \|m first_dihard_challenge_developmenteight_sources____________________________2019_______lingui___________________________________________________________________________e
999	1	1	\|l 14474794 \|s ISIL:US-CTY \|t REC \|a csssi \|b 39002131552276 \|c In process. \|g 0 \|v 1 piece \|x lsfc \|y 12291275 \|p LOANABLE

First DIHARD challenge development : eight sources /

Similar Items