mirage

GALE Phase 2 Chinese Newswire Parallel Text Part 1

WakeSpace Repository

Show simple item record

dc.contributor.author Friedman, Lauren
dc.contributor.author Jin, Hubert
dc.contributor.author Song, Zhiyi
dc.contributor.author Krug, Gary
dc.contributor.author Strassel, Stephanie
dc.date.accessioned 2016-11-29T19:14:55Z
dc.date.available 2016-11-29T19:14:55Z
dc.date.issued 2014-07-15
dc.identifier.citation Friedman, Lauren, et al. GALE Phase 2 Chinese Newswire Parallel Text Part 1 LDC2014T15. Web Download. Philadelphia: Linguistic Data Consortium, 2014. en_US
dc.identifier.isbn 1-58563-684-3
dc.identifier.uri http://hdl.handle.net/10339/63130
dc.description.abstract GALE Phase 2 Chinese Newswire Parallel Text Part 1 was developed by the Linguistic Data Consortium (LDC). Along with other corpora, the parallel text in this release comprised training data for Phase 2 of the DARPA GALE (Global Autonomous Language Exploitation) Program. This corpus contains 117,173 tokens of Chinese source text and corresponding English translations selected from newswire data collected by LDC in 2007 and transcribed by LDC or under its direction. en_US
dc.publisher Linguistic Data Consortium en_US
dc.title GALE Phase 2 Chinese Newswire Parallel Text Part 1 en_US
dc.type Dataset en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record