Description
The PolyU Business Corpora are made up of three comparable corpora of Chinese, English and Japanese business texts, the majority of which originate from the business and finance sections of newspapers written in those languages, covering news and reports from auditing and accounting to insurance and investments. Newspapers used for both the English and Chinese corpora are online editions of publications circulated in Hong Kong in the respective languages, including (for English) the 'South China Morning Post' and the 'Hong Kong Standard', and (for Chinese) 'Ming Pao Daily News', 'Apple Daily', 'Oriental Daily News' and 'Sing Tao'. For the Japanese corpus, only online editions of Japanese newspapers published in Japan are used, including 'Asahi' and 'Nikkei'. Besides newspapers, other sources of data include company documents, annual reports, press releases, and online versions of company brochures, leaflets and other promotional texts available in Hong Kong or Japan. The materials reflect the type of language used in the broad business context in Hong Kong and Japan today.
Size
PolyU Business Corpus (Chinese): 1.23 million characters
PolyU Business Corpus (English): 1.16 million words
PolyU Business Corpus (Japanese): 1.32 million words
Compiler
The original corpora were compiled by Dr Li Lan and Professor Grahame Bilbow (HK PolyU) between 1999-2000 as part of the PolyU Business Corpus project. The English and Chinese corpora were subsequently used to build a bilingual business lexicon; see http://langbank.engl.polyu.edu.hk/polylexicon/default.htm .
| Search Corpus |
Chinese |
English |
Japanese |
|