原名「台灣學術線上」
包含TAO期刊庫 + TAO書籍庫 + 論文 + 史料文獻
首頁 | 關於TAO | 瀏覽 | 進階查詢 | 參考工具 | 會員服務 | 已購專書 | RSS服務 | 電子報 | FAQ  
查詢範圍:
   
查詢模式:
熱門查詢詞:
dvdDiy扣件原住民教育困境OTA
   
   
   
     
   
 
項次 書目
1
題名:Noisy Channel Models for Corrupted Chinese Text Restoration and GB-to-Big5 Conversion     (26點)
著者:Chao-Huang Chang
出版地區:台灣
出版城市:台北市
學科:電機資訊
刊名:International Journal of Computational Linguistics & Chinese Language Processing
卷期:3卷2期(1998.8)
頁碼:79-91
語言:英語
摘要: 英文摘要PDF

In this article, we propose a noisy channel/information restoration model for error recovery problems in Chinese natural language processing. A language processing system is considered as an information restoration process executed through a noisy channel. By feeding a large-scale standard corpus C into a simulated noisy channel, we can obtain a noisy version of the corpus N. Using N as the input to the language processing system (i.e., the information restoration process), we can obtain the output results C'. After that, the automatic evaluation module compares the original corpus C and the output results C, and computes the performance index (i.e., accuracy) automatically. The proposed model has been applied to two common and important problems related to Chinese NLP for the Internet: corrupted Chinese text restoration and GB-to-BIG5 conversion. Sinica Corpora version 1.0 and 2.0 are used in the experiment..The results show that the proposed model is useful and practical.


    

本卷期目次
International Journal of Computational Linguistics & Chinese Language Processing 3卷2期 (1998.8)
Noisy Channel Models for Corrupted Chinese Text Restoration and GB-to-Big5 Conversion/ Chao-Huang Chang
Statistical Analysis of Mandarin Acoustic Units and Automatic Extraction of Phonetically Rich Sentences Based upon a Very Large Chinese Text Corpus/ Hsin-Min Wang
An Assessment of Character-based Chinese News Filtering Using Latent Semantic Indexing/ Shih-Hung WuPey-Ching YangVon-Wun Soo
Information Extraction: Beyond Document Retrieval/ Robert GaizauskasYorick Wilks
Senses and Texts/ Yorick Wilks
Senses and Texts/ Yorick Wilks
Information Extraction: Beyond Document Retrieval/ Robert GaizauskasYorick Wilks
An Assessment of Character-based Chinese News Filtering Using Latent Semantic Indexing/ Shih-Hung WuPay-Ching YangVon-Wun Soo
Noisy Channel Models for Corrupted Chinese Text Restoration and GB-to-Big5 Conversion/ Chao-Huang Chang
Statistical Analysis of Mandarin Acoustic Units and Automatic Extraction of Phonetically Rich Sentences Based Upon a Very Large Chinese Text Corpus/ Hsin-Min Wang
 
   
 
   

與TAO合作 | 隱私與版權聲明 | 聯絡方式 | 下載Adobe Reader
地址:台北市中正區(100)北平東路30-12號3樓
電話:(02)2393-6968 傳真:(02)2393-6877
Email: service@wordpedia.com
Wordpedia Family: 學校、企業版入口 | 遠流影音館
Copyright©2011 Wordpedia Co., Ltd. All Rights Reserved.