原名「台灣學術線上」
包含TAO期刊庫 + TAO書籍庫 + 論文 + 史料文獻
首頁 | 關於TAO | 瀏覽 | 進階查詢 | 參考工具 | 會員服務 | 已購專書 | RSS服務 | 電子報 | FAQ  
查詢範圍:
   
查詢模式:
熱門查詢詞:
dvd網咖消費者顧客滿意度資優生world
   
   
   
     
   
 
項次 書目
1
題名:Design and Development of a Bilingual Reading Comprehension Corpus     (50點)
著者:Kui XuHelen Meng
出版地區:台灣
出版城市:台北市
學科:電機資訊
關鍵字:bilingual ; reading comprehension ; corpus
刊名:International Journal of Computational Linguistics & Chinese Language Processing
卷期:10卷2期(2005.7)
頁碼:251-275
語言:英語
摘要: 英文摘要PDF

This paper describes our initial attempt to design and develop a bilingual reading comprehension corpus (BRCC). RC is a task that conventionally evaluates the reading ability of an individual. An RC system can automatically analyze a passage of natural language text and generate an answer for each question based on information in the passage. The RC task can be used to drive advancements of natural language processing (NLP) technologies imparted in automatic RC systems. Furthermore, an RC system presents a novel paradigm of information search, when compared to the predominant paradigm of text retrieval in search engines on the Web. Previous works on automatic RC typically involved English-only language learning materials (Remedia and CBC4Kids) designed for children/students, which included stories, human-authored questions, and answer keys. These corpora are important for supporting empirical evaluation of RC performance. In the present work, we attempted to utilize RC as a driver for NLP techniques in both English and Chinese. We sought parallel English, and Chinese learning materials and incorporated annotations deemed relevant to the RC task. We measured the comparative levels of difficulty among the three corpora by means of the baseline bag-of-words (BOW) approach. Our results show that the BOW approach achieves better RC performance in BRCC (67%) when compared to Remedia (29%) and CBC4Kids (63%). This reveals that BRCC has the highest degree of word overlap between questions and passages among the three corpora, which artificially simplifies the RC task. This result suggests that additional effort should be devoted to authoring questions with a various grades of difficulty in order for BRCC to better support RC research across the English and Chinese languages.


    

本卷期目次
International Journal of Computational Linguistics & Chinese Language Processing 10卷2期 (2005.7)
The Formosan Language Archive: Linguistic Analysis and Language Processing/ Elizabeth ZeitounChing-Hua Yu
A Chinese Term Clustering Mechanism for Generating Semantic Concepts of a News Ontology/ Chang-Shing LEEYau-Hwang KuoChia-Hsin LiaoZhi-Wei Jian
Automatic Segmentation and Labeling for Mandarin Chinese Speech Corpora for Concatenation-based TTS/ Cheng-Yuan LinJyh-Shing Roger JangKuan-Ting Chen
TAICAR-The Collection and Annotation of an In-Car Speech Database Created in Taiwan/ Hsien-Chang WangChung-Hsien YangJhing-Fa WangChung-Hsien WuJen-Tzung Chien
Design and Development of a Bilingual Reading Comprehension Corpus/ Kui XuHelen Meng
Mandarin Topic-oriented Conversations/ Shu-Chuan Tseng
MATBN: A Mandarin Chinese Broadcast News Corpus/ Hsin-Min WangBerlin ChenJen-Wei KuoShih-Sian Cheng
Automatic Segmentation and Labeling for Mandarin Chinese Speech Corpora for Concatenation-based TTS/ Cheng-Yuan LinJyh-Shing Roger JangKuan-Ting Chen
The Formosan Language Archive: Linguistic Analysis and Language Processing/ Elizabeth ZeitounChing-Hua Yu
Mandarin Topic-oriented Conversations/ Shu-Chuan Tseng
MATBN: A Mandarin Chinese Broadcast News Corpus/ Hsin-Min WangBerlin ChenJen-Wei KuoShih-Sian Cheng
TAICAR-The Collection and Annotation of an In-Car Speech Database Created in Taiwan/ Hsien-Chang WangChung-Hsien YangJhing-Fa WangChung-Hsien WuJen-Tzung Chien
Design and Development of a Bilingual Reading Comprehension Corpus/ Kui XuHelen Meng
A Chinese Term Clustering Mechanism for Generating Semantic Concepts of a News Ontology/ Chang-Shing LeeYau-Hwang KuoChia-Hsin LiaoZhi-Wei Jian
 
   
 
   

與TAO合作 | 隱私與版權聲明 | 聯絡方式 | 下載Adobe Reader
地址:台北市中正區(100)北平東路30-12號3樓
電話:(02)2393-6968 傳真:(02)2393-6877
Email: service@wordpedia.com
Wordpedia Family: 學校、企業版入口 | 遠流影音館
Copyright©2011 Wordpedia Co., Ltd. All Rights Reserved.