A corpus-based model based on 1) repetition of words, 2) importance of words, and 3) collocational semantics for texts is proposed in this paper.
英
美
- 本文根据三个因素:1)词汇的重复,2)词汇的重要性,3)共容语意,提出一个基于真实语料的文件内容分析的模型。
