A Simple and Efficient Model Pruning Method for Conditional Random Fields
Zhao, H. and Kit, C.
{CRF训练后,按参数值去掉大部分特征,性能都不会下降,用事实证明CRF有太多冗余。}
Chinese text segmentation: A hybrid approach using transductive learning and statistical association measures
Tsai, R. T. H.
Expert Systems with Applications
{多种加入各种特征提高CRF性能的方法。}
Bayesian Unsupervised Word Segmentation with Nested Pitman-Yor Language Modeling
Mochihashi, Daichi and Yamada, Takeshi and Ueda, Naonori
Proceedings of the Joint Conference of the 47th Annual Meeting of the {ACL} and the 4th International Joint Conference on Natural Language Processing of the {AFNLP}
{用Pitman-Yor,建立了两层语言模型,一个是词的,一个是} 句子的。
Punctuation as Implicit Annotations for Chinese Word Segmentation
Li, Zhongguo and Sun, Maosong
Computational Linguistics
An Error-Driven Word-Character Hybrid Model for Joint Chinese Word Segmentation and {POS} Tagging
Kruengkrai, Canasai and Uchimoto, Kiyotaka and Kazama, Jun'ichi and Wang, Yiou and Torisawa, Kentaro and Isahara, Hitoshi
Proc. of {ACL-IJCNLP} 2009
词典词与生词分别对待
Automatic Adaptation of Annotation Standards: Chinese Word Segmentation and {POS} Tagging – A Case Study
Jiang, Wenbin and Huang, Liang and Liu, Qun
Proceedings of the 47th {ACL}
Perceptron,分词与词性标注结合。将一种标注体系下的参数,转移到另一种标注体系中使用。
2008}
The Fourth International Chinese Language Processing Bakeoff: Chinese Word Segmentation, Named Entity Recognition and Chinese {POS} Tagging
Jin, Guangjin and Chen, Xiao
Proceedings of the Sixth {SIGHAN} Workshop on Chinese Language Processing
2008
Unsupervised segmentation helps supervised learning of character tagging for word segmentation and named entity recognition
Zhao, Hai and Kit, Chunyu
The Sixth {SIGHAN} Workshop on Chinese Language Processing
An Empirical Comparison of Goodness Measures for Unsupervised Chinese Word Segmentation with a Unified Framework
Zhao, Hai and Kit, Chunyu
The Third International Joint Conference on Natural Language Processing ({IJCNLP-2008)}, Hyderabad, India
{描述了四种用于无监督中文分词的判别量:Frequency} of Substring with {ReductionDescription} Length Gain ({DLG)Accessor} Variety ({AV)Boundary} Entropy (Branching Entropy, {BE)}
Joint Word Segmentation and {POS} Tagging Using a Single Perceptron
Zhang, Yue and Clark, Stephen
Proceedings of {ACL-08:} {HLT}
Bayesian semi-supervised chinese word segmentation for statistical machine translation
Xu, J. and Gao, J. and Toutanova, K. and Ney, H.
Proceedings of the 22nd International Conference on Computational Linguistics-Volume 1
Statistical Properties of Overlapping Ambiguities in Chinese Word Segmentation and a Strategy for Their Disambiguation
Qiao, W. and Sun, M. and Menzel, W.
Text, Speech and Dialogue
Information retrieval oriented word segmentation based on character associative strength ranking
Liu, Y. and Wang, B. and Ding, F. and Xu, S.
Proceedings of the Conference on Empirical Methods in Natural Language Processing
{用了RankingSVM的方法分词,用于IR}
Word Lattice Reranking for Chinese Word Segmentation and Part-of-Speech Tagging
Jiang, Wenbin and Mi, Haitao and Liu, Qun
Proceedings of the 22nd International Conference on Computational Linguistics (Coling 2008)
Discriminative pruning of language models for Chinese word segmentation
Li, J. and Wang, H. and Ren, D. and Li, G.
Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational Linguistics
The Third International Chinese Language Processing Bakeoff: Word Segmentation and Named Entity Recognition
Levow, Gina-Anne
Proceedings of the Fifth {SIGHAN} Workshop on Chinese Language Processing
Contextual Dependencies in Unsupervised Word Segmentation
Goldwater, Sharon and Griffiths, Thomas L. and Johnson, Mark
Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational Linguistics
{基于D过程的语言模型与词法模型两个词两个词的Gibbs采样}
2005
现代汉语语料库建设及深加工
靳光瑾 and 肖航 and 富丽 and 章云帆
语言文字应用
国家语委的语料库介绍
A conditional random field word segmenter for sighan bakeoff 2005
Tseng, H. and Chang, P. and Andrew, G. and Jurafsky, D. and Manning, C.
Proceedings of the Fourth {SIGHAN} Workshop on Chinese Language Processing