Research on smoothing technology in Tibetan n-gram language model<br>Ren Qing Ji<br>(Key Laboratory of Tibetan intangible cultural heritage, Gansu Normal University for nationalities, Gansu cooperation 747000)<br>[Abstract] in this paper, srilm modeling platform is built in Linux environment, and then the corpus is processed in blocks. N-gram count and n-gram are used to count and build the language model, and several smoothing algorithms are used to test the degree of confusion. Finally, the values of these degrees of confusion are compared and analyzed, and an optimal one for the current corpus and language environment is concluded Smoothing method<br>
正在翻译中..