Research on machine translation based on key technologies of bilingual corpus

Di Lu

Abstract

Research on machine translation based on key technologies of bilingual corpus

Author(s): Di Lu

With the development of the technology of statistical natural language processing, the role of parallel corpus in statistical machine translation and cross-language retrieval cannot be ignored. In this paper, we examines the translation equivalent pairs could be extracted from parallel corpus. An iterative algorithm based on degree of word association is proposed to identify the multiword units for Chinese and English. Then a hypothesis testing approach is used to extract the Chinese English Translation Equivalent Pairs. We present a tree-tree model by mapping between the syntactic tree and the ITG tree, the model limits the reordering of the phrases in the global scope. While in the local scope, the tree-tree model takes the TTG-based local reordering model as one feature, in which the reordering probability of two blocks is decomposed into the product of the reordering probabilities of the child blocks respectively. So the model is able to estimate the reordering of two blocks with arbitrary lengths.

Research on machine translation based on key technologies of bilingual corpus

Table of Contents

Volume: 20

Volume: 19

Volume: 21

Volume: 18

Volume: 17

Volume: 16

Volume: 15

Volume: 14

Volume: 13

Volume: 12

Volume: 11

Volume: 10

Volume: 9

Volume: 7

Volume: 8

Volume: 6

Volume: 5

Volume: 4

Volume: 3

Volume: 2

Volume: 1

Google Scholar citation report

Citations : 875

Indexed In

For Authors

For Librarians

Open Access Journals

BioTechnology: An Indian Journal ISSN (PRINT): 0974-7435

Research on machine translation based on key technologies of bilingual corpus

Table of Contents

Citations : 875

Indexed In

For Authors

For Librarians

Open Access Journals

BioTechnology: An Indian Journal
ISSN (PRINT): 0974-7435