Research Collection School Of Computing and Information Systems

Experimental studies using statistical algorithms on transliterating phoneme sequences for English-Chinese name translation

Publication Type

Journal Article

Publication Date

1-2006

Abstract

Machine transliteration is automatic generation of the phonetic equivalents in a target language given a source language term, which is useful in many cross language applications. Transliteration between far distant languages, e.g. English and Chinese, is challenging because their phonological dissimilarities are significant. Existing techniques are typically rule-based or statistically noisy channel-based. Their accuracies are very low due to their intrinsic limitations on modeling transcription details. We propose direct statistical approaches on transliterating phoneme sequences for English–Chinese name translation. Aiming to improve performance, we propose two direct models: First, we adopt Finite State Automata on a process of direct mapping from English phonemes to a set of rudimentary Chinese phonetic symbols plus mapping units dynamically discovered from training. An effective algorithm for aligning phoneme chunks is proposed. Second, contextual features of each phoneme are taken into consideration by means of Maximum Entropy formalism, and the model is further refined with the precise alignment scheme using phoneme chunks. We compare our approaches with the noisy channel baseline that applies IBM SMT model, and demonstrate their superiority.

Keywords

Machine Transliteration, Finite State Transducer, Noisy Channel, Maximum Entropy, Phoneme, Fertility, Evaluation

Discipline

Databases and Information Systems

Research Areas

Data Science and Engineering

Publication

International Journal of Computer Processing of Oriental Languages

Volume

Issue

First Page

Last Page

Identifier

10.1142/S0219427906001396

Citation

GAO, Wei and WONG, Kam-Fai. Experimental studies using statistical algorithms on transliterating phoneme sequences for English-Chinese name translation. (2006). International Journal of Computer Processing of Oriental Languages. 19, (1), 63-88.
Available at: https://ink.library.smu.edu.sg/sis_research/4608

Additional URL

https://doi.org/10.1142/S0219427906001396

This document is currently not available here.

Find it in your library

COinS

Research Collection School Of Computing and Information Systems

Experimental studies using statistical algorithms on transliterating phoneme sequences for English-Chinese name translation

Publication Type

Publication Date

Abstract

Keywords

Discipline

Research Areas

Publication

Volume

Issue

First Page

Last Page

Identifier

Citation

Additional URL

Search

Links

Browse

Links

Research Collection School Of Computing and Information Systems

Experimental studies using statistical algorithms on transliterating phoneme sequences for English-Chinese name translation

Author

Publication Type

Publication Date

Abstract

Keywords

Discipline

Research Areas

Publication

Volume

Issue

First Page

Last Page

Identifier

Citation

Additional URL

Share

Search

Links

Browse

Links