Generally the Latin transliteration will give some indication as to the pronunciation of the Thai word, but the focus is not on an exact phonemic representation. So in the case of a Thai Romanization system, each Thai grapheme will be assigned one Latin counterpart. Strictly speaking, a transliteration system ideally uses a one-to-one character mapping so that the orthographic rules of the original script are kept intact. Transcription for a more detailed discussion of the differences and similarities between these two processes. Transcription does not necessarily involve a change in script (for example, in the case of an IPA transcription of a German text, the Latin script is used for the original and the transcribed text). Broadly speaking, transliteration can be defined as a graphemic conversion from one script to another, and transcription can be defined as a phonemic conversion from one writing system to another. However, it is still useful to understand the differences between the two. In practice, there is some overlap between the concepts of transcription and transliteration, and some systems in use are neither fully one nor fully the other. Transliteration can be between any two scripts, though the term is often used to imply transliteration into the Latin script (which is technically Romanization). ![]() There is often more than one system for transliterating between two given scripts. Transliteration is intended to broadly preserve the sounds of the original script, but the focus is not on providing an accurate phonemic representation as it is in the case of transcription. The primary aim of transliteration is to provide an alternate means of reading text using a different script. $cmd = " $DECODER -search-algorithm 1 -cube-pruning-pop-limit 5000 -s 5000 -threads 16 -drop-unknown -distortion-limit 0 -n-best-list $TMP_DIR/transliteration/ $eval_file.op.nBest 1000 distinct -f $TMP_DIR/transliteration/ $eval_ $TMP_DIR/transliteration/ $eval_file.Transliteration is one of two processes by which text is converted from one script into another. My $drop_stderr = $VERBOSE ? " " : " 2>/dev/null " $cmd = " $MOSES_SRC/scripts/ems/support/substitute-filtered-tables-and-weights.perl $TMP_DIR/transliteration/ $eval_file.filtered/moses.ini $TRANSLIT_MODEL/model/moses.ini $TRANSLIT_MODEL/tuning/ $TMP_DIR/transliteration/ $eval_ " $cmd = " $MOSES_SRC/scripts/training/ $TMP_DIR/transliteration/ $eval_file.filtered $TMP_DIR/transliteration/ $eval_ $TMP_DIR/transliteration/ $eval_file -Binarizer \" $MOSES_SRC/bin/CreateOnDiskPt 1 1 4 100 2 \" " ![]() My $cmd = " $MOSES_SRC/scripts/training/train-model.perl -dont-zip -first-step 9 -external-bin-dir $EXTERNAL_BIN_DIR -f $INPUT_EXTENSION -e $OUTPUT_EXTENSION -phrase-translation-table $TRANSLIT_MODEL/model/phrase-table -config $TMP_DIR/transliteration/ $eval_ -lm 0:3: $TMP_DIR/transliteration/ $eval_:8 " `touch $TMP_DIR/transliteration/ $eval_ ` Print "run_transliteration( $MOSES_SRC, $EXTERNAL_BIN_DIR, $TRANSLIT_MODEL, $eval_file) \n " # Run Transliteration Module to Obtain Transliterations # Open MYFILE, ">:encoding(UTF-8) ", $translitFile or die "Can't open $translitFile: $! \n " # Read the UNK word file and prepare for Transliteration #įoreach ( = split /\Q$_FACTOR_DELIMITER/ &run_decoder( $MOSES_SRC_DIR, $EXTERNAL_BIN_DIR, $LM_FILE) &form_corpus ( $translitFile, $translitFile. &run_transliteration ( $MOSES_SRC_DIR, $EXTERNAL_BIN_DIR, $TRANSLIT_MODEL, $OOV_FILE_NAME) &prepare_for_transliteration ( $OOV_FILE, $translitFile) Print "Preparing for Transliteration \n " ![]() My $translitFile = " $TMP_DIR/transliteration/ $OOV_FILE_NAME " `cp $OOV_FILE $TMP_DIR/transliteration ` $OUTPUT_FILE_NAME = basename ( $OUTPUT_FILE) $LM_FILE = $LM_FILE_WORD if $LM_FILE_WORD ne " " ĭie( "ERROR: could not find Language Model ' $LM_FILE' ")ĭie( "ERROR: could not find Transliteration Model ' $TRANSLIT_MODEL' ")ĭie( "ERROR: could not find OOV file $OOV_FILE' ")ĭie( "ERROR: could not find Input file $INPUT_FILE' ") 'output-extension=s ' => \ $OUTPUT_EXTENSION,ĭie( "ERROR: you need to define -moses-src-dir -external-bin-dir, -transliteration-model-dir, -oov-file, -output-file -input-extension, -output-extension, and -language-model ") 'input-extension=s ' => \ $INPUT_EXTENSION, 'transliteration-model-dir=s ' => \ $TRANSLIT_MODEL, ![]() 'external-bin-dir=s ' => \ $EXTERNAL_BIN_DIR, Unless &GetOptions( 'moses-src-dir=s ' => \ $MOSES_SRC_DIR, My ( $MOSES_SRC_DIR, $TRANSLIT_MODEL, $EVAL_DIR, $OUTPUT_FILE, $OUTPUT_FILE_NAME, $OOV_FILE, $OOV_FILE_NAME, $EXTERNAL_BIN_DIR, $LM_FILE, $INPUT_EXTENSION, $OUTPUT_EXTENSION, $INPUT_FILE, $VERBOSE, $DECODER, $TMP_DIR) ĭie( "ERROR: wrong syntax when invoking postDecodingTransliteration.perl ") # Public License version 2.1 or, at your option, any later version. Its use is licensed under the GNU Lesser General
0 Comments
Leave a Reply. |
AuthorWrite something about yourself. No need to be fancy, just an overview. ArchivesCategories |