Translation of Sentence Lampung-Indonesian Languages with Neural Machine Translation Attention Based Approach
DOI:
https://doi.org/10.35450/jip.v6i02.97Keywords:
Decoder, Encoder, Neural machine translation, Out-of-Vocabulary, Recurrent neural networkAbstract
In this research, automatically Lampung language translation into the Indonesian language was using neural machine translation (NMT) attention based approach. NMT, a new approach method in machine translation technology, that has worked by combining the encoder and decoder. The encoder in NMT is a recurrent neural network component that encrypts the source language to several length-stable vectors and the decoder is a recurrent neural networks component that generates translation result comprehensive. NMT Research has begun with creating a pair of 3000 parallel sentences of Lampung language (api dialect) and Indonesian language. Then it continues to decide the NMT parameter model for the data training process. The next step is building NMT model and evaluate it. The testing of this approach has used 25 single sentences without out-of-vocabulary (OOV), 25 single sentences with OOV, 25 plural sentences without OOV, and 25 plural sentences with OOV. The testing translation result using NMT attention shows the bilingual evaluation understudy (BLEU) an average value is 51, 96 %.
Downloads
References
Bhattacharyya, P. (2015). Machine Translation. CRC Press, Mumbai, India, 25-27.
Goodfellow, I. Bengio, Y. dan Courville, (2016): Deep Learning, MIT Press, London, 462-464.
Hermanto, A., Adji, T. B., & Setiawan, N. (2015) . Recurrent Neural Network Language Model for English-Indonesian Machine Translation: Experimental Study. Proceeding of International Conference on Science in Information Technology. Yogyakarta, Indonesia. 27 – 28 October 2015.
Junczys-Dowmunt, M., Dwojak, T., & Hoang, H. (2016). Is Neural Machine Translation Ready for Deployment ? A Case Study on 30 Translation Directions. Retrieved from https://workshop 2016.iwslt.org/downloads/IWSLT _2016_paper_4.pdf
Zhang, J., Ding, Y., Shen, S., Cheng, Y., Sun, M., Luan, H., dan Liu, Y. (2017): THUMT-theano: An Open Source Toolkit for Neural Machine Translation. Retrieved from https://arxiv.org/abs/1706.06 415.
Anonim, 2017. Nestapa Guru Bahasa Lampung. Retrieved from http://www.lampost.co/berita-nestapa-guru-bahasa-lampung.
Anonim, 2016. 139 Bahasa Daerah di Indonesia Terancam Punah. Retrieved from http://nasional.republika.co.id/berita/nasional/umum/16/08/02/ob9t2h383-139-bahasa-daerah-di-indonesia-terancam-punah.
Published
How to Cite
Issue
Section
The Authors submitting a manuscript do so on the understanding that if accepted for publication, copyright of the article shell be assigned to Development Innovation: Jurnal Kelitbangan (JIP) , Balitbangda Provinsi Lampung as publisher of the journal.
Copyright encompasses exclusive rights to reproduce and deliver the article in all form and media, including reprints, photographs, microfilms and any other similar reproductions, as well as translations, the reproduction of any part of this journal, its storeage in databases and its transmission by any form or media. such as electronic, electrostatic and mechanical copies, photocopies, recordings, magnetic media, etc.