A character can be written by combining many characters. For example, in Limon and ABC font families, the word ŠńО е (KNYOM) is assigned to one code. Some commonly used words or syllables are represented in one code. For example, the word НБ (SI) can be represented as the sequence of (SA), Н (SRAK U) and Б (SRAK II) and also theĢ Working Papers sequence of (SA), Б (SRAK II) and Н (SRAK U). Some words can be written in different ways.
The rest of this paper will discuss this module, as it is an important part Problem in Khmer non-unicode writing technique Since there is no standard to assign the character among Khmer non-unicode fonts, many problems occur for the representation. Its main function is to convert a non- Unicode text into Unicode format. Conversion Assembly is the core module of the system. Its tasks are to get the content of the document to be converted from the MS Office suite into conversion assembly, to replace the converted text to MS Office suite, and to keep the format of the converted document the same as the previous one. Automation application: All the applications in this module act as the intermediate between the conversion assembly and MS Office suite. MS Office suite: is the existing MS Office suite (Word, Excel, FrontPage, Power Point, Outlook and Publisher). Figure A.1 of Appendix A illustrates the system. The system will be developed by dividing its functionalities into three main modules: MS Office Suite, Automation Application, and Conversion Assembly. Architecture of the application To ensure the user-friendliness, the embedded applications for MS Office are proposed. The rest of this paper will present a detailed technique for the conversion and the results of the research. Since the non-unicode fonts input method is very flexible, users can combine whatever they want without respecting the meaning and spelling orders, and the task of the conversion is a challenge task. Grammatically, Khmer syllable must respect the spelling order. This leads to the invention of Khmer Unicode and the development of a tool to convert from those non-unicode font documents to Unicode format, which is an indispensable task. As a result, data transfer across fonts is impossible. A character can be represented in two different codes for two different fonts. Consequently, there is no standard for representation. Many fonts are being created to facilitate the office task.
Before the creation of Khmer Unicode, Khmer used Latin code to represents its characters. Khmer has been a unique and independent script for more than 1,400 years. The exact sources have not been determined, but there is a great similarity between the earliest inscriptions in the region and the Pallawa script of the Coromandel Coast of India. It is descended from the Brahmi script of South India, as are Thai, Lao, Myanmar, Old Mon, and others. Introduction According to the Unicode Standard Version 4.0 from the Unicode Consortium Khmer script, called aksaa khmae ( Khmer Letters ), is the official script in Cambodia. Section 4 discusses the findings and implications of the work that will present all the cases that have not been handled yet. Section 3 presents the result of the studies. Section 2 discusses the analysis of the non-unicode font representation and the conversion technique. The document is divided into three main sections. 1 Encoding Conversion Utility for Khmer Chea Sok Huor, Atif Gulzar, Ros Pich Hemy, Neak Longchrea PAN Localization Team, Cambodia Abstract This paper discusses the research on the encoding conversion from Khmer non-unicode fonts such as Limon, KHEK etc.