<?xml version='1.0' encoding='UTF-8'?><?xml-stylesheet href="http://www.blogger.com/styles/atom.css" type="text/css"?><feed xmlns='http://www.w3.org/2005/Atom' xmlns:openSearch='http://a9.com/-/spec/opensearchrss/1.0/' xmlns:georss='http://www.georss.org/georss' xmlns:gd='http://schemas.google.com/g/2005' xmlns:thr='http://purl.org/syndication/thread/1.0'><id>tag:blogger.com,1999:blog-1150206436721132886</id><updated>2011-11-21T18:24:48.646-08:00</updated><title type='text'>Myanmar NLP Team</title><subtitle type='html'>The abbreviation "NLP" stands for natural language processing.  'Natural language' does not mean any programming languages. In this case, we refer to Myanmar language processing on computers.</subtitle><link rel='http://schemas.google.com/g/2005#feed' type='application/atom+xml' href='http://myanmarnlpteam.blogspot.com/feeds/posts/default'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/1150206436721132886/posts/default?max-results=100'/><link rel='alternate' type='text/html' href='http://myanmarnlpteam.blogspot.com/'/><link rel='hub' href='http://pubsubhubbub.appspot.com/'/><author><name>Myanmar NLP Team</name><uri>http://www.blogger.com/profile/13423400580653677357</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><generator version='7.00' uri='http://www.blogger.com'>Blogger</generator><openSearch:totalResults>15</openSearch:totalResults><openSearch:startIndex>1</openSearch:startIndex><openSearch:itemsPerPage>100</openSearch:itemsPerPage><entry><id>tag:blogger.com,1999:blog-1150206436721132886.post-8386717768261154335</id><published>2008-02-29T19:43:00.000-08:00</published><updated>2008-02-29T19:46:16.381-08:00</updated><title type='text'>Syllable Segmentation Software</title><content type='html'>Pls download and try &lt;a href="http://wunnakoko.googlepages.com/SyllableSegmentationwithoutDotnet.exe"&gt;Burmese Syllable Segmenter.&lt;/a&gt;&lt;br /&gt;It is a free software. But the source may not be open. After a complete documentation, we will publish the technology.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/1150206436721132886-8386717768261154335?l=myanmarnlpteam.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://myanmarnlpteam.blogspot.com/feeds/8386717768261154335/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=1150206436721132886&amp;postID=8386717768261154335' title='4 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/1150206436721132886/posts/default/8386717768261154335'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/1150206436721132886/posts/default/8386717768261154335'/><link rel='alternate' type='text/html' href='http://myanmarnlpteam.blogspot.com/2008/02/syllable-segmentation-software.html' title='Syllable Segmentation Software'/><author><name>Wunna Ko Ko</name><uri>http://www.blogger.com/profile/11703674113047905790</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>4</thr:total></entry><entry><id>tag:blogger.com,1999:blog-1150206436721132886.post-7945690430783331660</id><published>2008-02-29T19:23:00.000-08:00</published><updated>2008-02-29T19:51:03.687-08:00</updated><title type='text'>Publications</title><content type='html'>Some of our publications:&lt;br /&gt;&lt;br /&gt;2008&lt;br /&gt;&lt;br /&gt;Wunna Ko Ko &amp;amp; Thin Zar Phyo, 2008 &lt;a style="font-style: italic;" href="http://wunnakoko.googlepages.com/SelectionofXMLTagset.pdf"&gt;Selection of XML Tag Set for Myanmar National Corpus&lt;/a&gt; The 6th Workshop on Asian Language Resources (ALR6), IJCNLP08, Hyderabad, India. Slides&lt;br /&gt;&lt;br /&gt;2007&lt;br /&gt;&lt;br /&gt;Wunna Ko Ko, 2007 &lt;a href="http://wunnakoko.googlepages.com/BurmeseLanguageEnabling_MES.pdf"&gt;&lt;span style="font-style: italic;"&gt;&lt;/span&gt;&lt;/a&gt;&lt;a style="font-style: italic;" href="http://wunnakoko.googlepages.com/BurmeseLanguageEnabling_MES.pdf"&gt;Burmese Language Enabling at OpenOffice.org&lt;/a&gt; Annual Conference of Myanmar Engineering Society, Yangon, Myanmar. &lt;a href="http://wunnakoko.googlepages.com/MES_presentation.pdf"&gt;Slides&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;2006&lt;br /&gt;&lt;br /&gt;Wunna Ko Ko, Yoshiki Mikami &lt;a href="http://wunnakoko.googlepages.com/MyanmarinCyberspace.pdf"&gt;&lt;span style="font-style: italic;"&gt;Languages of Myanmar in Cyberspace&lt;/span&gt;&lt;/a&gt; Nagaoka University of Technology Bulletin on Language Science and Humanity, Vol. 19. pp.249&lt;span style="font-family:MS Mincho,monospace;"&gt;&lt;span lang="ja-JP"&gt;-&lt;/span&gt;&lt;/span&gt;264 (2005)&lt;br /&gt;&lt;br /&gt;Zavarsky  Pavol, Wunna Ko Ko, Yew Choong Chew, Yoshiki Mikami, Tatsuo  Kobayashi, &lt;a href="http://wunnakoko.googlepages.com/Unicode-29.pdf"&gt;&lt;span style="font-style: italic;"&gt;Unicode Spreading on the Web: A case of Asian &amp;amp;  African Domains&lt;/span&gt;&lt;/a&gt;, &lt;u&gt;Internationalization and Unicode  Conference 2006&lt;/u&gt;, San Francisco, USA, March 2006&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/1150206436721132886-7945690430783331660?l=myanmarnlpteam.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://myanmarnlpteam.blogspot.com/feeds/7945690430783331660/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=1150206436721132886&amp;postID=7945690430783331660' title='1 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/1150206436721132886/posts/default/7945690430783331660'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/1150206436721132886/posts/default/7945690430783331660'/><link rel='alternate' type='text/html' href='http://myanmarnlpteam.blogspot.com/2008/02/publications.html' title='Publications'/><author><name>Wunna Ko Ko</name><uri>http://www.blogger.com/profile/11703674113047905790</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>1</thr:total></entry><entry><id>tag:blogger.com,1999:blog-1150206436721132886.post-6171429751998563188</id><published>2007-11-08T19:28:00.000-08:00</published><updated>2007-11-30T20:26:02.348-08:00</updated><title type='text'>Sample Burmese Sorting Program</title><content type='html'>I uploaded the Burmese Sorting Software (Trial). It can sort the dictionary order of (Burmese-Burmese) published by Myanmar Language Commission.&lt;br /&gt;&lt;br /&gt;&lt;a href="http://wunnakoko.googlepages.com/Sorting_M1000_T30.exe"&gt;Burmese Sorting Software Download&lt;/a&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/1150206436721132886-6171429751998563188?l=myanmarnlpteam.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://myanmarnlpteam.blogspot.com/feeds/6171429751998563188/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=1150206436721132886&amp;postID=6171429751998563188' title='2 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/1150206436721132886/posts/default/6171429751998563188'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/1150206436721132886/posts/default/6171429751998563188'/><link rel='alternate' type='text/html' href='http://myanmarnlpteam.blogspot.com/2007/11/sample-burmese-sorting-program.html' title='Sample Burmese Sorting Program'/><author><name>Wunna Ko Ko</name><uri>http://www.blogger.com/profile/11703674113047905790</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>2</thr:total></entry><entry><id>tag:blogger.com,1999:blog-1150206436721132886.post-7851692512791905209</id><published>2007-10-18T21:09:00.000-07:00</published><updated>2007-11-08T19:27:20.874-08:00</updated><title type='text'>Burmese Sorting</title><content type='html'>I had made a lot of research for Burmese Sorting in recent years. Now, I have developed an algorithm successfully. It can sort all the data as long as the data is compliant with the standard &lt;a href="http://std.dkuug.dk/JTC1/SC2/WG2/"&gt;ISO/IEC JTC1/SC2/WG2&lt;/a&gt; and technical notes &lt;a href="http://www.unicode.org/notes/tn11/"&gt;UTN#11&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;The fonts which compliant with the above mentioned standard are (as long as I know) Myanmar2, Myanmar3, Parabike and Padauk. There may be many more with compliant with the standard. The input (keyboard) should also be in compliant with the above mentioned standard.&lt;br /&gt;&lt;br /&gt;It can sort out the input with the order of dictionary (Burmese-Burmese Dictionary, Burmese-English Dictionary) released by Myanmar Language Commission.&lt;br /&gt;&lt;br /&gt;With the help of my colleague, I have developed a sample software and a .dll file for sorting.&lt;br /&gt;&lt;br /&gt;Since all the cost incurs to me, I need some money to recover the research cost. I will provide the sample software free but I will sell .dll file with some cost. .dll file can be used by the developers who wish to include the Burmese sorting in their own programs.&lt;br /&gt;&lt;br /&gt;I will upload the sample software in a few days. ( I can't use internet at office. I upload it as soon as I can get a chance. )&lt;br /&gt;&lt;br /&gt;For developers, please contact me to wunnakoko at gmail dot com. I will provide you the detail for the cost.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/1150206436721132886-7851692512791905209?l=myanmarnlpteam.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://myanmarnlpteam.blogspot.com/feeds/7851692512791905209/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=1150206436721132886&amp;postID=7851692512791905209' title='1 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/1150206436721132886/posts/default/7851692512791905209'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/1150206436721132886/posts/default/7851692512791905209'/><link rel='alternate' type='text/html' href='http://myanmarnlpteam.blogspot.com/2007/10/burmese-sorting.html' title='Burmese Sorting'/><author><name>Wunna Ko Ko</name><uri>http://www.blogger.com/profile/11703674113047905790</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>1</thr:total></entry><entry><id>tag:blogger.com,1999:blog-1150206436721132886.post-6637097694084264159</id><published>2007-10-18T21:02:00.001-07:00</published><updated>2007-10-23T21:02:17.390-07:00</updated><title type='text'>No Internet Connection at Myanmar Unicode &amp; NLP Research Center</title><content type='html'>The internet access was not available to public in Myanmar from 28th September 2007 but it restored to normal condition by 15th October 2007. But the authorities at Myanmar Computer Federation (MCF) blogged the staffs of Myanmar Unicode and NLP Research center till today.&lt;br /&gt;&lt;br /&gt;I don't know when will they open back again. This makes the advancement of NLP research to a very slow condition.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/1150206436721132886-6637097694084264159?l=myanmarnlpteam.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://myanmarnlpteam.blogspot.com/feeds/6637097694084264159/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=1150206436721132886&amp;postID=6637097694084264159' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/1150206436721132886/posts/default/6637097694084264159'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/1150206436721132886/posts/default/6637097694084264159'/><link rel='alternate' type='text/html' href='http://myanmarnlpteam.blogspot.com/2007/10/no-internet-connection-at-myanmar.html' title='No Internet Connection at Myanmar Unicode &amp; NLP Research Center'/><author><name>Wunna Ko Ko</name><uri>http://www.blogger.com/profile/11703674113047905790</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-1150206436721132886.post-3053541491419362845</id><published>2007-09-22T01:34:00.000-07:00</published><updated>2007-09-22T02:56:07.631-07:00</updated><title type='text'>Burmese Language Project at OpenOffice.org</title><content type='html'>I have created "&lt;a href="http://my.openoffice.org"&gt;Burmese Language Project&lt;/a&gt;" at &lt;a href="http://www.openoffice.org"&gt;OpenOffice.org&lt;/a&gt; . Firstly, we need to enable Burmese Language at I18N (Internationalization) part. I18N is a kind of supporting usage of Burmese with OpenOffice. I18N includes:&lt;br /&gt;&lt;ul&gt;&lt;li&gt; &lt;span style="background: transparent none repeat scroll 0% 50%; -moz-background-clip: -moz-initial; -moz-background-origin: -moz-initial; -moz-background-inline-policy: -moz-initial;"&gt;Word, line and sentence break &lt;/span&gt; &lt;/li&gt;&lt;li&gt; &lt;span style="background: transparent none repeat scroll 0% 50%; -moz-background-clip: -moz-initial; -moz-background-origin: -moz-initial; -moz-background-inline-policy: -moz-initial;"&gt;Search and replace &lt;/span&gt; &lt;/li&gt;&lt;li&gt; &lt;span style="background: transparent none repeat scroll 0% 50%; -moz-background-clip: -moz-initial; -moz-background-origin: -moz-initial; -moz-background-inline-policy: -moz-initial;"&gt;Paragraph numbering &lt;/span&gt; &lt;/li&gt;&lt;li&gt; &lt;span style="background: transparent none repeat scroll 0% 50%; -moz-background-clip: -moz-initial; -moz-background-origin: -moz-initial; -moz-background-inline-policy: -moz-initial;"&gt;Transliteration     &lt;/span&gt; &lt;/li&gt;&lt;li&gt; &lt;span style="background: transparent none repeat scroll 0% 50%; -moz-background-clip: -moz-initial; -moz-background-origin: -moz-initial; -moz-background-inline-policy: -moz-initial;"&gt;Character classification &lt;/span&gt; &lt;/li&gt;&lt;li&gt; &lt;span style="background: transparent none repeat scroll 0% 50%; -moz-background-clip: -moz-initial; -moz-background-origin: -moz-initial; -moz-background-inline-policy: -moz-initial;"&gt;Number Formats&lt;/span&gt;   &lt;/li&gt;&lt;li&gt; &lt;span style="background: transparent none repeat scroll 0% 50%; -moz-background-clip: -moz-initial; -moz-background-origin: -moz-initial; -moz-background-inline-policy: -moz-initial;"&gt;Calendars     &lt;/span&gt; &lt;/li&gt;&lt;li&gt; &lt;span style="background: transparent none repeat scroll 0% 50%; -moz-background-clip: -moz-initial; -moz-background-origin: -moz-initial; -moz-background-inline-policy: -moz-initial;"&gt;Collation     &lt;/span&gt; &lt;/li&gt;&lt;li&gt; &lt;span style="background: transparent none repeat scroll 0% 50%; -moz-background-clip: -moz-initial; -moz-background-origin: -moz-initial; -moz-background-inline-policy: -moz-initial;"&gt;Locale data&lt;/span&gt;&lt;/li&gt;&lt;/ul&gt;I have grouped a team of members to work on it. Our team members do not work for money but they need money to work. So, I am looking for a research grant. If any person wants to provide any contribution, not only money but also works, please contact me at wunnakoko at gmail dot com.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/1150206436721132886-3053541491419362845?l=myanmarnlpteam.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://myanmarnlpteam.blogspot.com/feeds/3053541491419362845/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=1150206436721132886&amp;postID=3053541491419362845' title='1 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/1150206436721132886/posts/default/3053541491419362845'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/1150206436721132886/posts/default/3053541491419362845'/><link rel='alternate' type='text/html' href='http://myanmarnlpteam.blogspot.com/2007/09/burmese-language-project-at.html' title='Burmese Language Project at OpenOffice.org'/><author><name>Wunna Ko Ko</name><uri>http://www.blogger.com/profile/11703674113047905790</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>1</thr:total></entry><entry><id>tag:blogger.com,1999:blog-1150206436721132886.post-3184811428070443226</id><published>2007-09-15T03:49:00.000-07:00</published><updated>2007-09-15T03:57:35.681-07:00</updated><title type='text'>Syllable Segmentation &amp; Line Breaking Software</title><content type='html'>I hope we can publish the beta version of Syllable Segmentation Software in near future. Although, it is in beta version, we have already tested for a period of time. The software will do the following:&lt;br /&gt;1. Phonologic segmentation of Burmese Syllables&lt;br /&gt;2. Orthographic segmentation of Burmese Syllables&lt;br /&gt;3. Line Breaking or Word Wrapping&lt;br /&gt;&lt;br /&gt;It can handle three types of documents:&lt;br /&gt;1. Text Documents (.txt) files&lt;br /&gt;2. XML Documents (.xml) files&lt;br /&gt;3. MS Word Documents (.doc) files&lt;br /&gt;&lt;br /&gt;By handling XML documents, we hope that it will be useful for segmenting all types of other documents like Spreadsheet files, Database files, etc.&lt;br /&gt;&lt;br /&gt;I hope all of our friends and Burmese community will help us in testing it.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/1150206436721132886-3184811428070443226?l=myanmarnlpteam.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://myanmarnlpteam.blogspot.com/feeds/3184811428070443226/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=1150206436721132886&amp;postID=3184811428070443226' title='1 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/1150206436721132886/posts/default/3184811428070443226'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/1150206436721132886/posts/default/3184811428070443226'/><link rel='alternate' type='text/html' href='http://myanmarnlpteam.blogspot.com/2007/09/syllable-segmentation-line-breaking.html' title='Syllable Segmentation &amp; Line Breaking Software'/><author><name>Wunna Ko Ko</name><uri>http://www.blogger.com/profile/11703674113047905790</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>1</thr:total></entry><entry><id>tag:blogger.com,1999:blog-1150206436721132886.post-6650925677112241464</id><published>2007-09-12T03:32:00.000-07:00</published><updated>2007-09-13T01:27:31.385-07:00</updated><title type='text'>Syllable Segmentation</title><content type='html'>&lt;p style="margin-bottom: 0in;"&gt;&lt;span style="font-family:Zawgyi-One, sans-serif;"&gt;&lt;span style="font-size:100%;"&gt;Syllable segmentation is the process of determination of syllable boundaries in a piece of text. Since Burmese is the tonal and analytic language and Burmese writing system is a syllabic writing system, the fundamental building blocks of a language are the syllables. In determination of syllable boundaries in Burmese Script, there can be of two types; 1) phonological boundary of a syllable, and 2) orthographic boundary of a syllable. Since Burmese script is a phonetic script, the phonological segmentation of a syllable is the basic segmentation. The phonological boundary of a syllable is defined, as the name goes, according to the phonological manner whereas the orthographic boundary of a syllable is defined according to the orthography. The orthographic syllable need not correspond exactly with a phonological syllable. The orthographic syllable is just the combination of phonological syllables and the non-breaking rules. Example: In a &lt;/span&gt;&lt;/span&gt;&lt;span style="font-family:Zawgyi-One, sans-serif;"&gt;&lt;span style="font-size:100%;"&gt;&lt;span lang="th-TH"&gt;မႏၱေလး&lt;/span&gt; &lt;/span&gt;&lt;/span&gt;&lt;span style="font-family:Zawgyi-One, sans-serif;"&gt;&lt;span style="font-size:100%;"&gt;word, it has 3 phonological syllables, &lt;/span&gt;&lt;/span&gt;&lt;span style="font-family:Zawgyi-One, sans-serif;"&gt;&lt;span style="font-size:100%;"&gt;&lt;span lang="th-TH"&gt;မန္&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;span style="font-family:Zawgyi-One, sans-serif;"&gt;&lt;span style="font-size:100%;"&gt;, &lt;/span&gt;&lt;/span&gt;&lt;span style="font-family:Zawgyi-One, sans-serif;"&gt;&lt;span style="font-size:100%;"&gt;&lt;span lang="th-TH"&gt;တ&lt;/span&gt; &lt;/span&gt;&lt;/span&gt;&lt;span style="font-family:Zawgyi-One, sans-serif;"&gt;&lt;span style="font-size:100%;"&gt;and &lt;/span&gt;&lt;/span&gt;&lt;span style="font-family:Zawgyi-One, sans-serif;"&gt;&lt;span style="font-size:100%;"&gt;&lt;span lang="th-TH"&gt;ေလး&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;span style="font-family:Zawgyi-One, sans-serif;"&gt;&lt;span style="font-size:100%;"&gt;&lt;span lang="th-TH"&gt;. &lt;/span&gt;But, for orthographic syllable, it has just 2 syllables, &lt;/span&gt;&lt;/span&gt;&lt;span style="font-family:Zawgyi-One, sans-serif;"&gt;&lt;span style="font-size:100%;"&gt;&lt;span lang="th-TH"&gt;မႏၱ&lt;/span&gt; &lt;/span&gt;&lt;/span&gt;&lt;span style="font-family:Zawgyi-One, sans-serif;"&gt;&lt;span style="font-size:100%;"&gt;and &lt;/span&gt;&lt;/span&gt;&lt;span style="font-family:Zawgyi-One, sans-serif;"&gt;&lt;span style="font-size:100%;"&gt;&lt;span lang="th-TH"&gt;ေလး&lt;/span&gt; &lt;/span&gt;&lt;/span&gt;&lt;span style="font-family:Zawgyi-One, sans-serif;"&gt;&lt;span style="font-size:100%;"&gt;.&lt;/span&gt;&lt;/span&gt;&lt;/p&gt; &lt;span style=";font-family:Myanmar3;font-size:12;"  &gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/1150206436721132886-6650925677112241464?l=myanmarnlpteam.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://myanmarnlpteam.blogspot.com/feeds/6650925677112241464/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=1150206436721132886&amp;postID=6650925677112241464' title='1 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/1150206436721132886/posts/default/6650925677112241464'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/1150206436721132886/posts/default/6650925677112241464'/><link rel='alternate' type='text/html' href='http://myanmarnlpteam.blogspot.com/2007/09/syllable-segmentation.html' title='Syllable Segmentation'/><author><name>Wunna Ko Ko</name><uri>http://www.blogger.com/profile/11703674113047905790</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>1</thr:total></entry><entry><id>tag:blogger.com,1999:blog-1150206436721132886.post-2952252160377479524</id><published>2007-09-12T02:53:00.000-07:00</published><updated>2007-09-12T02:56:13.637-07:00</updated><title type='text'>References</title><content type='html'>For all of you to have a reference for future NLP works, I uploaded some of my academic publications.&lt;br /&gt;&lt;br /&gt;&lt;a href="http://wunnakoko.googlepages.com/Thesis.pdf"&gt;Master Thesis at Nagaoka University of Technology&lt;/a&gt;&lt;a href="http://wunnakoko.googlepages.com/Thesis.pdf"&gt;&lt;br /&gt;&lt;/a&gt;&lt;br /&gt;&lt;a href="http://wunnakoko.googlepages.com/MyanmarinCyberspace.pdf"&gt;Languages of Myanmar In Cyberspace&lt;/a&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/1150206436721132886-2952252160377479524?l=myanmarnlpteam.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://myanmarnlpteam.blogspot.com/feeds/2952252160377479524/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=1150206436721132886&amp;postID=2952252160377479524' title='2 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/1150206436721132886/posts/default/2952252160377479524'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/1150206436721132886/posts/default/2952252160377479524'/><link rel='alternate' type='text/html' href='http://myanmarnlpteam.blogspot.com/2007/09/references.html' title='References'/><author><name>Wunna Ko Ko</name><uri>http://www.blogger.com/profile/11703674113047905790</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>2</thr:total></entry><entry><id>tag:blogger.com,1999:blog-1150206436721132886.post-6842739779307373581</id><published>2007-09-06T03:51:00.000-07:00</published><updated>2007-09-06T04:07:32.491-07:00</updated><title type='text'>Word Break</title><content type='html'>&lt;p class="MsoNormal" style="text-indent: 0.5in;"&gt;&lt;span style="" lang="EN"&gt;A &lt;b&gt;word, &lt;/b&gt;is a unit of language, carries meaning and consists of one or more morphemes. Although there are spaces to separate between words English, no spaces are needed to add in &lt;st1:country-region st="on"&gt;&lt;st1:place st="on"&gt;Myanmar&lt;/st1:place&gt;&lt;/st1:country-region&gt;. Typically, a word will consist of a root or stem and zero or more affixes. Words can be combined to create phrase, clauses and sentences (Wikipedia).&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal" style="text-indent: 0.5in;"&gt;&lt;span style="" lang="EN"&gt;So, word boundary must be detected for Myanmar Language. But, &lt;/span&gt;Myanmar Language is the tonal and analytic language. &lt;st1:place st="on"&gt;&lt;st1:country-region st="on"&gt;Myanmar&lt;/st1:country-region&gt;&lt;/st1:place&gt; writing system is a syllabic writing system, so the fundamental building blocks of a language are the syllables. Determining syllable boundary can be done by rule base.&lt;/p&gt;  &lt;p class="MsoNormal"&gt;A word can be formed by one or more than one syllables. &lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;span style=""&gt;            &lt;/span&gt;Syllable break which can be used for sorting, searching, text to speech, transliteration, can also be used for word breaking methods.&lt;/p&gt;  &lt;span style=""&gt;&lt;span style=""&gt;            &lt;/span&gt;&lt;span style="font-size:100%;"&gt;Word break which can be used for spell checking, grammar checking, translation, line breaking, etc,.&lt;br /&gt;&lt;/span&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/1150206436721132886-6842739779307373581?l=myanmarnlpteam.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://myanmarnlpteam.blogspot.com/feeds/6842739779307373581/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=1150206436721132886&amp;postID=6842739779307373581' title='2 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/1150206436721132886/posts/default/6842739779307373581'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/1150206436721132886/posts/default/6842739779307373581'/><link rel='alternate' type='text/html' href='http://myanmarnlpteam.blogspot.com/2007/09/word-break.html' title='Word Break'/><author><name>The The Aye</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>2</thr:total></entry><entry><id>tag:blogger.com,1999:blog-1150206436721132886.post-3965969959683903365</id><published>2007-08-31T22:37:00.000-07:00</published><updated>2007-08-31T22:39:49.037-07:00</updated><title type='text'>Our Discussion group</title><content type='html'>We have created a google group for discussion about Myanmar NLP development. I like to invite all the interested people to join and post at myanmarnlpteam@googlegroups.com.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/1150206436721132886-3965969959683903365?l=myanmarnlpteam.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://myanmarnlpteam.blogspot.com/feeds/3965969959683903365/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=1150206436721132886&amp;postID=3965969959683903365' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/1150206436721132886/posts/default/3965969959683903365'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/1150206436721132886/posts/default/3965969959683903365'/><link rel='alternate' type='text/html' href='http://myanmarnlpteam.blogspot.com/2007/08/our-discussion-group.html' title='Our Discussion group'/><author><name>Wunna Ko Ko</name><uri>http://www.blogger.com/profile/11703674113047905790</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-1150206436721132886.post-3573977701204101473</id><published>2007-08-31T22:22:00.000-07:00</published><updated>2007-09-01T04:28:23.354-07:00</updated><title type='text'>What is Font?</title><content type='html'>Some people put messages about "FONT". So, I like to contribute some information on "FONT". The interested people can learn about "FONT" at the following sites:&lt;br /&gt;w3 (World Wide Web Consortium) says as &lt;a href="http://www.w3.org/TR/REC-CSS2/fonts.html"&gt;http://www.w3.org/TR/REC-CSS2/fonts.html&lt;/a&gt;&lt;br /&gt;wiki (The free encyclopedia) says as  &lt;a href="http://en.wikipedia.org/wiki/Font"&gt;http://en.wikipedia.org/wiki/Font&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;I like to let people know that font is not to sort, break, etc. The background encoding has to be used for syllable break, sorting, word breaking, etc.&lt;br /&gt;&lt;br /&gt;In Myanmar, there are many magicians. The magicians use technology to show magic to people. So, people think it is true.&lt;br /&gt;&lt;br /&gt;I like to give a general logic here.&lt;br /&gt;1 ft= 12 x 1 inch&lt;br /&gt;so 12 times of 1 in (from Myanmar) has to be the same with 1 ft (from United States). Am I right?&lt;br /&gt;&lt;br /&gt;It is called "STANDARD". If "SORTING" is OK, all Unicode encoded text can be sorted. Why it is only for the specific "FONT" and any product. It is like 12 times of 1 in (from Myanmar) is not equal to 1 ft (from United States).&lt;br /&gt;&lt;br /&gt;It is also the same for "BREAK" and other processes.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/1150206436721132886-3573977701204101473?l=myanmarnlpteam.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://myanmarnlpteam.blogspot.com/feeds/3573977701204101473/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=1150206436721132886&amp;postID=3573977701204101473' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/1150206436721132886/posts/default/3573977701204101473'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/1150206436721132886/posts/default/3573977701204101473'/><link rel='alternate' type='text/html' href='http://myanmarnlpteam.blogspot.com/2007/08/what-is-font.html' title='What is Font?'/><author><name>Wunna Ko Ko</name><uri>http://www.blogger.com/profile/11703674113047905790</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-1150206436721132886.post-2461230247430443578</id><published>2007-08-16T03:37:00.001-07:00</published><updated>2007-08-16T03:56:03.697-07:00</updated><title type='text'>Unicode Font (OR) Unicode Compliant Font</title><content type='html'>&lt;a href="http://www.unicode.org"&gt;Unicode&lt;/a&gt; consortium do not develop any font. The font developers develop font. There can be Unicode compliant fonts and Unicode non-compliant fonts. If someone wants to sure that the font is Unicode compliant or not, check with the Unicode system.&lt;br /&gt;    Myanmar 3, the latest release of our NLP research team, is the Unicode compliant font. The main reason Myanmar 3 is reliable and Unicode compliant is that the Unicode system for Myanmar is created with us.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/1150206436721132886-2461230247430443578?l=myanmarnlpteam.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://myanmarnlpteam.blogspot.com/feeds/2461230247430443578/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=1150206436721132886&amp;postID=2461230247430443578' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/1150206436721132886/posts/default/2461230247430443578'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/1150206436721132886/posts/default/2461230247430443578'/><link rel='alternate' type='text/html' href='http://myanmarnlpteam.blogspot.com/2007/08/unicode-font-or-unicode-compliant-font.html' title='Unicode Font (OR) Unicode Compliant Font'/><author><name>Wunna Ko Ko</name><uri>http://www.blogger.com/profile/11703674113047905790</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-1150206436721132886.post-7161254295722810909</id><published>2007-08-16T03:28:00.000-07:00</published><updated>2007-11-21T21:17:34.268-08:00</updated><title type='text'>Download Links</title><content type='html'>We provide some downloads which are the products of our team. These products (softwares, fonts, publications, documents, etc) are created by all or one of the members of Myanmar NLP research center.&lt;br /&gt;(Point the cursor on the following links. Right click on the mouse, choose "save target as".)&lt;br /&gt;  &lt;a href="http://myanmarnlpteam.googlegroups.com/web/Myanmar3_Release.rar?gda=N32q-kUAAADGv5lcFFSfoA3V0JRRlR6REy7zTLQ8a7ChZuoe8CSqqWG1qiJ7UbTIup-M2XPURDQQUVQXHVDg_Kko_xlc2eTtKonjrgomGbyQY5s4Sy3Eaw"&gt;Myanmar 3 Font and Keyboard&lt;/a&gt;&lt;br /&gt;&lt;a href="http://groups.google.com/group/myanmarnlpteam/web/Myanmar3_ship.ttf"&gt;Myanmar 3 Font Only&lt;/a&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/1150206436721132886-7161254295722810909?l=myanmarnlpteam.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://myanmarnlpteam.blogspot.com/feeds/7161254295722810909/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=1150206436721132886&amp;postID=7161254295722810909' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/1150206436721132886/posts/default/7161254295722810909'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/1150206436721132886/posts/default/7161254295722810909'/><link rel='alternate' type='text/html' href='http://myanmarnlpteam.blogspot.com/2007/08/download-links.html' title='Download Links'/><author><name>Wunna Ko Ko</name><uri>http://www.blogger.com/profile/11703674113047905790</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-1150206436721132886.post-5712936885634579524</id><published>2007-08-13T00:37:00.000-07:00</published><updated>2007-08-13T00:55:20.756-07:00</updated><title type='text'>Who are we?</title><content type='html'>Basically, all the interested persons of Myanmar Unicode can write blog here. The core bloggers will be from the Myanmar NLP Team who are working on Myanmar Natural Language Processing.&lt;br /&gt;   Natural Language Processing is a kind of computer processing specialized for Human Language, like Burmese, Karen, English, Japanese, etc. which is referred as Natural Language in computer processing circle in order to differentiate Computer Programming Langauge like, C, C++, C#, Java, VB, etc.&lt;br /&gt;  This blog is to discuss the latest events on Myanmar NLP and also for people's point of view on Myanmar Unicode and other encodings.&lt;br /&gt; We welcome all the participants' +ve and -ve points of view and discussions.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/1150206436721132886-5712936885634579524?l=myanmarnlpteam.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://myanmarnlpteam.blogspot.com/feeds/5712936885634579524/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=1150206436721132886&amp;postID=5712936885634579524' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/1150206436721132886/posts/default/5712936885634579524'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/1150206436721132886/posts/default/5712936885634579524'/><link rel='alternate' type='text/html' href='http://myanmarnlpteam.blogspot.com/2007/08/who-are-we.html' title='Who are we?'/><author><name>Wunna Ko Ko</name><uri>http://www.blogger.com/profile/11703674113047905790</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry></feed>
