請用此 Handle URI 來引用此文件:
http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/31411
完整後設資料紀錄
DC 欄位 | 值 | 語言 |
---|---|---|
dc.contributor.advisor | 楊佳玲(Chia-Lin Yang) | |
dc.contributor.author | Yi-Hsuan Hsin | en |
dc.contributor.author | 辛逸軒 | zh_TW |
dc.date.accessioned | 2021-06-13T03:12:43Z | - |
dc.date.available | 2016-09-15 | |
dc.date.copyright | 2006-10-23 | |
dc.date.issued | 2006 | |
dc.date.submitted | 2006-09-19 | |
dc.identifier.citation | [1] Libi18n. http://www.mozilla.org/docs/refList/i18n/libi18n-desc.html.
[2] Unicode character database. http://www.unicode.org/Public/UNIDATA/. [3] Unihan database. http://www.unicode.org/charts/unihan.html. [4] Utf-8 on wikipedia. http://en.wikipedia.org/wiki/UTF-8. [5] Cy Cedar, David Veintimilla, Michel Suignard, and Asmus Freytag. Report from the trenches: Microsoft publisher goes unicode. In Proceedings of the Eleventh International Unicode Conference, 1997. [6] The Unicode Consortium. The Unicode Standard, Version 5.0. Addison-Wesley Professional, fifth edition, 2006. http://www.unicode.org/versions/Unicode5.0.0/. [7] Microsoft Coporation. The mlang library. http://msdn.microsoft.com/workshop/misc/mlang/mlang.asp. [8] Mark Davis. Unicode standard annex #29, text boundaries, 2005. http://www.unicode.org/reports/tr29/. [9] Asmus Freytag. Unicode standard annex #11, east asian width, 2005. http://www.unicode.org/reports/tr11/. [10] Asmus Freytag. Unicode standard annex #14, line breaking properties, 2005. http://www.unicode.org/reports/tr14/. [11] Donald E. Knuth. Digital Typography. Center for the Study of Language and Information - Lecture Notes; Reissue edition, 1998. [12] Donald E. Knuth and Michael F. Plass. Digital Typography, chapter Breaking Paragraphs into Lines. CSLI, 1978. [13] Markus Gunther Kuhn. UTF-8 and Unicode FAQ for Unix/Linux. http://www.cl.cam.ac.uk/%7Emgk25/unicode.html. [14] Ken Lunde. CJKV Information Processing. O’Reilly Media, second edition, 1998. [15] 辛逸軒. Irssi cjk and line breaking patches. PR#: ports/34305, ports/34343, ports/45912, ports/50374, ports/101126, ports/102777 in FreeBSD Problem Report Database http://www.freebsd.org/cgi/querypr.cgi. [16] 辛逸軒. Mutt line breaking patches. PR#: ports/34610 in FreeBSD Problem Report Database http://www.freebsd.org/cgi/query-pr.cgi. [17] 辛逸軒. PTT BBS DBCS Patches. https://opensvn.csie.org/traccgi/pttbbs/changeset/2781. [18] 辛逸軒. Screen cjk-width patch. https://savannah.gnu.org/bugs/?func=detailitem&item%5Fid=16666. [19] 曾士熊. 認識中文碼. 則團法人中文數位化技術推廣基金會. http://www.cmex.org.tw/cmex/word/00.htm. [20] 曾士熊. Unicode 與 ISO10646. 中央研究院計算中心通訊, 第16卷(第10, 11期), 民國89年. http://www.ascc.sinica.edu.tw/nl/. | |
dc.identifier.uri | http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/31411 | - |
dc.description.abstract | 斷行(line breaking)指的是將文字字串分隔為適當長度,以符合顯示區域寬度的動作。而隨著處理多國語言的需要,各種作業系統及應用程式開始採用萬國碼(Unicode)作為標準內碼,因此能適當地對萬國碼字串斷行成為支援萬國碼的重要工作。本文實作了萬國碼標準附錄14(UAX#14)所建議的斷行演算法,並提供了中、日、韓、越文(CJKV)環境下的客製化選項。 | zh_TW |
dc.description.abstract | Line breaking is the process to divide long string into shorter lines to fit in display width. With the vast requirement of processing multilingual texts, many operating systems and applications have adopted Unicode as default character set. Therefore, breaking Unicode strings properly is an important part of supporting Unicode. In this thesis, we implement the algorithm proposed in Unicode Standard Annex 14(UAX#14), and provide some customization options for Chinese, Japanese, Korean, Vietnamese(CJKV) context. | en |
dc.description.provenance | Made available in DSpace on 2021-06-13T03:12:43Z (GMT). No. of bitstreams: 1 ntu-95-R91922040-1.pdf: 1040036 bytes, checksum: 3ef2b51c3295112bbc7c9b26ef249532 (MD5) Previous issue date: 2006 | en |
dc.description.tableofcontents | 1 緒論 1
1.1 研究動機 . . . . . . . . . . . . . . . 1 1.2 本文架構 . . . . . . . . . . . . . . . 2 2 背景 3 2.1 斷行 . . . . . . . . . . . . . . . . . 3 2.1.1 定義 . . . . . . . . . . . . . . . . 3 2.1.2 換行機會 . . . . . . . . . . . . . . 4 2.1.3 選擇換行機會 . . . . . . . . . . . . 4 2.2 萬國碼(Unicode) . . . . . . . . . . . 5 2.2.1 簡史 . . . . . . . . . . . . . . . . 5 2.2.2 編碼結構 . . . . . . . . . . . . . . 6 2.2.3 UTF-8 . .. . . . . . . . . . . . . . 6 2.2.4 Unicode Character Database . . . . . 7 2.3 相關研究 . . . . . . . . . . . . . . . 8 3 斷行演算法 9 3.1 斷行屬性 . . . . . . . . . . . . . . . 9 3.2 斷行規則 . . . . . . . . . . . . . . 13 3.3 表格化斷行規則 . . . . . . . . . . . 13 3.4 選擇換行機會 . . . . . . . . . . . . 14 4 實作與結果 16 4.1 介面 . . . . . . . . . . . . . . . . 16 4.2 結果 . . . . . . . . . . . . . . . . 17 5 結論 20 參考文獻 21 | |
dc.language.iso | zh-TW | |
dc.title | 符合萬國碼標準之斷行程式庫 | zh_TW |
dc.title | A Unicode Standard Compliant Line Breaking Library | en |
dc.type | Thesis | |
dc.date.schoolyear | 94-2 | |
dc.description.degree | 碩士 | |
dc.contributor.oralexamcommittee | 周承復(Cheng-Fu Chou),逄愛君(Ai-Chun Pang) | |
dc.subject.keyword | 萬國碼,斷行,中日韓越, | zh_TW |
dc.subject.keyword | Unicode,Line Breaking,CJKV, | en |
dc.relation.page | 22 | |
dc.rights.note | 有償授權 | |
dc.date.accepted | 2006-09-21 | |
dc.contributor.author-college | 電機資訊學院 | zh_TW |
dc.contributor.author-dept | 資訊工程學研究所 | zh_TW |
顯示於系所單位: | 資訊工程學系 |
文件中的檔案:
檔案 | 大小 | 格式 | |
---|---|---|---|
ntu-95-1.pdf 目前未授權公開取用 | 1.02 MB | Adobe PDF |
系統中的文件,除了特別指名其著作權條款之外,均受到著作權保護,並且保留所有的權利。