{"id":23349,"date":"2023-03-30T03:05:16","date_gmt":"2023-03-30T03:05:16","guid":{"rendered":"https:\/\/www.booksofall.com\/ja\/?post_type=product&#038;p=23349"},"modified":"2023-03-30T03:05:16","modified_gmt":"2023-03-30T03:05:16","slug":"programming-with-unicode-documentation","status":"publish","type":"product","link":"https:\/\/www.booksofall.com\/ja\/programming-with-unicode-documentation\/","title":{"rendered":"Programming with Unicode Documentation"},"content":{"rendered":"<h2>Chapter 3 &#8211;\u00a0Definitions<\/h2>\n<p><b>3.1 Character\u00a0<\/b><\/p>\n<p>Generic term for a <a href=\"https:\/\/en.wikipedia.org\/wiki\/List_of_logic_symbols\">semantic symbol<\/a>. Many possible interpretations exist in the context of encoding. In computing, the most important aspect is that characters can be letters, spaces or control characters which represent the end of a file or can be used to trigger a sound.<\/p>\n<p><b>3.2 Glyph\u00a0<\/b><\/p>\n<p>One or more shapes that may be combined into a grapheme. In <a href=\"https:\/\/en.wikipedia.org\/wiki\/Latin\">Latin<\/a>, a glyph often has 2 variants like \u2018A\u2019 and \u2018a\u2019 and <a href=\"https:\/\/en.wikipedia.org\/wiki\/Arabic\">Arabic<\/a> often has four. This term is context dependent and different styles or formats can be considered different glyphs. Most relevant in programming is that diacritic marks (e.g. accents like \u2018 and ^) are also glyphs, which are sometimes represented with another at one point, like the \u00e0 in ISO 8859-1 or as two separate glyphs, so an a and the combining \u2018 (U+0300 and U+0061 combined as U+00E0).<\/p>\n<p><b>3.3 Code point\u00a0<\/b><\/p>\n<p>A <a href=\"https:\/\/en.wikipedia.org\/wiki\/Code_point\">code point<\/a> is an<a href=\"https:\/\/www.ibm.com\/docs\/en\/aix\/7.2?topic=types-signed-unsigned-integers\"> unsigned integer<\/a>. The smallest code point is zero. Code points are usually written as hexadecimal, e.g. \u201c0x20AC\u201d (8,364 in decimal).<\/p>\n<p><b>3.4 Character set (charset)\u00a0<\/b><\/p>\n<p>A character set, abbreviated charset, is a mapping between code points and characters. The mapping has a fixed size. For example, most 7 bits encodings have 128 entries, and most 8 bits encodings have 256 entries. The biggest charset is the Unicode Character Set 6.0 with 1,114,112 entries.<\/p>\n<p>In some charsets, code points are not all contiguous. For example, the cp1252 charset maps code points from 0 though 255, but it has only 251 entries: 0x81, 0x8D, 0x8F, 0x90 and 0x9D code points are not assigned. Examples of the ASCII charset: the digit five (\u201c5\u201d, U+0035) is assigned to the code point 0x35 (53 in decimal), and the uppercase letter \u201cA\u201d (U+0041) to the code point 0x41 (65). The biggest code point depends on the size of the charset. For example, the biggest code point of the ASCII charset is 127 (27 \u2212 1)<\/p>\n<p>Charset examples:<\/p>\n<p id=\"dSSpuzn\"><img loading=\"lazy\" decoding=\"async\" width=\"360\" height=\"112\" class=\"alignnone size-full wp-image-23354 \" src=\"https:\/\/www.booksofall.com\/ja\/wp-content\/uploads\/sites\/4\/2023\/03\/img_6424fa22a6850.png\" alt=\"\" \/><\/p>\n<p><b>3.5 Character string\u00a0<\/b><\/p>\n<p>A <a href=\"https:\/\/www.ibm.com\/docs\/en\/db2-big-sql\/7.0?topic=list-character-strings\">character string<\/a>, or \u201cUnicode string\u201d, is a string where each unit is a character. Depending on the implementation, each character can be any Unicode character, or only characters in the range U+0000\u2014U+FFFF, range called the Basic Multilingual Plane (BMP). There are 3 different implementations of character strings:<\/p>\n<ul>\n<li>array of 32 bits unsigned integers (the UCS-4 encoding): full <a href=\"https:\/\/home.unicode.org\/\">Unicode<\/a> range<\/li>\n<li>array of 16 bits unsigned integers (UCS-2): BMP only<\/li>\n<li>array of 16 bits unsigned integers with surrogate pairs (UTF-16): full Unicode range<\/li>\n<\/ul>\n<p>UCS-4 uses twice as much memory than UCS-2, but it supports all Unicode characters. UTF-16 is a compromise between UCS-2 and UCS-4: characters in the BMP range use one UTF-16 unit (16 bits), characters outside this range use two UTF-16 units (a surrogate pair, 32 bits). This advantage is also the main disadvantage of this kind of character string. The length of a character string implemented using UTF-16 is the number of UTF-16 units, and not the number of characters, which is confusing. For example, the U+10FFFF character is encoded as two UTF-16 units: {U+DBFF, U+DFFF}. If the character string only contains characters of the BMP range, the length is the number of charac- ters. Getting the nth character or the length in characters using UTF-16 has a complexity of ?(?), whereas it has a complexity of ?(1) for UCS-2 and UCS-4 strings. The Java language, the Qt library and Windows 2000 implement character strings with UTF-16. The C and Python languages use UTF-16 or UCS-4 depending on: the size of the wchar_t type (16 or 32 bits) for C, and the compilation mode (narrow or wide) for Python. Windows 95 uses UCS-2 strings.<\/p>\n<p>See also:<\/p>\n<p><em>UCS-2, UCS-4 and UTF-16 encodings, and surrogate pairs.<\/em><\/p>\n","protected":false},"excerpt":{"rendered":"<p><iframe style=\"width: 100%; height: 750px; border: none;\" src=\"https:\/\/online.visual-paradigm.com\/share\/book\/programming-with-unicode-1axl46bc5m?p=1\" frameborder=\"0\" allowfullscreen=\"allowfullscreen\"><\/iframe><\/p>\n","protected":false},"featured_media":23353,"template":"","meta":{"_yoast_wpseo_title":"","_yoast_wpseo_metadesc":"Unicode is assigns unique numerical values to characters and symbols from a vast range of writing systems. Learn more about programming with it here."},"product_brand":[],"product_cat":[374],"product_tag":[],"class_list":{"0":"post-23349","1":"product","2":"type-product","3":"status-publish","4":"has-post-thumbnail","6":"product_cat-misc","8":"first","9":"instock","10":"shipping-taxable","11":"product-type-simple"},"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v27.1.1 - https:\/\/yoast.com\/product\/yoast-seo-wordpress\/ -->\n<title>Programming with Unicode Documentation - BooksOfAll Japanese<\/title>\n<meta name=\"description\" content=\"Unicode is assigns unique numerical values to characters and symbols from a vast range of writing systems. Learn more about programming with it here.\" \/>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/www.booksofall.com\/ja\/programming-with-unicode-documentation\/\" \/>\n<meta property=\"og:locale\" content=\"ja_JP\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Programming with Unicode Documentation - BooksOfAll Japanese\" \/>\n<meta property=\"og:description\" content=\"Unicode is assigns unique numerical values to characters and symbols from a vast range of writing systems. Learn more about programming with it here.\" \/>\n<meta property=\"og:url\" content=\"https:\/\/www.booksofall.com\/ja\/programming-with-unicode-documentation\/\" \/>\n<meta property=\"og:site_name\" content=\"BooksOfAll Japanese\" \/>\n<meta property=\"og:image\" content=\"https:\/\/www.booksofall.com\/ja\/wp-content\/uploads\/sites\/4\/2023\/03\/Programming-with-Unicode-Documentation.jpg\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:image\" content=\"https:\/\/www.booksofall.com\/ja\/wp-content\/uploads\/sites\/4\/2023\/03\/Programming-with-Unicode-Documentation.jpg\" \/>\n<meta name=\"twitter:label1\" content=\"\u63a8\u5b9a\u8aad\u307f\u53d6\u308a\u6642\u9593\" \/>\n\t<meta name=\"twitter:data1\" content=\"3\u5206\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"WebPage\",\"@id\":\"https:\/\/www.booksofall.com\/ja\/programming-with-unicode-documentation\/\",\"url\":\"https:\/\/www.booksofall.com\/ja\/programming-with-unicode-documentation\/\",\"name\":\"Programming with Unicode Documentation - BooksOfAll Japanese\",\"isPartOf\":{\"@id\":\"https:\/\/www.booksofall.com\/ja\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\/\/www.booksofall.com\/ja\/programming-with-unicode-documentation\/#primaryimage\"},\"image\":{\"@id\":\"https:\/\/www.booksofall.com\/ja\/programming-with-unicode-documentation\/#primaryimage\"},\"thumbnailUrl\":\"https:\/\/www.booksofall.com\/ja\/wp-content\/uploads\/sites\/4\/2023\/03\/Programming-with-Unicode-Documentation.jpg\",\"datePublished\":\"2023-03-30T03:05:16+00:00\",\"description\":\"Unicode is assigns unique numerical values to characters and symbols from a vast range of writing systems. Learn more about programming with it here.\",\"breadcrumb\":{\"@id\":\"https:\/\/www.booksofall.com\/ja\/programming-with-unicode-documentation\/#breadcrumb\"},\"inLanguage\":\"ja\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/www.booksofall.com\/ja\/programming-with-unicode-documentation\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"ja\",\"@id\":\"https:\/\/www.booksofall.com\/ja\/programming-with-unicode-documentation\/#primaryimage\",\"url\":\"https:\/\/www.booksofall.com\/ja\/wp-content\/uploads\/sites\/4\/2023\/03\/Programming-with-Unicode-Documentation.jpg\",\"contentUrl\":\"https:\/\/www.booksofall.com\/ja\/wp-content\/uploads\/sites\/4\/2023\/03\/Programming-with-Unicode-Documentation.jpg\",\"width\":\"827\",\"height\":\"1169\",\"caption\":\"Programming with Unicode Documentation\"},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/www.booksofall.com\/ja\/programming-with-unicode-documentation\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\/\/www.booksofall.com\/ja\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Categories\",\"item\":\"https:\/\/www.booksofall.com\/ja\/categories\/\"},{\"@type\":\"ListItem\",\"position\":3,\"name\":\"Programming with Unicode Documentation\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/www.booksofall.com\/ja\/#website\",\"url\":\"https:\/\/www.booksofall.com\/ja\/\",\"name\":\"BooksOfAll Japanese\",\"description\":\"Biggest IT eBooks library and learning resources - Free eBooks for programming, computing, artificial intelligence and more.\",\"publisher\":{\"@id\":\"https:\/\/www.booksofall.com\/ja\/#organization\"},\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/www.booksofall.com\/ja\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"ja\"},{\"@type\":\"Organization\",\"@id\":\"https:\/\/www.booksofall.com\/ja\/#organization\",\"name\":\"BooksOfAll Japanese\",\"url\":\"https:\/\/www.booksofall.com\/ja\/\",\"logo\":{\"@type\":\"ImageObject\",\"inLanguage\":\"ja\",\"@id\":\"https:\/\/www.booksofall.com\/ja\/#\/schema\/logo\/image\/\",\"url\":\"https:\/\/www.booksofall.com\/ja\/wp-content\/uploads\/sites\/4\/2022\/06\/booksofall-logo-2.png\",\"contentUrl\":\"https:\/\/www.booksofall.com\/ja\/wp-content\/uploads\/sites\/4\/2022\/06\/booksofall-logo-2.png\",\"width\":166,\"height\":30,\"caption\":\"BooksOfAll Japanese\"},\"image\":{\"@id\":\"https:\/\/www.booksofall.com\/ja\/#\/schema\/logo\/image\/\"}}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Programming with Unicode Documentation - BooksOfAll Japanese","description":"Unicode is assigns unique numerical values to characters and symbols from a vast range of writing systems. Learn more about programming with it here.","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/www.booksofall.com\/ja\/programming-with-unicode-documentation\/","og_locale":"ja_JP","og_type":"article","og_title":"Programming with Unicode Documentation - BooksOfAll Japanese","og_description":"Unicode is assigns unique numerical values to characters and symbols from a vast range of writing systems. Learn more about programming with it here.","og_url":"https:\/\/www.booksofall.com\/ja\/programming-with-unicode-documentation\/","og_site_name":"BooksOfAll Japanese","og_image":[{"url":"https:\/\/www.booksofall.com\/ja\/wp-content\/uploads\/sites\/4\/2023\/03\/Programming-with-Unicode-Documentation.jpg","type":"","width":"","height":""}],"twitter_card":"summary_large_image","twitter_image":"https:\/\/www.booksofall.com\/ja\/wp-content\/uploads\/sites\/4\/2023\/03\/Programming-with-Unicode-Documentation.jpg","twitter_misc":{"\u63a8\u5b9a\u8aad\u307f\u53d6\u308a\u6642\u9593":"3\u5206"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"WebPage","@id":"https:\/\/www.booksofall.com\/ja\/programming-with-unicode-documentation\/","url":"https:\/\/www.booksofall.com\/ja\/programming-with-unicode-documentation\/","name":"Programming with Unicode Documentation - BooksOfAll Japanese","isPartOf":{"@id":"https:\/\/www.booksofall.com\/ja\/#website"},"primaryImageOfPage":{"@id":"https:\/\/www.booksofall.com\/ja\/programming-with-unicode-documentation\/#primaryimage"},"image":{"@id":"https:\/\/www.booksofall.com\/ja\/programming-with-unicode-documentation\/#primaryimage"},"thumbnailUrl":"https:\/\/www.booksofall.com\/ja\/wp-content\/uploads\/sites\/4\/2023\/03\/Programming-with-Unicode-Documentation.jpg","datePublished":"2023-03-30T03:05:16+00:00","description":"Unicode is assigns unique numerical values to characters and symbols from a vast range of writing systems. Learn more about programming with it here.","breadcrumb":{"@id":"https:\/\/www.booksofall.com\/ja\/programming-with-unicode-documentation\/#breadcrumb"},"inLanguage":"ja","potentialAction":[{"@type":"ReadAction","target":["https:\/\/www.booksofall.com\/ja\/programming-with-unicode-documentation\/"]}]},{"@type":"ImageObject","inLanguage":"ja","@id":"https:\/\/www.booksofall.com\/ja\/programming-with-unicode-documentation\/#primaryimage","url":"https:\/\/www.booksofall.com\/ja\/wp-content\/uploads\/sites\/4\/2023\/03\/Programming-with-Unicode-Documentation.jpg","contentUrl":"https:\/\/www.booksofall.com\/ja\/wp-content\/uploads\/sites\/4\/2023\/03\/Programming-with-Unicode-Documentation.jpg","width":"827","height":"1169","caption":"Programming with Unicode Documentation"},{"@type":"BreadcrumbList","@id":"https:\/\/www.booksofall.com\/ja\/programming-with-unicode-documentation\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/www.booksofall.com\/ja\/"},{"@type":"ListItem","position":2,"name":"Categories","item":"https:\/\/www.booksofall.com\/ja\/categories\/"},{"@type":"ListItem","position":3,"name":"Programming with Unicode Documentation"}]},{"@type":"WebSite","@id":"https:\/\/www.booksofall.com\/ja\/#website","url":"https:\/\/www.booksofall.com\/ja\/","name":"BooksOfAll Japanese","description":"Biggest IT eBooks library and learning resources - Free eBooks for programming, computing, artificial intelligence and more.","publisher":{"@id":"https:\/\/www.booksofall.com\/ja\/#organization"},"potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/www.booksofall.com\/ja\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"ja"},{"@type":"Organization","@id":"https:\/\/www.booksofall.com\/ja\/#organization","name":"BooksOfAll Japanese","url":"https:\/\/www.booksofall.com\/ja\/","logo":{"@type":"ImageObject","inLanguage":"ja","@id":"https:\/\/www.booksofall.com\/ja\/#\/schema\/logo\/image\/","url":"https:\/\/www.booksofall.com\/ja\/wp-content\/uploads\/sites\/4\/2022\/06\/booksofall-logo-2.png","contentUrl":"https:\/\/www.booksofall.com\/ja\/wp-content\/uploads\/sites\/4\/2022\/06\/booksofall-logo-2.png","width":166,"height":30,"caption":"BooksOfAll Japanese"},"image":{"@id":"https:\/\/www.booksofall.com\/ja\/#\/schema\/logo\/image\/"}}]}},"_links":{"self":[{"href":"https:\/\/www.booksofall.com\/ja\/wp-json\/wp\/v2\/product\/23349","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.booksofall.com\/ja\/wp-json\/wp\/v2\/product"}],"about":[{"href":"https:\/\/www.booksofall.com\/ja\/wp-json\/wp\/v2\/types\/product"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.booksofall.com\/ja\/wp-json\/wp\/v2\/media\/23353"}],"wp:attachment":[{"href":"https:\/\/www.booksofall.com\/ja\/wp-json\/wp\/v2\/media?parent=23349"}],"wp:term":[{"taxonomy":"product_brand","embeddable":true,"href":"https:\/\/www.booksofall.com\/ja\/wp-json\/wp\/v2\/product_brand?post=23349"},{"taxonomy":"product_cat","embeddable":true,"href":"https:\/\/www.booksofall.com\/ja\/wp-json\/wp\/v2\/product_cat?post=23349"},{"taxonomy":"product_tag","embeddable":true,"href":"https:\/\/www.booksofall.com\/ja\/wp-json\/wp\/v2\/product_tag?post=23349"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}