X-Git-Url: http://git.shiar.net/unicode-sampler.git/blobdiff_plain/6fcd62bdc5cbf7210bc928021af99161a77894fd..d1f81a3feba94014b37d41667c8da988b1bf45ba:/unicode.txt?ds=sidebyside
diff --git a/unicode.txt b/unicode.txt
index 770e629..b9f2cba 100644
--- a/unicode.txt
+++ b/unicode.txt
@@ -1,9 +1,9 @@
Unicode sampler
â¾â¾â¾â¾â¾â¾â¾â¾â¾â¾â¾â¾â¾â¾â¾
-Test support of various text encoded with Unicode up to version 8.0 (2015).
+Test support of various text encoded with Unicode up to version 10.0 (2017).
Based on file by Markus Kuhn
Hash[ :nbsp => 0O2_40 ].each {|name, cp| puts "#{name} is '#{cp.chr}'" } - while ((c = *l++) != '\0') { m->stat[2] = IO | (~OK & X_8); } + while ((c = *l++) != '\0') { m->stat[2] = IO | (~OK & X_8); } /* C */ perl -pe's/\w/$^ =~ $& > chop($^ = $& . $^) ? "@-" : $&/ge' + fix$(<$>)<$>(:)<*>((<$>((:[{- hs -}])<$>))(=<<)<$>(*)<$>(>>=)(+)($))$1 + â1 âµâ¨.â§3 4=+/,¯1 0 1â.â¯1 0 1â½Â¨ââµ â game of life Mathematics and sciences: @@ -40,7 +50,7 @@ Mathematics and sciences: Proper typography: - ⢠Lookalikes: 1lI|, 0OD, 8B, 2Z, 5S$ + ⢠Lookalikes: 1lI|, 0ODÃ, 8B, 2Z, 5S$, AÐÎê®áªð ð½ ⢠âItâs âquotedââ, âdeutsche âGänsefüÃchenââ, «â¯guillemets â¹comme ciâºâ¯Â» ⢠u + ¨ + ´ = Ç, o + ~ + ¯ = È, e + ^ + ` = á», e + ¸ + Ë = Ḡ⢠1 + 2 â 3 à 4 ÷ 5 â 0â1â° â 0° 21â² 36â³ @@ -50,16 +60,16 @@ Proper typography: English panphone (traditional, IPA, Shavian, Braille): Just as the French queen looked for it, she heard that symphony again. - A beige hue on the waters of the loch impressed all, including young Arthur. + A beige hue of the loch water impressed all, including young Arthur. ʤÊst æz Ã°É fɹÉnʧ kÊ°Êin lÊkÌt fo ɪÌt | Êi ɦÉd ðat ËsɪɱfÉni ÉËgÉn - É beÊ Ã§Ê Én Ã°É ËwÉtÌ Éz Év Ã°É lÉÏ ÉªmËpʰɹÉst ÊÉËÉ« | ɪÅËkludɨŠjÉÅ ËÉÉ¹Î¸É + É beÊ Ã§Ê Év Ã°É lÉÏ ËwÉtÌ É ÉªmËpʰɹÉst ÊÉËÉ« | ɪÅËkludɨŠjÉÅ ËÉÉ¹Î¸É ð¡ð³ðð ð¨ð ð ðð®ð§ð¯ð ðð¢ð°ð¯ ð¤ð«ðð ð ð¦ð, ðð° ð£ð½ð ðð¨ð ðð¦ð¥ðð©ð¯ð° ð©ðð§ð¯. - ð© ðð±ð ð£ð¿ ðªð¯ ð ð¢ð·ðð¼ð ð ð ð¤ðªð ð¦ð¥ðð®ð§ðð ð·ð¤, ð¦ððð¤ðµðð¦ð ðð³ð ·ð¸ðð». + ð© ðð±ð ð£ð¿ ð ð ð¤ðªð ð¢ð·ðð¼ ð¦ð¥ðð®ð§ðð ð·ð¤, ð¦ððð¤ðµðð¦ð ðð³ð ·ð¸ðð». â â â â µâ â ®â â â â ⠢⠡â â â ¥â â ¢â â â â â â «â â ¿â â â â â ©â â â â â â â â â â â ½â â â â â â ½â â â â ² - â â â â â â â â â â â ¥â â â â â â ®â â ºâ â â »â â â ·â â ®â â â â ¡â â â â â â â â â «â â â â â â â â â â ¥â â ¬â â â ½â â â ⠹⠥â â ² + â â â â â â â â â â â ¥â â â ·â â ®â â â â ¡â â ºâ â â »â â â â â â â â â «â â â â â â â â â â ¥â â ¬â â â ½â â â ⠹⠥â â ² Ãnglisc Åhthere & Ç·ulfstÄn: @@ -72,7 +82,7 @@ Precomposed and combining diacritics: Muļķa hipiji mÄÄ£ina nogarÅ¡ot žÅaudzÄjÄÅ«sku. Trâu cháºm uá»ng nÆ°á»c Äục. Mul̦k̦a hipiji meÌgÌina nogarsÌot zÌn̦audzeÌjcÌuÌsku. TraÌu chaÌ£Ìm uoÌÌng nuÌoÌÌc ÄuÌ£c. - STARGÉ ÌTE ⢠a = vÌ = rÌ, aâ ⥠bâ ⢠1̴·2ââ¯Â·3̶Ì̮·4ÌỊ̤̀·5âÍÍÌ̹ + STARGÉ ÌTE ⢠a = vÌ = rÌ, aâ ⥠bâ ⢠1̴·2ââ¯Â·3̶Ì̮·4ÌỊ̤̀·5âÍÍÌ̹ ⢠ZÌ´ÌÍaÍÌÍÌÍ̤̲ÌÌ̼Í̷̯lÍÍÌ¿ÌÌÍÍÍ¡ÌÍ̮̲ÍÍÍ̸̪gÌÌÍÌÌÌÌÍ̬Ì̸̧oÌÍÌÍÍÍÌÍÌ Ì¨Į̤̫́Į̩̀Ì̸!ÍÌÌÍ ÌÍÌÌÌÍÍÌ Í̬̪ Pangrams: @@ -86,7 +96,7 @@ Pangrams: is: Sævör grét áðan þvà úlpan var ónýt. lt: Ä®linkdama fechtuotojo Å¡paga sublykÄiojusi pragrÄÅ¾Ä apvalų arbÅ«zÄ . lv: GlÄžšķūÅa rÅ«Ä·Ä«Å¡i dzÄrumÄ Äiepj Baha koncertflÄ«Ä£eļu vÄkus. - naq: ÇKam ÇÅ©i-aob gye Çẽib di gÅ«na Çhomi Çna gye ÇÅ©i hã i. + nmn: ÇqháaÌ° kÅ« Çnûm Çɢˤûlitê Çè dtxóÊlu Çnà e Çʼá sˤà aÌ°. nl: Wijf lokt u cq 'r pa dmv 'n zg sexy bh. (af: én Å kwêvoëltjie) ro: MuzicologÄ Ã®n bej vând whisky Èi tequila, preÈ fix. se: Vuol Ruoŧa geÄggiid leat máÅga luosa ja Äuovžža. @@ -100,9 +110,13 @@ German with presentational ligatures: Im ï¬nï¬ eren JagdÅ¿chloà am oï¬enen FelsquellwaÅ¿Å¿er patzte der aï¬gâï¬atterhafte kauzigâhöfâliche Bäcker über Å¿einem verÅ¿ifften kniï¬igen CâXylophon. -Common homographs: +Taa/!xóõ: - AÎÐáªê®ð ð½ OÎÐÕâ²×¡ßᲿê³ððð«ðáðá ê¢ð°ð«©âµð© 0ð¾âãê¨ + uÊ°Çei ÇgÊm sa ce te buÇei ba ÇÊÉnʼse Ça qaisa i ÇgoÌÊ°oÌÊ° ce tÊÌ°ÊmÌ© kaÌ ÇÊ°uÌ ceÇe + beÅkele Çi ei ʼÇÅaÌ°an ce. xabeka ÇaÇi ÇÊÌ°ÊnÌ© i teÌʼeÌ eÊ°ÇʼaÌoÌku ci dzaÌ°ai ce ÊaÉe + i kaneka ÇaÊ°eÊ° ku ÇaÌ°alute te iʼe ÊaÉe eÊ°kaÌ° ba ʼao ʼahnÌ© i ba sa tsʼÉnci + ÇuÊa Êiqatʲe BoroÇxao ÇgÊm ce xabeka ÇaÌ°asa ÇÊÌ°Ên i teÌʼeÌ n̩ʼnÌ© ce ÇxaÌ°a + kuÇÅÊmʼu n̩ʼnÌ© ÇgÊma tÊ°ani. Modern Greek ÎÎ¼Î½Î¿Ï ÎµÎ¹Ï Ïην ÎÎ»ÎµÏ Î¸ÎµÏίαν: @@ -160,6 +174,15 @@ Kazakh equivalents: دÛÙÙÛÚ¯Û ÙÛÙÛدÙ. ادا٠دارعا اÙÙÙ-پاراسات, ار-Ùجدا٠بÛرÙÙÚ¯ÛÙ, سÙÙدÙÙتا٠ÙÙار ءبÙر-بÙرÙÙ Û٠تÛÙستÙÙ, باÛÙر٠اÙدÙÙ ÙارÙÙ -ÙاتÙÙاس جاساÛÙار٠ءتÙÙس. +Arabic: + + ﻧﺺ ﺣï»ï»´ï»¢ ï»ï»ª ﺳﺮ ï»ïºï»ï» ï»ïº«ï» ﺷïºï»¥ ï»ï»ï»´ï»¢ ﻣï»ïºï»®ïº ï»ï» ï»° ïºï»®ïº ïºïº§ï»ïº® ï»ï»£ï»ï» ï» ïºïº ï» ïºª ïºïº¯ïºï». + Ùص ØÙÙÙ Ù٠سر Ùاطع Ùذ٠شأ٠عظÙÙ Ù ÙتÙب عÙÙ Ø«Ùب أخضر Ù٠غÙ٠بجÙد أزرÙ. + ÙÙصÙÙ ØÙÙÙÙÙ Ù ÙÙÙ٠سÙرÙÙ ÙÙاطÙع٠ÙØ°ÙÙ Ø´ÙØ£Ù٠عÙظÙÙÙÙ Ù Ù ÙتÙب٠عÙÙÙÙ Ø«ÙÙÙب٠أخÙضÙر٠ÙÙ ÙغÙÙÙ٠بÙجÙÙÙد٠أزÙرÙÙ. + + Naá¹£un ḥakymun lahu syrun qÄá¹iÊ¿un wa á¸u Å¡Änin Ê¿áºymin + maktubun Ê¿ala ṯubin aáºá¸ra wa muÄ¡alafun biǧildin azraq. + Hebrew: ××רש×× ×עת ××× ×¡ Unicode ×××× ××××× ×עש×ר×, ש×××¢×¨× ××× ×ת×ר×××× 12Ö¾10 ××רץ @@ -173,11 +196,8 @@ Zarka Table (Torah cantillation): ×ַרְקָ×Ö® סְ××Ö¹×ְתָּ×Ö ××Ö¼× Ö·×Ö¾×Ö°×ַרְ×ÖµÖ£×Ö¼× ××Ö¼× Ö·Ö£× ×¨Ö°×Ö´Ö××¢Ö· פָּ×ֵר־קָ×Ö¸Ö¡× ×ªÖ°Ö¼×Ö´×ש×Ö¸×Ö¾×Ö°Ö ××Ö¹×Ö¸× ×ªÖ°Ö¼×Ö´×ש×Ö¸×־קְ×Ö·× Ö¸×Ö© ×Ö·×Ö°×Ö¸Ö¨× ×Ö¼Ö¶Ö×¨Ö¶×©× ×Ö°×ֻפָּ֤×Ö° פַּשְ××Ö¸×Ö ×ָקֵף־קָ×Ö¸Ö× ×ִפְ×Ö¸Ö× ×Ö·×ªÖ°× Ö¸Ö× ×ַּרְ×Ö¸Ö¼Ö§× ×ªÖ°Ö¼×Ö´Ö×ר ×ִפְ×Ö¸Ö× ×ֵרְ×Ö¸Ö¥× ×¡Ö´×Ö¼Ö½×Ö¼×§× -Zalgo text: - - T̫̺̳oÌ¬Ì Ã¬Ì¬Í̲ÌnvÌÌ̻̣̹ÌoÍÌÌ Ì̤kÍÍ̹Í̼e̦Ì̪ÍÌªÍ Ì¬Í tÌhÌ ÍÌ®ÍÍe̱ÌÌÍÌ Ì¥ÍÌ«Í̪ÍÌ£Íḥi̼̦Í̼vÒÌ©ÌÍÌÍeÍÌÌ»Í̦̤-mÌ·ÌÌ̱ÃÍÌ̦̳nÌ̲̯ÌÌ®Íd̴̺̦ÍÌ« ÌÌÌÍÍrÌÍÌÌÍÍÌ«Í¢epÍrÌ̯ÌÍÍÍ̺eÌ´sÌ¥e̵Ì̳ÍÍÌ©ÌnÌ¢Í̪ÍÌÌ°Ì Ì¦t̺ÌÌ°iÍnÒ̮̦ÌÌgÌ®Í̱̻ÍÌ̳ ̳cÌÌ®ÌÌ£Ì°Ì Ì©hÌ·ÌÍÌÍÌÍÍa̧Í̯̹̲̺̫óÌÌỊ̯̀Ís̶̤̮̩Ì.̨̻̪ÌÍ Ì³Ì̦ÌÌ̦ÌÌIÌ ÍÌ®nÍ̹̪̬vÌ´ÍÌÌÌo̸kÒ̬̤ÍÍÌ ÍiÍnÌ̩̹ÍÌ̹gÍ Ì Ì¥Í tÌ°ÍÍh̫̼̪eÌÌ©Ì ÌÌ Ì²Ì«Ífe̤ÍÌ̱eÍÌ®Ì Ì¹ÌÍÍlÍ̲ÌÍÌ ÌªiÌ¢ÌÍÌ®Ì̯ÍÌ©n̸̰gÌ̱ÌÌÍÌ¬Í ÍoÍÍ̩̮͢fÌÍ̦̥ ÌÍc̵̫̱ÌÍÍ̦hÍaÌÍÍ̳̣ÍÍoÍÌs̤Ì.ÌÌÌÌ£Ì³Ì¼Í - Ethiopic (Amharic, Blin, Sebatbeit): + á©áá®áµ áá¥á«áá³áá± áá°áᣠá©áá®áµ ááá°ááµáá¡ á©áá®áµ á¥áá á¤ááµ áá°áᤠááááá ááááµ á®áááá°á á¢ááᣠá£á» á®ááá©á°ááá á£á½áá¡ ááá á¤ááµ á®ááá°á á¢á¸áᤠááááá ááááµ áá®áá«á á¢ááᣠá£á» áá®áá«ááá á£á½áá¡ ááá á¤ááµ áá¾áºáá á¢á¸áᤠ@@ -290,6 +310,8 @@ Japanese Iroha: æµ ã夢è¦ã ãããããã¿ã ã¢ãµãã¦ã¡ã㸠アサキï¾ï¾ï¾ï½¼ï¾ é¿ä½ä¼å©å¥³ç¾ä¹ é ã²ããã ãã²ããããã ã±ãã¢ã»ãºã³ã ウェï¾ï¾ï½¾ï½½ï¾ï¾ æµæ¯æ¯å¢é + hentaigana å¤ä½ä»®å: ððð¦ððð¶ð» ð¦ð¶ðð¸ð ððð«ððð ð©ðððð + Chinese: ⣠Most common characters: @@ -297,7 +319,7 @@ Chinese: è¦å°±åºä¼å¯ä¹ä½ 对çè½èåé£å¾äºçä¸èªä¹å¹´è¿ååä½é ⣠Extension blocks: - Aã¡ã¬ã§äµ Bð££ð¤¶ ðªð¦ C𪢨ðªªð«ºð«´ Dð«ð«ð«»ð« Eð« «ð¬ð¬³ð¬º¡ + Aã¡ã¬ã§äµ Bð££ð¤¶ ðªð¦ C𪢨ðªªð«ºð«´ Dð«ð«ð«»ð« Eð« «ð¬ð¬³ð¬º¡ F𬺰ðð¢ð¯´ ⣠QiÄn zì wén ååæ by Xing Si Zhou: 天å°çé»å®å®æ´ªè æ¥æçæ辰宿åå¼µ å¯ä¾æå¾ç§æ¶å¬è @@ -310,6 +332,22 @@ Chinese: å¾ ã ⢠ãã©Ë lÇ /ly˥˩/ ⢠leotⶠ/løtÌ˨/ é©¢/é©´ ⢠ãã©Ë lÇ /ly˧˥/ ⢠leoiâ´ /løy˨˩/ + ⣠shinjitai differences: + å ©æ¡å®å´å³å¹å ååå£çååå賣廣絲覽è±éé½å»³è¾¯ + 两æ¶åä¸¥ä¼ ä»·å¿åå³åè¥å¢å¾å´å广ä¸è§ä¸°å ³é½¿å 辩 + 両æªåå³ä¼ä¾¡å åå´å§å¶å£å³å²å£²åºç³¸è¦§è±é¢æ¯åºå¼ + +Alternate English (Deseret): + + ðð²ð ð» ð°ð ð ððð¯ðð½ ð¿ð¶ð¨ð ðð³ð¿ð¼ ðð«ð ð®ð», ðð¨ ð¸ð²ðð¼ ðð°ð» ð ð®ððð²ðð¨ ð°ðð¯ð. + ð ðºð©ð ð¸ð ð±ð ð ððªð¿ ð¶ð«ð»ð²ð ð®ðð¹ðð¯ð ð¼ ð«ð, ð®ðð¿ððð¼ð®ð ð·ð²ð ðððð. + +Enclosed/mathematical letters: + Fá´ÊTÊá´Wɪɴ ᶠáµÊ³áµÊ°áµáµâ±â¿ ᵩᷫâͦᵣͬâÍâͪâͤʬᵢͥâá· uá´ÊÇÉ¥ê±É¹oâ² + ð ð¨ð«ðð¡ððð¢ð§ ð¹ðððâðððð ðððð»ððð¾ðð â±â´ðð¯ð½â¯ð²ð¾ð ðð¸ð»ð£ð±ð®ð¦ð²ð· ðð¬ð¯ðð¥ð¢ðð¦ð« ð½ð ð£ðððððð + ð±ððð¿ððððð ð¥ððð³ðð¾ð¶ðð ðð¼ð¿ð§ðµð²ðªð¶ð» ðð°ð³ðð©ð¦ððªð¯ ðð¤ð§ðððððð£ ðµðððððððð ðâªâð£â£â ð¦â¤â© + â»ââ¡ââââââ ðµð ð ð ð £ð ¦ ð µðð + Box drawing alignment tests: ââ¬ââââ¥â ââââ¤âââ ââââ³âââ ââââ â»â· âââââââ â âââââ âââââââââ