Misplaced Pages

Template:ISO 15924 script codes and related Unicode data: Difference between revisions

Article snapshot taken from Wikipedia with creative commons attribution-sharealike license. Give it a read and then ask your questions in the chat. We can research this topic together.
Browse history interactively← Previous editNext edit →Content deleted Content addedVisualWikitext
Revision as of 01:13, 4 February 2021 editDePiep (talk | contribs)Extended confirmed users294,285 editsNo edit summary← Previous edit Revision as of 01:16, 4 February 2021 edit undoDePiep (talk | contribs)Extended confirmed users294,285 editsNo edit summaryNext edit →
Line 44: Line 44:
{{ISO 15924 script codes and related Unicode data/row|1=Elym|version=12.0 |chars=23 |note=Ancient/historic}} {{ISO 15924 script codes and related Unicode data/row|1=Elym|version=12.0 |chars=23 |note=Ancient/historic}}
{{ISO 15924 script codes and related Unicode data/row|1=Ethi|version=3.0 |chars=495 |note=}} {{ISO 15924 script codes and related Unicode data/row|1=Ethi|version=3.0 |chars=495 |note=}}
{{ISO 15924 script codes and related Unicode data/row|1=Geok|version= |chars= |note=Unicode groups ({{slink||Geok}}) and ({{slink||Geor}}) together as "Georgian"}} {{ISO 15924 script codes and related Unicode data/row|1=Geok|version= |chars= |note=Unicode groups "Khutsori", "Asomtavruli" and "Nuskhuri" into 'Georgian' ({{slink||Geok}}) and ({{slink||Geor}}) together as "Georgian"}}
{{ISO 15924 script codes and related Unicode data/row|1=Geor|version=1.0 |chars=173 |note=In Unicode, also includes Geok (Nuskhuri)}} {{ISO 15924 script codes and related Unicode data/row|1=Geor|version=1.0 |chars=173 |note=In Unicode, also includes Geok (Nuskhuri)}}
{{ISO 15924 script codes and related Unicode data/row|1=Glag|version=4.1 |chars=132 |note=Ancient/historic}} {{ISO 15924 script codes and related Unicode data/row|1=Glag|version=4.1 |chars=132 |note=Ancient/historic}}

Revision as of 01:16, 4 February 2021

Scripts in ISO 15924 and in Unicode
ISO 15924 Script in Unicode
Code ISO number ISO formal name Directionality Unicode Alias Version Characters Notes Description
] ZZ—
] ZZ— Not in Unicode, proposal is explored
] ZZ— ; Ancient/historic
] ZZ— ; Ancient/historic
] ZZ—
] ZZ— Typographic variant of Arabic (§ Arab)
] ZZ— ; Ancient/historic
] ZZ—
] ZZ— ; Ancient/historic
] ZZ—
] ZZ—
] ZZ— ; Ancient/historic
] ZZ—
] ZZ—
] ZZ— ; Ancient/historic
] ZZ— Not in Unicode, proposal is explored
] ZZ—
] ZZ— ; Ancient/historic
] ZZ—
] ZZ—
] ZZ—
] ZZ—
] ZZ—
] ZZ— ; Ancient/historic
] ZZ—
] ZZ—
] ZZ— ; Ancient/historic
] ZZ— Not in Unicode
] ZZ— ; Ancient/historic, Disunified from Greek in 4.1
] ZZ— Not in Unicode
] ZZ— ; Ancient/historic
] ZZ— ; Includes typographic variant Old Church Slavonic (§ Cyrs)
] ZZ— Typographic variant of Cyrillic (§ Cyrl); Ancient/historic
] ZZ—
] ZZ— ; Ancient/historic
] ZZ— ; Ancient/historic
] ZZ—
] ZZ—
] ZZ— Not in Unicode
] ZZ— Not in Unicode
] ZZ— ; Ancient/historic
] ZZ— ; Ancient/historic
] ZZ— ; Ancient/historic
] ZZ—
] ZZ— ; Unicode groups "Khutsori", "Asomtavruli" and "Nuskhuri" into 'Georgian' (§ Geok) and (§ Geor) together as "Georgian"
] ZZ— ; In Unicode, also includes Geok (Nuskhuri)
] ZZ— ; Ancient/historic
] ZZ—
] ZZ—
] ZZ— ; Ancient/historic
] ZZ— ; Ancient/historic
] ZZ— ; Directionality sometimes as boustrophedon
] ZZ—
] ZZ—
] ZZ— See § Hani, § Bopo
] ZZ— ; Hangul syllables relocated in 2.0
] ZZ—
] ZZ—
] ZZ— Subset of Han (Hanzi, Kanji, Hanja) (§ Hani)
] ZZ— Subset of § Hani
] ZZ— ; Ancient/historic
] ZZ—
] ZZ—
] ZZ— ; Ancient/historic
] ZZ—
] ZZ—
] ZZ— ; See § Hira, § Kana
] ZZ— ; Ancient/historic
] ZZ— Not in Unicode, proposal is explored
] ZZ— ; Ancient/historic
] ZZ— Subset of § Hang
] ZZ—
] ZZ— See § Hani, § Hira and § Kana
] ZZ— Not in Unicode
] ZZ—
] ZZ—
] ZZ— ; Ancient/historic
] ZZ—
] ZZ— ; Ancient/historic
] ZZ— Not in Unicode
] ZZ— ; Ancient/historic
] ZZ—
] ZZ— See § Hani, § Hang
] ZZ— Not in Unicode, proposal is explored
] ZZ— ; Ancient/historic
] ZZ—
] ZZ—
] ZZ— Typographic variant of Latin (§ Latn)
] ZZ— Typographic variant of Latin (§ Latn)
] ZZ— ; See also : Latin script in Unicode
] ZZ— Not in Unicode
] ZZ—
] ZZ—
] ZZ— ; Ancient/historic
] ZZ— ; Ancient/historic
] ZZ—
] ZZ— Not in Unicode, proposal is explored
] ZZ— ; Ancient/historic
] ZZ— ; Ancient/historic
] ZZ— ; Ancient/historic
] ZZ— ; Ancient/historic
] ZZ—
] ZZ— ; Ancient/historic
] ZZ— ; Ancient/historic
] ZZ— Not in Unicode
] ZZ—
] ZZ—
] ZZ— ; Ancient/historic
] ZZ— ; Ancient/historic
] ZZ—
] ZZ— ; Ancient/historic
] ZZ— ; Mong includes Clear and Manchu scripts
] ZZ— Not in Unicode, proposal is explored
] ZZ—
] ZZ—
] ZZ— ; Ancient/historic
] ZZ—
] ZZ— ; Ancient/historic
] ZZ— ; Ancient/historic
] ZZ— ; Ancient/historic
] ZZ—
] ZZ— Not in Unicode
] ZZ— Not in Unicode, proposal is explored
] ZZ—
] ZZ—
] ZZ— ; Ancient/historic
] ZZ—
] ZZ— ; Ancient/historic
] ZZ—
] ZZ—
] ZZ—
] ZZ— Not in Unicode
] ZZ— ; Ancient/historic
] ZZ—
] ZZ— Not in Unicode
] ZZ— Not in Unicode
] ZZ— ; Ancient/historic
] ZZ— ; Ancient/historic
] ZZ— ; Ancient/historic
] ZZ— ; Ancient/historic
] ZZ— Not in Unicode
] ZZ— ; Ancient/historic
] ZZ— Rejected for inclusion in Unicode
] ZZ—
] ZZ— ; Ancient/historic
] ZZ— Not in Unicode
] ZZ— Not in Unicode
] ZZ— Not in Unicode
] ZZ—
] ZZ—
] ZZ— Not in Unicode, proposal is explored
] ZZ— ; Ancient/historic
] ZZ—
] ZZ— Not in Unicode
] ZZ— ; Ancient/historic
] ZZ—
] ZZ—
] ZZ—
] ZZ—
] ZZ— Not in Unicode
] ZZ— ; Ancient/historic
] ZZ—
] ZZ—
] ZZ— ; Ancient/historic
] ZZ— ; Ancient/historic
] ZZ—
] ZZ— ; Ancient/historic
] ZZ—
] ZZ— ; Ancient/historic
] ZZ— ; Includes variants Estrangelo (§ Syre), Western (§ Syrj), and Eastern (§ Syrn)
] ZZ— Typographic variant of Syriac (§ Syrc)
] ZZ— Typographic variant of Syriac (§ Syrc)
] ZZ— Typographic variant of Syriac (§ Syrc)
] ZZ—
] ZZ—
] ZZ—
] ZZ—
] ZZ—
] ZZ— ; Ancient/historic
] ZZ—
] ZZ—
] ZZ— Not in Unicode
] ZZ—
] ZZ—
] ZZ—
] ZZ—
] ZZ— ; Added in 1.0, removed in 1.1 and reintroduced in 2.0
] ZZ—
] ZZ— Not in Unicode
] ZZ— ; Ancient/historic
] ZZ—
] ZZ— Not in Unicode
] ZZ—
] ZZ—
] ZZ— Not in Unicode, proposal is explored
] ZZ— ; Ancient/historic
] ZZ— ; Ancient/historic
] ZZ— ; Ancient/historic
] ZZ—
] ZZ— ; Ancient/historic
] ZZ—
] ZZ— Not a 'script' in Unicode
] ZZ— Not a 'script' in Unicode
] ZZ— Not a 'script' in Unicode
] ZZ— Not a 'script' in Unicode
] ZZ—
] ZZ— ; In Unicode: All other code points
Notes
  1. ISO 15924 publications As of 25 January 2021
  2. ISO 15924 Normative text file As of 25 January 2021
  3. ISO 15924 Changes (including Aliases for Unicode; as of 25 January 2021)
  4. Unicode version 13.0
  5. Unicode charts
  6. Unicode uses the "Property Value Alias" (Alias) as the script-name. These Alias names are part of Unicode and are published informatively next to ISO 15924. An alias script name may be used in a character name: Palm, Palmyrene → U+10860 𐡠 PALMYRENE LETTER ALEPH.
  7. In Unicode, the Phoenician script is intended for the representation of text in Paleo-Hebrew, Archaic Phoenician, Phoenician, Early Aramaic, Late Phoenician cursive, Phoenician papyri, Siloam Hebrew, Hebrew seals, Ammonite, Moabite, and Punic.
Template documentation[view] [edit] [history] [purge]

This documentation is shared between templates {{Unicode blocks}} and {{ISO 15924 script codes and related Unicode data}}.

Usage

The template can be used as usual. It is not a navigation box, so it can be everywhere in an article. The notes are contained within the template, and will not appear in the main References part.

Note: when resolving red links or wrong links, edit {{ISO 15924/wp-article}}. That is where the connection between ISO code and a Misplaced Pages article is made.

ISO 15924 templates

ISO 15924Unicode – Wikidata – enwiki: Overview templates & properties
Item In template /subs Content Example Publisher Usage TPU Note
Code (ISO) {{ISO 15924 code}} /subp ID Arab ISO 15924 Everywhere Alpha-4, enwiki central ISO script id list
Alias (Unicode) {{ISO 15924 alias}} /subp ID Arabic Unicode
Article (enwiki) {{ISO 15924/wp-article}} /subp ID ] enwiki
QID (wikidata) {{ISO 15924/qid}} /subp ID Q790681 Wikidata
Number; range 000–999 {{ISO 15924 number}} /subp ID 234 ISO 15924 rarely ISO number not used as ID in enwiki; see Code
Scripts (sub)merged into main scripts {{ISO 15924 alias/unicode-merged-into-script}} /subp Merged scripts Latf → Latn Unicode Script descriptions, re U+ In mainspace: 10× hardcoded (e.g.); 2× Qxxx depr
Name {{ISO 15924 name}} /subp data Deseret (Mormon) ISO 15924
Unicode chapter {{ISO 15924/unicode-chapter}} /subp data Ch 18.1 Unicode pdf does not open at .n subchapter
Script example
character
{{ISO 15924/script-example-character}} /subp data ع‎ enwiki User boxes
In Mainspace
Overview
  • ISO
  • U
  • enwiki
{{ISO 15924 script codes and related Unicode data}} /subp list enwiki ISO 15924 Mainspace: ISO 15924, Script (Unicode), Unicode character property
Blocks ⇄ Scripts {{Unicode blocks}} /subp list enwiki some script articles Mainspace; related
graphs {{ISO 15924/unicode-script-illustration}} /subp fonts&files Mainspace, Scripts in Unicode
Overviews
Overview: templates {{ISO 15924/overview-templates}} /subp list Misplaced Pages
WP-category {{ISO 15924/wp-category}} /subp data Category:Arabic script enwiki Not checked for mainspace watered down concept for minor scripts
Also (doc, userbox, technical, ...)
Documentation {{ISO 15924 templates/doc}} /subp prime documentation Latin script in Unicode (~) Reused in multiple templates
Redirect {{R from ISO 15924 code}} /subp template enwiki Redirects
userbox {{User iso15924}} /subp Userboxes
Related Changes {{Recent changes in Unicode}} /subp pages Unicode, ISO 15924 WP:RELC Related Changes enwiki WikiProject 900+700 P x T
Unicode versions {{Unicode version}} /subp Version number as of Unicode version 13.0 enwiki (new Sep2022)
Wikidata properties
Directionality script directionality (P1406) P1406 {{Infobox}}, ...
Unicode ranges Unicode range (P5949) P5949 {{Infobox}}, ...
ISO English name name (P2561) P2561 Crosscheck only
Modules
Data module module:Unicode data /subp § Functions overview
HTML named entities module:Numcr2namecr /subp
More templates


Template data

This is the TemplateData for this template used by TemplateWizard, VisualEditor and other tools. See a monthly parameter usage report for Template:ISO 15924 script codes and related Unicode data in articles based on its TemplateData.

TemplateData for ISO 15924 script codes and related Unicode data

No description.

Template parameters

ParameterDescriptionTypeStatus
11

no description

Unknownoptional

Background: How is this table composed

Note that a script is not a language. A single script, like the Latin alphabet, is used in many languages. Unicode is only about scripts, not about languages that use that script. Still there may be nuances, like the English versus Polish language in using accents on letters.

Step 1: ISO defines a script

ISO defines and publishes a script in the ISO 15924 list. It defines the Alpha-4 code (Aaaa-Zzzz), the Numeric code (000-999), and the formal Name for each accepted script. Currently there are some 160 scripts defined in this list. Included are scripts like "Mathematical notation (Zmth)" and "Code for undetermined script (a.k.a. Common, Zyyy)". The list is formally maintained and published by ISO, and practically by the Unicode Consortium office. It is published on the Unicode website. Technically, the list is file iso15924.txt.

Step 2: Unicode attaches an Alias name

Then, Unicode (not ISO) maintains a list of Alias script names right next to the ISO-defined scripts, for each script Unicode has encoded. The Alias name is an English name for that script.

So the ISO alpha-4 code gets a unique Alias name by Unicode: Mymr:ISO Name=Myanmar (Burmese), Alias=Myanmar. These Alias names are also present in the definition file iso15924.txt.

Step 3: Usage by Unicode

From that list, Unicode can translate any alpha4-code into the Alias name of the script, and reverse. Unicode does not use the formal ISO name.

A script name is used in the Unicode Name of a character: "U+05BF ֿ HEBREW POINT RAFE".

Per character

In the Unicode database, Unicode adds one single appropriate alpha-4 code to every individual script character. So every letter, punctuation, number and so of a script get that code. Characters used by multiple scripts, such as the period (.), have script code "Zyyy" (Common). The "script" codes for Mathematical and Symbol are not used by Unicode; symbols and mathematical characters have the property script="Unknown".

Then, in the file Scripts.txt, Unicode publishes the Alias script name per character (possibly by a range of characters). A part of that file looks like:

...
0591..05BD    ; Hebrew # Mn   HEBREW ACCENT ETNAHTA..HEBREW POINT METEG
05BE          ; Hebrew # Pd       HEBREW PUNCTUATION MAQAF
05BF          ; Hebrew # Mn       HEBREW POINT RAFE
05C0          ; Hebrew # Po       HEBREW PUNCTUATION PASEQ
05C1..05C2    ; Hebrew # Mn    HEBREW POINT SHIN DOT..HEBREW POINT SIN DOT
05C3          ; Hebrew # Po       HEBREW PUNCTUATION SOF PASUQ
...

This datafile defines which scripts are present in Unicode, and what script is at a certain code point.

In a block

Given a block range of code points, then which scripts are present in that block? See {{Unicode blocks}}: this table is constructed by signaling every script that is present as a block (once).

There is no secure relation between a script name and a block name. Some scripts are in a single block, but other scripts are spread amongst several blocks.

See also

ISO 15924 templates
General
ISO-defined
Unicode
Misplaced Pages (enwiki)
Wikidata
Userboxes
Technical
Unicode templates
General
Inline
Character
properties
Code points
Scripts
CJK-specific
Misplaced Pages related
The above documentation is transcluded from Template:ISO 15924 script codes and related Unicode data/doc. (edit | history)
Editors can experiment in this template's sandbox (edit | diff) and testcases (create) pages.
Add categories to the /doc subpage. Subpages of this template.
  1. ^ "Proposed New Scripts". Unicode Consortium. 2018-05-25. Retrieved 2019-09-12.
  2. Michael Everson (1997-09-18). "Proposal to encode Klingon in Plane 1 of ISO/IEC 10646-2".
  3. The Unicode Consortium (2001-08-14). "Approved Minutes of the UTC 87 / L2 184 Joint Meeting".
  4. "Middle East-II, Ancient Scripts" (PDF). 13.0.0. The Unicode Consortiumtitle=Middle-East scripts II. Retrieved 2021-01-28.
  5. "UTR #24: Unicode Script Property". Unicode Consortium. 2023-08-14.
Categories: