![]() OGHAM REVERSED FEATHER MARK (U+169C, Pe): ᚜ TIBETAN MARK ANG KHANG GYAS (U+0F3D, Pe): ༽ TIBETAN MARK ANG KHANG GYON (U+0F3C, Ps): ༼ TIBETAN MARK GUG RTAGS GYAS (U+0F3B, Pe): ༻ TIBETAN MARK GUG RTAGS GYON (U+0F3A, Ps): ༺ RIGHT-POINTING DOUBLE ANGLE QUOTATION MARK (U+00BB, Pf): » LEFT-POINTING DOUBLE ANGLE QUOTATION MARK (U+00AB, Pi): « ![]() (note: on some Bash implementations UTF-8 printing has a bug that causes it to print U+00AB "«" and U+00BB "»" as "?", and some terminals don't have the ability to render all characters correctly.) while IFS=' ' read number name category rest Here's a quick Bash script to get this information, and its output. Those you are going to have to find by hand there is no pre-determined listing of those. And some character, like, are used as brackets in some contexts (such as HTML/XML), while they are considered math symbols ( Sm) in UnicodeData.txt. are indicated with Pi and Pf (initial and final punctuation), so you might want to include those as well. Note that not all characters that you consider brackets may be listed for instance, quotation marks (including "«»"). Look for those character, and you'll find what you're looking for. Open and close punctuation characters are denoted with Ps (punctuation start) and Pe (punctuation end) in the General_Category field (the third field, delimited by ). The primary information is contained in UnicodeData.txt. There is a plain-text database of information about every Unicode character available from the Unicode Consortium the format is described in Unicode Annex #44.
0 Comments
Leave a Reply. |
AuthorWrite something about yourself. No need to be fancy, just an overview. ArchivesCategories |