GNU.WIKI: The GNU/Linux Knowledge Base

  [HOME] [PHP Manual] [HowTo] [ABS] [MAN1] [MAN2] [MAN3] [MAN4] [MAN5] [MAN6] [MAN7] [MAN8] [MAN9]

  [0-9] [Aa] [Bb] [Cc] [Dd] [Ee] [Ff] [Gg] [Hh] [Ii] [Jj] [Kk] [Ll] [Mm] [Nn] [Oo] [Pp] [Qq] [Rr] [Ss] [Tt] [Uu] [Vv] [Ww] [Xx] [Yy] [Zz]


       unicode - command line unicode database query tool


       unicode [options] string


       This manual page documents the unicode command.

       unicode is a command line unicode database query tool.


       -h     --help

              Show help and exit.

       -x     --hexadecimal

              Assume string to be a hexadecimal number

       -d     --decimal

              Assume string to be a decimal number

       -o     --octal

              Assume string to be an octal number

       -b     --binary

              Assume string to be a binary number

       -r     --regexp

              Assume string to be a regular expression

       -s     --string

              Assume string to be a sequence of characters

       -a     --auto

              Try to guess type of string from one of the above (default)


              Maximal  number of codepoints to display, default: 20; use 0 for


              I/O character set. For maximal pleasure, run  unicode  on  UTF-8
              capable  terminal  and  specify  IOCHARSET  to be UTF-8. unicode
              tries to guess this value from your locale, so with properly set
              up locale, you should not need to specify it.


              Convert  numerical  arguments  from  this  encoding, default: no
              conversion.  Multibyte encodings are supported. This is  ignored
              for non-numerical arguments.


              Show  hexadecimal reprezentation of displayed characters in this
              additional charset.


              USE_COLOUR is one of on off auto

              --colour=on will use ANSI colour codes to colourise the output

              --colour=off won't use colours.

              --colour=auto will test if standard output is  a  tty,  and  use
              colours only when it is.

              --color is a synonym of --colour

       -v     --verbose

              Be  more verbose about displayed characters, e.g. display Unihan
              information, if available.

       -w     --wikipedia

              Spawn browser pointing to Wikipedia entry about the character.


              List (approximately) all known encodings.


       unicode tries to guess the type of an argument. In particular,  if  the
       arguments  looks  like  a valid hexadecimal representation of a Unicode
       codepoint, it will be considered to be such. Using

       unicode face

       will display information about U+FACE CJK COMPATIBILITY IDEOGRAPH-FACE,
       and  it  will not search for 'face' in character descriptions - for the
       latter, use:

       unicode -r face

       For example, you can use any of the following  to  display  information
       about  U+00E1 LATIN SMALL LETTER A WITH ACUTE (á):

       unicode 00E1

       unicode U+00E1

       unicode á

       unicode 'latin small letter a with acute'

       You  can  specify  a range of characters as argumets, unicode will show
       these  characters  in  nice  tabular  format,   aligned   to   256-byte
       boundaries.  Use two dots ".." to indicate the range, e.g.

       unicode 0450..0520

       will  display  the  whole  cyrillic  and hebrew blocks (characters from
       U+0400 to U+05FF)

       unicode 0400..

       will display just characters from U+0400 up to U+04FF

       Use --fromcp to query codepoints from other encodings:

       unicode --fromcp cp1250 -d 200

       Multibyte encodings are supported: unicode --fromcp big5 -x aff3

       and multi-char strings are supported, too:

       unicode --fromcp utf-8 -x c599c3adc5a5


       Tabular format does not deal well with full-width,  combining,  control
       and RTL characters.




       Radovan Garabík <garabik @>

                                  2003-01-31                        UNICODE(1)

  All copyrights belong to their respective owners. Other content (c) 2014-2018, GNU.WIKI. Please report site errors to
Page load time: 0.097 seconds. Last modified: November 04 2018 12:49:43.