GNU.WIKI: The GNU/Linux Knowledge Base

  [HOME] [PHP Manual] [HowTo] [ABS] [MAN1] [MAN2] [MAN3] [MAN4] [MAN5] [MAN6] [MAN7] [MAN8] [MAN9]

  [0-9] [Aa] [Bb] [Cc] [Dd] [Ee] [Ff] [Gg] [Hh] [Ii] [Jj] [Kk] [Ll] [Mm] [Nn] [Oo] [Pp] [Qq] [Rr] [Ss] [Tt] [Uu] [Vv] [Ww] [Xx] [Yy] [Zz]


NAME

       frog  -  Dutch  morpho-syntactic  analyzer,  IOB chunker and dependency
       parser

SYNOPSYS

       frog [options]

       frog -t test-file

DESCRIPTION

       frog is an integration  of  memory-based  natural  language  processing
       (NLP)  modules  developed  for  Dutch.   frog's  current  version  will
       tokenize, tag, lemmatize, and morphologically segment  word  tokens  in
       Dutch  text files, add IOB chunks and will assign a dependency graph to
       each sentence.

OPTIONS

       -c <configfile>
              set the configuration using 'file'

       -d <level>
              set debug level.

       -e <encoding>
              set input encoding. (default UTF8)

       -h
              give some help

       --keep-parser-files=[yes|no]
              keep the intermediate files from the parser. Last sentence only!

       -n
              assume inputfile to hold one sentence per line

       -o <file>
              send output to 'file' instead of stdout. Defaults to the name of
              the inputfile with '.out' appended.

       --outputdir <dir>
              send  all  output  to 'dir' instead of stdout. Creates filenames
              from the inputfilename(s) with '.out' appended.

       --skip=[aclmnpt]
              skip parts of the proces: Tokenizer (t), Chunker (c), Lemmatizer
              (l),  Morphological  Analyzer  (a),  Multi-Word unit (m), Named-
              Entity recognizer (n) or Parser (p)

       -Q
              Enable quotedetection in the tokenizer. May run havock!

       -S <port>
              Run a server on 'port'

       -t <file>
              process 'file'

       -x <xmlfile>
              process 'xmlfile', which is supposed to be in FoLiA  format!  If
              'xmlfile'  is empty, and --testdir=<dir> is provided, all '.xml'
              files in 'dir' will be processed as FoLia XML.

       --textclass=<cls>
              When  -x  is  given,  use  'cls'  to  find  text  in  the  FoLiA
              document(s).

       --testdir=<dir>
              process  all  files  in  'dir'. When the input mode is XML, only
              '.xml' files are teken from 'dir'. see also --outputdir

       --tmpdir=<dir>
              location to store intermediate files. Default /tmp.

       --threads=<n>
              use a maximum of 'n' threads. The default is to take whatever is
              needed.  In servermode we always run on 1 thread.

       -V or --version
              show version info

       --xmldir=<dir>
              generate  FoLiA  XML  output  and  send  it  to  'dir'.  Creates
              filenames from the inputfilename with '.xml'  appended.  (Except
              when it already ends with '.xml')

       -X <file>
              generate FoLiA XML output and send it to 'file'. Defaults to the
              name of the inputfile(s) with '.xml' appended. (Except  when  it
              already ends with '.xml')

       --id=<id>
              When -X for FoLia is given, use 'id' to give the doc an ID.

BUGS

       likely

AUTHORS

       Maarten van Gompel proycon@anaproy.nl

       Ko van der Sloot Timbl@uvt.nl

       Antal van den Bosch Timbl@uvt.nl

SEE ALSO

       ucto(1)

                                2012 January 31                        frog(1)



  All copyrights belong to their respective owners. Other content (c) 2014-2017, GNU.WIKI. Please report site errors to webmaster@gnu.wiki.
Page load time: 0.084 seconds. Last modified: November 09 2017 18:38:06.