Table of Contents
nlpcd - utility for starting all nlpc tools
nlpcd [-dhmv] [-a path] [-f path] [-p path] [-r path] [LONG_OPTS]
The nlpcd utility starts all tools within the nlpcrawl(1)
suite. It’s
not suitable for highly-customised environments (yet), but suffices to
begin a generic scan sequence. The arguments are as follow:
- -a path
- The archive file path. See the nlpcrawl(1)
FILES section.
- -d
- Enable debugging. Use multiple times for more verbosity.
- -h
- Print a help message and exit.
- -m
- Enable monitoring mode in nlpc-fetch(1)
. This disables all
other debugging.
- -v
- Print version information and exit.
- -f path
- The cache file path. See the nlpcrawl(1)
FILES section.
- -p path
- The database environment path. See the nlpcrawl(1)
FILES section.
- -r path
- The REP entity path. See the nlpcrawl(1)
FILES section.
In addition, the following long arguments may be used:
- --filter-auth string
-
Accept only pages with the given URI scheme and authority.
- --filter-charset string
-
Accept only pages with the given character-set encodings (may be
a space-separated list).
- --filter-dict string
-
Accept only pages with at least a 10% hit rate in a dictionary.
The dictionary must be UTF-8 encoded and have a Unix-style endof-line.
It must have one word per line.
- --filter-lang string
-
Accept only pages with the given language.
- --filter-scheme string
-
Accept only pages with the given URI scheme.
nlpcrawl(1)
Table of Contents