pyani classify

The classify subcommand identifies cliques (k-complete) graphs of genomes from a larger set. This is helpful for circumscribing potentially meaningful groups of genomes that can not be described using traditional taxonomy.

usage: pyani.py classify [-h] [-l LOGFILE] [-v] [--debug] [--disable_tqdm]
                     [--version] [--citation] -o OUTDIR --run_id RUN_ID
                     [--dbpath DBPATH] [--cov_min COV_MIN] [--id_min ID_MIN]
                     [--min_id MIN_ID] [--max_id MAX_ID]
                     [--resolution RESOLUTION] [--show_all]

Flagged arguments

--cov_min COV_MIN
Minimum percent coverage required to draw an edge between two nodes (genomes). (default: 0.5)
--dbpath DBPATH
Path to the location of the local pyani database to be used. Default: .pyani/pyanidb
--disable_tqdm
Disable the tqdm progress bar while the download process runs. This is useful when testing to avoid aesthetic problems with test output. (default: False)
--formats FORMATS
Graphics output format(s); more than one can be specified. Valid options are: (pdf/png/svg/jpg). (default: png)
--id_min ID_MIN
Minimum percent identity required to draw an edge between two nodes (genomes). (default: 0.8)
-l LOGFILE, --logfile LOGFILE
Provide the location LOGFILE to which a logfile of the download process will be written.
--max_id MAX_ID
Maximum identity threshold to test. (default: None)
--method {seaborn,mpl,plotly}
Graphics method to use for plotting. (default: seaborn)
--min_id MIN_ID
Minimum identity threshold to test. (default: None)
-o OUTDIR, --outdir OUTDIR
Path to a directory where comparison output files will be written.
--resolution RESOLUTION
Number of identity thresholds to test. (default: 0.0001)
--run_id RUN_ID
Unique database ID of the run to be plotted.
--show_all
Report all intervals in log. (default: only intervals where all subgraphs are k-complete) (default: False)
-v, --verbose
Provide verbose output to STDOUT.