pyani.scripts.subcommands.subcmd_download module¶
Provides the download subcommand for pyani.
-
class
pyani.scripts.subcommands.subcmd_download.Skipped[source]¶ Bases:
tupleConvenience struct for holding information about skipped genomes.
-
accession¶ Alias for field number 1
-
dltype¶ Alias for field number 5
-
organism¶ Alias for field number 2
-
strain¶ Alias for field number 3
-
taxon_id¶ Alias for field number 0
-
url¶ Alias for field number 4
-
-
pyani.scripts.subcommands.subcmd_download.configure_entrez(args: argparse.Namespace) → Optional[str][source]¶ Configure Entrez email, return API key.
Parameters: args – Namespace, command-line arguments Returns None if no API key found
-
pyani.scripts.subcommands.subcmd_download.dl_info_to_str(esummary, uid_class) → str[source]¶ Return descriptive string for passed download data.
Parameters: - esummary –
- uid_class –
-
pyani.scripts.subcommands.subcmd_download.download_data(args: argparse.Namespace, api_key: Optional[str], asm_dict: Dict[str, List[T]]) → Tuple[List[T], List[T], List[T]][source]¶ Download the accessions indicated in the passed dictionary.
Parameters: - args – Namespace of command-line arguments
- api_key – str, API key for NCBI downloads
- asm_dict – dictionary of assembly UIDs to download, keyed by taxID
Returns lists of information about downloaded genome classes and labels, and a list of skipped downloads (as Skipped objects).
-
pyani.scripts.subcommands.subcmd_download.download_genome(args: argparse.Namespace, filestem: str, tid: str, uid: str, uid_class)[source]¶ Download single genome data to output directory.
Parameters: - args – Namespace, command-line arguments
- filestem – str, output filestem
- tid – str, taxonID
- uid – str, assembly UID
- uid_class –
-
pyani.scripts.subcommands.subcmd_download.extract_genomes(args: argparse.Namespace, dlstatus: pyani.download.DLStatus, esummary) → None[source]¶ Extract genome files in passed dlstatus.
Parameters: - args – Namespace of command-line arguments
- dlstatus –
- esummary –
-
pyani.scripts.subcommands.subcmd_download.get_tax_asm_dict(args: argparse.Namespace) → Dict[str, List[T]][source]¶ Return dictionary of assembly UIDs to download, keyed by taxID.
Parameters: args – Namespace of command-line arguments
-
pyani.scripts.subcommands.subcmd_download.hash_genomes(args: argparse.Namespace, dlstatus: pyani.download.DLStatus, filestem: str, uid_class) → Tuple[str, str][source]¶ Hash genome files in passed dlstatus.
Parameters: - args – Namespace of command-line arguments
- dlstatus –
- filestem – str, filestem for output
- uid_class –