cellxmer

Convert a Malva Index into a cell-by-chunk AnnData object that can be used to cluster cells directly from sequence composition

usage: malva cellxmer [-h] [--index-in INDEX_IN] --h5ad-out H5AD_OUT [--save-kmer] [--kmer-min KMER_MIN] [--kmer-max KMER_MAX] [--bin-size BIN_SIZE] [--w-size W_SIZE] [--num-buckets NUM_BUCKETS]
                      [--chunk-size CHUNK_SIZE] [--tmp-dir TMP_DIR]

Named Arguments

--index-in
Valid directory where the malva index (and metadata) is located.

The directory must contain the file malva_index.h5. Otherwise, an exception will be thrown.

--h5ad-out

Directory where the cell-by-mer object will be stored.

--save-kmer

When specified, also saves the cell-by-kmer matrix together with the cell-by-bucket matrix

Default: False

--kmer-min

k-mers occurring less than –kmer-min times are ignored

Default: 10

--kmer-max

k-mers occurring more than –kmer-max times are ignored

Default: 10000

--bin-size

Aggregates spatial units at the final AnnData file to reduce dimensions of final cell-by-kmer matrix

Default: 0

--w-size

W-mer size for filtering (default: 16, must be <= k-mer size)

Default: 16

--num-buckets

Number of buckets for clustering k-mers (default: 100,000)

Default: 100000

--chunk-size

Number of k-mers to process in each chunk (default: 1,000,000)

Default: 1000000

--tmp-dir

Directory for temporary files (default: system temp directory)