Select matches within an LPI score range. High scoring matches are found in close phylogenetic relatives, and unlikely to be horizontally transferred. Low scoring matches are potential HGT candidates.
Note: searches involving > 50 genomes may require several minutes to complete.
Restrict search to matches containing one or more annotation keywords. This search operates in simple "OR" mode, selecting all matches whose protein descriptions contain any of the keywords entered.
Enter or paste keywords(s):
Restrict search to sequences matching one or more amino acid reference sequences. Enter sequences below in FASTA format. This function is currently limited to 10 reference sequences per search.
evalue cutoff: 1e-5 1e-10 1e-50 1e-100 1e-250
Enter or paste sequence(s):
Upload a file:
Restrict search to matches where potential donor sequences are known to be typical within their own taxa, to ensure sufficient database sequence representation is available to build well-supported phylogenetic trees. Excluding reciprocal LPI scores < 0.75 typically eliminates HGT candidates that cannot be easily corroborated using phylogenetic trees.
Restrict search to query sequences a having a particular DNA composition. Percent GC is sometimes used as a simple marker of foreign DNA within a genome, although there may be a wide disparity between individual genes due to other factors. Z-scores have been calculated for each query protein by dividing absolute difference from the coding sequence mean score by coding sequence standard deviation for the parent genome. Larger z-scores indicate more atypical GC composition.