Select matches within an LPI score range. High scoring matches are found in
close phylogenetic relatives, and unlikely to be horizontally transferred. Low
scoring matches are potential HGT candidates.
Note: searches involving > 50 genomes may require several minutes to complete.
Restrict search to matches containing one or more annotation keywords. This search
operates in simple "OR" mode, selecting all matches whose protein descriptions contain
any of the keywords entered.
Enter or paste keywords(s):
Restrict search to sequences matching one or more amino acid reference sequences.
Enter sequences below in FASTA format. This function is currently limited to
10 reference sequences per search.
Enter or paste sequence(s):
Upload a file:
Restrict search to matches where potential donor sequences are known to be typical within
their own taxa, to ensure sufficient database sequence representation is available to build
well-supported phylogenetic trees. Excluding reciprocal LPI scores < 0.75 typically
eliminates HGT candidates that cannot be easily corroborated using phylogenetic trees.
Restrict search to query sequences a having a particular DNA composition.
Percent GC is sometimes used as a simple marker of foreign DNA within a genome,
although there may be a wide disparity between individual genes due to other factors.
Z-scores have been calculated for each query protein by dividing absolute difference
from the coding sequence mean score by coding sequence standard deviation for the
parent genome. Larger z-scores indicate more atypical GC composition.