BLAST (Basic Local Alignment Search Tool) is an algorithm and suite of programs used to perform sequence similarity searches between a query sequence and sequences in a selected database, such as GenBank. It is commonly used to compare nucleotide and protein sequences by identifying regions of local similarity. BLAST operates by using
heuristic methods, taking shortcuts in order to reach an answer faster. BLAST has to read in an user-inputted query sequence then the sequence from the selected database and search parameters. It then analyzes the query sequence for
low complexity and repeats within this sequence in order to produce a set of characters that’ll be used to compare the query sequence to sequences within the database. First, the database searches for short matching sequences using the character set, which are then extended, not allowing gaps to find initial alignments. If these gap free alignments reach a certain score threshold, they are used to begin a gapped extension that estimates the alignment’s score and length without yet calculating insertions or deletions, which are more computationally expensive. In order to avoid a computationally expensive process, only gapped alignments that meet a specified score are kept, with lower scored alignments being discarded. Finally, BLAST performs traceback, meaning the retained alignments are fully extended again, this time including insertions and deletions as well as more sensitive parameters to produce the final, detailed alignments. Input sequences are typically provided in FASTA or GenBank format, and results can be returned in multiple formats including HTML, XML, and plain text. The default output from the webpage presents results in table format, along with graphs of matched sequences in a separate tab, statistical significance values, and detailed alignments between the query and subject sequences.
BLAST programs There are multiple BLAST programs available through the NCBI BLAST homepage, which differ in the types of sequences they analyze. The Image below displays one such program, blastp, displaying the interface's general layout. These Programs include: •
blastn: Compares nucleotide sequences against nucleotide sequences in a selected database, often used for identifying more distant relationships. •
blastp: Compares protein sequences against protein sequences in a selected database; its algorithm forms the basis for other BLAST programs such as blastx and tblastn. •
blastx: Compares translated nucleotide sequences against protein sequences in a selected database. •
tblastn: Compares protein sequences against translated nucleotide sequences in a selected database. ==Entrez==