Primer Design
Contents
Selecting primers for PCR
*Designing regular PCR primers using Primer3
The RefSeq entry NM_079400 contains the sequence of the D. melanogaster mRNA coding for tap, the target of Poxn. Tap encodes a bHLH protein expressed in larval chemosensory organs and involved in the response to sugar and salt. We wish to amplify by PCR the region coding for the HLH motif.
Search NCBIs RefSeq database to find out the location of the coding sequence in the sequence. |
---|
|
Search NCBIs Gene database to find the location of the Helix-loop-helix domain. |
---|
In the RefSeq record and click on the Gene link.
|
This location applies to the protein sequence so it's expressed in amino acids.
Calculate the corresponding positions in the nucleotide sequence. |
---|
In the nucleotide sequence of the RefSeq mRNA record, the domain is located between position +577 and +745. |
The Primer3 program can help to select suitable primers.
Go to Primer3 [1] or use the nicer interface at Primer3Plus [2] (in the solution below Primer3Plus was used).
Set the location of the primers so that they amplify the HLH motif of tap. |
---|
|
Set the Tm difference of the primers to a maximum of 2 degrees. |
---|
Go to the General Settings tab and set Maximum Tm Difference to 2.0 |
Note the Product Size Ranges box: given these settings Primer3 will first try to find primer pairs that amplify a product with length between 500 and 600 and if this is not possible it will go for a larger product etc...
This is, however, not what we want: we want to amplify the hlh region only so we aim for a much smaller product.
Set the minimum size of the product to the length of the hlh region and the maximum to 300. |
---|
Since the hlh region is 168 bp long we set this as the lower limit for the product size range.
|
Click Pick Primers
In the results page you can see the following:
The program has selected 5 primer pairs, with the best at the top.
For each primer you can find:
- the location on the sequence where the first base of the primer binds (for the forward/left primer this is on the complementary strand)
- its length
- its melting temperature
- its %GC
- the score of an alignment of the primer with the other primer and with itself
The program first considers all possible primers and retains those that satisfy a series of criteria, then considers all possible pairs of these primers and retains those who satisfy a series of criteria. At the bottom of the page you can see information about how many primers were considered and rejected/retained.
Send the first set of primers to PrimerManager. |
---|
Click the Send to Primer3Manager button (red) for the first primer pair.
|
This leads to a form for further evaluating the primers.
Send the primers to BLAST. |
---|
Click the BLAST! link to check the specificity of the primers. |
The primers are sent to a blastn form allowing you to iniate a BLAST search to find potential targets of the primers. By default the nr database is selected. Of course you are only interested in targets in the organism you are working with. Hits in other organisms will not influence your PCR.
Check the specificity of the primers. |
---|
|
Look at the 9 highest scoring hits for the forward primer.
Is the forward primer really identical to 9 different locations of the Drosophila genome ? |
---|
No what you see here are 9 records that all contain the tap sequence (mRNA, chromosome, synthetic constructs...) Remarkably some of them are annotated as the biparous TF, but if you take a closer look you will see that tap and biparous are the same gene. This is the problem when working with the nr database which in contrast to whats it's name lets you believe is not non-redundant at all. |
So it's always a better idea to Blast against Refseq sequences unless they are not available for the organism you work with or you have reasons to believe that they are not complete (i.e. they do not represent the full genome). However, for model organisms, full chromosome sequences are available so you can use Refseq for BLAST.
Repeat the Blast on RefSeq |
---|
Now you see that the redundancy has disappeared. |
To go to the results of the reverse primer change Results for at the top of the page
When evaluating the specificity of the primers, it is especially important to check that the primers are specific at the 3' end because that's the site where the polymerase will attach nucleotides.
So generally, it is recommended to not use primers that contain long identical stretches (>15 nt for primers of 20 nt long) to other regions in the genome apart from the one you want to amplify/measure, and certainly not if these stretches comprise the last nucleotide at the 3' end of the primer.
*Designing regular PCR primers using PrimerBLAST
It's a lot of work to design primers using Primer3/CLC Main Workbench especially if you discover during the BLAST that the suggested primers are not specific and you have to start all over again.
Therefore, we will design primers using the PrimerBLAST tool [3]. PrimerBLAST uses the Primer3 algorithm to find primers allowing you to set all the parameters related to effciency of the primers (e.g. secondary structure and dimer formation, Tm, difference in Tm...). On top of this, PrimerBLAST uses the BLAST algorithm to find specific primers, that will only bind to the intended target. It is the only tool that incorporates BLAST in the primer search.
For some applications, it is vital to have specific primers. This is for instance the case when you use PCR for quantification (qPCR) or when you want to amplify a commonly occurring sequence from the genome.
As an example, we will design primers to amplify a region of RefSeq entry NM_079400, which represents the D. melanogaster mRNA coding for tap. Tap encodes a bHLH protein expressed in larval chemosensory organs and involved in the response to sugar and salt. We wish to amplify the region encoding the Helix-loop-helix domain. In the sequence of the RefSeq record, the domain is located between position +577 and +745.
So, let's go to the PrimerBLAST page.
Find primers to amplify the hlh region, with maximum Tm difference of 2° and maximum product size of 300 bp. |
---|
Fill in the first two sections of the BLAST form as follows:
Note that you do not have to paste the sequence, you can directly enter the RefSeq ID (red). To amplify the helix-loop-helix domain you need a primer at both sides of the domain. To accomplish this you need to define the regions in the sequence that you want to use for selecting primers in by defining a range for both primers as shown in green. Set the product size (blue) and the maximum difference in anneal temperatures of the primers of a pair (purple) to reasonable values (e.g. take the same values as in Primer3). |
Remember that PrimerBLAST also allows you to set the parameters related to specificity of the primers.
Set database to search in and specify a minimum of 4 mismatches, the last 2 nucleotides of the primers have to be specific |
---|
Fill in the first two sections of the BLAST form as follows:
Compare the primers to the Drosophila (green) genome (red). You see that you can easily define the specifity that you want to obtain (blue): you can set
|
Only primers that fulfill the specificity criteria that you have set will be shown.
Expand the Advanced parameters section:
This shows a set of parameters concerning primer specificity. I normally keep these at their default settings. If you want a more stringent search you can decrease the Blast word size. This is the minimum number of consecutive nucleotides that have to exactly match between the primer and a sequence from the database to consider it a hit. The lower you set it, the faster BLAST will consider a sequence from the database a hit.
After that follow the Primer parameters. At the top you have a set of general parameters that are self-explanatory.
Then parameters concerning the likelihood that the primers will form hairpins and dimers. By default Primer-BLAST uses secondary structure alignment to predict these characteristics but you can choose more advanced and accurate thermodynamic models. Calculations will be slow when you choose these models. Basically, instead of looking at individual nucleotide matches you use the stability of nucleotide matches, e.g. the stability of CT hybridized to GA is different from that of CA hybridized to GT.
The scores for self and pair complementarity in the default secondary structure model are local alignment scores: the scoring system uses 1.00 for complementary bases, -1.00 for a mismatch, and -2.00 for a gap. So a score of 8 corresponds to 8 consecutive matches or 9 aligned nucleotides with 8 matches and 1 mismatch...
Perform the search using default settings for the advanced parameters. |
---|
Click the Get Primers button (purple) to start the search. |
The tool will first ask you to indicate the intended target. Thereafter it will return 5 specific high quality primer pairs:
Knowing that PrimerBLAST uses the same algorithm to pick primers as Primer3 [4], PrimerBLAST will save you a lot of work and time when you have to design primers.
Designing primers for amplifying microbial genomes for sequencing using Primer Blast
In Ugene get 20 complete norovirus genomes from NCBI
Tip:
- try searching the “nucleotide” database, using norovirus in the Organism field AND “complete genome” in All Fields
- select all 20 results and Download in fasta format
- select all sequences (the shift key is your best friend), right click and export as norovirus.fa
In Ugene make a MSA of these sequences (they're long so ClustalO might be a good choice) and save as norovirus.aln
select the consensus, right click and Copy consensus with gaps
Go to Primer-Blast and paste the consensus in the PCR template box. We want to see 10 primer combinations that generate a product of 1000-2000 bp.
Search in the Genomes (chromosomes from all organisms) database and make sure the primers do not amplify human DNA
Designing primers for qPCR
Exercises on qPCR primer design can be found on our primer design wiki page.
Analyzing primer characteristics using OligoAnalyzer
OligoAnalyzer is a tool implemented by ID&T (who sell primers) to check the characteristics of your primers.
How to check the Tm of a primer ? |
---|
Copy the sequence of the primer in the Sequence box, adjust the concentrations to these that are used by Primer3 (see slides) and click Analyze:
|
Predicted melting temperature might be slightly different depending on the tool used. There are many methods to predict Tm and each method will give a different result. Concentrations of primers and ions have an enormous impact on the Tm prediction. Don't worry about the differences: the only way to determine Tm values is by doing actual PCR. Just make sure the difference in Tm between the two primers is small.
How to check the self-complementarity of a primer in OligoAnalyzer ? |
---|
Click Self-Dimer:
It will align the primer to itself looking for the highest scoring alignment and counting the number of matches in that alignment, this is the score that is returned. |
Complementarity will only cause problems when located at the 3' end of the primer. Since polymerases add bases at the 3’ end of the primer, the primer dimer can only be extended when it occurs at the 3'end of the primer giving rise to aspecific products. ID&T recommends to avoid complementary stretches of more than 2 bp at the 3’ end.
However, even if the primer dimer cannot be extended, it could interfere when its formation competes with the annealing of primer and target. This is only the case when the stability of the dimer is similar to the stability of a perfectly matched primer-target duplex. The stability of the perfectly matched duplex is shown as a Maximum Delta G at the top of the results. So non-extendable dimer structures that are much shorter than the intended duplex are not going to disrupt the PCR reaction.
It is advised to review all possible interactions between primers so both Self-Dimer (primers binding to themselves) and Hetero-Dimer (primers binding to each other) interactions between primers are examined.
How to check the likelihood that the primers will bind to each other ? |
---|
Click Hetero-Dimer:
This opens a text box to enter the second primer. Click Analyze. |
Again, it's only problematic when you have a stretch of 3 matching nucleotides at the 3'end of one of the primers.
If you have complementarity at the 3'end of your primers you might consider taking a look at the other pairs of primers that Primer3 or PrimerBLAST suggests.
Obtaining the reverse complement of a sequence
Use this tool to obtain the reverse complement of a sequence.
In silico PCR in the UCSC Browser
You can visualize a PCR product (and additional annotation) in the UCSC Genome Browser using the In Silico PCR tool.
Select the most recent version of the human genome and paste the sequences of forward and reverse primers in their respective boxes. Click submit
This returns the location and the sequence of the PCR product:
Clicking the link of the location visualizes the PCR product in the UCSC genome browser. If necessary you can emove unnecessary tracks by right clicking the tack's name and selecting hide.
if necessary, add tracks showing relevant annotations like positions of SNPs by scrolling down to the Variation section. Select the tracks you want to visualize and click refresh.
Setting the SNPs track from hide to full or dense shows the SNPs in the browser. Zoom in on the primer in the same way as in the Ensembl Genome Browser.
You will often see that a primer matches SNPs but as long as they are not located near the 3'end of the primer it does not pose a problem.
References: