[BBF Standards] Tom Knight RFC (plain text)

Drew Endy endy at MIT.EDU
Tue Jul 8 16:15:50 EDT 2008


Just a quick resend of Tom Knight's RFC as plain text.

******

Request for comments

Biobrick assembly standard modifications

8 July 2008

Tom Knight


Background:

Over the past several years the original Biobrick assembly standard  
has proven to be a useful DNA assembly technique.  Despite this, a  
significant flaw has been the composition of the mixed base scar, T  
ACTAGA G.  Since this scar is 8 bp long, it makes protein fusions,  
aligned on three base codon boundaries, quite difficult.  Ira Phillips  
at the Silver Lab worked around this problem by ignoring the flanking  
T and G sites (inserted for protection against methylation issues) and  
using the mixed site ACTAGA.  This resulted in the amino acid sequence  
Thr-Arg inserted into fusion protein designs, two amino acids with  
significant chemical difficulties in many contexts.

Chris Anderson at Berkeley worked around this problem in a different  
way, by adopting a  new restriction enzyme set, BglII (prefix, AGATCT  
site) and BamHI (suffix, GGATCC site).  These enzymes are insensitive  
to methylation, and produce a scar GGATCT (Gly-Ser).  The Gly-Ser  
amino acids are near ideal for most protein fusion work, and the  
enzymes are cheap and effective.  Unfortunately, neither of these  
enzymes can be heat inactivated, making automated assembly with them  
substantially more difficult.

Another difficulty with these enzymes is the frequency of the BamHI  
and BglII sites in many natural DNA sequences.  For example, in the E.  
coli genome the BamHI average fragment length is 9,000, while the  
average fragment length of XbaI fragment is 120,000.  This reflects  
the relative rarity of the CTAG sequence in E. coli genomic DNA (for  
reasons poorly understood).  The high frequency of sites causes two  
problems.  First, making new Biobricks from existing genomic DNA  
becomes substantially more difficult.  Second, the frequent occurrence  
of these sites in contaminating genomic DNA in minipreps results in  
short fragments which can replace desirable parts in assembly  
reactions, yielding incorrect products.

Proposal:

Two additional restriction enzymes exist with a CTAG overhang: AvrII  
(CCTAG site) and NheI (GCTAGC site).  AvrII cannot be heat killed, and  
produces poorer codon choices than NheI.  I propose that we  
restructure the cloning site and flanking sites of Biobrick parts with  
the following structure:

……<EcoRI>……<SpeI> Part <NheI>…..<PstI>…..

The part would be flanked by bare SpeI and NheI sites.  The mixed site  
formed by assembly of these fragments, using standard approaches,  
would be GCTAGT, coding for Ala-Ser.  The Ala-Ser amino acids are  
almost as fusion-friendly as the Gly-Ser of the Anderson fusion  
technique.

The NheI enzyme can be heat killed, and thus is more amenable to  
automated assembly processes.

The rarity of the NheI site in E. coli genomic DNA means that many  
fewer fragments accidentally cut from genomic DNA contamination of  
minipreps will clone in place of the desired part.

Transition issues:

We would need to construct new cloning vectors with the new cloning  
site.  Parts would need to be recloned into the new vectors, probably  
using PCR with new primers.  Manual assembly of parts mixed between  
old and new formats would likely be possible in many cases as an  
interim solution, since the parts retain a common CTAG overhang.

We should rethink the use of the EcoRI enzyme for the prefix outside  
cutter.  There are likely more robust enzymes usable.

We should rethink the need/desirability of the NotI sites between the  
outside and inside restriction enzyme sites.  Some DNA fragment is  
necessary there, but it need not be that sequence, and the two  
sequences need not be identical.

Plan:

1)    Circulate this document for comments and blunder stopping

2)    Analyze the frequency of NheI sites in existing registry parts

3)    Test the efficiency of NheI and any other recommended enzymes

4)    Test for the ability to heat kill the enzymes

5)    Design automated programs to assist in the primer design for  
transition

6)    Choose which parts are worth transitioning

7)    Design desirable part collections for protein fusion work


  


More information about the Standards mailing list