[BBF Standards] Sequences and Formats and Devices, oh my! [Re: data exchange issue 1: Abstraction]
Ralph Santos
rasantos at lbl.gov
Tue Feb 26 09:35:00 EST 2008
Ah, now I see what you mean by format, and I see I misunderstood what
you meant by a device the first time around as well...
I think the way the community talks about and thinks about biobricks
works against basing a biobrick definition in its sequence. Look at the
parts registry itself. You don't look for a biobrick by pasting a
sequence into a search box like on a BLAST web page, you navigate it by
having a particular function in mind and searching on that. The whole
concept is about packaging something a particular function or behavior
in an easily handled format.
But one has to consider how one defines a biobrick and how one handles
biobrick ID's as two separate questions.
I think it's easiest to think of a Biobrick as a transformation of some
sort which takes a pre-existing functional unit of DNA and packages it,
so one can think about the 'raw' and 'packaged' sequence of a biobrick.
Another way to talk about it is one can borrow from email terminology
and think of the format as the 'envelope' of the biobrick and the
functional part can be thought of as the body.
Actually, there's a possibly useful analogy there. In email systems,
the programs are divided into MTA's and MUA's, message transport agents
and message user agents. The message envelope (the headers) are free to
be handled and manipulated by the mail system, but the responsibility of
MTA's is to keep the message body unchanged. MUA's are free to
manipulate the message body for reformatting, printing, etc.
Having said all that, there's a complicating real-world factor, which is
the physical existence of biobricks in old formats in freezers. They'll
be associated with the current ID. It seems there are several options
to resolve the matter:
(1) Associate the ID numbers with the precise sequences
* pro: ID's of classic biobricks keep their identity
* con: same function registered under multiple ID's for repackagings in
different formats
* con: forces folks who've repackaged existing bricks in non-classic
formats to reidentify their samples
(2) Biobrick ID's associated with function
* pro: functional classifications of biobricks remain the same
* con: will lead to mixups between people because formats might not match
I think (1) is the most pragmatic alternative because although we think
about biobricks functionally, one must still think about pre-existing
biobrick ID's written on tubes in freezers and buried in lab notebooks.
One could salvage or recapture (2) by coming up with a new "function
code" to ascribe an ID purely to a biobrick's function or behavior, but
at least at this point the whole thing is new enough and complicated
enough that we might be better off waiting a while before we try to
precisely codify biobrick functions.
---ralf
More information about the Standards
mailing list