Home | Business News | Browse by Publication | X | XML Journal

An introduction to BSML: enabling the bioinformatics revolution. (XML Labs).

Publication: XML Journal
Publication Date: 01-MAR-03
Format: Online - approximately 3113 words
Delivery: Immediate Online Access
Full Article Title: An introduction to BSML: enabling the bioinformatics revolution. (XML Labs).(Bioinformatic Sequence Markup Language)

Article Excerpt
Over the past two decades, the life sciences industry I has taken a dramatic leap into an online, collaborative world. Tasks and activities that are commonplace today were either extremely difficult or outright impossible just 20 years ago.

One key reason for this shift was the abundance of genomic sequence data, the sequence of base pairs that make up an organism's DNA. A vast amount of this data was made possible by advances in genomic sequencing techniques, such as the shotgun and clone contig techniques made famous to the general public during the Human Genome Project. In order to provide access to all of this publicly available data, the National Institute of Health created GenBank, an online repository of genomic information. When GenBank first opened in 1982 there were just 606 sequences available. Today, there are nearly 15 million sequences available--the number has doubled nearly every year since 1992!

With this vast amount of publicly available information came the need to share, organize, and analyze it. Bioinformatics was the term coined to describe this science. While bioinformatics is not an extremely well-defined term, in general it refers to three broad areas: the collection of new genomic data, the analysis and interpretation of existing data, and the development of new algorithms for analysis.

While the Interact has always been an important tool for researchers, it was a key catalyst for the bioinformatics revolution. Through the Interact, scientists we/e able to publish and analyze data in a truly collaborative fashion. One group of researchers could analyze a specific genome in a particular way to gain insight as to its function. They could then describe their new understanding of the genome sequence by annotating it and sharing that annotation. Research groups could then leverage the ongoing work of other groups through public data repositories. However, as with many areas on the Internet, a common format was required in order to efficiently share information.

Entrez XML

Early on, flat file formats were the de facto standard for information exchange. However, the larger the variety of information, the more difficult it became to capture it all. No longer was information sharing limited to the familiar raw genomic strings of A, T, C, and G. Broader annotations, research protocols, published research results, metabolic pathways, and more now needed to be shared and cross-referenced.

From its inception GenBank standardized on a format known as ASN.1. In fact, the ASN.1 format is still used by several large government organizations, including GenBank, as a default format. It is an object-oriented, text-based language that in many ways resembles XML. However, with the advent of XML it became apparent that XML would become the backbone of a future standard. Not only could XML represent arbitrarily complex information,...



More articles from XML Journal
XBRL for business reporting: positioned to become the global standard...., March 01, 2003
Microsoft BizTalk Server 2002 part 1: an out-of-box integration soluti..., March 01, 2003
Tamino XML server provides extended support. (XML News).(Brief Article..., March 01, 2003
IBM announces Multimodal beta. (XML News).(Brief Article), March 01, 2003
XAware announces eight new industry partnerships. (XML News).(Brief Ar..., March 01, 2003

Looking for additional articles?
Search our database of over 3 million articles.

Looking for more in-depth information on this industry?
Search our complete database of Industry & Market reports by text, subject, publication name or publication date.

About Goliath
Whether you're looking for sales prospects, competitive information, company analysis or best practices in managing your organization, Goliath can help you meet your business needs.

Our extensive business information databases empower business professionals with both the breadth and depth of credible, authoritative information they need to support their business goals. Whether it be strategic planning, sales prospecting, company research or defining management best practices - Goliath is your leading source for accurate information.