The EMBL Nucleotide Sequence Database -- Kanz et al. 33 (Supplement 1): D29 -- Nucleic Acids Research

The EMBL Nucleotide Sequence Database

Carola Kanz^*, Philippe Aldebert, Nicola Althorpe, Wendy Baker, Alastair Baldwin, Kirsty Bates, Paul Browne, Alexandra van den Broek, Matias Castro, Guy Cochrane, Karyn Duggan, Ruth Eberhardt, Nadeem Faruque, John Gamble, Federico Garcia Diez, Nicola Harte, Tamara Kulikova, Quan Lin, Vincent Lombard, Rodrigo Lopez, Renato Mancuso, Michelle McHale, Francesco Nardone, Ville Silventoinen, Siamak Sobhany, Peter Stoehr, Mary Ann Tuli, Katerina Tzouvara, Robert Vaughan, Dan Wu, Weimin Zhu and Rolf Apweiler

EMBL Outstation, European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK

^* To whom correspondence should be addressed. Tel: +44 1223 494453; Fax: +44 1223 494468; Email: ckanz{at}ebi.ac.uk

Received September 14, 2004; Revised October 6, 2004; Accepted October 14, 2004

ABSTRACT

TOP
ABSTRACT
INTRODUCTION
SUBMISSIONS TO THE EMBL...
DATA IN THE EMBL...
ACCESSING THE EMBL NUCLEOTIDE...
NEW DEVELOPMENTS
CITING THE EMBL NUCLEOTIDE...
CONTACTING THE EMBL DATABASE
REFERENCES

The EMBL Nucleotide Sequence Database (http://www.ebi.ac.uk/embl),maintained at the European Bioinformatics Institute (EBI) nearCambridge, UK, is a comprehensive collection of nucleotide sequencesand annotation from available public sources. The database ispart of an international collaboration with DDBJ (Japan) andGenBank (USA). Data are exchanged daily between the collaboratinginstitutes to achieve swift synchrony. Webin is the preferredtool for individual submissions of nucleotide sequences, includingThird Party Annotation (TPA) and alignments. Automated proceduresare provided for submissions from large-scale sequencing projectsand data from the European Patent Office. New and updated datarecords are distributed daily and the whole EMBL NucleotideSequence Database is released four times a year. Access to thesequence data is provided via ftp and several WWW interfaces.With the web-based Sequence Retrieval System (SRS) it is alsopossible to link nucleotide data to other specialist molecularbiology databases maintained at the EBI. Other tools are availablefor sequence similarity searching (e.g. FASTA and BLAST). Changesover the past year include the removal of the sequence lengthlimit, the launch of the EMBLCDSs dataset, extension of theSequence Version Archive functionality and the revision of qualityrules for TPA data.

	ABSTRACT

INTRODUCTION

TOP
ABSTRACT
INTRODUCTION
SUBMISSIONS TO THE EMBL...
DATA IN THE EMBL...
ACCESSING THE EMBL NUCLEOTIDE...
NEW DEVELOPMENTS
CITING THE EMBL NUCLEOTIDE...
CONTACTING THE EMBL DATABASE
REFERENCES

The European Bioinformatics Institute (EBI) is an outstationof the European Molecular Biology Laboratory (EMBL) in Heidelberg,Germany. It is located on the Wellcome Trust Genome Campus nearCambridge, UK.

	INTRODUCTION

The mission of the Service Programme at the EBI is the building,maintenance and provision of biological databases and otherinformation services to support data deposition and free accessby the scientific community (1).

The EMBL Nucleotide Sequence Database (http://www.ebi.ac.uk/embl/)is Europe's primary nucleotide sequence resource. This databaseis the European part of an international collaboration withDDBJ (Japan) (2) and GenBank (USA) (3) (INSDC, InternationalNucleotide Sequence Database Collaboration). Data are exchangedon a daily basis between the collaborating institutes. The datain the EMBL Nucleotide Sequence Database originates from a combinationof large-scale genome sequencing projects, direct submissionsfrom individual scientists and the European Patent Office. Thereis a quarterly release of the whole database and new and updatedrecords are distributed daily.

Over the last year, the size of EMBL Nucleotide Sequence Databasehas increased from 27.2 million entries in Release 76, September2003 to 42.3 million entries in Release 80, September 2004,of which 4.4 million entries are WGS (Whole Genome Shotgun)data. There are now over 185 000 organisms represented in thedatabase.

In 2004, the limit on sequence length has been dropped, theEMBLCDSs dataset containing all coding sequences annotated inthe EMBL Nucleotide Sequence Database was launched, the datacollection rules for Third Party Anotation (TPA) data were revisedand the functionality of the Sequence Version Archive was extendedfurther.

Other databases provided by the EBI include the protein resourceUniProt (4), InterPro, a database of protein families, domainsand functional sites (5), the Macromolecular Structure DatabaseE-MSD (6), the automatic genome annotation database Ensembl(7), Genome Reviews, curated versions of complete Genomes fromthe EMBL Database, the Enzyme database IntEnz (8) and the databasefor protein interaction data, IntAct (9).

SUBMISSIONS TO THE EMBL NUCLEOTIDE SEQUENCE DATABASE

TOP
ABSTRACT
INTRODUCTION
SUBMISSIONS TO THE EMBL...
DATA IN THE EMBL...
ACCESSING THE EMBL NUCLEOTIDE...
NEW DEVELOPMENTS
CITING THE EMBL NUCLEOTIDE...
CONTACTING THE EMBL DATABASE
REFERENCES

Why is it essential to submit new sequence?
Printing sequence data as part of a publication is neither sensiblenor manageable, hence journals prefer to cite only the accessionnumber assigned by the INSD Collaboration. Most journals havea mandatory submission procedure such that papers will onlybe accepted if they have an accession number. The nucleotidesequence is considered part of the publication and thereforealmost all nucleotide sequences are publicly available. Havingyour sequence in the database means it is readily availableto the scientific user community. A repository of primary nucleotidesequence data that is freely accessible is essential for computationalanalysis and genome research.

	SUBMISSIONS TO THE EMBL NUCLEOTIDE SEQUENCE DATABASE

How to submit new sequences to the EMBL Nucleotide Sequence Database?
The primary tool for submission of nucleotide sequence datais Webin. For alignment data, it is Webin-Align. Projects withlarge-scale submissions can open a project account allowingdirect updates.

Information for submitters can be found here: http://www.ebi.ac.uk/embl/Documentation/information_for_submitters.html.For submission guidelines please see http://www.ebi.ac.uk/embl/Submission/.

Webin
Webin is the preferred submission tool for nucleotide sequencesand biological information. It should also be used for TPA submissions.Webin allows fast submissions of single, multiple and very largenumbers of sequences (bulk submissions) and is available athttp://www.ebi.ac.uk/embl/Submission/webin.html.

Genome project submissions
Large-scale sequencing projects can open a project account todeposit and update data directly using email or ftp. Groupsproducing large volumes of sequence data are advised to contactthe database at datasubs{at}ebi.ac.uk. More information is availableat http://www.ebi.ac.uk/embl/Submission/genomes.html.

Alignment submissions
Webin-Align (10) is the dedicated submission tool for multiplenucleotide and protein alignments. It accepts all common alignmentformats and is available at http://www.ebi.ac.uk/embl/Submission/align_top.html.

WGS submissions
WGS data submission is not a continuous process—WGS datasetsare normally not updated more often than once every few months.Therefore email or ftp accounts are not opened for the submissionof WGS data, but submissions are dealt with on a one-by-onebasis. Potential submitters are advised to contact the EMBLdatabase at datasubs{at}ebi.ac.uk.

How to update entries in the EMBL Nucleotide Sequence Database?
The editorial rights to an entry in the EMBL Nucleotide SequenceDatabase remain with the original submitter(s). The EBI teamadds value to entries, e.g. via cross-references, but the dataitself is archival and is not updated by the EBI. Submittersare advised to update their own entries via the update form(http://www.ebi.ac.uk/embl/webin/update.html).

DATA IN THE EMBL NUCLEOTIDE SEQUENCE DATABASE

TOP
ABSTRACT
INTRODUCTION
SUBMISSIONS TO THE EMBL...
DATA IN THE EMBL...
ACCESSING THE EMBL NUCLEOTIDE...
NEW DEVELOPMENTS
CITING THE EMBL NUCLEOTIDE...
CONTACTING THE EMBL DATABASE
REFERENCES

Data in the EMBL Nucleotide Sequence Database are grouped intodivisions, according to either the methodology used in theirgeneration (e.g. EST and HTG divisions) or taxonomic originof the sequence source (e.g. HUM and PRO divisions). There arealso some specialized entry types.

	DATA IN THE EMBL NUCLEOTIDE SEQUENCE DATABASE

Whole Genome Shotgun (WGS) data
Methods using WGS data are used to gain a large amount of genomecoverage for an organism. The sequences of all contigs originatingfrom one experiment are grouped in a set. WGS entries have thestandard EMBL format, with accession numbers clearly distinctfrom those of non-WGS entries. The accession numbers of allentries in each WGS set share the same prefix.

Third Party Annotation (TPA) data
The Third Party Annotation data set was launched in responseto requests from the research community to submit entries thatinclude either re-annotation of existing data, or combinationsof novel sequence, existing primary sequence, trace archiveand WGS data.

To distinguish TPA entries from primary data, the abbreviation‘TPA’ appears at the beginning of each description(DE) line and in the keyword list. The link to the primary datainformation is given in the linetypes AH and AS that have beencreated for TPA entries. The following flatfile extract is takenfrom entry BN000024 [GenBank] :

View this table:
[in this window]
[in a new window]

Constructed (CON) entries and expanded CONs
CON entries do not contain a sequence but an assembly of contigs,i.e. the sequence is to be constructed from segments of smallersequences.

The format of a CON entry is similar to that of a standard entry,with the additional CO linetype to accommodate the assemblyinformation. A CON entry does not have any annotation apartfrom source features.

The following example of an assembly is taken from entry BX470249 [GenBank] :

CO join(BX640423 [GenBank] [GenBank] .1:1..348251,BX640424.1:51. .349146,BX640425.1:51..348257,

CO BX640426 [GenBank] [GenBank] .1:51..348866,BX640427.1:51..348997,BX640428.1:51..348525,

CO BX640429 [GenBank] [GenBank] .1:51..344321,BX640430.1:51..348014,BX640431.1:51..347894,

CO BX640432 [GenBank] [GenBank] .1:51..346301,BX640433.1:51..349305,BX640434.1:51..344805,

CO BX640435 [GenBank] [GenBank] .1:51..346259,BX640436.1:51..255260)

Recently,the expanded forms of CON entries (CONFF) have been made availablevia SRS and ftp. In this format, the sequence defined by theassembly and the annotation of the segments are imposed ontothe constructed sequence.

EMBLCDSs dataset
Following requests from database users, a new subset of EMBLdata, the EMBLCDSs database, has been created during the lastyear. Every CDS (coding sequence) feature annotated in EMBLentries is displayed as a single entry.

More details are provided in the New Developments section below.

ACCESSING THE EMBL NUCLEOTIDE SEQUENCE DATABASE

TOP
ABSTRACT
INTRODUCTION
SUBMISSIONS TO THE EMBL...
DATA IN THE EMBL...
ACCESSING THE EMBL NUCLEOTIDE...
NEW DEVELOPMENTS
CITING THE EMBL NUCLEOTIDE...
CONTACTING THE EMBL DATABASE
REFERENCES

The EMBL Nucleotide Sequence Database is available from theEBI via various WWW interfaces, ftp and email (for more informationsee http://www.ebi.ac.uk/embl/Access).

	ACCESSING THE EMBL NUCLEOTIDE SEQUENCE DATABASE

Sequence Retrieval System (SRS)
The EMBL Nucleotide Sequence Database can be accessed via theEBI SRS server (11,12) at http://srs.ebi.ac.uk/. In SRS, thedata are available in the libraries shown in Table 1.

View this table:
[in this window]
[in a new window]

Table 1. SRS data libraries

WGS data are not represented in a separate library any more,but is part of EMBL (Release) and EMBL (Updates). WGS entriescan be identified via the keyword ‘WGS’.

SRS also links to other databases, with cross-references toUniProt and publications available online, for example.

FTP Server
Release data, daily updates and cumulative files of all datatypes can be freely obtained from the ftp server at ftp://ftp.ebi.ac.uk/pub/databases/embl/.Please see the README file for further information.

To create and maintain a local copy of the cumulative file,the syncron tool (ftp://ftp.ebi.ac.uk/pub/software/unix/listtools/)can be used to download automatically newly available incrementaldata files from the ftp site and to merge them locally.

Dbfetch
Dbfetch (database fetch) is a tool for simple sequence retrievalvia http. It can be used to retrieve up to 50 entries from variousdatabases. Dbfetch can be found at http://www.ebi.ac.uk/cgi-bin/dbfetch.

Wsdbfetch provides programmatic access to the Dbfetch functionality.The service is described using Web Services Description Language(WSDL) and uses the Simple Object Access Protocol (SOAP) tocommunicate with other systems. For further information on Wsdbfetchplease see http://www.ebi.ac.uk/Tools/webservices/WSDbfetch.html.

EMBL Sequence Version Archive
The EMBL Sequence Version Archive (SVA) (13) is a repositoryof all versions of any entry that have been distributed to thepublic from the EMBL Nucleotide Sequence Database. An interactiveweb-based interface to the SVA can be accessed at http://www.ebi.ac.uk/cgi-bin/sva/sva.pl.

Entries from the SVA can also be retrieved using dbfetch.

Completed genome sequences
Direct access to completely sequenced genomic components isavailable via the EBI Genomes server at http://www.ebi.ac.uk/genomes/.At the time of writing (September 2004) there are 162 completedgenomes of bacteria, 19 archaea, 36 eukaryota, 540 organelles,136 phages, 204 plasmids, 903 viruses and 36 viroids available.

Sequence searching
A comprehensive set of sequence analysis and database searchalgorithms is available at http://www.ebi.ac.uk/Tools/. Themost commonly used algorithms available are FASTA (14) and WU-BLAST(15), permitting comparisons between query sequences and thenucleotide, translated nucleotide and protein databases.

Sequence similarity searches are available interactively overthe WWW as well as by email. Instructions for email searchescan be obtained by sending a message with the word HELP in itsbody to gpfasta{at}ebi.ac.uk.

Access via email
Data can also be retrieved by email using netserv (netserv{at}ebi.ac.uk).To get started send an email to netserv{at}ebi.ac.uk with ‘HELP’in the message body.

NEW DEVELOPMENTS

TOP
ABSTRACT
INTRODUCTION
SUBMISSIONS TO THE EMBL...
DATA IN THE EMBL...
ACCESSING THE EMBL NUCLEOTIDE...
NEW DEVELOPMENTS
CITING THE EMBL NUCLEOTIDE...
CONTACTING THE EMBL DATABASE
REFERENCES

Sequence length limit
In the past, the sequence length of a database record was limitedto 350 000 bp. In June 2004, this restriction was lifted andentries of any length are now permitted in the database. Completegenomic units such as entire chromosomes can now be representedin a single entry. To represent unsequenced gaps, the new ‘gap’feature is used. Some genomes that were split in the past inorder to comply with the 350 000 bp limit have now been updatedinto single entries, e.g. AE000516 [GenBank] .

	NEW DEVELOPMENTS

Third Party Annotations—new rules
Following a decision taken at the 2004 Collaborative Meeting,the INSD Collaboration has increased the stringency for acceptanceof data into the TPA dataset. The aim is to ensure that theTPA dataset includes the highest quality sequence and biologicalannotation.

To achieve this aim, the similarity between the TPA sequenceand the contributing primary sequences is checked at the timeof submission. We aim to achieve a similarity of at least 90%.In addition, there can be no more than 50 bp of the TPA sequencethat does not correspond to primary entry(ies). All TPA recordsare manually curated and checked prior to public release.

To be released into the public TPA dataset, entries must alsomeet the following requirements:

The study must have been publishedin a peer-reviewed journal.
The study must be supported bybiological experimental evidence.

Further details may be found at: http://www.ebi.ac.uk/embl/Documentation/third_party_annotation_dataset.html.and http://www.ebi.ac.uk/webin/webin_help.html.

EMBL Sequence Version Archive—extended functionality
In February 2004, a new ‘batch retrieval’ functionalityhas been added to the SVA. Multiple entries can now be retrievedby supplying a list of accession numbers with either entry versionnumber, sequence version number (user-indicated in the interface)or no version details for the most recent entry.

By the end of 2004, expanded CON entries will be included inthe SVA.

A warning has been added to report the suppression date forentries that have been suppressed in the database.

EMBLCDSs dataset
Following requests from database users, a new subset of EMBLdata, EMBLCDSs database, has been created during the year. EveryCDS (coding sequence) feature annotated in EMBL entries is displayedas a single entry.

Entries are presented in an EMBL-like flatfile format, withaddition of new line types (Figure 1).

View larger version (25K):
[in this window]
[in a new window]

Figure 1. A sample entry from the EMBLCDSs dataset.

The primary identifier of the entry given in the ID line isthe protein_id of the CDS feature, the IV (identifier version)line gives protein_id and version. The accession number andsequence version of the parent EMBL entry can be found in thePA line. The DE line is created automatically and comprisesthe organism and product names. The taxonomic information istaken from the parent entry. The CDS annotation itself containsall qualifiers that belong to the feature, nucleotide locationsbeing given in relation to the parent entry(ies). The nucleotidesequence of the feature is shown last in the entry.

The EMBLCDSs dataset is available via SRS [library: EMBL (CodingSequences)] and ftp (ftp://ftp.ebi.ac.uk/pub/databases/embl/cds).

Finishing whole genome shotgun sets
Data from the WGS projects where the sequencing and assemblingprocess is finished are moved into the main section of the database.At the time of writing only 5 out of 120 relatively small projectshave been finished (example: Nanoarchaeum equitans Kin4-M, WGSproject prefix: AACL, newly created entry in the main section:AE017199 [GenBank] ). In all cases, accession numbers of the WGS entriesare added as secondary accession numbers to newly created entriesin the main section to help track the data.

XML format
The International Nucleotide Sequence Database CollaborationINSDC has adopted a first draft for a common XML format fornucleotide data. The DTD can be found at http://www.ebi.ac.uk/embl/Documentation/DTD/INSDSeq_v1.dtd.txt.

CITING THE EMBL NUCLEOTIDE SEQUENCE DATABASE

TOP
ABSTRACT
INTRODUCTION
SUBMISSIONS TO THE EMBL...
DATA IN THE EMBL...
ACCESSING THE EMBL NUCLEOTIDE...
NEW DEVELOPMENTS
CITING THE EMBL NUCLEOTIDE...
CONTACTING THE EMBL DATABASE
REFERENCES

The preferred form for citation of the EMBL Nucleotide SequenceDatabase is: Kanz,C. et al. (2005) The EMBL Nucleotide SequenceDatabase. Nucleic Acids Res., 33, D29–D33.

	CITING THE EMBL NUCLEOTIDE SEQUENCE DATABASE

CONTACTING THE EMBL DATABASE

TOP
ABSTRACT
INTRODUCTION
SUBMISSIONS TO THE EMBL...
DATA IN THE EMBL...
ACCESSING THE EMBL NUCLEOTIDE...
NEW DEVELOPMENTS
CITING THE EMBL NUCLEOTIDE...
CONTACTING THE EMBL DATABASE
REFERENCES

Contact by email: data submissions: datasubs{at}ebi.ac.uk; otherenquiries: datalib{at}ebi.ac.uk; data updates/publication notifications:update{at}ebi.ac.uk.

	CONTACTING THE EMBL DATABASE

Postal address: EMBL Nucleotide Sequence Database, EuropeanBioinformatics Institute, Wellcome Trust Genome Campus, Hinxton,Cambridge CB10 1SD, UK.

Telephone: data submissions, +44 1223 494499; general, +44 1223494444.

Fax: general, +44 1223 494468.

Notes

The online version of this article has been published underan open access model. Users are entitled to use, reproduce,disseminate, or display the open access version of this articlefor non-commercial purposes provided that: the original authorshipis properly and fully attributed; the Journal and Oxford UniversityPress are attributed as the original place of publication withthe correct citation details given; if an article is subsequentlyreproduced or disseminated not in its entirety but only in partor as a derivative work this must be clearly indicated. Forcommercial re-use permissions, please contact journals.permissions{at}oupjournals.org.

	Notes

REFERENCES

TOP
ABSTRACT
INTRODUCTION
SUBMISSIONS TO THE EMBL...
DATA IN THE EMBL...
ACCESSING THE EMBL NUCLEOTIDE...
NEW DEVELOPMENTS
CITING THE EMBL NUCLEOTIDE...
CONTACTING THE EMBL DATABASE
REFERENCES

	REFERENCES

Brooksbank,C., Camon,E., Harris,M.A., Magrane,M., Martin,M., Mulder,N., O'Donovan,C., Parkinson,H., Tuli,M., Apweiler,R. et al. ( (2003) ) The European Bioinformatics Institute's data resources. Nucleic Acids Res., , 31, , 43–50.[Abstract/Free Full Text] .
Miyazaki,S., Sugawara,H., Ikeo,K., Gojobori,T. and Tateno,Y. ( (2004) ) DDBJ in the stream of various biological data. Nucleic Acids Res., , 32, , D31–D34.[Abstract/Free Full Text] .
Benson,D.A., Karsch-Mizrachi,I., Lipman,D.J., Ostell,J. and Wheeler,D.L. ( (2004) ) GenBank: update. Nucleic Acids Res., , 32, , D23–D26.[Abstract/Free Full Text] .
Bairoch,A., Apweiler,R., Wu,C.H., Barker,W.C., Boeckmann,B., Ferro,S., Gasteiger,E., Huang,H., Lopez,R., Magrane,M. et al. ( (2005) ) The Universal Protein Resource (UniProt). Nucleic Acids Res., , 33, , D154–D159.[Abstract/Free Full Text] .
Mulder,N.J., Apweiler,R., Attwood,T.K., Bairoch,A., Barrell,D., Bateman,A., Binns,D., Biswas,M., Bradley,P., Bork,P. et al. ( (2003) ) The InterPro Database, 2003 brings increased coverage and new features. Nucleic Acids Res., , 31, , 315–318.[Abstract/Free Full Text] .
Golovin,A., Oldfield,T.J., Tate,J.G., Velankar,S., Barton,G.J., Boutselakis,H., Dimitropoulos,D., Fillon,J., Hussain,A., Henrick,K. et al. ( (2004) ) E-MSD: an integrated data resource for bioinformatics. Nucleic Acids Res., , 32, , D211–D216.[Abstract/Free Full Text] .
Clamp,M., Andrews,D., Barker,D., Bevan,P., Cameron,G., Chen,Y., Clark,L., Cox,T., Cuff,J., Curwen,V. et al. ( (2003) ) Ensembl 2002: accommodating comparative genomics. Nucleic Acids Res., , 31, , 38–42.[Abstract/Free Full Text] .
Fleischmann,A., Darsow,M., Degtyarenko,K., Fleischmann,W., Boyce,S., Axelsen,K.B., Bairoch,A., Schomburg,D., Tipton,K.F. and Apweiler,R. ( (2004) ) IntEnz, the integrated relational enzyme database. Nucleic Acids Res., , 32, , D434–D437.[Abstract/Free Full Text] .
Hermjakob,H., Montecchi-Palazzi,L., Lewington,C., Mudai,S., Kerrien,S., Orchard,S., Vingron,M., Roechert,B., Roepstorff,P. and Apweiler,R. ( (2004) ) IntAct: an open source molecular interaction database. Nucleic Acids Res., , 32, , D452–D455.[Abstract/Free Full Text] .
Lombard,V., Camon,E.B., Parkinson,H.E., Hingamp,P., Stoesser,G. and Redaschi,N. ( (2002) ) EMBL-Align: a new public nucleotide and amino acid multiple sequence alignment database. Bioinformatics, , 18, , 763–764.[Abstract/Free Full Text] .
Zdobnov,E.M., Lopez,R., Apweiler,R. and Etzold,T. ( (2002) ) The EBI SRS server—new features. Bioinformatics, , 18, , 1149–1150.[Abstract/Free Full Text] .
Zdobnov,E.M., Lopez,R., Apweiler,R. and Etzold,T. ( (2002) ) The EBI SRS server—recent developments. Bioinformatics, , 18, , 368–373.[Abstract/Free Full Text] .
Leinonen,R., Nardone,F., Oyewole,O., Redaschi,N. and Stoehr,P. ( (2003) ) The EMBL sequence version archive. Bioinformatics, , 19, , 1861–1862.[Abstract/Free Full Text] .
Pearson,W.R. ( (1994) ) Using the FASTA program to search protein and DNA sequence databases. Methods Mol. Biol., , 24, , 307–331.[Medline] .
Lopez,R., Silventoinen,V., Robinson,S., Kibria,A. and Gish,W. ( (2003) ) WU-Blast2 server at the European Bioinformatics Institute. Nucleic Acids Res., , 31, , 3795–3798.[Abstract/Free Full Text] .

This article has been cited by other articles:

J. Robinson, M. J. Waller, P. Stoehr, and S. G. E. Marsh
IPD--the Immuno Polymorphism Database
Nucleic Acids Res., January 1, 2005; 33(suppl_1): D523 - D526.
[Abstract] [Full Text] [PDF]

C. Brooksbank, G. Cameron, and J. Thornton
The European Bioinformatics Institute's data resources: towards systems biology
Nucleic Acids Res., January 1, 2005; 33(suppl_1): D46 - D53.
[Abstract] [Full Text] [PDF]

				C. Brooksbank, G. Cameron, and J. Thornton The European Bioinformatics Institute's data resources: towards systems biology Nucleic Acids Res., January 1, 2005; 33(suppl_1): D46 - D53. [Abstract] [Full Text] [PDF]