BLAST executables for local use are provided for
Solaris, LINUX, Windows, and MacOSX systems. See the README file in the ftp
directory for more information. Pre-formatted databases for BLAST nucleotide,
protein, and translated searches also are available for downloading under the
db subdirectory.
Sequence databases for use with the stand-alone BLAST
programs. The files in this directory are pre-formatted databases that are
ready to use with BLAST.
Sequence databases in FASTA format for use with the
stand-alone BLAST programs. These databases must be formatted using formatdb
before they can be used with BLAST.
This site contains files for all sequence records in
GenBank in the default flat file format. The files are organized by GenBank
division, and the full contents are described in the README.genbank file.
This site contains all nucleotide and protein sequence
records in the Reference Sequence (RefSeq) collection. The
""release"" directory contains the most current release of
the complete collection, while data for selected organisms (such as human,
mouse and rat) are available in separate directories. Data are available in
FASTA and flat file formats. See the README file for details.
This site contains next-generation sequencing data
organized by the submitted sequencing project.
This site contains the trace chromatogram data
organized by species. Data include chromatogram, quality scores, FASTA
sequences from automatic base calls, and other ancillary information in
tab-delimited text as well as XML formats. See the README file for details.
This site contains individual directories for each
organism with data in UniGene. The data for each species includes the unique
sequence for each UniGene cluster, all sequences in each cluster in FASTA
format and library information for the cluster. See the README file for further
details.
This site contains the UniVec and UniVec_Core
databases in FASTA format. See the README.uv file for details.
This site contains whole genome shotgun sequence data
organized by the 4-digit project code. Data include GenBank and GenPept flat
files, quality scores and summary statistics. See the README.genbank.wgs file
for more information.
Submissions
An online form that provides an interface for
researchers, consortia and organizations to register their BioProjects. This
serves as the starting point for the submission of genomic and genetic data for
the study. The data does not need to be submitted at the time of BioProject registration.
A web-based sequence submission tool for one or a few
submissions to the GenBank database, designed to make the submission process
quick and easy.
Tool for submission to the GenBank database of Barcode
short nucleotide sequences from a standard genetic locus for use in species
identification.
A stand-alone software tool developed by the NCBI for
submitting and updating entries to public sequence databases (GenBank, EMBL, or
DDBJ). It is capable of handling simple submissions that contain a single short
mRNA sequence, complex submissions containing long sequences, multiple
annotations, segmented sets of DNA, as well as sequences from phylogenetic and
population studies with alignments. For simple submission, use the online
submission tool BankIt instead.
A command-line program that automates the creation of
sequence records for submission to GenBank using many of the same functions as
Sequin. It is used primarily for submission of complete genomes and large
batches of sequences.
This link describes how submitters of SRA data can
obtain a secure NCBI FTP site for their data, and also describes the allowed
data formats and directory structures.
A single entry point for submitters to link to and
find information about all of the data submission processes at NCBI. Currently,
this serves as an interface for the registration of BioProjects and BioSamples
and submission of data for WGS and GTR. Future additions to this site are
planned.
This link describes how submitters of trace data can
obtain a secure NCBI FTP site for their data, and also describes the allowed
data formats and directory structures.
No comments:
Post a Comment