SGD SGD


This site contains data from the SGD Oracle database, including lists of database IDs, ORF locations, Gene Ontology annotations, phenotype data, SAGE data, genetic mapping results, and literature curation.

A brief description of the data files provided are below. Please see the README files for more details about the files within each directory. Please send questions on the files and their contents to yeast-curator@yeastgenome.org


Data File Directory

Chromosomal Features:
Information on ORFs and non-protein coding genes and gene products annotated at SGD. Includes mapping of ORF names to IDs (eg SwissProt, Genbank), chromosomal coordinates, intron info, changes annotations, etc.

Relational Database Schema
SQL scripts used to create the SGD Oracle database.

Registry of Gene Names
Information about registered S. cerevisiae gene names.

Literature Curation Data
Information from the SGD curation of the literature, including SGD's contributions to the Gene Ontology (GO) project and phenotype data from the Systematic Deletion Consortium.

Protein Information
Protein structure and composition data, includes molecular weight, codon bias, etc.

DNA & Protein Sequences
Contains all S. cerivisiae DNA and protein sequence data

GenBank Sequences
FASTA files of all publicly available S. cerevisiae sequences, both DNA and protein from NCBI.

NCBI formated chromosomal sequences
FASTA, NCBI Sequin and ASN.1 format files that are the source of the information at NCBI on the S. cerevisiae genome (NCBI genome section).

cDNA sequences
Collection of cDNA (EST) sequences from budding yeast.

Genomic Sequences
Root for directories providing a variety of sequence datasets derived from the systematic S. cerevisiae genome sequence.

Genome to genome sequence similarity results

Systematic genome analysis results