PDB - Protein Data Bank Macromolecule structures determined by X-ray crystallography and NMR, both proteins and DNA. The principal source for molecular 3D coordinates.
Genomes:
Genome databases contain partial or full sequences for the chromosomes of organisms. Certain centers (KEGG, Sanger, EMGLib, TIGR, Celera) distribute several genomes whereas others concentrate on a single organism. The first two below (GOLD, KEGG) are good places for finding any genome project.
genomesize.com Database of genome sizes, covering even species which have not been sequenced
Genetic Maps:
Genomic sequencing is usually based on certain markers, which can be used to locate genes. These markers are important also for the "gene hunting", localization of certain genes.
Information related to macromolecule (mainly protein) three dimensional structure and their analyses.
PDB - Protein Data Bank Macromolecule structures determined by X-ray crystallography and NMR, both proteins and DNA. The principal source for molecular 3D coordinates.
MSD - Macromolecular Structure Database at EBI The European project for the collection, management and distribution of data about macromolecular structures, derived in part from the Protein Data Bank (PDB).
MMDB Database of macromolecular 3D structures at NCBI, data taken from PDB but enhanced with consistent taxonomy, consistent secondary structure assignments etc. Searchable with Entrez, can be directly linked to sequence and/or literature searches.
NDB A database specializing in nucleic acid 3D structures and DNA-binding protein structures.
CATH Hierarchical classification of protein domain structures
SCOP Familial and structural protein relationships
ASTRAL Analysis of protein structures and their sequences
Gene3D A database of precalulated structural assignments for genes within whole genomes.
HSSP Structural families and alignments. Homology-derived structures of proteins (secondary and tertiary), similar to Gene3D.
Membrane protein topology database This database contains information of experimentally verified transmembrane helices (172 proteins in Jan 2007)
BioMagResBank A database of NMR-derived protein and nucleic acid 3D structures
BTKbase Mutation registry for X-linked agammaglobulinemia
HIV-RT HIV reverse transcriptase and protease sequence variation. Shows an interesting focus in the interplay of medication, development of resistance and sequence changes.
KinMutBase Disease-causing protein kinase mutations
PAHdb Mutations at the phenylalanine hydroxylase locus. A good example of a disease-oriented database.
BLOCKS Ungapped multiple protein alignments extracted from SwissProt/TrEMBL entries, corresponding to the most highly conserved regions in protein families documented in InterPro
SMART Identification and annotation of genetically mobile domains and the analysis of domain architectures
PROSITE Biologically-significant protein patterns and profiles
iProClass Comprehensive family relationships and structural/functional features of protein
Gene Expression:
The transcription of genes in genomes can be easily analysed e.g. with chip technology. For the distribution and analysis a number of Web sites are available.