This directory contains the Apr. 2005 freeze of the D. simulans genome
(droSim1) produced by the Genome Sequencing Center at Washington 
University School of Medicine in St. Louis.

Files included in this directory:

  - chr*.fa.zip: compressed FASTA sequence of each chromosome.
    Repeats (from RepeatMasker and Tandem Repeat Finder) 
    are in lower case while non-repeating sequence is in upper case.
    RepeatMasker Sep. 2004 open-3-0-5 version with RepBase libraries:
    RepBase Update 9.04, RM database version 20040702

------------------------------------------------------------------
If you plan to download a large file or multiple files from this 
directory, we recommend that you use ftp rather than downloading the 
files via our website. To do so, ftp to hgdownload.cse.ucsc.edu, then 
go to the directory goldenPath/droSim1/chromosomes. To download multiple 
files, use the "mget" command:

    mget <filename1> <filename2> ...
    - or -
    mget -a (to download all the files in the directory)

The D. simulans sequence is made freely available before scientific 
publication by The Genome Sequencing Center, WUSTL School of Medicine with 
the following understanding: 

1. The data may be freely downloaded, used in analyses, and repackaged in
   databases. 
2. Users are free to use the data in scientific papers analyzing particular 
   genes and regions if the providers of these data (Genome Sequencing 
   Center, WUSTL School of Medicine) are properly acknowledged. 
3. The Drosophila simulans analysis group is aiming to publish an initial 
   analysis of the D. simulans genome sequence in 2005 (submitted in 
   2005) that will include descriptions of the assembly, genome landscape, 
   comparative analysis and initial gene content. People who would like to 
   coordinate other genome-wide analysis with this work should contact 
   Richard K. Wilson, Genome Sequencing Center Director, Washington 
   University School of Medicine. We welcome a coordinated approach to 
   describing this community resource. 
4. Any redistribution of the data should carry this notice. 
      Name                    Last modified      Size  Description
Parent Directory - md5sum.txt 2005-04-13 12:14 892 chrYh_random.fa.gz 2005-04-13 12:13 27K chrXh_random.fa.gz 2005-04-13 12:13 23K chrX_random.fa.gz 2005-04-13 12:13 1.3M chrX.fa.gz 2005-04-13 12:13 4.5M chrU.fa.gz 2005-04-13 12:13 3.9M chrM.fa.gz 2005-04-13 12:13 4.7K chr4_random.fa.gz 2005-04-13 12:13 35K chr4.fa.gz 2005-04-13 12:13 265K chr3h_random.fa.gz 2005-04-13 12:13 397K chr3R_random.fa.gz 2005-04-13 12:13 294K chr3R.fa.gz 2005-04-13 12:13 8.0M chr3L_random.fa.gz 2005-04-13 12:13 238K chr3L.fa.gz 2005-04-13 12:13 6.5M chr2h_random.fa.gz 2005-04-13 12:12 886K chr2R_random.fa.gz 2005-04-13 12:12 684K chr2R.fa.gz 2005-04-13 12:12 5.6M chr2L_random.fa.gz 2005-04-13 12:12 210K chr2L.fa.gz 2005-04-13 12:12 6.4M