This directory contains fasta files for human genomic sequence in the ENCODE regions, for the May 2004 (UCSC hg17, NCBI Build 35) build of the human genome. The sequences are in the files: hg17.fa.gz unmasked sequence (all upper case) hg17.msk.fa.gz soft-masked sequence (repeats in lower case) In October 2005, the ENCODE project is transitioning from the initial reference build (Build 34) to this one. For background on the ENCODE project, see: NHGRI: The ENCODE Project: ENCylopedia Of DNA Elements http://www.genome.gov/10005107 For the list of primary and backup regions see: ENCODE Target Regions http://genome.ucsc.edu/ENCODE/regions.html
Name Last modified Size Description
Parent Directory - md5sum.txt 2005-09-26 11:18 94 hg17.msk.fa.gz 2005-09-26 16:36 9.3M hg17.fa.gz 2005-09-26 16:36 8.7M