ENCODE Project at UCSC

NAR Molecular Biology Database Collection entry number 985
Rosenbloom, Kate1; Dreszer, Timothy1; Malladi, Venkat1; Long, Jeffrey1; Sloan, Cricket1; Raney, Brian1; Cline, Melissa1; Karolchik, Donna1; Barber, Galt1; Clawson, Hiram1; Diekhans, Mark1; Fujita, Pauline1; Goldman, Mary1; Gravell, Robert1; Kuhn, Robert1; Harte, Rachel1; Hinrichs, Angie1; Kirkup, Vanessa1; Learned, Katrina1; Maddren, Morgan1; Meyer, Laurence1; Pohl, Andy1,2; Rhead, Brooke1; Wong, Mathew1; Zweig, Ann1; Haussler, David1,3; Kent, Jim1
1Center for Biomolecular Science and Engineering, School of Engineering, University of California, Santa Cruz (UCSC), Santa Cruz, CA 95064, USA
2Centre for Genomic Regulation (CRG), Barcelona, Spain and 3Howard Hughes Medical Institute, University of California Santa Cruz (UCSC), Santa Cruz, CA 95064, USA

Database Description

The Encyclopedia of DNA Elements (ENCODE) project is an international consortium of investigators funded to analyze the human genome with the goal of producing a comprehensive catalog of functional elements. Initially a pilot project focused on 1% of the human genome sequence, ENCODE was scaled up in September 2007 to cover the entire genome. The ENCODE Data Coordination Center (DCC) at The University of California, Santa Cruz (UCSC) is the primary repository for experimental results generated by ENCODE investigators. These results are captured in the UCSC Genome Bioinformatics database and download server for visualization and data mining via the UCSC Genome Browser and companion tools (Rhead et al. The UCSC Genome Browser Database: update 2010, in this issue). To support the breadth and volume of ENCODE data, new features for organizing and selecting datasets have been added to the browser. Additionally, metadata describing the experiments is now captured more formally via controlled vocabularies which are associated with browser tracks and download files. The ENCODE web portal at UCSC (http://encodeproject.org or http://genome.ucsc.edu/ENCODE) provides information about the ENCODE data and convenient links for access. As of September 2009, the ENCODE investigators have completed four production-level data freezes, consisting of 341 experiments submitted to the ENCODE DCC, with 207 of these released in a total of 18 tracks on the UCSC Genome Browser. These experiments represent a major methodology shift from microarray to high-throughput sequencing assays, and include a broad range of experiment types (e.g. ChIP-seq, RNA-seq, Methyl-seq) in 7 designated common cell types, and 51 additional human cell lines and tissues.


We would like to thank the systems administration staff at the Center for Biomolecular Science and Engineering: Jorge Garcia, Erich Weiler, Victoria Lin and Gary Moro, for their dedication and support, keeping high-volume ENCODE data flowing to our public site while assuring our servers are reliable and available. Thanks also to members of the ENCODE Consortium for providing these valuable data sets.


1. ENCODE Consortium. (2004) The ENCODE (ENCyclopedia Of DNA Elements) Project. Science 306, 636-640
2. The ENCODE Project Consortium, Birney, E., Stamatoyannopoulos, J., Dutta, A., Guigó, R., Gingeras, T., Margulies, E., Weng, Z., Snyder, M., Dermitzakis, E. et al. (2007) Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot project. Nature 447, 799-816
3. Weinstock, G.M. (2007) ENCODE: More genomic empowerment. Genome Res. 17, 667-668
4. Kent, W.J., Sugnet, C.W., Furey, T.S., Roskin, K.M., Pringle, T.H., Zahler, A.M., and Haussler, D. (2002). The human genome browser at UCSC. Genome Res. 12, 996-1006
5. Kuhn, R.M., Karolchik, D., Zweig, A.S., Wang, T., Smith, K., Rosenbloom, K.R., Rhead, B., Raney, B.J., Pohl, A., Pheasant, M. et al. (2009) The UCSC Genome Browser Database: update 2009. Nucleic Acids Res. 37 D755-D761
6. Thomas, D.J., Rosenbloom, K.R., Clawson, H., Hinrichs, A.S., Trumbower, H., Raney, B.J., Karolchik, D., Barber, G.P., Harte, R.A., Hillman-Jackson, J. et al. (2007) The ENCODE Project at UC Santa Cruz. Nucleic Acids Res. 35 D663-667
7. Celniker, S., Dillon, L., Gerstein, M., Gunsalus, K., Henikoff, S., Karpen, G., Kellis, M., Lai, E., Lieb, J., MacAlpine, D. et al. (2009) Unlocking the secrets of the genome. Nature 459 927-930
8. Li, H., Handsaker, B., Wysoker, A., Fennell, T., Ruan, J., Homer, N., Marth, G., Abecasis, G., Durbin, R., and the 1000 Genome Project Data Processing Subgroup (2009) The Sequence Alignment/Map (SAM) Format and SAMtools. Bioinformatics 25 2078-2079

Subcategory: Human ORFs

Go to the article in the NAR Database issue.
Oxford University Press is not responsible for the content of external internet sites