3.8: Human Genome
- Page ID
- 6500
\( \newcommand{\vecs}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)
\( \newcommand{\vecd}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash {#1}}} \)
\( \newcommand{\id}{\mathrm{id}}\) \( \newcommand{\Span}{\mathrm{span}}\)
( \newcommand{\kernel}{\mathrm{null}\,}\) \( \newcommand{\range}{\mathrm{range}\,}\)
\( \newcommand{\RealPart}{\mathrm{Re}}\) \( \newcommand{\ImaginaryPart}{\mathrm{Im}}\)
\( \newcommand{\Argument}{\mathrm{Arg}}\) \( \newcommand{\norm}[1]{\| #1 \|}\)
\( \newcommand{\inner}[2]{\langle #1, #2 \rangle}\)
\( \newcommand{\Span}{\mathrm{span}}\)
\( \newcommand{\id}{\mathrm{id}}\)
\( \newcommand{\Span}{\mathrm{span}}\)
\( \newcommand{\kernel}{\mathrm{null}\,}\)
\( \newcommand{\range}{\mathrm{range}\,}\)
\( \newcommand{\RealPart}{\mathrm{Re}}\)
\( \newcommand{\ImaginaryPart}{\mathrm{Im}}\)
\( \newcommand{\Argument}{\mathrm{Arg}}\)
\( \newcommand{\norm}[1]{\| #1 \|}\)
\( \newcommand{\inner}[2]{\langle #1, #2 \rangle}\)
\( \newcommand{\Span}{\mathrm{span}}\) \( \newcommand{\AA}{\unicode[.8,0]{x212B}}\)
\( \newcommand{\vectorA}[1]{\vec{#1}} % arrow\)
\( \newcommand{\vectorAt}[1]{\vec{\text{#1}}} % arrow\)
\( \newcommand{\vectorB}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)
\( \newcommand{\vectorC}[1]{\textbf{#1}} \)
\( \newcommand{\vectorD}[1]{\overrightarrow{#1}} \)
\( \newcommand{\vectorDt}[1]{\overrightarrow{\text{#1}}} \)
\( \newcommand{\vectE}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash{\mathbf {#1}}}} \)
\( \newcommand{\vecs}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)
\( \newcommand{\vecd}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash {#1}}} \)
\(\newcommand{\avec}{\mathbf a}\) \(\newcommand{\bvec}{\mathbf b}\) \(\newcommand{\cvec}{\mathbf c}\) \(\newcommand{\dvec}{\mathbf d}\) \(\newcommand{\dtil}{\widetilde{\mathbf d}}\) \(\newcommand{\evec}{\mathbf e}\) \(\newcommand{\fvec}{\mathbf f}\) \(\newcommand{\nvec}{\mathbf n}\) \(\newcommand{\pvec}{\mathbf p}\) \(\newcommand{\qvec}{\mathbf q}\) \(\newcommand{\svec}{\mathbf s}\) \(\newcommand{\tvec}{\mathbf t}\) \(\newcommand{\uvec}{\mathbf u}\) \(\newcommand{\vvec}{\mathbf v}\) \(\newcommand{\wvec}{\mathbf w}\) \(\newcommand{\xvec}{\mathbf x}\) \(\newcommand{\yvec}{\mathbf y}\) \(\newcommand{\zvec}{\mathbf z}\) \(\newcommand{\rvec}{\mathbf r}\) \(\newcommand{\mvec}{\mathbf m}\) \(\newcommand{\zerovec}{\mathbf 0}\) \(\newcommand{\onevec}{\mathbf 1}\) \(\newcommand{\real}{\mathbb R}\) \(\newcommand{\twovec}[2]{\left[\begin{array}{r}#1 \\ #2 \end{array}\right]}\) \(\newcommand{\ctwovec}[2]{\left[\begin{array}{c}#1 \\ #2 \end{array}\right]}\) \(\newcommand{\threevec}[3]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \end{array}\right]}\) \(\newcommand{\cthreevec}[3]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \end{array}\right]}\) \(\newcommand{\fourvec}[4]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \\ #4 \end{array}\right]}\) \(\newcommand{\cfourvec}[4]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \\ #4 \end{array}\right]}\) \(\newcommand{\fivevec}[5]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \\ #4 \\ #5 \\ \end{array}\right]}\) \(\newcommand{\cfivevec}[5]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \\ #4 \\ #5 \\ \end{array}\right]}\) \(\newcommand{\mattwo}[4]{\left[\begin{array}{rr}#1 \amp #2 \\ #3 \amp #4 \\ \end{array}\right]}\) \(\newcommand{\laspan}[1]{\text{Span}\{#1\}}\) \(\newcommand{\bcal}{\cal B}\) \(\newcommand{\ccal}{\cal C}\) \(\newcommand{\scal}{\cal S}\) \(\newcommand{\wcal}{\cal W}\) \(\newcommand{\ecal}{\cal E}\) \(\newcommand{\coords}[2]{\left\{#1\right\}_{#2}}\) \(\newcommand{\gray}[1]{\color{gray}{#1}}\) \(\newcommand{\lgray}[1]{\color{lightgray}{#1}}\) \(\newcommand{\rank}{\operatorname{rank}}\) \(\newcommand{\row}{\text{Row}}\) \(\newcommand{\col}{\text{Col}}\) \(\renewcommand{\row}{\text{Row}}\) \(\newcommand{\nul}{\text{Nul}}\) \(\newcommand{\var}{\text{Var}}\) \(\newcommand{\corr}{\text{corr}}\) \(\newcommand{\len}[1]{\left|#1\right|}\) \(\newcommand{\bbar}{\overline{\bvec}}\) \(\newcommand{\bhat}{\widehat{\bvec}}\) \(\newcommand{\bperp}{\bvec^\perp}\) \(\newcommand{\xhat}{\widehat{\xvec}}\) \(\newcommand{\vhat}{\widehat{\vvec}}\) \(\newcommand{\uhat}{\widehat{\uvec}}\) \(\newcommand{\what}{\widehat{\wvec}}\) \(\newcommand{\Sighat}{\widehat{\Sigma}}\) \(\newcommand{\lt}{<}\) \(\newcommand{\gt}{>}\) \(\newcommand{\amp}{&}\) \(\definecolor{fillinmathshade}{gray}{0.9}\)All these ACGTs. What are they?
Over three billion of them from a human form the human genome - the human genetic material - all the information needed to encode a human being. It would take about 9.5 years to read out loud - without stopping - the more than three billion pairs of bases in one person's genome.
The Human Genome
What makes each one of us unique? You could argue that the environment plays a role, and it does to some extent. But most would agree that your parents have something to do with your uniqueness. In fact, it is our genes that make each one of us unique – or at least genetically unique. We all have the genes that make us human: the genes for skin and bones, eyes and ears, fingers and toes, and so on. However, we all have different skin colors, different bone sizes, different eye colors and different ear shapes. In fact, even though we have the same genes, the products of these genes work a little differently in most of us. And that is what makes us unique.
The human genome is the genome - all the DNA - of Homo sapiens. Humans have about 3 billion bases of information, divided into roughly 20,000 to 22,000 genes, which are spread among non-coding sequences and distributed among 24 distinct chromosomes (22autosomes plus the X and Y sex chromosomes) (below). The genome is all of the hereditary information encoded in the DNA, including the genes and non-coding sequences.
Human Genome, Chromosomes, and Genes. Each chromosome of the human genome contains many genes as well as noncoding intergenic (between genes) regions. Each pair of chromosomes is shown here in a different color.
Thanks to the Human Genome Project, scientists now know the DNA sequence of the entire human genome. The Human Genome Project is an international project that includes scientists from around the world. It began in 1990, and by 2003, scientists had sequenced all 3 billion base pairs of human DNA. Now they are trying to identify all the genes in the sequence. The Human Genome Project has produced a reference sequence of the human genome. The human genome consists of protein-coding exons, associated introns and regulatory sequences, genes that encode other RNA molecules, and other DNA sequences (sometimes referred to as "junk" DNA), which are regions in which no function as yet been identified.
You can watch a video about the Human Genome Project and how it cracked the "code of life" at this link: http://www.pbs.org/wgbh/nova/genome/program.html.
Our Molecular Selves video discusses the human genome, and is available athttp://www.genome.gov/25520211 or http://www.youtube.com/watch?v=_EK3g6px7Ik.Genome, Unlocking Life's Code is the Smithsonian's National Museum of Natural History exhibit of the human genome. See http://unlockinglifescode.org to visit the exhibit.
ENCODE: The Encyclopedia of DNA Elements
In September 2012, ENCODE, The Encyclopedia of DNA Elements, was announced. ENCODE was a colossal project, involving over 440 scientists in 32 labs the world-over, whose goal was to understand the human genome. It had been thought that about 80% of the human genome was "junk" DNA. ENCODE has established that this is not true. Now it is thought that about 80% of the genome is active. In fact, much of the human genome is regulatory sequences, on/off switches that tell our genes what to do and when to do it. Dr. Eric Green, director of the National Human Genome Research Institute of the National Institutes of Health which organized this project, states, "It's this incredible choreography going on, of a modest number of genes and an immense number of ... switches that are choreographing how those genes are used."
It is now thought that at least three-quarters of the genome is involved in making RNA, and most of this RNA appears to help regulate gene activity. Scientists have also identified about 4 million sites where proteins bind to DNA and act in a regulatory capacity. These new findings demonstrate that the human genome has remarkable and precise, and complex, controls over the expression of genetic information within a cell.
See ENCODE data describes function of human genome athttp://www.genome.gov/27549810 for additional information.
Summary
- The human genome consists of about 3 billion base pairs of DNA.
- In 2003, the Human Genome Project finished sequencing all 3 billion base pairs.
Explore More
Use this resource to answer the questions that follow.
- http://www.hippocampus.org/Biology → Non-Majors Biology → Search: Human Genome Project
- What were 3 goals of the Human Genome Project?
- How many genes are on chromosome #1?
- How big is the human genome?
- How much genetic variation is there among people?
- How much of the genome is not part of any gene?
Review
- Describe the human genome.
- What has the Human Genome Project achieved?
- Describe the makeup of the human genome.