Concept explainers
The genome of the bacterium Neisseria gonorrhoeae consists of one double-stranded DNA molecule that contains 2220 kilobase pairs. If 85% of this DNA molecule is made up of the open reading frames of genes encoding proteins, and the average protein is 300 amino acids long, how many protein-encoding genes does Neisseria have? What kind of genetic information is present in the other 15% of the DNA?
To discuss:
The Neisseria gonorrhoeae bacterial genome consists of one double stranded DNA and that contains 2220 Kbp. If 85% of this DNA sequence is made up of open reading frames of genes encoding proteins, and the average protein size is 300 amino acids long, Neisseria contains how many protein-encoding genes. Other 15% of the DNA sequence contains what kind of genetic information.
Concept introduction:
Open reading frames or ORF is a specific segment of DNA or RNA molecule and that part can be translated into a protein sequence. An open reading frame (ORF) (sequence of nucleotides) contains a start codon (AUG) followed by a stretch of various codons and end with a stop codon (UAA, UGA, or UAG). An ORF region in the mRNA is essential for its translation.
Explanation of Solution
The Neisseria gonorrhoeae bacterial genome contains 2220 Kbp or 2,220,000 bp.
Each base pair size is 0.34 nm.
Therefore 2220 Kbp x 0.34 nm is 754, 800 nm.
The length of DNA is 754, 800 nm or 0.07548 cm
If 85% of the bacterial genome is composed of open reading frames, 1,887 Kbp of DNA sequence could be the open reading frames.
(
Average size of the protein is 300 amino acids. Each amino acid is encoded by three nucleotides (one codon). Therefore, 900 bp of DNA sequence in the open reading frame encode proteins.
Number of protein coding bacterial gene is 2097. The remaining 15 % of the bacterial DNA can be non-coding genes, which may regulate gene expression.
Want to see more full solutions like this?
Chapter 4 Solutions
Brock Biology of Microorganisms (15th Edition)
- Duchenne muscular dystrophy is caused by a mutation in a gene that comprises 2.5 million base pairs and specifies a protein called dystrophin. However, less than 1% of the gene actually encodes the amino acids in the dystrophin protein. On the basis of what you now know about gene structure and RNA processing in eukaryotic cells, provide a possible explanation for the large size of the dystrophin gene.arrow_forwardYou are studying a large eukaryotic gene that is 439,515 base pairs long. You find the polypeptide that this gene produces in liver cells is 46,771 amino acids long. Your colleague studies the function of this gene in brain cells, and finds the polypeptide produced in the brain is much larger – 61,438 amino acids long. How do you explain this difference? Possible Answers: A. The cell cycle of liver cells is much longer than that of brain cells. B. This is due to alternative splicing. in the brain C. There was a different complement of sequence-specific transcription factor binding sites in the CRM of the brain cells. D. There is no 5' cap added to the gene product from the liver cells.arrow_forwardIn the human gene for the beta chain of haemoglobin (the oxygen-carrying protein in the red blood cells), the first 30 nucleotides in the amino-acid-coding region is represented by the sequence: 3'-TACCACGTGGACTGAGGACTCCTCTTCAGA-5'. What is the sequence of the partner strand? 4B. If the DNA duplex for the beta chain of haemoglobin above were transcribed from left to right, deduce the base sequence of the RNA in this coding region. 4C. In NOT more than 200 words, explain how eukaryotic RNA synthesized by RNA polymerase II is modified before leaving the nucleus?arrow_forward
- In the human gene for the beta chain of haemoglobin (the oxygen-carrying protein in the red blood cells), the first 30 nucleotides in the amino-acid-coding region is represented by the sequence: 3'-TACCACGTGGACTGAGGACTCCTCTTCAGA-5'. What is the sequence of the partner strand? 4B. If the DNA duplex for the beta chain of haemoglobin above were transcribed from left to right, deduce the base sequence of the RNA in this coding region.arrow_forwardGiven the fact that 1 fg of DNA = 9.78 * 105base pairs (on average), you can convert the amount of DNA per cell to the lengthof DNA in numbers of base pairs. (a) Calculate the number of basepairs of DNA in the haploid yeast genome. Express your answer inmillions of base pairs (Mb), a standard unit for expressing genomesize. Show your work. (b) How many base pairs per minute weresynthesized during the S phase of these yeast cells?arrow_forwardPinker mentions that only 1% of the human genome codes for proteins (the rest included introns, regulatory sequences, and repetitive DNA, some of it parasitic—we will talk about this later). Given that the human genome contains 3 x 109 base-pairs of DNA, there are about 20,000 human genes, and three base pairs code for each amino acid in a protein, how many amino acids are in the average human protein? [Hint: start with what fraction of base pairs in the human genome code for proteins.]arrow_forward
- A compact disc (CD) stores about 4.8 × 109 bits of information in a 96 cm2 area. This information is stored as a binary code—that is, every bit is either a 0 or a 1. how many bits would it take to specify each nucleotide pair in a DNA sequence? how many CDs would it take to store the information contained in the human genome?arrow_forwardThe DNA sequence of the genome of a virus is known and contains equal number of A, TG and C. The genome is composed of double stranded DNA molecule. It is 10Kb in length. If one searches the genome for the presence of the following sequence 5'-AAAAAA-3'/3'-TTTTTT-5', predict the number of such stretches that are likely to occur. (1Kb = 1000 base pairs). a) Two b) Eight c) Sixteen d) Thirty twoarrow_forwardIn addition to the standard base-paired helical structures, DNA can form X-shaped hairpin structures called cruciforms in which most bases are involved in Watson–Crick pairs. Such structures tend to occur at sequences with inverted repeats. Draw the cruciform structure formed by the DNA sequence TCAAGTCCACGGTGGACTTGC.arrow_forward
- People who carry a theoretical genetic disorder (called B-disease) can be identified from a 2kb DNA sequence. People who carry this genetic disorder have a single nucleotide polymorphism that results in a change of GTATTC to GGATTC, a site that only occurs once at nucleotide number 750 in this DNA sequence. Answer the following questions based on the information provided. (a) How can you develop a simple molecular test to identify the genetic disorder? (b) If you have carried out the molecular test (based on the information above) on a 100 individuals and found that 24 were healthy (BB) and 26 were carriers (bb); 1) What is the ratio of heterozygous? 2) Show how can you identify the three types from the agarose gel.arrow_forwardPeople who carry a theoretical genetic disorder (called B-disease) can be identified from a 2kb DNA sequence.People who carry this genetic disorder have a single nucleotide polymorphism that results in a change of GTATTC to GGATTC, a site that only occurs once at nucleotide number 750 in this DNA sequence. Answer the following questions based on the information provided. (a) How can you develop a simple molecular test to identify the genetic disorder? (b) If you have carried out the molecular test (based on the information above) on a 100 individual and found that 24 were healthy (BB) and 26 were carriers (bb); 1) What is the ratio of heterozygous? 2) Show how can you identify the three types from the agarose gelarrow_forwardPeople who carry a theoretical genetic disorder (called B-disease) can be identified from a 2kb DNA sequence. People who carry this genetic disorder have a single nucleotide polymorphism that results in a change of GTATCC to GGATCC, a site that only occurs once at nucleotide number 750 in this DNA sequence. Answer the following questions based on the information provided. (a) How can you develop a simple molecular test to identify the genetic disorder?r B-dif w. (41 (b) If you have carried out the molecular test (based on the information above) on a 100 individual and found that 24 were healthy (BB) and 26 were carriers (bb); 1) What is the ratio of heterozygous? 2) Show how can you identify the three types from the agarose gel (H focaiarrow_forward
- Human Anatomy & Physiology (11th Edition)BiologyISBN:9780134580999Author:Elaine N. Marieb, Katja N. HoehnPublisher:PEARSONBiology 2eBiologyISBN:9781947172517Author:Matthew Douglas, Jung Choi, Mary Ann ClarkPublisher:OpenStaxAnatomy & PhysiologyBiologyISBN:9781259398629Author:McKinley, Michael P., O'loughlin, Valerie Dean, Bidle, Theresa StouterPublisher:Mcgraw Hill Education,
- Molecular Biology of the Cell (Sixth Edition)BiologyISBN:9780815344322Author:Bruce Alberts, Alexander D. Johnson, Julian Lewis, David Morgan, Martin Raff, Keith Roberts, Peter WalterPublisher:W. W. Norton & CompanyLaboratory Manual For Human Anatomy & PhysiologyBiologyISBN:9781260159363Author:Martin, Terry R., Prentice-craver, CynthiaPublisher:McGraw-Hill Publishing Co.Inquiry Into Life (16th Edition)BiologyISBN:9781260231700Author:Sylvia S. Mader, Michael WindelspechtPublisher:McGraw Hill Education