Background
A pharmacogenetic-based biotechnology company, Ocard Asthma Inc., has developed a new treatment for acute asthma attacks it calls PulmaKine. This inhaler-delivered compound was discovered using a combinatorial chemistry approach directed against known cytokine receptors expressed on lung epithelial cells. During the clinical trial phase, it was observed that PulmaKine was highly effective in blocking respiratory impairment in 65% of asthmatics who used the Ocard inhaler during an acute asthma attack. However, the other 35% of the test subjects showed no measurable improvement with the drug, and in fact, many of these PulmaKine non-responders required emergency medical treatment to stop the asthma attack.
Bronchial lavage fluid (BLF) had been collected from all of the study subjects. RT-PCR microchip RNA expression profiling was performed using the BLF cell samples. This characterization led to the identification of a cDNA transcript (EST.4591) that was found to be expressed in the lung tissues of PulmaKine Responders, but not expressed in the PulmaKine Non-responders. Genotyping of the EST.4591 genomic locus identified a germline polymorphism in PulmaKine Non-responders that explained the lack of expression in BLF samples. The researchers named the putative EST.4591 gene product AMG (asthma modulating gene).
Basic Strategy
The molecular genetic research group at Ocard intend to use the microchip data as a starting point to investigate the function of the AMG protein, and moreover, to develop a diagnostic test that can be used to positively identify asthma patients who will benefit from PulmaKine therapy. One long term goal of the company is to develop a second generation therapeutic compound that would be used to specifically treat PulmaKine Non-responders that exhibit low levels of AMG expression.
The illustration below shows how data will be collected from PulmaKine patients based on RNA Expression Profiling using human cDNA microarrays. In this image analysis, red spots indicate that the specific transcript is expressed at a higher level in the patient sample than it is in the control RNA, whereas, green spots denote lower expression of a particular gene in the patient sample relative to the control.

BLAST searches were done with EST.4591 to try and establish a functional identity. Initial results suggested that EST.4591 may corresponded to non-coding sequence based on low homology (E values >0.001) to sequences in the "nr" Database.

Sequence homology searches with the AMG-1 sequence (EST.4591) were performed with other GenBank databases to identify overlapping EST sequences that could be used to extend the AMG transcript sequence. The largest human cDNA identified by these "database mining" approaches was named AMG-2 and found to extend within ~1200 base pairs of the predicted 5' end based on Northern blots. No other GenBank sequences were found to extend beyond AMG-2. In order to isolate the missing portion of AMG, the researchers mail-ordered the AMG-2 cDNA from the public domain storehouse (I.M.A.G.E. Consortium) and used it as a reagent in the lab to isolate a third clone called AMG-3. DNA sequence analysis of the combined AMG sequence (AMG contig) identified a single AMG open reading frame of 352 amino acids.

1. (3 pts.) How are the fluorescently labeled Cy3 (control) and Cy5 (experimental) nucleic acids generated for use in expression profiling studies by microarray technology? What type of RNA sample might have been used to generate the Cy3-labeled control nucleic acid in order to produce the observed microarray results?
2. (2 pts.) The GenBank "blastn" search of EST.4591 against the "nr" database did not show any significant matches. Based on the available NCBI databases, list and briefly describe two other GenBank databases that could have led to the identification of AMG-2 as an overlapping, but uncharacterized, AMG expressed sequence tag (EST).
3. (2 pts.) Describe two independent methods that the Ocard researchers could have used in their molecular genetic lab to isolate the AMG-3 cDNA sequence.
4. (3 pts.) Describe a GenBank query that could be used to identify a possible function for the 352 amino acid AMG protein? Assuming you get a significant match to a related gene from a model organism, what other types of databases could you query through the Internet to find out more about the function of AMG?
| Department of Biochemistry & Molecular Biophysics The University of Arizona Professor Roger L. Miesfeld RLM@u.arizona.edu © 2001. All rights reserved. |