Skip Navigation
 

User Guide

Genetic Alterations in Cancer (GAC)


Data Mining

Data to be displayed are selected from the pull-down lists provided for each field in the Data Selection window (Figure 1)

 

Data Mining Features

Figure 1. Data Mining Features


Within Data Mining analysis type, a selection is required in each of the first three fields Data Category, Species, and Topography before a display option and chart area can be selected (see Figure 2). The remaining fields Morphology, Agent, Tumor Origin, and Disorder (human studies) or Strain (rodent studies) are used to refine the query. As selections are made in each data field, options in subsequent fields are restricted to those for which data are available.

 

Display Options

Data can be displayed graphically or in summary tables in any one of the four chart areas. Choose the GRAPH or TABLE display option and a Chart number in which to display query results (Figure 2). Click the [Search] button to start the query. Each chart is independent, and the results are retained until the CLEAR feature on the chart tool bar is selected or a new query is defined for that chart area. Select the Data Mining option on the main tool bar to clear and reset all four charts.

 

Display Option Selection

Figure 2. Display Option Selection.


 

The GRAPH feature generates a histogram that shows the incidence of mutation for each gene that has been evaluated. The TABLE option generates a tabular summary (see Table 1 below) that shows the total number of tumors that have been evaluated, the number of tumors in which mutations were reported, and the incidence (percentage) of tumors with mutations for each gene. Values are calculated based on data from all of the studies represented in the users query.

 Back to top Back to top

 

The sample profile plotted in Figure 3 is from a general query for gene mutations in any type of human skin tumor. The results are displayed in Chart 1 and the query selections are shown below the graph. The data include point mutations, frameshifts, and inframe insertions and deletions.

 

Sample Profile 1 (General Search)

Figure 3. Sample Profile 1 (General Search).


 

The second sample profile, shown in Figure 4, is from a refined query in which UV Radiation was selected in the agent data field, Basal cell carcinomas for the morphology, and "All" for tumor origin. The profile shows data for all basal cell carcinomas in human skin associated with exposure to UV-radiation. Point to any bar to see the gene symbol and percentage of tumors with mutations.

 

Sample Profile 2 (Refined Search)

Figure 4. Sample Profile 2 (Refined Search).


The TABLE option on the Chart tool bar generates a summary table like that below in Table 1 of the data represented in the graph. The TABLE and GRAPH options are used to toggle between the two views.

 

Summary Data Table for Sample Profile 2

Table 1. Summary Data Table for Sample Profile 2.




Click any gene symbol in a summary table or bar in a graphic profile to view results for individual studies represented in the display. For example, clicking the bar in Figure 4 for the PTCH gene generates the graph shown in Figure 5. Each study is identified by a Reference ID number. Toggle to the data table (Table 2 below) using the TABLE option on the chart tool bar to see the number of tumors evaluated and the number of tumors that had PTCH mutations in each study. A summary table for each study are also generated by clicking on a gene symbol in Table 1.

 

Sample Graph of Individual Studies

Figure 5. Sample Graph of Individual Studies.


 

 Back to top Back to top

A sample data table is shown in Table 2 for the PTCH data. Clicking the CGAP symbol in Table 1 next to any gene displays gene information from the Cancer Genome Anatomy Project (CGAP (http://cgap.nci.nih.gov/)) web site a new window for human or mouse data genes. The symbol RGD beside rat gene symbols links to information on the Rat Genome Database (RGD (http://rgd.mcw.edu/) Exit NIEHS) web site for the corresponding gene.

 

Sample Data Table for Individual Studies

Table 2. Sample Data Table for Individual Studies.



Select a study number in the Data Table or a bar in the profile graph to generate a detailed data list for that study. Select the [All Data] button to display data lists for all of the studies shown in the Data Table. A sample data list is shown in Table 3 for the PTCH gene mutation data reported for basal cell carcinomas in reference CG0006. Each entry includes a unique Subject ID number, based on subject identities reported in the reference when available; the subjects gender and age; a field for identifying any known genetic disorder (e.g., xeroderma pigmentosum); an alteration description (Alt. Desc.); the Exon and Codon number for the altered sequence; the codon wild type sequence (WT Seq) and altered sequence (Alt Seq); and the wild type amino acid (WT AA) and altered amino acid (Alt AA) predicted from the altered sequence.

 

>
CHART 1 BACK Study - CG0006 Gene - PTCH
New Window | PTCH Mutation Spectrum | More Detail
Subject IDSexAgeDisorderAlt. Desc.ExonCodon AWT SeqAlt SeqWT AAAlt AA
ZPL BCC 10F28NONEGC>AT13616AGCAACSerAsn
ZPL BCC 13F28NONEGC>AT14668TACTATTyrTyr
ZPL BCC 04F28NONEGC>AT14688CAGTAGGlnSTOP
ZPL BCC 03F29NONEGC>AT15770CGATGAArgSTOP
ZPL BCC 09F26NONEDeletion15795TTCNAPheNA
ZPL BCC 08F25NONEInsertion17903ACTNAThrNA
ZPL BCC 06F29NONEGC>AT181018TGGTGATrpSTOP
ZPL BCC 12F18NONEGC>TA191066GAGTAGGluSTOP
ZPL BCC 05F25NONEGC>AT211163GGCAGCGlySer
ZPL BCC 05F25NONEGC>AT5236TGGTAGTrpSTOP
ZPL BCC 11F25NONEGC>AT5237GAAAAAGluLys
ZPL BCC 01F28NONEGC>AT8365CAGTAGGlnSTOP
ZPL BCC 02M26NONEGC>AT9417CAATAAGlnSTOP
ZPL BCC 04F28NONEAT>TA9431CTGCAGLeuGln
ZPL BCC 07M27NONEGC>TA-NANANANANA
Table 3. Sample Detailed Data List for Results from Study CG0006.

 

Note that a subject may have more than one mutation reported for a given tumor (see ZPL BCC 05) . If mutations are reported in two or more tumors from the same subject a suffix a, b, c, etc. is added to the Subject ID. Data lists for studies in mice or rats show the strain that was used in place of the Disorder field.

 

The More Detail option on the Data Table tool bar opens a new window and displays detailed study information, including additional agent information, dose and treatment times, identity of the region of the gene that was evaluated (exon numbers), an indication of the analytical technique, and notes for clarification or additional details if needed. A sample display of More Detail is shown in Figure 6 for study number CG0006.

 

More Detail Display

Figure 6. More Detail Display.


 

The Gene Mutation Spectrum option on the Chart tool bar generates a pie chart (mutation spectrum) that shows the percentage of each mutation as a portion of the total number of mutations. A sample mutation spectrum is shown in Figure 7 for the PTCH data. The number of tumors with mutations and the total number of mutations reported are shown in the mutation spectrum heading.

 

Sample Mutation Spectrum

Figure 7. Sample Mutation Spectrum.


 

Ref IDCitation (authors, year, title, journal, volume, pages)Source
CG0006Zhang, H., Ping, X.L., Lee, P.K., Wu, X.L., Yao, Y.J., Zhang, M.J., Silvers, D.N., Ratner, D., Malhotra, R., Peacocke, M., Tsou, H.C. (2001 ) RAle of PTCH and p53 genes in early-onset basal cell carcinoma. Am J Pathol, 158, 381-5.PubMed: 11159175 
CG0038Kim, M.Y., Park, H.J., Baek, S.C., Byun, D.G., Houh, D. (2002 ) Mutations of the p53 and PTCH gene in basal cell carcinomas: UV mutation signature and strand bias. J Dermatol Sci, 29, 1-9.PubMed: 12007715 
IL0040Bodak, N., Queille, S., Avril, M.F., Bouadjar, B., Drougard, C., Sarasin, A., Daya-Grosjean, L. (1999 ) High levels of patched gene mutations in basal-cell carcinomas from patients with xeroderma pigmentosum. Proc Natl Acad Sci U S A, 96, 5117-22.PubMed: 10220428 
IL0046D'Errico, M., Calcagnile, A., Canzona, F., Didona, B., Posteraro, P., Cavalieri, R., Corona, R., Vorechovsky, I., Nardo, T., Stefanini, M., Dogliotti, E. (2000 ) UV mutation signature in tumor suppressor genes involved in skin carcinogenesis in xeroderma pigmentosum patients. Oncogene, 19, 463-7.PubMed: 10656695 
IL0086Ratner, D., Peacocke, M., Zhang, H., Ping, X.L., Tsou, H.C. (2001 ) UV-specific p53 and PTCH mutations in sporadic basal cell carcinoma of sun-exposed skin. J Am Acad Dermatol, 44, 293-7.PubMed: 11174390 
IL0108Aszterbaum, M., Rothman, A., Johnson, R.L., Fisher, M., Xie, J., Bonifas, J.M., Zhang, X., Scott, M.P., Epstein, E.H. Jr (1998 ) Identification of mutations in the human PATCHED gene in sporadic basal cell carcinomas and in patients with the basal cell nevus syndrome. J Invest Dermatol, 110, 885-8.PubMed: 9620294 
Table 4. Sample Reference List.

 

The New Window option on the Chart tool bar opens a new, independent, interactive window containing the results displayed in the current chart. Selecting CLEAR on the original chart bar will clear the chart area without clearing the new window. The Ref List option generates a complete bibliographic list of references represented in the active display chart. A sample reference list is shown in Table 4 for the PTCH gene data. Each Reference ID is a unique number that is assigned to a study to identify the data source. The PubMed number, when available, is hyperlinked to its corresponding PubMed abstract.

Mutation Distribution

Codon distribution analyses are viewed by using the pull down list for Analysis Type and selecting [Mutation Distribution]. These displays show how base pair substitutions are distributed by codon along the length of different genes.

 

 Back to top Back to top

Data Selection Options

Data to be displayed are selected from the pull down lists. For codon distribution analysis, the only fields required before the graph and table areas can be displayed are species and gene highlighted in Figure 8. Other fields can be selected as needed to further refine the query. All selections will remain until either changed or all fields are Reset. As selections are made in each data field, options in subsequent fields are restricted to those for which data is available.

 

Data Mining Features

Figure 1. Codon Distribution Features.


 

Display Options

 

Table/Graph

 

Data can be displayed in graphical and/or tabular formats. These are shown when the Display button is selected. The table displays the number of single base substitutions that occur in each exon (Table 5). The codons numbers defining each exon are given by Codon Range, the number of tumors with mutations by Tumors Mutated, and the numbers of tumors analyzed for each exon are also given. The incidence (percent) of mutations that occur in analyzed tumors is also given as % Mutated Tumor Incidence. In addition to this is the option to display the results graphically; this is achieved by selecting the Include at the Display Options, Graph prompt. This feature generates a bar chart that shows the number of single base substitutions that occur at each codon across the length of a gene (Figure 9). Point to any bar to see the codon number and number of base substitutions occurring. Large genes may be divided into separate but contiguous graphs. Values are calculated based on point mutation data from all studies represented by users selections, but excludes tandem and complex substitutions.

 

Codon Distribution Display Table

Table 5. Codon Distribution Display Table.



Codon Distribution Display Table
Figure 9. Codon Distribution Graph Using Counts Per Codon


 

Graph Display Type

 

The type of graphic display can be modified by choosing to either display the number of mutations per codon (default), or the incidence of mutations at each codon per exon. This is achieved by choosing the appropriate selection under Display Options-> Graph Display Type. (See Figure 8).

 

Summary Table

 

A summary table describing, in detail, the results presented graphically can be selected at the Display Options -> Summary Table option. This summarizes the codon numbers at which mutations occurred and the number of mutations that were present at each.

 

Detailed Information

From the table feature more Detailed Information (see Table 5) about individual exons can be selected using the radio buttons for an individual exon. An expanded graphical display of the distribution of mutations for the selected exon and the types of base pair substitutions occurring is generated. A sample profile of the mutations that occur in Human, Skin, TP53, exon 7 is shown in Figure 10 and Table 6.

 

The bar chart in Figure 10 displays the number of single base substitutions that occur at each codon; Point to any bar to see the codon number and number of base substitutions present.

 

Mutation Profile for a Single Exon

Figure 10. Mutation Profile for a Single Exon.



Base Substitution Type

Table 6. Base Substitution Type.



 

Tandem Mutations

The displayed data can be further modified to include tandem mutation information with the Data Selection options (Figure 11).

 

Tandem Substitution Data Selection Options

 

Table 7. Single Base and Tandem Substitutions.

 

When Include Tandem Mutation(s) and Display are selected both the graph and the table are modified to include all tandem point mutations. In the table these are listed separately as CC>TT, CC>TT CpG, and Tandem. The latter describes any two consecutive base substitutions occurring in 1 codon that are not CC>TT changes including those occurring at CpG sites. The number of non-CC>TT tandem mutations occurring at CpG sites is given as a footnote (Table 7).

 

Single Base and Tandem Substitutions

Table 7. Single Base and Tandem Substitutions.


 

The types of substitutions that occur as tandem mutations can be shown by selecting Include Tandem Mutation(s) Table and Display. This produces a table (Table 8) listing the tandem base substitutions that occur by codon number.

Back to Top