| User Manual | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Five input types are provided, including 1. gene name(s) for DBTSS, 2. NCBI nucleotide accession#(s), 3. fasta sequence(s) paste, 4. primer pair information for PCR product and 5. fasta file input. They are introduced as follows: User can input gene name(s) to search transcriptional start sites from DBTSS (http://dbtss.hgc.jp/) and two organisms are selectable, they are human and mouse, respectively. (Fig. 1)
When the text field of ˇ§Gene Namesˇ¨ is inputted correctly and ˇ§Queryˇ¨ button is clicked (Fig. 1) (for instance ˇ§TEKT4ˇ¨), the system will query Gene ID information via NCBI database. (Fig. 2)
And then, the system will search query transcriptional start sites information via DBTSS database. (Fig. 3)
When the query is finished, related information is shown as Fig. 4. This information contains ˇ§Organismˇ¨, ˇ§Gene Nameˇ¨, ˇ§Gene IDˇ¨, ˇ§Official Full Nameˇ¨, and promoter sequence. And now user can set the CpG island parameters and clicked the ˇ§CpG Island Searchˇ¨ button to search CpG islands in the promoter sequence. (Fig. 4)
After the ˇ§CpG Island Searchˇ¨ button is clicked (Fig. 4), the system will search CpG islands by the program ˇ§CpG Island Searcherˇ¨. (Fig. 5)
For a result of CpG island search, all CpG islands in the promoter sequence will be output (this case only one CpG island is found). And then user can select which CpG island to analyze methylation RFLP (Restriction Fragment Length Polymorphism). (Fig. 6)
There are three options in every CpGIsland #, they are ˇ§CpG Mappingˇ¨, ˇ§CpG Distributionˇ¨, and ˇ§CpG Island Seqˇ¨, respectively. And the CpG island information is described including start position, end position, %GC, ObsCpG/ExpCpG, and length. (Fig. 6) CpG Mappingˇ¨ shows the CpG island and CpG position in the promoter sequence with color sequence and color bar. Black color font and
ˇ§CpG Distributionˇ¨ shows the CpG island and CpG position in the promoter sequence with color bar.
ˇ§CpG Island Seqˇ¨ shows the CpG island and CpG position in the promoter sequence with color sequence. Black color font represents nucleotide bases, green color font represents CpG island, and pink color font represents CpG position. (Fig. 9)
Here, we selected CpG island 1 to analyze methylation RFLP. Click the ˇ§Methy-RFLP analysisˇ¨ button of CpG island 1, and then wait the process of methylation RFLP analysis. (Fig. 10)
Finally, available RFLP for methylation enzymes of selecting CpG island are provided and shown as Fig. 11 (partly enzymes are shown). Enzymes are classed with commercial and non-commercial, and the recognition sequences contain two types that are IUPAC and Non-IUPAC. At the right top, four options are available for displaying additional information; they are ˇ§Sequenceˇ¨, and ˇ§Visualizationˇ¨, ˇ§CTCFBS Searchˇ¨, ˇ§Bisulfite T Stretchˇ¨, respectively. (Fig. 11) The bisulfite primers had integrated in Methyl-Typing that shown in Fig. 11 "Primer information". The bisulfite sequencing primers (BSP) were designed in Methyl-Typing according to the criteria of Methyl Primer Express R Software v1.0 user manual (www.appliedbiosystems.com/methylprimerexpress). Using the BSP primers for PCR amplification, it is able to be digested with the mined COBRA enzymes to perform COBRA analysis.
Result of clicking the ˇ§sequenceˇ¨ box shows the original orbisulfite-treated/methylated sequenceswith CpG islands and mined restriction enzymes are colored in green and red, respectively. The bisulfite-treated/un-methylated sequence is also provided for comparison . All the recognition sequences are designed in CpG island. (Fig. 12)
A convenient function is employed to find enzyme recognition sequences position. Cclick a recognition sequence of enzyme and all the corresponding enzyme recognition sequences will be marked as red color font and the selected enzyme line will also be marked as shallow yellow color (Fig. 13).
ˇ§Visualizationˇ¨ shows the CpG island and enzyme recognition sequences with color bar. The red color bar represents enzyme recognition sequences, green color bar represents CpG island, and blue color font represents nucleotide bases. Two visualization are shown, one is visualization for all enzymes in CpG island and the other is visualization for a selected enzymes in CpG island. (Fig. 14)
When user selects a recognition sequence, all the corresponding recognition sequences will be marked by red bar and the selected line will also be marked as shallow yellow color (Fig. 15).
ˇ§CTCFBS Searchˇ¨ uses CTCFBSDB (http://insulatordb.utmem.edu/help.php) to search insulators for the input sequence. When user click the ˇ§CTCFBS Searchˇ¨ button (Fig. 16), and a new page will be popped up. The input sequence will be loaded automatically and E-value is defaulted by 10.0. (Fig. 16)
User can clicked the ˇ§CTCFBS Searchˇ¨ button and the result will be output at an original page. (Fig. 17)
ˇ§Bisulfite T Stretchˇ¨ provides a search for repeat ˇĄTˇ¦ bases after bisulfite process. The corresponding sequence will be marked by red color and the default length is 10 bps. And user can adjust by the option where select a value is 9, 8, 7, 6, or 5. (Fig. 18)
Result for clicking the ˇ§BSP (Bisulfite sequencing primer)ˇ¨ box in the followign Fig. 19. The primers F (forward) and R (reverse) are visualized and their sequences were provided. The positions for the primer location and COBRA enzymes were shown as well as the product size for PCR digestion with COBRA enzymes.
2. NCBI nucleotide accession#(s) NCBI nucleotide accession# input is availability in the system for providing various sequences. (Fig. 20)
When the text field of ˇ§Nucleotide Accession#ˇ¨ is input correctly and ˇ§Queryˇ¨ button is clicked (Fig. 20), the system will query nucleotide information via NCBI database. (Fig. 21)
When the query is finished, related information is shown as Fig. 22. This information contains ˇ§LOCUSˇ¨, ˇ§DEFINITIONˇ¨, ˇ§ACCESSIONˇ¨, ˇ§VERSIONˇ¨, and nucleotide sequence. And now user can set the CpG island parameters and clicked the ˇ§CpG Island Searchˇ¨ button to search CpG islands in the nucleotide sequence (Fig. 22). And then the other steps are the same as above description.
The following steps are the same as above description after the ˇ§CpG Island Searchˇ¨ button are clicked.
A fasta format sequence can be type or paste to the sequence input box, and multiple fasta format sequences are available. GpG Island Searcher parameters are defaulted as %GC=55%, ObsCpG/ExpCpG=0.60, and Length=200bp for search the input sequence CpG islands. (Fig. 23)
When the ˇ§Analysisˇ¨ button is clicked (Fig. 23), and then the sequences in input box will be processed and shown as fasta format. Every fasta sequence follows a ˇ§CpG Island Searchˇ¨ button and user can select they desired sequence to search CpG island (Fig. 24). And then the other steps are the same as above description.
The following steps are the same as above description after the ˇ§CpG Island Searchˇ¨ button are clicked.
4. primer pair information for PCR product A primer pair can be type or paste to the ˇ§Forward Primerˇ¨ and ˇ§Reverse Primerˇ¨ to find PCR product. The database and available parameters is reference to UCSC In-Silico PCR (http://genome.ucsc.edu/cgi-bin/hgPcr). GpG Island Searcher parameters are defaulted as %GC=55%, ObsCpG/ExpCpG=0.60, and Length=200bp for search the input sequence CpG islands. (Fig. 25)
When the primer pair is inputted correctly and related parameters is set, click the ˇ§Queryˇ¨ button (Fig. 25), the system will query PCR product information via UCSC In-Silico PCR. (Fig. 26)
And then, the PCR product sequence will output and display as a fasta sequence. (Fig. 27)
The following steps are the same as above description after the ˇ§CpG Island Searchˇ¨ button are clicked.
5. fasta file input A fasta format sequence or multiple fasta format sequences can be saved to a file in local computer. And user can by the ˇ§browseˇKˇ¨ botton to select its absolute path. GpG Island Searcher parameters are defaulted as %GC=55%, ObsCpG/ExpCpG=0.60, and Length=200bp for search the input sequence CpG islands (Fig. 28). When ˇ§Put Inˇ¨ button is clicked, the input file will be processed and shown as fasta format (same as Fig. 24). And then the other steps are the same as above description after the ˇ§CpG Island Searchˇ¨ button is clicked.
The following steps are the same as above description after the ˇ§CpG Island Searchˇ¨ button are clicked.
|