<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	xmlns:georss="http://www.georss.org/georss" xmlns:geo="http://www.w3.org/2003/01/geo/wgs84_pos#" xmlns:media="http://search.yahoo.com/mrss/"
	>

<channel>
	<title>Panida L&#039;s Blog</title>
	<atom:link href="http://ppersica.wordpress.com/feed/" rel="self" type="application/rss+xml" />
	<link>http://ppersica.wordpress.com</link>
	<description>Just another WordPress.com weblog</description>
	<lastBuildDate>Wed, 20 Jan 2010 09:44:25 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.com/</generator>
<cloud domain='ppersica.wordpress.com' port='80' path='/?rsscloud=notify' registerProcedure='' protocol='http-post' />
<image>
		<url>http://s2.wp.com/i/buttonw-com.png</url>
		<title>Panida L&#039;s Blog</title>
		<link>http://ppersica.wordpress.com</link>
	</image>
	<atom:link rel="search" type="application/opensearchdescription+xml" href="http://ppersica.wordpress.com/osd.xml" title="Panida L&#039;s Blog" />
	<atom:link rel='hub' href='http://ppersica.wordpress.com/?pushpress=hub'/>
		<item>
		<title>Assignment 6</title>
		<link>http://ppersica.wordpress.com/2010/01/20/assignment-6/</link>
		<comments>http://ppersica.wordpress.com/2010/01/20/assignment-6/#comments</comments>
		<pubDate>Wed, 20 Jan 2010 05:53:17 +0000</pubDate>
		<dc:creator>ppersica</dc:creator>
				<category><![CDATA[Hypercourse on Bioinformatics]]></category>

		<guid isPermaLink="false">http://ppersica.wordpress.com/?p=180</guid>
		<description><![CDATA[Select one of your interesting sequences from the database (sequence should be longer than 300 base pair) to do the BLAST search and answer the following questions: a. What are the different between 6 BLASTs(blastn, blastp, blastx, tblastn, tblastx, PSI-BLAST)? blastn: Search a nucleotide database using a nucleotide query blastp: Search protein database simply compares [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=ppersica.wordpress.com&amp;blog=10630555&amp;post=180&amp;subd=ppersica&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p><a href="http://ppersica.files.wordpress.com/2010/01/blast9.png"></a>Select one of your interesting sequences from the database (sequence should be longer than 300 base pair) to do the BLAST search and answer the following questions:</p>
<p><strong><em>a</em>.</strong> What are the different between 6 BLASTs(blastn, blastp, blastx, tblastn, tblastx, PSI-BLAST)?</p>
<p>blastn: Search a <strong>nucleotide</strong> database using a <strong>nucleotide</strong> query</p>
<p>blastp: Search <strong>protein</strong> database simply compares a protein query to a protein database using a <strong>protein</strong> query</p>
<p>blastx: Search <strong>protein</strong> database using a <strong>translated nucleotide</strong> query</p>
<p>tblastn: Search <strong>translated nucleotide</strong> database using a <strong>protein</strong> query</p>
<p>tblastx: Search <strong>translated nucleotide</strong> database using a <strong>translated nucleotide</strong> query</p>
<p>PSI-BLAST (protein-specific iterated BLAST): Search <strong>protein</strong> database using a <strong>protein</strong> query, allowing the user to build a PSSM (position-specific scoring matrix) using the results of the first BlastP run.) <br />
<strong><em>b</em></strong>. Use your sequence to do 3 out of 6 BLASTs and discuss &#8220;What’s the strength and weakness of BLAST you have selected?&#8221;</p>
<p>Human hexose-6-phosphate dehydrogenase is chosen. Retrieve the nucleotide sequence from GenBank (<a href="http://www.ncbi.nlm.nih.gov/">http://www.ncbi.nlm.nih.gov/</a>).</p>
<p><a href="http://ppersica.files.wordpress.com/2010/01/blast1.png"><img class="alignnone size-medium wp-image-185" title="blast1" src="http://ppersica.files.wordpress.com/2010/01/blast1.png?w=300&#038;h=215" alt="" width="300" height="215" /></a></p>
<p>Search nucleotide for  Homo sapiens hexose-6-phosphate dehydrogenase (glucose 1-dehydrogenase) (H6PD). NCBI accession number NM_004285.2</p>
<p><a href="http://ppersica.files.wordpress.com/2010/01/blast2.png"><img class="alignnone size-medium wp-image-184" title="blast2" src="http://ppersica.files.wordpress.com/2010/01/blast2.png?w=300&#038;h=216" alt="" width="300" height="216" /></a></p>
<p>Click on &#8220;FASTA&#8221; and save the sequence as a text file. Then, blast the nucleotide sequence with BLAST program available on <a href="http://blast.ncbi.nlm.nih.gov/Blast.cgi">http://blast.ncbi.nlm.nih.gov/Blast.cgi</a>.</p>
<p><a href="http://ppersica.files.wordpress.com/2010/01/blast3.png"><img class="alignnone size-medium wp-image-186" title="blast3" src="http://ppersica.files.wordpress.com/2010/01/blast3.png?w=300&#038;h=215" alt="" width="300" height="215" /></a></p>
<p>The BLAST programs chosen are blastn, blastx, and tblastx.</p>
<p>Click on &#8220;<strong>blastn</strong>&#8221; from this page. Paste the nucleotide sequence in FASTA format.</p>
<p><a href="http://ppersica.files.wordpress.com/2010/01/blast4.png"><img class="alignnone size-medium wp-image-189" title="blast4" src="http://ppersica.files.wordpress.com/2010/01/blast4.png?w=300&#038;h=216" alt="" width="300" height="216" /></a></p>
<p>Under &#8220;Choose Search Set&#8221;, choose &#8220;others&#8221; checkbox to choose nucleotide database including every organism.</p>
<p><a href="http://ppersica.files.wordpress.com/2010/01/blast5.png"><img class="alignnone size-medium wp-image-188" title="blast5" src="http://ppersica.files.wordpress.com/2010/01/blast5.png?w=300&#038;h=208" alt="" width="300" height="208" /></a></p>
<p>Under &#8220;Program Selection&#8221; section, choose &#8220;somewhat similar sequence (blastn)&#8221; checkbox. Then, click on &#8220;BLAST&#8221; button. The result page will show up as following:</p>
<p><a href="http://ppersica.files.wordpress.com/2010/01/blast6.png"><img class="alignnone size-medium wp-image-192" title="blast6" src="http://ppersica.files.wordpress.com/2010/01/blast6.png?w=300&#038;h=216" alt="" width="300" height="216" /></a> </p>
<p>The BLAST result will be shown</p>
<p><a href="http://ppersica.files.wordpress.com/2010/01/blast7.png"><img class="alignnone size-medium wp-image-191" title="blast7" src="http://ppersica.files.wordpress.com/2010/01/blast7.png?w=300&#038;h=216" alt="" width="300" height="216" /></a></p>
<p><strong>blastx</strong> is done in a similar way to blastn.</p>
<p>Paste the nucleotide sequence</p>
<p><a href="http://ppersica.files.wordpress.com/2010/01/blast9.png"><img class="alignnone size-medium wp-image-198" title="blast9" src="http://ppersica.files.wordpress.com/2010/01/blast9.png?w=300&#038;h=217" alt="" width="300" height="217" /></a></p>
<p>Choose database as &#8220;nr&#8221;</p>
<p>Then, click on &#8220;BLAST&#8221; button.</p>
<p><a href="http://ppersica.files.wordpress.com/2010/01/blast10.png"><img class="alignnone size-medium wp-image-199" title="blast10" src="http://ppersica.files.wordpress.com/2010/01/blast10.png?w=300&#038;h=215" alt="" width="300" height="215" /></a></p>
<p>The result is shown</p>
<p> <a href="http://ppersica.files.wordpress.com/2010/01/blast11.png"><img class="alignnone size-medium wp-image-200" title="blast11" src="http://ppersica.files.wordpress.com/2010/01/blast11.png?w=300&#038;h=215" alt="" width="300" height="215" /></a></p>
<p>tblastx: paste the nucleotide sequence and choose database as &#8220;nr/nt&#8221; </p>
<p><a href="http://ppersica.files.wordpress.com/2010/01/blast121.png"><img class="alignnone size-medium wp-image-206" title="blast12" src="http://ppersica.files.wordpress.com/2010/01/blast121.png?w=300&#038;h=215" alt="" width="300" height="215" /></a></p>
<p>The error occurs. The program cannot operated within the time allowed because the search is too large. No result is received.</p>
<p><a href="http://ppersica.files.wordpress.com/2010/01/blast13.png"><img class="alignnone size-medium wp-image-205" title="blast13" src="http://ppersica.files.wordpress.com/2010/01/blast13.png?w=300&#038;h=208" alt="" width="300" height="208" /></a></p>
<p>The strength and weakness of the BLASTs chosen are:</p>
<p>blastn: it searches nucleotide query in nucleotide database. Consequently, it does not require much of time to operate since it aligns nucleotide query to nucleotide database. The nucleotide of query and result must be exact to be scored. Consequently, it is rather specific but if there is a polymorphism of nucleotide(s), that position is not scored. As a result, the total score is less than it should be since that position might not be significantly different as they give the same amino acid.   </p>
<p>blastx: it seaches translated nucleotide in protein database. It takes sometimes to process the translation of the nucleotide query in all reading frame. However, the same amino acid may result from different codons. Translatinging nucleotide sequence into amino acid sequence is probably increasing the chance to identify a protein that their nucleotide sequences may differ due to genetic variation of codons. Moreover, the reading frame of translation might not be corrected as all reading frame are employed. It can be distinguish whether which reading frame is corresponded to the real reading frame of that gene.</p>
<p>tblastx: it searches translated nucleotide query in translated nucleotide database. Hence, it takes a plenty of time to process as well as much of CPU usage. This program essentially increases the chance of finding possible result as all reading frame of translated nucleotide in database and the nucleotide query are aligned. Incorrect reading frame may result but it provides all the possibility of the result that could be. </p>
<p><strong><em>c.</em></strong> Show us the first hit on each BLAST with their identity or/and similarity scores.</p>
<p>blastn: NM_004285.3 Homo sapiens hexose-6-phosphate dehydrogenase, E-value 0.0, Maximum identity 100%</p>
<p>blastx: NP_004276.2 hexose-6-phosphate dehydrogenase precursor, E-value 0.0</p>
<p>tblastx: no result is obtained.<br />
<strong><em>d.</em></strong> Summarize the result from 3 BLASTs you select.</p>
<p>blastn and blastx gave out the same result, which is hexose-6-phosphate dehydrogenase of human, with E-value = 0.0. Zero E-value means that the sequence of query is identical to that of the result, giving its reliability. tblastx could not operate the request as it requires too much CPU usage to translate a long nucleotide sequence and locally aligns them to the translated nucleotide database. blastn could be a potential tool since it is fast and accurate.</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/ppersica.wordpress.com/180/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/ppersica.wordpress.com/180/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/ppersica.wordpress.com/180/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/ppersica.wordpress.com/180/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/ppersica.wordpress.com/180/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/ppersica.wordpress.com/180/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/ppersica.wordpress.com/180/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/ppersica.wordpress.com/180/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/ppersica.wordpress.com/180/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/ppersica.wordpress.com/180/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/ppersica.wordpress.com/180/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/ppersica.wordpress.com/180/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/ppersica.wordpress.com/180/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/ppersica.wordpress.com/180/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=ppersica.wordpress.com&amp;blog=10630555&amp;post=180&amp;subd=ppersica&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://ppersica.wordpress.com/2010/01/20/assignment-6/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/b57a607c18872a75205d06e880a553af?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">ppersica</media:title>
		</media:content>

		<media:content url="http://ppersica.files.wordpress.com/2010/01/blast1.png?w=300" medium="image">
			<media:title type="html">blast1</media:title>
		</media:content>

		<media:content url="http://ppersica.files.wordpress.com/2010/01/blast2.png?w=300" medium="image">
			<media:title type="html">blast2</media:title>
		</media:content>

		<media:content url="http://ppersica.files.wordpress.com/2010/01/blast3.png?w=300" medium="image">
			<media:title type="html">blast3</media:title>
		</media:content>

		<media:content url="http://ppersica.files.wordpress.com/2010/01/blast4.png?w=300" medium="image">
			<media:title type="html">blast4</media:title>
		</media:content>

		<media:content url="http://ppersica.files.wordpress.com/2010/01/blast5.png?w=300" medium="image">
			<media:title type="html">blast5</media:title>
		</media:content>

		<media:content url="http://ppersica.files.wordpress.com/2010/01/blast6.png?w=300" medium="image">
			<media:title type="html">blast6</media:title>
		</media:content>

		<media:content url="http://ppersica.files.wordpress.com/2010/01/blast7.png?w=300" medium="image">
			<media:title type="html">blast7</media:title>
		</media:content>

		<media:content url="http://ppersica.files.wordpress.com/2010/01/blast9.png?w=300" medium="image">
			<media:title type="html">blast9</media:title>
		</media:content>

		<media:content url="http://ppersica.files.wordpress.com/2010/01/blast10.png?w=300" medium="image">
			<media:title type="html">blast10</media:title>
		</media:content>

		<media:content url="http://ppersica.files.wordpress.com/2010/01/blast11.png?w=300" medium="image">
			<media:title type="html">blast11</media:title>
		</media:content>

		<media:content url="http://ppersica.files.wordpress.com/2010/01/blast121.png?w=300" medium="image">
			<media:title type="html">blast12</media:title>
		</media:content>

		<media:content url="http://ppersica.files.wordpress.com/2010/01/blast13.png?w=300" medium="image">
			<media:title type="html">blast13</media:title>
		</media:content>
	</item>
		<item>
		<title>Assignment 5</title>
		<link>http://ppersica.wordpress.com/2010/01/15/assignment-5/</link>
		<comments>http://ppersica.wordpress.com/2010/01/15/assignment-5/#comments</comments>
		<pubDate>Fri, 15 Jan 2010 10:42:37 +0000</pubDate>
		<dc:creator>ppersica</dc:creator>
				<category><![CDATA[Hypercourse on Bioinformatics]]></category>

		<guid isPermaLink="false">http://ppersica.wordpress.com/?p=144</guid>
		<description><![CDATA[Please use the bioinformatics tools to design these following items; 1. The real-time PCR primer and probe set(s) which can be used to distinguish between 2009 Swine-Origin Influenza A (H1N1)from other influenza subtypes. Please also describe what are gene(s)/region(s) that you choose? And give us the reason why? To distinguish 2009 swine-originated influenza A (H1N1) [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=ppersica.wordpress.com&amp;blog=10630555&amp;post=144&amp;subd=ppersica&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p><a href="http://ppersica.files.wordpress.com/2010/01/pri19.png"></a>Please use the bioinformatics tools to design these following items;</p>
<p>1. The real-time PCR primer and probe set(s) which can be used to distinguish between 2009 Swine-Origin Influenza A (H1N1)from other influenza subtypes.<br />
Please also describe what are gene(s)/region(s) that you choose? And give us the reason why?</p>
<p>To distinguish 2009 swine-originated influenza A (H1N1) from other subtypes, real time PCR is a promising approach if a specific region is chosen. The virus is characterised by haemagglutinin 1 and neuraminidase 1 that present on the envelope of the virus. The region(s) of 2009 swine-originated that differs from other subtypes is determined by means of alignment in order to separate it from other subtypes.</p>
<p>Retrieve the hemagglutinin (HA) mRNA and amino acid sequence of swine influenza from GenBank (<a href="http://www.ncbi.nlm.nih.gov">http://www.ncbi.nlm.nih.gov</a>). Search nucleotide for H1N1 AND HA. Amino acid sequence is included after &#8220;translation&#8221; heading. The nucleotide sequences of HA gene are available from many countries. HA nucleotide sequences of H1N1 influenza that is not a swine-orginated are retrieved to compare with HA of swine-originated influenza.  </p>
<p><a href="http://ppersica.files.wordpress.com/2010/01/pri2.png"><img class="alignnone size-medium wp-image-166" title="pri2" src="http://ppersica.files.wordpress.com/2010/01/pri2.png?w=300&#038;h=217" alt="" width="300" height="217" /></a></p>
<p>GenBank accession number of HA:<br />
NWS: U08903.1<br />
Alberta: U47310.1<br />
Ws: U08904.1<br />
Swine influenza from Rio de Janeiro: CY054281.1<br />
Swine influenza from Nebraska: S67220.1</p>
<p>Amino acid alignment of these 5 strains of H1N1 is done on ClustalW program (<a href="http://www.ebi.ac.uk/clustalw">http://www.ebi.ac.uk/clustalw</a>). Paste the sequences in FASTA format and run the program.</p>
<p><a href="http://ppersica.files.wordpress.com/2010/01/pri15.png"><img class="alignnone size-medium wp-image-168" title="pri15" src="http://ppersica.files.wordpress.com/2010/01/pri15.png?w=300&#038;h=210" alt="" width="300" height="210" /></a></p>
<p>Asterisk represents the amino acid that is found in all strains. The region with asterisk will be ignored, while the region of amino acid sequences that the 2 swine influenza are similar but different from the rest are considered.  This region will be subsequently used for identification of swine-originated influenza. The nucleotide alignment is also done to find the chosen region from amino acid alignment, which is the position 201-254, that corresponds to nucleotide sequence.</p>
<p><a href="http://ppersica.files.wordpress.com/2010/01/pri16.png"><img class="alignnone size-medium wp-image-169" title="pri16" src="http://ppersica.files.wordpress.com/2010/01/pri16.png?w=300&#038;h=209" alt="" width="300" height="209" /></a></p>
<p> The chosen region of nucleotide is 623-761. This region is used for real time PCR primer and probe design in Primer3 program (<a href="http://frodo.wi.mit.edu/primer3">http://frodo.wi.mit.edu/primer3</a>). Paste the sequence of this region onto the program. The checkbox of Pick left primer, right primer and hybidization probe are chosen to design forward and reverse primers and the single probe.</p>
<p><a href="http://ppersica.files.wordpress.com/2010/01/pri17.png"><img class="alignnone size-medium wp-image-170" title="pri17" src="http://ppersica.files.wordpress.com/2010/01/pri17.png?w=300&#038;h=215" alt="" width="300" height="215" /></a></p>
<p> The parameters of primers are set as following:</p>
<p><a href="http://ppersica.files.wordpress.com/2010/01/pri17.png"><img class="alignnone size-medium wp-image-170" title="pri17" src="http://ppersica.files.wordpress.com/2010/01/pri17.png?w=300&#038;h=215" alt="" width="300" height="215" /></a></p>
<p>Product size ranges: 50-90, 80-120 bp</p>
<p>Primer size: min 19 bp, opt 20 bp, max 23 bp</p>
<p>Primer Tm: min 60 oC, opt 64 oC, 68 oC</p>
<p>Primer CG%: min 35, max 65</p>
<p>The parameters of probe (Hyb Oligo) is as following:</p>
<p>Hyb Oligo excluded region: 47,4 98,8 (these regions will not be considered for the probe since they are conserved region in all strains)</p>
<p>Hyb Oligo size: min 20 bp, opt 23 bp, max 26 bp</p>
<p>Hyb Oligo Tm: min 68 oC, opt 70 oC, max 70 oC</p>
<p>Hyb Oligo GC%: min 20, opt 60, max 80</p>
<p><a href="http://ppersica.files.wordpress.com/2010/01/pri18.png"><img class="alignnone size-medium wp-image-171" title="pri18" src="http://ppersica.files.wordpress.com/2010/01/pri18.png?w=300&#038;h=216" alt="" width="300" height="216" /></a></p>
<p>Then, click on &#8220;Pick primers&#8221;</p>
<p>Forward primer: TCAACAAGCTCTCTACCAGAACG, Tm 60.95 oC, %GC 47.83 <br />
Reverse primer: TCGTTGCTATTTCTGGCTTGAAC, Tm 62.75 oC, %GC 43.48 </p>
<p>Probe: TGCCTATGTTTTTGTGGGGTCATCA, Tm 68.29 oC, %GC 44.00</p>
<p><a href="http://ppersica.files.wordpress.com/2010/01/pri19.png"><img class="alignnone size-medium wp-image-167" title="pri19" src="http://ppersica.files.wordpress.com/2010/01/pri19.png?w=300&#038;h=215" alt="" width="300" height="215" /></a></p>
<p>The start position is 651 and ends at position 762 (corresponding to the position of the HA gene). The probe binds from position 678-703. The product size is 89 bp.<br />
 2. The conventional PCR and sequencing primer set which can be used to identify oseltamivir resistance associated NA gene mutations: N1: H274Y</p>
<p><strong>Sequencing of NA gene to identify mutation that leads to oseltamivir resistance in H1N1 virus</strong></p>
<p>The nucleotide sequence of neuraminidase is retrieved from GenBank. The accession number of swine-originated influenza neuraminidase gene is GU371257.1, while that of oseltamivir resistance is GU371269.1. These sequences are aligned in order to determined the mutate region by using ClustalW program (<a href="http://www.ebi.ac.uk/Tools/clustalw2/index.html">http://www.ebi.ac.uk/Tools/clustalw2/index.html</a>).</p>
<p><a href="http://ppersica.files.wordpress.com/2010/01/pri6.png"><img class="alignnone size-medium wp-image-148" title="pri6" src="http://ppersica.files.wordpress.com/2010/01/pri6.png?w=300&#038;h=216" alt="" width="300" height="216" /></a></p>
<p>The nucleotide sequences are given in FASTA format. Click on &#8220;Run&#8221;. The alignment will appear.</p>
<p><a href="http://ppersica.files.wordpress.com/2010/01/pri7.png"><img class="alignnone size-medium wp-image-149" title="pri7" src="http://ppersica.files.wordpress.com/2010/01/pri7.png?w=300&#038;h=216" alt="" width="300" height="216" /></a></p>
<p>Neuraminidase 1 protein has a nucleotide transversion of cytosine at position 827 in segment 6 to thyrimidine that leads to amino acid substitution of histidine to tyrosine. To identify oseltamivir resistance, this position should be determined.</p>
<p>Sequencing of this mutation can be achieved by using primers covering around this region. Nucleotide sequence of wild-type NA gene is applied to Primer3 program (<a href="http://frodo.wi.mit.edu/primer3">http://frodo.wi.mit.edu/primer3</a>).</p>
<p><a href="http://ppersica.files.wordpress.com/2010/01/pri9.png"><img class="alignnone size-medium wp-image-152" title="pri9" src="http://ppersica.files.wordpress.com/2010/01/pri9.png?w=300&#038;h=216" alt="" width="300" height="216" /></a></p>
<p>On checkbox, choose &#8220;Pick left primer&#8221; and &#8220;Pick right primer&#8221;. The parameters are set as following:</p>
<p><a href="http://ppersica.files.wordpress.com/2010/01/pri10.png"><img class="alignnone size-medium wp-image-154" title="pri10" src="http://ppersica.files.wordpress.com/2010/01/pri10.png?w=300&#038;h=216" alt="" width="300" height="216" /></a></p>
<p>- Targets (to indicate the nucleotides that we want to include in the product, in this case the mutation is at position 827): 825,5 (starts to include at position 825 for 5 bases)</p>
<p>- Product size ranges: 150-250, 100-300 bp</p>
<p>- General primer picking conditions:<br />
 Primer size: min 20 bp, opt 23, max 25<br />
 Primer Tm: min 55 oC, opt 60 oC, max 75 oC<br />
 Primer GC%: min 40, opt 50, max 60</p>
<p>Then, click on &#8220;Pick primers&#8221;. The result will show up</p>
<p><a href="http://ppersica.files.wordpress.com/2010/01/pri11.png"><img class="alignnone size-medium wp-image-155" title="pri11" src="http://ppersica.files.wordpress.com/2010/01/pri11.png?w=300&#038;h=215" alt="" width="300" height="215" /></a></p>
<p>Sequencing primers are obtain as following:</p>
<p>The forward primer: CAGGCCTCATACAAGATCTTCAG, Tm 60.27 oC, %GC   47.83 <br />
The reverse primer: CCAGATTCTGATTGAAAGACACC, Tm 59.99 oC, %GC   43.48</p>
<p><a href="http://ppersica.files.wordpress.com/2010/01/pri12.png"><img class="alignnone size-medium wp-image-153" title="pri12" src="http://ppersica.files.wordpress.com/2010/01/pri12.png?w=300&#038;h=215" alt="" width="300" height="215" /></a></p>
<p> The alignment of primers and the sequence show that the start position is 753 and end is 936. The product size is 184 bp. Asterisks show the included nucleotide that the mutate nucleotide is at position 827 or position 74 in the sequencing.</p>
<p><strong>Conventional PCR of NA 1 gene</strong></p>
<p>The parameters of conventional PCR primers are different from sequencing primers.</p>
<p><a href="http://ppersica.files.wordpress.com/2010/01/pri13.png"><img class="alignnone size-medium wp-image-163" title="pri13" src="http://ppersica.files.wordpress.com/2010/01/pri13.png?w=300&#038;h=216" alt="" width="300" height="216" /></a></p>
<p>- Product size ranges: 250-300 300-400 400-500 bp</p>
<p>- General primer picking conditions:<br />
 Primer size: min 18 bp, opt 20 bp, max 22  bp<br />
 Primer Tm: min 52 oC, opt 55 oC, max 58 oC<br />
 Primer GC%: min 40, opt 50, max 60</p>
<p>Forward primer: GCTTTACTGTAATGACCGATG, 21mers, Tm 55.02 oC, %GC 42.86</p>
<p>Reverse primer: TGCCTGTCTTATCATTAGGG, 20 mers, Tm 55.35 oC, %GC   45.00 </p>
<p><a href="http://ppersica.files.wordpress.com/2010/01/pri14.png"><img class="alignnone size-medium wp-image-162" title="pri14" src="http://ppersica.files.wordpress.com/2010/01/pri14.png?w=300&#038;h=216" alt="" width="300" height="216" /></a></p>
<p>The product starts from position 718 - 1005 of NA gene. Product size: 288 bp</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/ppersica.wordpress.com/144/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/ppersica.wordpress.com/144/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/ppersica.wordpress.com/144/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/ppersica.wordpress.com/144/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/ppersica.wordpress.com/144/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/ppersica.wordpress.com/144/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/ppersica.wordpress.com/144/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/ppersica.wordpress.com/144/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/ppersica.wordpress.com/144/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/ppersica.wordpress.com/144/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/ppersica.wordpress.com/144/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/ppersica.wordpress.com/144/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/ppersica.wordpress.com/144/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/ppersica.wordpress.com/144/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=ppersica.wordpress.com&amp;blog=10630555&amp;post=144&amp;subd=ppersica&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://ppersica.wordpress.com/2010/01/15/assignment-5/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/b57a607c18872a75205d06e880a553af?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">ppersica</media:title>
		</media:content>

		<media:content url="http://ppersica.files.wordpress.com/2010/01/pri2.png?w=300" medium="image">
			<media:title type="html">pri2</media:title>
		</media:content>

		<media:content url="http://ppersica.files.wordpress.com/2010/01/pri15.png?w=300" medium="image">
			<media:title type="html">pri15</media:title>
		</media:content>

		<media:content url="http://ppersica.files.wordpress.com/2010/01/pri16.png?w=300" medium="image">
			<media:title type="html">pri16</media:title>
		</media:content>

		<media:content url="http://ppersica.files.wordpress.com/2010/01/pri17.png?w=300" medium="image">
			<media:title type="html">pri17</media:title>
		</media:content>

		<media:content url="http://ppersica.files.wordpress.com/2010/01/pri17.png?w=300" medium="image">
			<media:title type="html">pri17</media:title>
		</media:content>

		<media:content url="http://ppersica.files.wordpress.com/2010/01/pri18.png?w=300" medium="image">
			<media:title type="html">pri18</media:title>
		</media:content>

		<media:content url="http://ppersica.files.wordpress.com/2010/01/pri19.png?w=300" medium="image">
			<media:title type="html">pri19</media:title>
		</media:content>

		<media:content url="http://ppersica.files.wordpress.com/2010/01/pri6.png?w=300" medium="image">
			<media:title type="html">pri6</media:title>
		</media:content>

		<media:content url="http://ppersica.files.wordpress.com/2010/01/pri7.png?w=300" medium="image">
			<media:title type="html">pri7</media:title>
		</media:content>

		<media:content url="http://ppersica.files.wordpress.com/2010/01/pri9.png?w=300" medium="image">
			<media:title type="html">pri9</media:title>
		</media:content>

		<media:content url="http://ppersica.files.wordpress.com/2010/01/pri10.png?w=300" medium="image">
			<media:title type="html">pri10</media:title>
		</media:content>

		<media:content url="http://ppersica.files.wordpress.com/2010/01/pri11.png?w=300" medium="image">
			<media:title type="html">pri11</media:title>
		</media:content>

		<media:content url="http://ppersica.files.wordpress.com/2010/01/pri12.png?w=300" medium="image">
			<media:title type="html">pri12</media:title>
		</media:content>

		<media:content url="http://ppersica.files.wordpress.com/2010/01/pri13.png?w=300" medium="image">
			<media:title type="html">pri13</media:title>
		</media:content>

		<media:content url="http://ppersica.files.wordpress.com/2010/01/pri14.png?w=300" medium="image">
			<media:title type="html">pri14</media:title>
		</media:content>
	</item>
		<item>
		<title>Assignment 4</title>
		<link>http://ppersica.wordpress.com/2010/01/05/assignment-4/</link>
		<comments>http://ppersica.wordpress.com/2010/01/05/assignment-4/#comments</comments>
		<pubDate>Tue, 05 Jan 2010 07:05:26 +0000</pubDate>
		<dc:creator>ppersica</dc:creator>
				<category><![CDATA[Hypercourse on Bioinformatics]]></category>

		<guid isPermaLink="false">http://ppersica.wordpress.com/?p=119</guid>
		<description><![CDATA[Structural bioinformatics Function of a protein can be predicted based on its 3D structure as a particular domain may serve a particular function. Therefore, many structures of proteins have been solved by means of X-ray crystallography or NMR. Still, it has not been practicable to obtain structure of every protein through these approaches. Consequenlty, 3D structure [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=ppersica.wordpress.com&amp;blog=10630555&amp;post=119&amp;subd=ppersica&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>Structural bioinformatics</p>
<p>Function of a protein can be predicted based on its 3D structure as a particular domain may serve a particular function. Therefore, many structures of proteins have been solved by means of X-ray crystallography or NMR. Still, it has not been practicable to obtain structure of every protein through these approaches. Consequenlty, 3D structure of a protein can be modelled based on a protein with similar amino acid sequence in which the 3D structure is available.  </p>
<p>1. BLAST the nucleotide sequence with &#8220;blastx&#8221; program that is available in <a href="http://blast.ncbi.nlm.nih.gov/Blast.cgi">http://blast.ncbi.nlm.nih.gov/Blast.cgi</a></p>
<p>This BLAST program will translate the nucleotide query to amino acid sequences in all reading frames and search in protein database.</p>
<p><a href="http://ppersica.files.wordpress.com/2010/01/pro1.png"><img class="alignnone size-medium wp-image-122" title="pro1" src="http://ppersica.files.wordpress.com/2010/01/pro1.png?w=300&#038;h=210" alt="" width="300" height="210" /></a></p>
<p>Paste the nucleotide sequence in FASTA format. Then, choose database as &#8220;Non-redundant protein sequences (nr)&#8221;.</p>
<p><a href="http://ppersica.files.wordpress.com/2010/01/pro2.png"><img class="alignnone size-medium wp-image-123" title="pro2" src="http://ppersica.files.wordpress.com/2010/01/pro2.png?w=300&#038;h=208" alt="" width="300" height="208" /></a></p>
<p>Then, Click on &#8220;BLAST&#8221; button on the left. The program may take a few minutes to finish. The result will be shown as following: </p>
<p><a href="http://ppersica.files.wordpress.com/2010/01/pro4.png"><img class="alignnone size-medium wp-image-125" title="pro4" src="http://ppersica.files.wordpress.com/2010/01/pro4.png?w=300&#038;h=208" alt="" width="300" height="208" /></a></p>
<p>2. Identify the unknown nucleotide sequence </p>
<p>Scroll down to the description box. The best matching protein to the query will be on the top of the list based on the lowest E-value.</p>
<p><a href="http://ppersica.files.wordpress.com/2010/01/pro5.png"><img class="alignnone size-medium wp-image-126" title="pro5" src="http://ppersica.files.wordpress.com/2010/01/pro5.png?w=300&#038;h=208" alt="" width="300" height="208" /></a></p>
<p>The unknown nucleotide sequence is corresponded to BRC1 protein after it had been translated into amino acid sequence.</p>
<p>3. 3D structure of BRC1</p>
<p>Click on the accession number of the top-list protein. The information page of this protein will show up. Now we know what kind of protein this nucleotide sequence is. Then, we can look for the structure of this protein by scrolling down the information page and go to &#8220;LinkOut&#8221; under the &#8220;All links for this record&#8221; box on the right.</p>
<p><a href="http://ppersica.files.wordpress.com/2010/01/pro6.png"><img class="alignnone size-medium wp-image-127" title="pro6" src="http://ppersica.files.wordpress.com/2010/01/pro6.png?w=300&#038;h=209" alt="" width="300" height="209" /></a></p>
<p>The following page will appear. Click on &#8220;MODBASE&#8221; to go to the comparative modeling of this protein.</p>
<p><a href="http://ppersica.files.wordpress.com/2010/01/pro7.png"><img class="alignnone size-medium wp-image-128" title="pro7" src="http://ppersica.files.wordpress.com/2010/01/pro7.png?w=300&#038;h=209" alt="" width="300" height="209" /></a></p>
<p>The 3D structure of BRC1 protein is generated based on comparative modeling.</p>
<p><a href="http://ppersica.files.wordpress.com/2010/01/pro8.png"><img class="alignnone size-medium wp-image-129" title="pro8" src="http://ppersica.files.wordpress.com/2010/01/pro8.png?w=300&#038;h=210" alt="" width="300" height="210" /></a></p>
<p>The template is a crystal structure of BRCA1/BARD1 ring heterodimer in which the structure can be assessed through PDB code 1JM7A on RCSB database.</p>
<p><a href="http://ppersica.files.wordpress.com/2010/01/pro10.png"><img class="alignnone size-medium wp-image-121" title="pro10" src="http://ppersica.files.wordpress.com/2010/01/pro10.png?w=300&#038;h=208" alt="" width="300" height="208" /></a></p>
<p>The 3D structure of BRC1 protein allows us to understand its function regarding to its template, BRCA1, that consists of domains in the structure such as the DNA binding domain. Simulation of the structure can be further investigated in silico to observe how it interacts with other compounds, etc.</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/ppersica.wordpress.com/119/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/ppersica.wordpress.com/119/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/ppersica.wordpress.com/119/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/ppersica.wordpress.com/119/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/ppersica.wordpress.com/119/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/ppersica.wordpress.com/119/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/ppersica.wordpress.com/119/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/ppersica.wordpress.com/119/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/ppersica.wordpress.com/119/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/ppersica.wordpress.com/119/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/ppersica.wordpress.com/119/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/ppersica.wordpress.com/119/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/ppersica.wordpress.com/119/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/ppersica.wordpress.com/119/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=ppersica.wordpress.com&amp;blog=10630555&amp;post=119&amp;subd=ppersica&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://ppersica.wordpress.com/2010/01/05/assignment-4/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/b57a607c18872a75205d06e880a553af?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">ppersica</media:title>
		</media:content>

		<media:content url="http://ppersica.files.wordpress.com/2010/01/pro1.png?w=300" medium="image">
			<media:title type="html">pro1</media:title>
		</media:content>

		<media:content url="http://ppersica.files.wordpress.com/2010/01/pro2.png?w=300" medium="image">
			<media:title type="html">pro2</media:title>
		</media:content>

		<media:content url="http://ppersica.files.wordpress.com/2010/01/pro4.png?w=300" medium="image">
			<media:title type="html">pro4</media:title>
		</media:content>

		<media:content url="http://ppersica.files.wordpress.com/2010/01/pro5.png?w=300" medium="image">
			<media:title type="html">pro5</media:title>
		</media:content>

		<media:content url="http://ppersica.files.wordpress.com/2010/01/pro6.png?w=300" medium="image">
			<media:title type="html">pro6</media:title>
		</media:content>

		<media:content url="http://ppersica.files.wordpress.com/2010/01/pro7.png?w=300" medium="image">
			<media:title type="html">pro7</media:title>
		</media:content>

		<media:content url="http://ppersica.files.wordpress.com/2010/01/pro8.png?w=300" medium="image">
			<media:title type="html">pro8</media:title>
		</media:content>

		<media:content url="http://ppersica.files.wordpress.com/2010/01/pro10.png?w=300" medium="image">
			<media:title type="html">pro10</media:title>
		</media:content>
	</item>
		<item>
		<title>Assignment-3</title>
		<link>http://ppersica.wordpress.com/2009/12/22/assignment-3/</link>
		<comments>http://ppersica.wordpress.com/2009/12/22/assignment-3/#comments</comments>
		<pubDate>Tue, 22 Dec 2009 15:45:33 +0000</pubDate>
		<dc:creator>ppersica</dc:creator>
				<category><![CDATA[Hypercourse on Bioinformatics]]></category>

		<guid isPermaLink="false">http://ppersica.wordpress.com/?p=81</guid>
		<description><![CDATA[Phylogenetic tree construction by using BioEdit program Common ancestors of organisms can be investigated by means of phylogenetic analysis. Mitochondrial cytochrome b nucleotide sequence showing below is found to be of a dinosaur that lived 80 million years ago: cccttctattattcattctcattctattcgttattcttgtactccacacatccaaacaac aaagcataatattccacccattgagtccattcctatcctgattcttagtccccgaacctt ttacactcacatg Phylogenetic tree of the dinosaur cytochrome b is constructed to find which organism(s) [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=ppersica.wordpress.com&amp;blog=10630555&amp;post=81&amp;subd=ppersica&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>Phylogenetic tree construction by using BioEdit program</p>
<p>Common ancestors of organisms can be investigated by means of phylogenetic analysis. Mitochondrial cytochrome b nucleotide sequence showing below is found to be of a dinosaur that lived 80 million years ago:</p>
<p>cccttctattattcattctcattctattcgttattcttgtactccacacatccaaacaac aaagcataatattccacccattgagtccattcctatcctgattcttagtccccgaacctt<br />
ttacactcacatg</p>
<p>Phylogenetic tree of the dinosaur cytochrome b is constructed to find which organism(s) that share common ancestors with it.</p>
<p>1. Retrieve cytochrome b nucleotide sequences from Entrez Nucleotide database <a href="http://www.ncbi.nlm.nih.gov/sites/entrez">http://www.ncbi.nlm.nih.gov/sites/entrez</a>. For each organism, choose nucleotide and add the organism scientific name along with &#8220;AND cytochrome b&#8221;&#8211;e.g. Homo sapiens AND cytochrome b. Then, the result page will show up and click on RefSeq tab as following:</p>
<p><a href="http://ppersica.files.wordpress.com/2009/12/phylo.png"><img class="alignnone size-medium wp-image-83" title="phylo" src="http://ppersica.files.wordpress.com/2009/12/phylo.png?w=300&#038;h=169" alt="" width="300" height="169" /></a></p>
<p>Click on CYTB, which is an abbrevation of cytochrome b, under search in Gene section. The following page will appear.</p>
<p><a href="http://ppersica.files.wordpress.com/2009/12/phylo1.png"><img class="alignnone size-medium wp-image-84" title="phylo1" src="http://ppersica.files.wordpress.com/2009/12/phylo1.png?w=300&#038;h=203" alt="" width="300" height="203" /></a></p>
<p>Scroll down to &#8220;Genomic regions, transcripts, and products&#8221;. Click on the NCBI reference sequence number, NC_012920.1, and choose &#8220;FASTA&#8221; from Nucleotide link. The nucleotide sequence in FASTA format will appear and save it as text file. The organisms used in phylogenetic tree construction are shown with their NCBI accession number:</p>
<p>Human: Homo sapiens, NC_012920.1<br />
Dog: Canis lupus, NC_002008.1<br />
Rabbit: Oryctolagus cuniculus, NC_001913.1<br />
Rhinoceros: Rhinoceros unicornis, NC_001779.1<br />
Dugong: Dugong dugon, NC_003314.1<br />
Mouse: Mus musculus, NC_010339.1<br />
Whale: Balaenoptera edeni, NC_007938.1<br />
Bovine: Bos taurus, NC_006853.1<br />
Sicklebill: Epimachus fastuosus, GQ334244.1<br />
Chicken: Gallus gallus, NC_001323.1<br />
Magpie: Pica hudsonia, AY030114.1<br />
Frog: Rana plancyi, NC_009264.1</p>
<p>The nucleotide search step can be shortened by using &#8220;CYTB&#8221; as a keyword instead of the full name &#8220;cytochrome b&#8221;, which will give several results as it is not a specific name&#8211;e.i. there can be cytochrome b reductase, etc. Then, go to RefSeq tab and the target sequence will be listed.</p>
<p>All of the nucleotide sequences of these organisms in FASTA format are ordered in the same text file as shown below. Nucleotide sequence of dinosaur cytochrome b is also added in this text file.</p>
<p><a href="http://ppersica.files.wordpress.com/2009/12/phylo15.png"><img class="alignnone size-medium wp-image-111" title="phylo15" src="http://ppersica.files.wordpress.com/2009/12/phylo15.png?w=300&#038;h=209" alt="" width="300" height="209" /></a></p>
<p>2. Open BioEdit, which can be downloaded from <a href="http://www.mbio.ncsu.edu/BioEdit/bioedit.html">http://www.mbio.ncsu.edu/BioEdit/bioedit.html</a>. Start alignment by going to the menu bar and choose &#8221;File &gt; New Alignment&#8221;</p>
<p><a href="http://ppersica.files.wordpress.com/2009/12/phylo3.png"><img class="alignnone size-medium wp-image-85" title="phylo3" src="http://ppersica.files.wordpress.com/2009/12/phylo3.png?w=300&#038;h=210" alt="" width="300" height="210" /></a></p>
<p>A new alignment page will appear.</p>
<p><a href="http://ppersica.files.wordpress.com/2009/12/phylo4.png"><img class="alignnone size-medium wp-image-86" title="phylo4" src="http://ppersica.files.wordpress.com/2009/12/phylo4.png?w=300&#038;h=208" alt="" width="300" height="208" /></a></p>
<p>3. Import nucleotide sequences in FASTA format by &#8220;File&gt;Import&gt;Sequence alignment file&#8221; on the menu bar.</p>
<p><a href="http://ppersica.files.wordpress.com/2009/12/phylo5.png"><img class="alignnone size-medium wp-image-96" title="phylo5" src="http://ppersica.files.wordpress.com/2009/12/phylo5.png?w=300&#038;h=218" alt="" width="300" height="218" /></a></p>
<p>The nucleotide sequences will appear on the right panel and the names of the sequence will appear on the left panel.</p>
<p><a href="http://ppersica.files.wordpress.com/2009/12/phylo6.png"><img class="alignnone size-medium wp-image-97" title="phylo6" src="http://ppersica.files.wordpress.com/2009/12/phylo6.png?w=300&#038;h=217" alt="" width="300" height="217" /></a></p>
<p>4. Align the sequences by highlighting all the names of the sequences. Then, choose &#8220;Accessory Application &gt; ClustalW Multiple Alignment&#8221;</p>
<p><a href="http://ppersica.files.wordpress.com/2009/12/phylo7.png"><img class="alignnone size-medium wp-image-98" title="phylo7" src="http://ppersica.files.wordpress.com/2009/12/phylo7.png?w=300&#038;h=206" alt="" width="300" height="206" /></a></p>
<p>A popup window will show up and click on &#8220;RunClustalW&#8221; button.</p>
<p><a href="http://ppersica.files.wordpress.com/2009/12/phylo8.png"><img class="alignnone size-medium wp-image-99" title="phylo8" src="http://ppersica.files.wordpress.com/2009/12/phylo8.png?w=300&#038;h=210" alt="" width="300" height="210" /></a></p>
<p>The program will run in DOS window.</p>
<p><a href="http://ppersica.files.wordpress.com/2009/12/phylo9.png"><img class="alignnone size-medium wp-image-100" title="phylo9" src="http://ppersica.files.wordpress.com/2009/12/phylo9.png?w=300&#038;h=219" alt="" width="300" height="219" /></a></p>
<p>The alignment will be shown in a new window.</p>
<p><a href="http://ppersica.files.wordpress.com/2009/12/phylo10.png"><img class="alignnone size-medium wp-image-101" title="phylo10" src="http://ppersica.files.wordpress.com/2009/12/phylo10.png?w=300&#038;h=216" alt="" width="300" height="216" /></a></p>
<p>5. Calculate distance among these nucleotide sequences by selecting all sequences. Then, choose &#8220;Accessory Application&gt;DNAmlk DNA Maximum Likelihood program with molecular clock&#8221;</p>
<p><a href="http://ppersica.files.wordpress.com/2009/12/phylo11.png"><img class="alignnone size-medium wp-image-102" title="phylo11" src="http://ppersica.files.wordpress.com/2009/12/phylo11.png?w=300&#038;h=215" alt="" width="300" height="215" /></a></p>
<p>A small window of the program will appear. Click on &#8220;Run Application&#8221;.</p>
<p><a href="http://ppersica.files.wordpress.com/2009/12/phylo12.png"><img class="alignnone size-medium wp-image-103" title="phylo12" src="http://ppersica.files.wordpress.com/2009/12/phylo12.png?w=300&#038;h=209" alt="" width="300" height="209" /></a></p>
<p>The distance among these organisms will appear</p>
<p><a href="http://ppersica.files.wordpress.com/2009/12/phylo13.png"><img class="alignnone size-medium wp-image-104" title="phylo13" src="http://ppersica.files.wordpress.com/2009/12/phylo13.png?w=300&#038;h=209" alt="" width="300" height="209" /></a></p>
<p>Save it as text file. The tree is shown as following</p>
<p><a href="http://ppersica.files.wordpress.com/2009/12/phylo14.png"><img class="alignnone size-medium wp-image-95" title="phylo14" src="http://ppersica.files.wordpress.com/2009/12/phylo14.png?w=300&#038;h=209" alt="" width="300" height="209" /></a></p>
<p>6. View the phylogram by using TreeViewX, which can be downloaded from <a href="http://darwin.zoology.gla.ac.uk/~rpage/treeviewx/download.html">http://darwin.zoology.gla.ac.uk/~rpage/treeviewx/download.html</a></p>
<p>The program will display the graphic tree, and it can be set to be view as slanted cladogram, rectangular cladogram, or phylogram.</p>
<p>Open the program, go to &#8220;File&gt;New&#8230;&#8221; on the menu bar.</p>
<p><a href="http://ppersica.files.wordpress.com/2009/12/phylo16.png"><img class="alignnone size-medium wp-image-113" title="phylo16" src="http://ppersica.files.wordpress.com/2009/12/phylo16.png?w=300&#038;h=218" alt="" width="300" height="218" /></a></p>
<p>Open the text file of the tree from BioEdit by &#8220;File&gt;Open&#8230;&#8221; and the graphic view of the tree will appear as following. The type of tree view can be chosen on &#8220;Trees&#8221; on the menu bar.</p>
<p><a href="http://ppersica.files.wordpress.com/2009/12/phylo17.png"><img class="alignnone size-medium wp-image-114" title="phylo17" src="http://ppersica.files.wordpress.com/2009/12/phylo17.png?w=300&#038;h=217" alt="" width="300" height="217" /></a></p>
<p>From this phylogram, there are 2 major groups of animals as there are 2 clades beginning on the left. Magpie and Sicklebill are closely related since both of them are passerines, while Chicken is categorized in this group as all of them are birds (Class Aves). They share ancestor with Frog, which is in another class of amphibian. The length of the clade line represents how closed they are, and Frog is very far from these birds. On the next group, Human and dinosaur are categorized in the same group even though the distance between them is rather far. Based on the phylogram, the dinosaur is closely related to human. Whale is closed to bovine and rhinoceros. Dog and mouse are closely related.</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/ppersica.wordpress.com/81/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/ppersica.wordpress.com/81/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/ppersica.wordpress.com/81/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/ppersica.wordpress.com/81/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/ppersica.wordpress.com/81/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/ppersica.wordpress.com/81/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/ppersica.wordpress.com/81/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/ppersica.wordpress.com/81/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/ppersica.wordpress.com/81/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/ppersica.wordpress.com/81/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/ppersica.wordpress.com/81/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/ppersica.wordpress.com/81/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/ppersica.wordpress.com/81/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/ppersica.wordpress.com/81/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=ppersica.wordpress.com&amp;blog=10630555&amp;post=81&amp;subd=ppersica&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://ppersica.wordpress.com/2009/12/22/assignment-3/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/b57a607c18872a75205d06e880a553af?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">ppersica</media:title>
		</media:content>

		<media:content url="http://ppersica.files.wordpress.com/2009/12/phylo.png?w=300" medium="image">
			<media:title type="html">phylo</media:title>
		</media:content>

		<media:content url="http://ppersica.files.wordpress.com/2009/12/phylo1.png?w=300" medium="image">
			<media:title type="html">phylo1</media:title>
		</media:content>

		<media:content url="http://ppersica.files.wordpress.com/2009/12/phylo15.png?w=300" medium="image">
			<media:title type="html">phylo15</media:title>
		</media:content>

		<media:content url="http://ppersica.files.wordpress.com/2009/12/phylo3.png?w=300" medium="image">
			<media:title type="html">phylo3</media:title>
		</media:content>

		<media:content url="http://ppersica.files.wordpress.com/2009/12/phylo4.png?w=300" medium="image">
			<media:title type="html">phylo4</media:title>
		</media:content>

		<media:content url="http://ppersica.files.wordpress.com/2009/12/phylo5.png?w=300" medium="image">
			<media:title type="html">phylo5</media:title>
		</media:content>

		<media:content url="http://ppersica.files.wordpress.com/2009/12/phylo6.png?w=300" medium="image">
			<media:title type="html">phylo6</media:title>
		</media:content>

		<media:content url="http://ppersica.files.wordpress.com/2009/12/phylo7.png?w=300" medium="image">
			<media:title type="html">phylo7</media:title>
		</media:content>

		<media:content url="http://ppersica.files.wordpress.com/2009/12/phylo8.png?w=300" medium="image">
			<media:title type="html">phylo8</media:title>
		</media:content>

		<media:content url="http://ppersica.files.wordpress.com/2009/12/phylo9.png?w=300" medium="image">
			<media:title type="html">phylo9</media:title>
		</media:content>

		<media:content url="http://ppersica.files.wordpress.com/2009/12/phylo10.png?w=300" medium="image">
			<media:title type="html">phylo10</media:title>
		</media:content>

		<media:content url="http://ppersica.files.wordpress.com/2009/12/phylo11.png?w=300" medium="image">
			<media:title type="html">phylo11</media:title>
		</media:content>

		<media:content url="http://ppersica.files.wordpress.com/2009/12/phylo12.png?w=300" medium="image">
			<media:title type="html">phylo12</media:title>
		</media:content>

		<media:content url="http://ppersica.files.wordpress.com/2009/12/phylo13.png?w=300" medium="image">
			<media:title type="html">phylo13</media:title>
		</media:content>

		<media:content url="http://ppersica.files.wordpress.com/2009/12/phylo14.png?w=300" medium="image">
			<media:title type="html">phylo14</media:title>
		</media:content>

		<media:content url="http://ppersica.files.wordpress.com/2009/12/phylo16.png?w=300" medium="image">
			<media:title type="html">phylo16</media:title>
		</media:content>

		<media:content url="http://ppersica.files.wordpress.com/2009/12/phylo17.png?w=300" medium="image">
			<media:title type="html">phylo17</media:title>
		</media:content>
	</item>
		<item>
		<title>Assignment 2</title>
		<link>http://ppersica.wordpress.com/2009/11/26/assignment-2/</link>
		<comments>http://ppersica.wordpress.com/2009/11/26/assignment-2/#comments</comments>
		<pubDate>Thu, 26 Nov 2009 03:49:28 +0000</pubDate>
		<dc:creator>ppersica</dc:creator>
				<category><![CDATA[Hypercourse on Bioinformatics]]></category>

		<guid isPermaLink="false">http://ppersica.wordpress.com/?p=40</guid>
		<description><![CDATA[1. What is the name of haploview format to use in this analysis? In Haploview, which can be downloaded from http://www.broadinstitute.org/haploview/haploview-downloads, the input file formats accepted by Haploview are available in several formats such as  linkage format, completely or pharsed haplotypes, HapMap project data dumps, PHASE format, and PLINK. In this analysis, the haploview format is in HapMap format. [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=ppersica.wordpress.com&amp;blog=10630555&amp;post=40&amp;subd=ppersica&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p><a href="http://ppersica.files.wordpress.com/2009/11/ld5.png"></a>1. What is the name of haploview format to use in this analysis?</p>
<p>In Haploview, which can be downloaded from <a href="http://www.broadinstitute.org/haploview/haploview-downloads">http://www.broadinstitute.org/haploview/haploview-downloads</a>, the input file formats accepted by Haploview are available in several formats such as  linkage format, completely or pharsed haplotypes, HapMap project data dumps, PHASE format, and PLINK. In this analysis, the haploview format is in HapMap format. This type of input file format can be opened by choosing &#8220;HapMat Format&#8221; on the left and browse the text file containing haplotypes on chromosome X.</p>
<p><a href="http://ppersica.files.wordpress.com/2009/11/ld1.png"><img class="alignnone size-medium wp-image-42" title="ld1" src="http://ppersica.files.wordpress.com/2009/11/ld1.png?w=300&#038;h=191" alt="" width="300" height="191" /></a></p>
<p>The haplotypes will be loaded as shown:</p>
<p><a href="http://ppersica.files.wordpress.com/2009/11/ld5.png"><img class="alignnone size-medium wp-image-52" title="ld5" src="http://ppersica.files.wordpress.com/2009/11/ld5.png?w=300&#038;h=221" alt="" width="300" height="221" /></a></p>
<p>2. Please show us the marker and individual quality control of the genotype data use in the analysis?</p>
<p>- The marker and individual quality control of the genotype data are shown under &#8220;Check Markers&#8221; tab. The quality is assessed through following:</p>
<p>         &#8211; ObsHET is the marker&#8217;s observed heterozygosity</p>
<p>         &#8211; PredHET is the marker&#8217;s predicted heterozygosity</p>
<p>         &#8211; HWpval is the Hardy-Weinberg equilibrium p value, which is the probability that its deviation from H-W equilibrium could be explained by chance</p>
<p>         &#8211; %Geno is the percentage of non-missing genotypes for this marker</p>
<p>         &#8211; FamTrio is the number of fully genotyped family trios for this marker</p>
<p>         - MenErr is the number of observed Mendelian inheritance errors</p>
<p>         &#8211; MAF is the minor allele frequency for this marker</p>
<p><a href="http://ppersica.files.wordpress.com/2009/11/ld5.png"><img class="alignnone size-medium wp-image-52" title="ld5" src="http://ppersica.files.wordpress.com/2009/11/ld5.png?w=300&#038;h=221" alt="" width="300" height="221" /></a><br />
3. Please show us the LD map then explain what do you get from the LD map?</p>
<p>- The LD map is displayed under &#8220;LD plot&#8221; tab. The LD scores is calculated from pairwise of each marker. If the score is high, this pair of markers is said to be strong linkage disequilibrium. The scores are clustered together based on their values. The color of the score becomes more reddish as the score increases, and they can be grouped in order to designate a haplotype block. In this LD map, there are 3 regions of grouping red blocks implying 3 haplotype blocks as shown by triangles grouping the regions. The numbers above the map show the marker numbers and names of the alleles.  </p>
<p><a href="http://ppersica.files.wordpress.com/2009/11/ld2.png"></a></p>
<p><a href="http://ppersica.files.wordpress.com/2009/11/ld1.png"></a></p>
<p><a href="http://ppersica.files.wordpress.com/2009/11/ld4.png"><img class="alignnone size-medium wp-image-47" title="ld4" src="http://ppersica.files.wordpress.com/2009/11/ld4.png?w=300&#038;h=130" alt="" width="300" height="130" /></a><br />
4. How many haplotype blocks in this region of Chromosome X, then explain how to interpret them?</p>
<p>- There are 3 blocks of haplotypes in this region of chromosome X based on 95% confidence intervals, illustrated as Block 1, 2, and 3. The tag SNPs can be displayed by clicking on &#8220;Display&#8221; on the menu bar and choose &#8220;Show tags in blocks&#8221;. The tag SNPs are indicated by ticks under the marker numbers. There are 2 tag SNPs found in each haplotype blocks. Crossing regions show the likelihood of recombination between the 2 blocks. The thicker the crossing line, the stronger the recombination.</p>
<p><a href="http://ppersica.files.wordpress.com/2009/11/ld3.png"><img class="alignnone size-medium wp-image-49" title="ld3" src="http://ppersica.files.wordpress.com/2009/11/ld3.png?w=300&#038;h=215" alt="" width="300" height="215" /></a><br />
5. Could you find out the tagging SNP in each haplotype block, then explain what the tagging SNPs?</p>
<p>- The tag SNPs are the SNPs that can represent other SNPs because all of them are in linkage disequilibrium.</p>
<p>On &#8221;Haplotypes&#8221; page, the tag SNPs are indicated by ticks beneath the marker numbers. There are 2 tag SNPs presenting in each 3 haplotype blocks. Or they can be shown on &#8220;Tagger&#8221; page by choosing &#8220;Configuration&#8221; tab. Then, select all alleles by clicking on &#8220;Include All&#8221; button and &#8220;pairwise tagging only&#8221;. Every allele is paired and tested. Hit on &#8220;Run Tagger&#8221; button and go to &#8220;Results&#8221; page.</p>
<p><a href="http://ppersica.files.wordpress.com/2009/11/ld7.png"><img class="alignnone size-medium wp-image-54" title="ld7" src="http://ppersica.files.wordpress.com/2009/11/ld7.png?w=300&#038;h=215" alt="" width="300" height="215" /></a></p>
<p>On &#8220;Alleles captured by Current Selection&#8221; panel, 6 alleles are shown up. They contain tag SNPs.</p>
<p><a href="http://ppersica.files.wordpress.com/2009/11/ld6.png"><img class="alignnone size-medium wp-image-55" title="ld6" src="http://ppersica.files.wordpress.com/2009/11/ld6.png?w=300&#038;h=217" alt="" width="300" height="217" /></a></p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/ppersica.wordpress.com/40/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/ppersica.wordpress.com/40/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/ppersica.wordpress.com/40/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/ppersica.wordpress.com/40/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/ppersica.wordpress.com/40/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/ppersica.wordpress.com/40/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/ppersica.wordpress.com/40/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/ppersica.wordpress.com/40/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/ppersica.wordpress.com/40/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/ppersica.wordpress.com/40/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/ppersica.wordpress.com/40/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/ppersica.wordpress.com/40/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/ppersica.wordpress.com/40/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/ppersica.wordpress.com/40/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=ppersica.wordpress.com&amp;blog=10630555&amp;post=40&amp;subd=ppersica&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://ppersica.wordpress.com/2009/11/26/assignment-2/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/b57a607c18872a75205d06e880a553af?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">ppersica</media:title>
		</media:content>

		<media:content url="http://ppersica.files.wordpress.com/2009/11/ld1.png?w=300" medium="image">
			<media:title type="html">ld1</media:title>
		</media:content>

		<media:content url="http://ppersica.files.wordpress.com/2009/11/ld5.png?w=300" medium="image">
			<media:title type="html">ld5</media:title>
		</media:content>

		<media:content url="http://ppersica.files.wordpress.com/2009/11/ld5.png?w=300" medium="image">
			<media:title type="html">ld5</media:title>
		</media:content>

		<media:content url="http://ppersica.files.wordpress.com/2009/11/ld4.png?w=300" medium="image">
			<media:title type="html">ld4</media:title>
		</media:content>

		<media:content url="http://ppersica.files.wordpress.com/2009/11/ld3.png?w=300" medium="image">
			<media:title type="html">ld3</media:title>
		</media:content>

		<media:content url="http://ppersica.files.wordpress.com/2009/11/ld7.png?w=300" medium="image">
			<media:title type="html">ld7</media:title>
		</media:content>

		<media:content url="http://ppersica.files.wordpress.com/2009/11/ld6.png?w=300" medium="image">
			<media:title type="html">ld6</media:title>
		</media:content>
	</item>
		<item>
		<title>Assignment1_2</title>
		<link>http://ppersica.wordpress.com/2009/11/25/assignment1_2/</link>
		<comments>http://ppersica.wordpress.com/2009/11/25/assignment1_2/#comments</comments>
		<pubDate>Wed, 25 Nov 2009 04:32:44 +0000</pubDate>
		<dc:creator>ppersica</dc:creator>
				<category><![CDATA[Hypercourse on Bioinformatics]]></category>

		<guid isPermaLink="false">http://ppersica.wordpress.com/?p=22</guid>
		<description><![CDATA[Taverna Separate the list of FASTA file 1. Open Taverna, and type &#8220;split&#8221; in the search box. Under &#8220;Local Services&#8221;, &#8220;Split sting into string list by regular expression&#8221; is shown in red. The FASTA file from http://www.cs.manchester.ac.uk/~katy/taverna/fastaFile.txt contains some necleotide sequences, and these sequences will be separated individually by using the split service. 2. Right click [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=ppersica.wordpress.com&amp;blog=10630555&amp;post=22&amp;subd=ppersica&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p><a href="http://ppersica.files.wordpress.com/2009/11/13.png"></a><a href="http://ppersica.files.wordpress.com/2009/11/16.png"></a>Taverna</p>
<p>Separate the list of FASTA file</p>
<p>1. Open Taverna, and type &#8220;split&#8221; in the search box. Under &#8220;Local Services&#8221;, &#8220;Split sting into string list by regular expression&#8221; is shown in red. The FASTA file from <a href="http://www.cs.manchester.ac.uk/~katy/taverna/fastaFile.txt">http://www.cs.manchester.ac.uk/~katy/taverna/fastaFile.txt</a> contains some necleotide sequences, and these sequences will be separated individually by using the split service.</p>
<p><a href="http://ppersica.files.wordpress.com/2009/11/4.png"><img class="alignnone size-medium wp-image-24" title="4" src="http://ppersica.files.wordpress.com/2009/11/4.png?w=300&#038;h=208" alt="" width="300" height="208" /></a></p>
<p>2. Right click on &#8220;Split sting into string list by regular expression&#8221; and choose &#8220;Add to model&#8221;. Each nucleotide sequence is called a &#8216;string&#8217; and the regular expression is the pattern that separates each string.</p>
<p><a href="http://ppersica.files.wordpress.com/2009/11/5.png"><img class="alignnone size-medium wp-image-25" title="5" src="http://ppersica.files.wordpress.com/2009/11/5.png?w=300&#038;h=206" alt="" width="300" height="206" /></a></p>
<p>3. This service requires 2 workflow inputs: the string and regex (regular expression). Right click on &#8220;Workflow inputs&#8221; in the low left panel and choose &#8220;Create New Input&#8230;&#8221;</p>
<p><a href="http://ppersica.files.wordpress.com/2009/11/6.png"><img class="alignnone size-medium wp-image-26" title="6" src="http://ppersica.files.wordpress.com/2009/11/6.png?w=300&#038;h=210" alt="" width="300" height="210" /></a></p>
<p>4. Add &#8220;FASTA sequence&#8221; in the &#8220;Name for the new workflow input&#8221; box. This input will be the nucleotide sequence file. After that, add another workflow input as &#8220;pattern&#8221; to be assigned later on as a format that separates each nucleotide sequence.</p>
<p><a href="http://ppersica.files.wordpress.com/2009/11/7.png"><img class="alignnone size-medium wp-image-27" title="7" src="http://ppersica.files.wordpress.com/2009/11/7.png?w=300&#038;h=211" alt="" width="300" height="211" /></a></p>
<p>5. Then, right click on the &#8220;Workflow outputs&#8221; to add the output of the splited FASTA files.</p>
<p><a href="http://ppersica.files.wordpress.com/2009/11/8.png"><img class="alignnone size-medium wp-image-28" title="8" src="http://ppersica.files.wordpress.com/2009/11/8.png?w=300&#038;h=214" alt="" width="300" height="214" /></a></p>
<p>6. On the right panel, graphical representation of workflow inputs and output are illustrated. These boxes need to be connected together. Right click on &#8220;FASTA sequence&#8221; and choose &#8220;Processors &gt; Split_string&#8230; &gt; string&#8221;. Then, this box will be linked to the processor. The FASTA file will be added to this input later on.</p>
<p><a href="http://ppersica.files.wordpress.com/2009/11/9.png"><img class="alignnone size-medium wp-image-29" title="9" src="http://ppersica.files.wordpress.com/2009/11/9.png?w=300&#038;h=211" alt="" width="300" height="211" /></a></p>
<p>7. Connect &#8220;pattern&#8221; by right clicking on it and choose &#8220;Processors &gt; Split_string&#8230; &gt; regex&#8221;.  </p>
<p><a href="http://ppersica.files.wordpress.com/2009/11/10.png"><img class="alignnone size-medium wp-image-30" title="10" src="http://ppersica.files.wordpress.com/2009/11/10.png?w=300&#038;h=208" alt="" width="300" height="208" /></a></p>
<p>8. Connect the split processor to the output by right clicking on &#8220;split&#8221; under Processors category.</p>
<p><a href="http://ppersica.files.wordpress.com/2009/11/11.png"><img class="alignnone size-medium wp-image-31" title="11" src="http://ppersica.files.wordpress.com/2009/11/11.png?w=300&#038;h=215" alt="" width="300" height="215" /></a></p>
<p>9. After that, the workflow is established completely for this process on the right panel.</p>
<p><a href="http://ppersica.files.wordpress.com/2009/11/12.png"><img class="alignnone size-medium wp-image-32" title="12" src="http://ppersica.files.wordpress.com/2009/11/12.png?w=300&#038;h=213" alt="" width="300" height="213" /></a></p>
<p>10. The workflow can now be run by choosing File &gt; Run workflow&#8230; on the left corner.</p>
<p><a href="http://ppersica.files.wordpress.com/2009/11/13.png"><img class="alignnone size-medium wp-image-33" title="13" src="http://ppersica.files.wordpress.com/2009/11/13.png?w=300&#038;h=207" alt="" width="300" height="207" /></a></p>
<p>11. A popup window will be appeared and enable adding value for the inputs. Right click on &#8220;FASTA_sequence&#8221; and choose &#8220;New input value&#8221;.</p>
<p><a href="http://ppersica.files.wordpress.com/2009/11/14.png"><img class="alignnone size-medium wp-image-34" title="14" src="http://ppersica.files.wordpress.com/2009/11/14.png?w=300&#038;h=297" alt="" width="300" height="297" /></a></p>
<p>12. The FASTA file containing nucleotide sequences is added on the right panel.</p>
<p><a href="http://ppersica.files.wordpress.com/2009/11/15.png"><img class="alignnone size-medium wp-image-35" title="15" src="http://ppersica.files.wordpress.com/2009/11/15.png?w=300&#038;h=297" alt="" width="300" height="297" /></a></p>
<p>13. For pattern, only &#8220;&gt;&#8221; is added since each nucleotide sequence is begun with it. The service can find this symbol and separate each sequence.</p>
<p><a href="http://ppersica.files.wordpress.com/2009/11/16.png"><img class="alignnone size-medium wp-image-36" title="16" src="http://ppersica.files.wordpress.com/2009/11/16.png?w=300&#038;h=298" alt="" width="300" height="298" /></a></p>
<p>14. Then, click on &#8220;Run workflow&#8221; botton. The result will appear on the main window program under the &#8220;Result&#8221; tab.</p>
<p><a href="http://ppersica.files.wordpress.com/2009/11/17.png"><img class="alignnone size-medium wp-image-23" title="17" src="http://ppersica.files.wordpress.com/2009/11/17.png?w=300&#038;h=218" alt="" width="300" height="218" /></a></p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/ppersica.wordpress.com/22/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/ppersica.wordpress.com/22/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/ppersica.wordpress.com/22/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/ppersica.wordpress.com/22/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/ppersica.wordpress.com/22/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/ppersica.wordpress.com/22/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/ppersica.wordpress.com/22/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/ppersica.wordpress.com/22/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/ppersica.wordpress.com/22/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/ppersica.wordpress.com/22/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/ppersica.wordpress.com/22/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/ppersica.wordpress.com/22/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/ppersica.wordpress.com/22/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/ppersica.wordpress.com/22/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=ppersica.wordpress.com&amp;blog=10630555&amp;post=22&amp;subd=ppersica&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://ppersica.wordpress.com/2009/11/25/assignment1_2/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/b57a607c18872a75205d06e880a553af?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">ppersica</media:title>
		</media:content>

		<media:content url="http://ppersica.files.wordpress.com/2009/11/4.png?w=300" medium="image">
			<media:title type="html">4</media:title>
		</media:content>

		<media:content url="http://ppersica.files.wordpress.com/2009/11/5.png?w=300" medium="image">
			<media:title type="html">5</media:title>
		</media:content>

		<media:content url="http://ppersica.files.wordpress.com/2009/11/6.png?w=300" medium="image">
			<media:title type="html">6</media:title>
		</media:content>

		<media:content url="http://ppersica.files.wordpress.com/2009/11/7.png?w=300" medium="image">
			<media:title type="html">7</media:title>
		</media:content>

		<media:content url="http://ppersica.files.wordpress.com/2009/11/8.png?w=300" medium="image">
			<media:title type="html">8</media:title>
		</media:content>

		<media:content url="http://ppersica.files.wordpress.com/2009/11/9.png?w=300" medium="image">
			<media:title type="html">9</media:title>
		</media:content>

		<media:content url="http://ppersica.files.wordpress.com/2009/11/10.png?w=300" medium="image">
			<media:title type="html">10</media:title>
		</media:content>

		<media:content url="http://ppersica.files.wordpress.com/2009/11/11.png?w=300" medium="image">
			<media:title type="html">11</media:title>
		</media:content>

		<media:content url="http://ppersica.files.wordpress.com/2009/11/12.png?w=300" medium="image">
			<media:title type="html">12</media:title>
		</media:content>

		<media:content url="http://ppersica.files.wordpress.com/2009/11/13.png?w=300" medium="image">
			<media:title type="html">13</media:title>
		</media:content>

		<media:content url="http://ppersica.files.wordpress.com/2009/11/14.png?w=300" medium="image">
			<media:title type="html">14</media:title>
		</media:content>

		<media:content url="http://ppersica.files.wordpress.com/2009/11/15.png?w=300" medium="image">
			<media:title type="html">15</media:title>
		</media:content>

		<media:content url="http://ppersica.files.wordpress.com/2009/11/16.png?w=300" medium="image">
			<media:title type="html">16</media:title>
		</media:content>

		<media:content url="http://ppersica.files.wordpress.com/2009/11/17.png?w=300" medium="image">
			<media:title type="html">17</media:title>
		</media:content>
	</item>
		<item>
		<title>Assignment 1</title>
		<link>http://ppersica.wordpress.com/2009/11/24/assignment-1/</link>
		<comments>http://ppersica.wordpress.com/2009/11/24/assignment-1/#comments</comments>
		<pubDate>Tue, 24 Nov 2009 07:01:22 +0000</pubDate>
		<dc:creator>ppersica</dc:creator>
				<category><![CDATA[Hypercourse on Bioinformatics]]></category>

		<guid isPermaLink="false">http://ppersica.wordpress.com/?p=3</guid>
		<description><![CDATA[Taverna How to get nucleotide sequence from NCBI database 1. Open Taverna program. Type &#8220;get nucleotide&#8221; on the search box, and the results will appear in red letters. Right click on &#8221;Get Nucleotide FASTA&#8221; under NCBI folder and choose &#8220;Add to model&#8221;. 2. To retrieve a nucleotide sequence, the input for the workflow must be given as the accession [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=ppersica.wordpress.com&amp;blog=10630555&amp;post=3&amp;subd=ppersica&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>Taverna</p>
<p>How to get nucleotide sequence from NCBI database</p>
<p>1. Open Taverna program. Type &#8220;get nucleotide&#8221; on the search box, and the results will appear in red letters. Right click on &#8221;Get Nucleotide FASTA&#8221; under NCBI folder and choose &#8220;Add to model&#8221;.</p>
<p><a href="http://ppersica.files.wordpress.com/2009/11/untitled-1.png"><img class="alignnone size-medium wp-image-4" title="Untitled-1" src="http://ppersica.files.wordpress.com/2009/11/untitled-1.png?w=300&#038;h=215" alt="" width="300" height="215" /></a></p>
<p>2. To retrieve a nucleotide sequence, the input for the workflow must be given as the accession number of the query sequence. Right click on the &#8220;Workflow inputs&#8221; in the low left panel and choose &#8220;Create New Input&#8230;&#8221; to put the name of the sequence in. In this example, the accession number is given &#8220;ACC37599.1&#8243;. On the same way, an output is created under &#8220;Workflow outputs&#8221;. As a result, graphical boxes of the input and output workflow will appear in the right panel.</p>
<p><a href="http://ppersica.files.wordpress.com/2009/11/untitled-5.png"><img class="alignnone size-medium wp-image-13" title="Untitled-5" src="http://ppersica.files.wordpress.com/2009/11/untitled-5.png?w=300&#038;h=215" alt="" width="300" height="215" /></a></p>
<p><a href="http://ppersica.files.wordpress.com/2009/11/untitled-3.png"><img class="alignnone size-medium wp-image-7" title="Untitled-3" src="http://ppersica.files.wordpress.com/2009/11/untitled-3.png?w=300&#038;h=216" alt="" width="300" height="216" /></a></p>
<p><a href="http://ppersica.files.wordpress.com/2009/11/untitled-5.png"></a></p>
<p>4.  After the input and output of the workflow were created, the process of the workflow is built by connecting the input to the process. Right click on the input name and choose &#8220;Processors&#8221; and &#8220;Get Nucleotide FASTA&#8221; as illustrated.</p>
<p><a href="http://ppersica.files.wordpress.com/2009/11/untitled-4.png"><img class="alignnone size-medium wp-image-8" title="Untitled-4" src="http://ppersica.files.wordpress.com/2009/11/untitled-4.png?w=300&#038;h=212" alt="" width="300" height="212" /></a></p>
<p>5.  The input box is now connected to the processor. Next, the output box should be connected to the processor by right clicking on the &#8220;output&#8221; under &#8220;Get Nucleotide FASTA&#8221; processor.</p>
<p><a href="http://ppersica.files.wordpress.com/2009/11/untitled-71.png"><img class="alignnone size-medium wp-image-14" title="Untitled-71" src="http://ppersica.files.wordpress.com/2009/11/untitled-71.png?w=300&#038;h=215" alt="" width="300" height="215" /></a></p>
<p>6. Then, the workflow has been established and is ready to be run. Click on &#8220;File&#8221; on the left corner and choose &#8220;Run workflow&#8230;&#8221;</p>
<p><a href="http://ppersica.files.wordpress.com/2009/11/untitled-81.png"><img class="alignnone size-medium wp-image-15" title="Untitled-81" src="http://ppersica.files.wordpress.com/2009/11/untitled-81.png?w=300&#038;h=214" alt="" width="300" height="214" /></a></p>
<p>7. A popup window will appear. The GenBank accession number is filled in the input name &#8220;AAC3799.1&#8243; by right clicking and choose &#8220;New input value&#8221;. The accession number is typed on the right panel.</p>
<p><a href="http://ppersica.files.wordpress.com/2009/11/untitled-7.png"><img class="alignnone size-medium wp-image-10" title="Untitled-7" src="http://ppersica.files.wordpress.com/2009/11/untitled-7.png?w=300&#038;h=297" alt="" width="300" height="297" /></a></p>
<p>8. After the accession number has been given. Click on the &#8220;Run workflow&#8221; button.</p>
<p><a href="http://ppersica.files.wordpress.com/2009/11/untitled-8.png"><img class="alignnone size-medium wp-image-11" title="Untitled-8" src="http://ppersica.files.wordpress.com/2009/11/untitled-8.png?w=300&#038;h=294" alt="" width="300" height="294" /></a></p>
<p>9. The main program window will show the result. On the &#8220;Status&#8221; tab, it reports the process has been complete.</p>
<p> <a href="http://ppersica.files.wordpress.com/2009/11/untitled-10.png"><img class="alignnone size-medium wp-image-5" title="Untitled-10" src="http://ppersica.files.wordpress.com/2009/11/untitled-10.png?w=300&#038;h=210" alt="" width="300" height="210" /></a></p>
<p>10. The nucleotide sequence is shown under the &#8220;Result&#8221; tab in FASTA format.</p>
<p><a href="http://ppersica.files.wordpress.com/2009/11/untitled-9.png"><img class="alignnone size-medium wp-image-12" title="Untitled-9" src="http://ppersica.files.wordpress.com/2009/11/untitled-9.png?w=300&#038;h=206" alt="" width="300" height="206" /></a></p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/ppersica.wordpress.com/3/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/ppersica.wordpress.com/3/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/ppersica.wordpress.com/3/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/ppersica.wordpress.com/3/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/ppersica.wordpress.com/3/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/ppersica.wordpress.com/3/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/ppersica.wordpress.com/3/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/ppersica.wordpress.com/3/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/ppersica.wordpress.com/3/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/ppersica.wordpress.com/3/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/ppersica.wordpress.com/3/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/ppersica.wordpress.com/3/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/ppersica.wordpress.com/3/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/ppersica.wordpress.com/3/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=ppersica.wordpress.com&amp;blog=10630555&amp;post=3&amp;subd=ppersica&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://ppersica.wordpress.com/2009/11/24/assignment-1/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/b57a607c18872a75205d06e880a553af?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">ppersica</media:title>
		</media:content>

		<media:content url="http://ppersica.files.wordpress.com/2009/11/untitled-1.png?w=300" medium="image">
			<media:title type="html">Untitled-1</media:title>
		</media:content>

		<media:content url="http://ppersica.files.wordpress.com/2009/11/untitled-5.png?w=300" medium="image">
			<media:title type="html">Untitled-5</media:title>
		</media:content>

		<media:content url="http://ppersica.files.wordpress.com/2009/11/untitled-3.png?w=300" medium="image">
			<media:title type="html">Untitled-3</media:title>
		</media:content>

		<media:content url="http://ppersica.files.wordpress.com/2009/11/untitled-4.png?w=300" medium="image">
			<media:title type="html">Untitled-4</media:title>
		</media:content>

		<media:content url="http://ppersica.files.wordpress.com/2009/11/untitled-71.png?w=300" medium="image">
			<media:title type="html">Untitled-71</media:title>
		</media:content>

		<media:content url="http://ppersica.files.wordpress.com/2009/11/untitled-81.png?w=300" medium="image">
			<media:title type="html">Untitled-81</media:title>
		</media:content>

		<media:content url="http://ppersica.files.wordpress.com/2009/11/untitled-7.png?w=300" medium="image">
			<media:title type="html">Untitled-7</media:title>
		</media:content>

		<media:content url="http://ppersica.files.wordpress.com/2009/11/untitled-8.png?w=300" medium="image">
			<media:title type="html">Untitled-8</media:title>
		</media:content>

		<media:content url="http://ppersica.files.wordpress.com/2009/11/untitled-10.png?w=300" medium="image">
			<media:title type="html">Untitled-10</media:title>
		</media:content>

		<media:content url="http://ppersica.files.wordpress.com/2009/11/untitled-9.png?w=300" medium="image">
			<media:title type="html">Untitled-9</media:title>
		</media:content>
	</item>
	</channel>
</rss>
