1 | <p>Introductory bioinformatics exercises often walk students through the use of computational tools, but often provide little understanding of what a computational tool does "under the hood." A solid understanding of how a bioinformatics computational algorithm functions, including its limitations, is key for interpreting the output in a biologically relevant context. This introductory bioinformatics exercise integrates an introduction to web-based sequence alignment algorithms with models to facilitate student reflection and appreciation for how computational tools provide similarity output data. The exercise concludes with a set of inquiry-based questions in which students may apply computational tools to solve a real biological problem.</p>
| 1 | <p><span style="font-size:12.0pt;font-family:"Cambria",serif;
|
---|
| | 2 | mso-ascii-theme-font:minor-latin;mso-fareast-font-family:Cambria;mso-fareast-theme-font:
|
---|
| | 3 | minor-latin;mso-hansi-theme-font:minor-latin;mso-bidi-font-family:"Times New Roman";
|
---|
| | 4 | mso-bidi-theme-font:minor-bidi;mso-ansi-language:EN-US;mso-fareast-language:
|
---|
| | 5 | EN-US;mso-bidi-language:AR-SA">In this bioinformatics adventure, we are going to continue to look amino acid sequences and compute similarity scores using the <i style="mso-bidi-font-style:normal">BLOSUM62</i> matrix, but we will take a glimpse at some of the tools that are available in the world of <i style="mso-bidi-font-style:normal">Bioconductor</i>. This set of tasks are set after Exercise 3 in the <i style="mso-bidi-font-style:normal">Sequence Similarity</i> materials</span></p>
|
---|
3 | <p>In the module, students first define sequence similarity and then investigate how similarity can be quantitatively compared between two similar length proteins using a Blocks Substitution Matrix (BLOSUM) scoring matrix. Students then look for local regions of similarity between a sequence query and subjects within a large database using Basic Local Alignment Search Tool (BLAST). Lastly, students access text-based FASTA-formatted sequence information via National Center for Biotechnology Information (NCBI) databases as they collect sequences for a multiple sequence alignment using Clustal Omega to generate a phylogram and evaluate evolutionary relationships. The combination of diverse, inquiry-based questions, paper models, and web-based computational resources provides students with a solid basis for more advanced bioinformatics topics and an appreciation for the importance of bioinformatics tools across the discipline of biology.</p>
| 7 | <p><i style="mso-bidi-font-style:normal"><span style="font-size:12.0pt;font-family:"Cambria",serif;mso-ascii-theme-font:minor-latin;
|
---|
| | 8 | mso-fareast-font-family:Cambria;mso-fareast-theme-font:minor-latin;mso-hansi-theme-font:
|
---|
| | 9 | minor-latin;mso-bidi-font-family:"Times New Roman";mso-bidi-theme-font:minor-bidi;
|
---|
| | 10 | mso-ansi-language:EN-US;mso-fareast-language:EN-US;mso-bidi-language:AR-SA">Bioconductor</span></i><span style="font-size:12.0pt;font-family:"Cambria",serif;mso-ascii-theme-font:minor-latin;
|
---|
| | 11 | mso-fareast-font-family:Cambria;mso-fareast-theme-font:minor-latin;mso-hansi-theme-font:
|
---|
| | 12 | minor-latin;mso-bidi-font-family:"Times New Roman";mso-bidi-theme-font:minor-bidi;
|
---|
| | 13 | mso-ansi-language:EN-US;mso-fareast-language:EN-US;mso-bidi-language:AR-SA"> is a collection of </span><span class="VerbatimChar"><span style="font-size:11.0pt;
|
---|
| | 14 | mso-bidi-font-size:12.0pt;mso-fareast-font-family:Cambria;mso-fareast-theme-font:
|
---|
| | 15 | minor-latin;mso-bidi-font-family:"Times New Roman";mso-bidi-theme-font:minor-bidi;
|
---|
| | 16 | color:black;mso-color-alt:windowtext;mso-ansi-language:EN-US;mso-fareast-language:
|
---|
| | 17 | EN-US;mso-bidi-language:AR-SA">R</span></span><span style="font-size:12.0pt;
|
---|
| | 18 | font-family:"Cambria",serif;mso-ascii-theme-font:minor-latin;mso-fareast-font-family:
|
---|
| | 19 | Cambria;mso-fareast-theme-font:minor-latin;mso-hansi-theme-font:minor-latin;
|
---|
| | 20 | mso-bidi-font-family:"Times New Roman";mso-bidi-theme-font:minor-bidi;
|
---|
| | 21 | mso-ansi-language:EN-US;mso-fareast-language:EN-US;mso-bidi-language:AR-SA"> packages that has been built by several bioinformatics researchers to perform common calculations in their field. In their own words, ``<i style="mso-bidi-font-style:
|
---|
| | 22 | normal">Bioconductor</i> provides tools for the analysis and comprehension of high-throughput genomic data. Bioconductor uses the R statistical programming language, and is open source and open development. It has two releases each year, and an active user community.’’ </span></p>
|
---|