Resources: Compare

#56, v5.0 Published:
#2410, v1.0 Published:

Title

Old VersionNew Version
1Sequence Similarity: An inquiry based and "under the hood" approach for incorporating molecular sequence alignment in introductory undergraduate biology courses 1Sequence Similarity: A Quick Introduction to Bioconductor

Authors

Old VersionNew Version
1Sabrina Robertson (North Carolina State University) 1Derek Edward Sollberger ()
2Carlos Christopher Goller (North Carolina State University) 2Derek Edward Sollberger ()
3Steven Roof (Fairmont State University)   
4Benita Brink (Adams State University)   
5Adam Kleinschmit (Adams State University)   
6Hayley Orndorf (University of Pittsburgh)   

Description

Old VersionNew Version
1<p>Introductory bioinformatics exercises often walk students through the use of computational tools, but often provide little understanding of what a computational tool does &quot;under the hood.&quot; A solid understanding of how a bioinformatics computational algorithm functions, including its limitations, is key for interpreting the output in a biologically relevant context. This introductory bioinformatics exercise integrates an introduction to web-based sequence alignment algorithms with models to facilitate student reflection and appreciation for how computational tools provide similarity output data. The exercise concludes with a set of inquiry-based questions in which students may apply computational tools to solve a real biological problem.</p>  1<p><span style="font-size:12.0pt;font-family:&quot;Cambria&quot;,serif;
   2mso-ascii-theme-font:minor-latin;mso-fareast-font-family:Cambria;mso-fareast-theme-font:
   3minor-latin;mso-hansi-theme-font:minor-latin;mso-bidi-font-family:&quot;Times New Roman&quot;;
   4mso-bidi-theme-font:minor-bidi;mso-ansi-language:EN-US;mso-fareast-language:
   5EN-US;mso-bidi-language:AR-SA">In this bioinformatics adventure, we are going to continue to look amino acid sequences and compute similarity scores using the <i style="mso-bidi-font-style:normal">BLOSUM62</i> matrix, but we will take a glimpse at some of the tools that are available in the world of <i style="mso-bidi-font-style:normal">Bioconductor</i>. This set of tasks are set after Exercise 3 in the <i style="mso-bidi-font-style:normal">Sequence Similarity</i> materials</span></p>
2  6  
3<p>In the module, students first define sequence similarity and then investigate how similarity can be quantitatively compared between two similar length proteins using a Blocks Substitution Matrix (BLOSUM) scoring matrix. Students then look for local regions of similarity between a sequence query and subjects within a large database using Basic Local Alignment Search Tool (BLAST). Lastly, students access text-based FASTA-formatted sequence information via National Center for Biotechnology Information (NCBI) databases as they collect sequences for a multiple sequence alignment using Clustal Omega to generate a phylogram and evaluate evolutionary relationships. The combination of diverse, inquiry-based questions, paper models, and web-based computational resources provides students with a solid basis for more advanced bioinformatics topics and an appreciation for the importance of bioinformatics tools across the discipline of biology.</p>  7<p><i style="mso-bidi-font-style:normal"><span style="font-size:12.0pt;font-family:&quot;Cambria&quot;,serif;mso-ascii-theme-font:minor-latin;
   8mso-fareast-font-family:Cambria;mso-fareast-theme-font:minor-latin;mso-hansi-theme-font:
   9minor-latin;mso-bidi-font-family:&quot;Times New Roman&quot;;mso-bidi-theme-font:minor-bidi;
   10mso-ansi-language:EN-US;mso-fareast-language:EN-US;mso-bidi-language:AR-SA">Bioconductor</span></i><span style="font-size:12.0pt;font-family:&quot;Cambria&quot;,serif;mso-ascii-theme-font:minor-latin;
   11mso-fareast-font-family:Cambria;mso-fareast-theme-font:minor-latin;mso-hansi-theme-font:
   12minor-latin;mso-bidi-font-family:&quot;Times New Roman&quot;;mso-bidi-theme-font:minor-bidi;
   13mso-ansi-language:EN-US;mso-fareast-language:EN-US;mso-bidi-language:AR-SA"> is a collection of </span><span class="VerbatimChar"><span style="font-size:11.0pt;
   14mso-bidi-font-size:12.0pt;mso-fareast-font-family:Cambria;mso-fareast-theme-font:
   15minor-latin;mso-bidi-font-family:&quot;Times New Roman&quot;;mso-bidi-theme-font:minor-bidi;
   16color:black;mso-color-alt:windowtext;mso-ansi-language:EN-US;mso-fareast-language:
   17EN-US;mso-bidi-language:AR-SA">R</span></span><span style="font-size:12.0pt;
   18font-family:&quot;Cambria&quot;,serif;mso-ascii-theme-font:minor-latin;mso-fareast-font-family:
   19Cambria;mso-fareast-theme-font:minor-latin;mso-hansi-theme-font:minor-latin;
   20mso-bidi-font-family:&quot;Times New Roman&quot;;mso-bidi-theme-font:minor-bidi;
   21mso-ansi-language:EN-US;mso-fareast-language:EN-US;mso-bidi-language:AR-SA"> packages that has been built by several bioinformatics researchers to perform common calculations in their field. In their own words, ``<i style="mso-bidi-font-style:
   22normal">Bioconductor</i> provides tools for the analysis and comprehension of high-throughput genomic data. Bioconductor uses the R statistical programming language, and is open source and open development. It has two releases each year, and an active user community.&rsquo;&rsquo; </span></p>
4  23  
5<p><strong>CourseSource Citation</strong></p>  5<p>&nbsp;</p>
6    
7<p>Kleinschmit, A., Brink, B., Roof, S., Goller, C., and Robertson, S.D. &nbsp;2019. Sequence Similarity: An inquiry based and &ldquo;under the hood&rdquo; approach for incorporating molecular sequence alignment in introductory undergraduate biology courses. CourseSource. <a href="https://doi.org/10.24918/cs.2019.5">https://doi.org/10.24918/cs.2019.5</a></p>   

Quote

Old VersionNew Version
1 1"If you want to build a ship, don’t drum up the men to gather wood, divide the work and give orders. Instead, teach them to yearn for the vast and endless sea."
   2---Antoine de Saint—Exupery

Attachments

1 link — Sequence Similarity: An inquiry based and &quot;under the hood&quot; approach for incorporating molecular sequence alignment in introductory undergraduate biology courses | CourseSource 1 file — DerekSollberger_-_Module3R/Module3R_-_student.docx
2 file — ./sequence_sim.JPG 2 file — DerekSollberger_-_Module3R/Module3R_-_student.html
3 file — DerekSollberger_-_Module3R/Module3R_-_student.pdf
4 file — DerekSollberger_-_Module3R/Module3R_-_student.rmd
5 file — DerekSollberger_-_Module3R/Module3R_-_teacher.docx
6 file — DerekSollberger_-_Module3R/Module3R_-_teacher.html
7 file — DerekSollberger_-_Module3R/Module3R_-_teacher.pdf
8 file — DerekSollberger_-_Module3R/Module3R_-_teacher.rmd
9 file — DerekSollberger_-_Module3R/Teaching Notes_-_Derek Sollberger.docx
10 link — Sequence Similarity: An inquiry based and &quot;under the hood&quot; approach for incorporating molecular sequence alignment in introductory undergraduate biology courses | CourseSource
11 file — publication_1264_2603/sequence_sim.JPG