---+!! Learn Biocomputing %TOC% ---++ General Resources / Courses *Software Carpentry* %BR% https://software-carpentry.org/lessons/ *Code Academy* %BR% https://www.codecademy.com/ *edX Computer Science Courses* %BR% https://www.edx.org/course/subject/computer-science ---++ Learning R The programming language R was originally designed for statistical analysis, but researchers now use it for a wide variety of data analysis and visualization tasks. %BR% Download R here: https://www.r-project.org %BR% Use the RStudio IDE to write and run R code. Download here: https://rstudio.com *Primers on RStudio Cloud* %BR% https://rstudio.cloud/learn/primers %BR% These are short interactive lessons that let you write R code in your web browser. They use the "tidyverse" style of R coding, which we recommend for all new users. The first 4 lessons: "Basics", "Work with Data", "Visualize Data", and "Tidy your Data" give a strong foundation for writing R code to explore your data. *Primers on data analysis/visualization in R* %BR% https://wilkelab.org/classes/SDS348_spring_2020.html (first half of lectures) *R for Data Science* %BR% https://r4ds.had.co.nz/index.html %BR% Free online textbook that covers how to process data in R with tidyverse. ---++ Learning Python (for biology) Python has become a dominant programming language for asking and answering many different questions in biology. This is because of its relative simplicity yet powerful flexibility, and also because Python has extensive library support for various biological tools. *Python Installation* %BR% There are many different ways to run Python, but one of the easiest choices to get started is [[https://colab.research.google.com/][Google Colabroratory]]. This is essentially an online [[https://jupyter.org/][Jupyter Notebook]], which has many popular packages pre-installed, and removes much of the hassle that can come from a local intallation of python. [[https://jupyter.org/][Jupyter Notebooks]] are a great tool for keeping track of what the user has tried, and also seamlessly integrating spaces where the user can describe their thought process or add other notes. It is conceptually similar to a lab notebook that one might use in a wet lab. If a local installation of Python is preferred, use a flavor of [[https://conda.io][Conda]] such as miniconda to manage your install. Install the [[https://mamba.readthedocs.io][Mamba]] package to speed things up. In addition to Python, Conda/Mamba is exceptionally useful for installing many other command-line bioinformatics programs and their dependencies. Learn about [[https://docs.conda.io/projects/conda/en/latest/user-guide/tasks/manage-environments.html][environments]] and use them! [[https://www.anaconda.com/][Anaconda]] may be a good choice if you want a graphical interface for controlling package installation, but <i>be aware that installing it may cause some future clashes if you want to switch to the flexibility of Conda/Mamba later</i>. Anaconda is a data science platform that installs many useful tools, including Jupyter Notebook, and allows for easier package management. Packages contain libraries of useful code that other people have written making Python easier to use. [[https://biopython.org/][Biopython]] is one such such library, and contains indispensable tools such as FASTA/GBK parsers/writers, and tools for manipulating DNA/protein sequences. *Classes/resources* %BR% [[https://www.codecademy.com/][Codecademy]] offers short lessons that teach the basics of Python [[https://en.wikipedia.org/wiki/Syntax_(programming_languages)][syntax]]. Professor Claus Wilke of The University of Texas at Austin has [[https://wilkelab.org/classes/SDS348_spring_2020.html][a great class posted online]] (second half of lectures) that introduces how to use Python in a biological context. [[https://www.tutorialspoint.com/python/index.htm][TutorialsPoint]] offers a nicely laid-out reference to the basics of Python syntax and common operations. ---++Git / GitHub Git is currently the dominant tool for version control and distribution of open-source code. Learning Git is useful for organizing, versioning, saving, and backing up your own code as you develop it, but it is especially useful for working on code collaboratively as part of a group and making sure that it will be available into the future to others after you are done working on it. It's good to try using the command line a bit when learning git to understand how it works and have a way of doing power-user commands if you need them. There are many GUI clients for Git that you'll probably transition to using day-to-day, like GitHubDesktop or clients built into IDEs like VSCode or XCode. *Interactive Git lesson* %BR% https://github.com/jlord/git-it-electron *Setup Git for Barrick Lab* %BR% * [[https://github.com/barricklab][Barricklab GitHub]] * Ask to be granted access to see private repositories *Common Workflows* %BR% * [[https://gist.github.com/Chaser324/ce0505fbed06b947d962][Fork a Repo and submit a Pull Request]] ---++Best Practices for Creating Bioinformatics Software *Ten recommendations for creating usable bioinformatics command line software* %BR% https://doi.org/10.1186/2047-217X-2-15 ---++ Recommended Books *Practical Computing for Biologists* by Steven Haddock and Casey Dunn %BR% [[https://www.amazon.com/dp/0878933913/ref=cm_sw_r_tw_dp_U_x_5WVPEbFH8JJ6V][Buy on Amazon]] %BR% This book provides an exceptionally clean and well written introduction for biologists to begin working with computational data. The beginning focus on regular expressions and working with text files will help anyone starting out with coding. %BR% %BR% *Bioinformatics Data Skills: Reproducible and Robust Research with Open Source Tools* by Vince Buffalo %BR% [[https://www.amazon.com/dp/0878933913/ref=cm_sw_r_tw_dp_U_x_5WVPEbFH8JJ6V][Buy on Amazon]] %BR% This book is one of the few available "intermediate" level bioinformatic texts. Once you have mastered some basics in UNIX or Python/R programming, this books introduces more advanced concepts like workflow management and dealing with specific biological data.
E
dit
|
A
ttach
|
Watch
|
P
rint version
|
H
istory
: r8
<
r7
<
r6
<
r5
<
r4
|
B
acklinks
|
V
iew topic
|
M
ore topic actions
Barrick Lab
>
ComputationList
>
ProtocolsLearnBiocomputing
Contributors to this topic
JeffreyBarrick, MattMcGuffie, JosephReitman, SeanLeonard
Topic revision: r8 - 2024-07-09 - 21:00:49 - Main.JeffreyBarrick
Barrick Lab
Contact
Research
Publications
Team
Protocols
Reference
Software
UT Austin
Mol Biosciences
ILS
Microbiology
EEB
CSSB
CBRS
The LTEE
iGEM team
SynBioCyc
SynBio course
NGS course
BEACON
Search
Log in
Copyright ©2025 Barrick Lab contributing authors. Ideas, requests, problems?
Send feedback