As a biologist, you’ll often find yourself dealing with data.
Your experiment, be it phenotype measurements, gene expression analysis, genomic comparisons, will produce data.
To understand your results and to explain them to others, you’ll have to be able to make sense of those data.
No need, pick the software that suits your needs, use many of them and feel free to change.
R has some advantages:
Check out the TidyTuesday, it’s a weekly Social project on Twitter.
You can learn a lot just by looking at what other people are doing!
If you also publish TidyTuesdays frequently, you’ll learn how to analyze data in R, with a friendly and welcoming community.
Ready for a shiny Sunday? ☀️
— Sil Aarts (@sil_aarts) March 10, 2019
My son was sleeping at his grandparents, so what to do with all this free time? Made my very first #shiny using this week's #TidyTuesday data.
Code: https://t.co/jYNWu3DbNA
Source: Bureau of Labor pic.twitter.com/i9acX1m5g5
Simple slope graph for #TidyTuesday illustrating the unethical and unnecessary pay gap in engineering. #rstats #r4ds Code: https://t.co/1BhlhPnYyC pic.twitter.com/Pn0WjojXcW
— Jake Kaupp (@jakekaupp) March 6, 2019
Bookdown, has wonderful books on data analysis in R that you can consult openly online.
You might want to start from R for Data Science.
If you want to analyze data in R, first you have to load them.
You can do it with readr and readxl. These two packages cover two very common data formats: text rectangular data (csv, tsv) and excel data.
You can load many kind of data into R. For most of them you can find manuals and howto online.
If you know how to visualize your data in graphs and plots you’re half way there. You can use plots both to explore your data and to communicate your results.
You can use ggplot2, an R package for data visualization. You can find guides on how to use it on its website or in this chapter of r4ds.
Keep in mind that you’ll use plots for at least two reasons:
Explore: You can use plots to explore your data. Explorative plots should be produced quickly. (at expense of details)
Communicate: You can use plots to communicate your results to others. Plots for communication should be detailed and clear to everybody.
Plots and data visualizations are real work of design.
If you combine a technical and aesthetic representation, you make your plots nice and easy to understand.
More new work! 🎉 My #dataviz is in the December issue of @SciAm, revealing that “Normal Body Temperature Is Surprisingly Less Than 98.6” 🤒 https://t.co/FKpHLTlEpt
— Nadieh Bremer (@NadiehBremer) December 24, 2018
It's always such a pleasure to work with @ChristiansenJen ^_^ We're even working on new piece for the March issue! pic.twitter.com/Q3FCZxV4pn
Climate change: where we are in seven charts - and what you can do to help https://t.co/U5SEzICuIJ pic.twitter.com/UNxNXcyRQU
— Clara Guibourg (@cguibourg) December 3, 2018
Sometime you have manipulate your datasets: you might want to filter them or to change, remove, add new columns to your data.
You can manipulate your data with dplyr. Learn how to use dplyr here or with this chapter of r4ds.
You might want to filter your data, summarize them, or mutate them making new columns.
Or much more…
I made this presentation with the R markdown implementation of reveal.js.
The source code is here.