Welcome to the hands-on, practical learning portion of Lesson 2 where you will create a single page R Markdown document. I made this page to help guide you in the process. The material below contains a series of assignments and challenges designed to get you comfortable utilizing YAML metadata, Markdown syntax, and R code chunks.
The page itself is written almost entirely in R Markdown, meaning there is no extensive use of anything fancy like complicated CSS or HTML. I will introduce some simple HTML code you can use to jazz up your document a little. You should be able to solve any problem I present by digging around in the raw code on GitHub, consulting the Resources page, using one of the targeted search strategies described in the Problem Solving post, and/or posting a question to the Slack channel.
Knit often. Whenever you make a change to the YAML header, add a new code chunk, etc., re-knit (or render) your document. This is very important. Regular knitting allows you to a) see the effects of a change and b) track down (troubleshoot) issues more easily.
Simply hit the knit button or use the shortcut keys—on MacOS
Cmd
+Shift
+K
and WindowsCtrl
+Shift
+K
. Learn to love these shortcut keys. They will save you time.
You can choose how your document is previewed using the drop down menu in the document settings. Preview in Window
opens the document in a separate RStudio window and Preview in Viewer Pane
lets you see the document in the main RStudio IDE. These are good for quick looks. You should always double-check the actual HTML file because sometimes things look different in RStudio.
Something to consider while you create your page is readability. This not only applies to the final document but also the raw R Markdown code itself. Think about someone digging through your code to figure out how you did something. Or think about yourself coming back to the code after a few years. The better your document is formatted, the easier it will be to understand. In future lessons we will return to some best practices but for now remember, part of what we are doing here is making your science more transparent and reproducible. If your document is confusing to follow then it serves neither of these purposes.
In the first assignment, you will employ some basic techniques to control the look of your document and how R code chunks are rendered. We will cover YAML metadata modifications, setting global chunk options, and writing Markdown content. You will also learn about testing R code first before rendering the final document.
If you have not already done so, your first task is to create an R Markdown document. Open R Studio and go to File > New File > R Markdown
. A window should pop up where you can fill in the details. It is not important what you put here since you can change it at any time. What is important is that Document and HTML are both selected. We will cover other document types in the future. Hit Ok
and follow the steps in this graphic. Remember, you do not need to add a file extension when you save the document.
Once you have a document built and saved, there should be a .html
file in your working directory. Double-click that file—it should open in your default browser. Each time you re-knit the .Rmd
file, you can just refresh the browser page or double-click the file again.
Your first task is to add some content and format the content with Markdown. This doesn’t need to be anything fancy to start. You can either add text as you go or paste a large amount of text in at once. Dealers choice. You can use the Markdown section of Lesson 0 or the Markdown section from the Resources page for reference.
Now it is time to get some practice modifying code chunk options so you can gain more control over the behavior of code and result display. If you have your own R code you are more than welcome to use it here. I will use the default code chunks that were added to the .Rmd
file. Please see the section on Chunk structure & options from Lesson 2 for more details.
Here are the two default code chunks. As you can see, both have names and the second chunk has a single option.
```{r cars}
summary(cars)
```
```{r pressure, echo=FALSE}
plot(pressure)
```
There are many code chunk options you can control. Which options you use and how you set them will be determined by your needs. Test the behavior of the following options by setting each equal to either TRUE
or FALSE
. Render the document and see if you can figure out what changed. Each of these has a default value so you may not see a change until you set the alternative value.
echo
collapse
eval
prompt
highlight
include
Next, it is a really good habit to check code chunks as you add them. This will ensure that each chunk works, making it easier to track down problems. If you refer to the first image on this page, you can you have options for Chunk Output—Inline and Console. This controls where the output is displayed. Let’s take a quick look at a code chunk in RStudio and see how you test chunks before rendering.
Take a look at the tool bar on the far right. Option 1 is a drop down menu that gives you an alternative way to set code chunk options. Option 2 will Run all Code Chunks Above meaning that RStudio will run all code chunks above the current chunk but not the current chunk itself. And Option 3 will Run the Current Chunk. Incidentally, if you do not see these options it means something is wrong with the chunk.
Go ahead and run the chunk.
You last task is to modify the YAML header to suit your needs and tastes. I would like you to experiment with different options and settings to see what happens in the final document.
?rmarkdown::html_document
or ?html_document
in the Console to see the header options for an HTML document..md
copy of your file..md
file in a text editor. This is the output from knitr
—after all R code has been processed—and what PanDoc uses to generate an HTML file. Keep this file open as you build your document. Pay attention to how your R code is converted to Markdown syntax.In this assignment, you will explore different methods of incorporating tables in your document. The choice of method depends on a) the type of data, b) the amount of data, and c) the desired output. I will cover a few tools for creating tables but please note there are many options out there, so look around and let us know if you find a tool you like.
For each example, I will use the mtcars
data set from the datasets
package. The mtcars
data set has 32 rows and 11 columns. Feel free to load your own data table or use the mtcars
data set.
You will use four different tools in this assignment for making tables. Here is a summary table of of each tool.
Table type | Table size | Formatting Options | Skill Level |
---|---|---|---|
markdown |
small | minimal | beginner |
rmarkdown::paged_table |
large | minimal | beginner |
knitr::kable + kableExtra |
small | extensive | intermediate |
DT + DataTables |
large | extensive | advanced |
The simplest method of building a table is with Markdown syntax. This is a nice option because you can hard code the table right into the document—no need to install and load libraries or write code chunks—and it is easy to implement. The downside is there is minimal functionality available in a Markdown table.
Markdown does not work well for large tables. So I will first grab a subset of mtcars
, specifically the first 4 rows and 3 columns. In my code chunk I add the chunk option comment=""
. This prevents knit from appending a string (default is ##
) to the start of each line of results in the final document.
mpg cyl disp
Mazda RX4 21.0 6 160
Mazda RX4 Wag 21.0 6 160
Datsun 710 22.8 4 108
Hornet 4 Drive 21.4 6 258
Incidentally, the results box above is technically the simplest table you can make, either by calling the data frame mtcars_sub
directly or running print.data.frame(mtcars_sub)
.
Anyway, run this code chunk, copy the results, and make a Markdown table. You can either run the chunk in RStudio without rendering the document (described above) or render the document and copy the results from HTML page. I added a header to the first column. And here is the Markdown table.
model | mpg | cyl | disp |
---|---|---|---|
Mazda RX4 | 21.0 | 6 | 160 |
Mazda RX4 Wag |
21.0 | 6 | 160 |
Datsun 710 | 22.8 | 4 | 108 |
Hornet 4 Drive | 21.4 | 6 | 258 |
Since this is a Markdown table, you can add additional Markdown syntax for formatting. See if you can figure out what syntax I added to my table and add some to your table. Also check out the Tables section of PanDoc User’s Guide for other Markdown table options, including how to add a caption. In addition to pipe_tables
, you can create multiline_tables
grid_tables
, and simple_tables
.
Something to notice is that the Markdown table spans the entire width of the page—even though it does not need all of that space. As far as I know, there is no way to control this behavior without adding additional HTML formatting.
Recommendation Use
Markdown
for small, simple tables where styling is not a concern.
With larger tables, it may not be practical to display the full table inline. So we need a way to shrink a large table so it looks good while still allowing access to the full table.
The next type of table I want you to try are Paged Tables. R Markdown comes with its own built in table function called paged_table
. The paged_table
function allows pagination of rows and columns making it possible to render a large table in a small space.
It is easy to code paged_table
but that ease comes with a small price—limited functionality. Here are the options you do have with paged_table
function.
Option | Description |
---|---|
rows.print |
Maximum rows to print per page. |
max.print |
Maximum rows in the table (defaults to 1000). |
cols.print |
Maximum columns in the table (defaults to 10). |
rownames.print |
Print row names as part of the table. |
A quick side note. You actually need to load the rmarkdown
package for paged_table
to work. Anyway, this is a good time to return to the first code chunk in your .Rmd
file—the chunk called setup
that R Markdown added by default.
I like to use this chunk to load all of the packages I need for my document. Using a single chunk for all of my packages helps me keep my document organized. Notice the setup
chunk has the option include=FALSE
. This prevents the content of the chunk from appearing in the final document, which for me is more stylistically appealing. I can add a sessionInfo()
chunk at the end of my document to report all of the packages so this information is available to the reader. We will cover sessionInfo()
when we get into more depth on the subject of reproducible. If you do not want to load the library you can run the command like this: rmarkdown::paged_table()
.
OK, back to the table. Now I can create a table of mtcars
with the paged_table
function and use an option to limit the number of printed rows to 5
for each page. I used echo=FALSE
in my code chunk to hide the code :). By now you should know where to look for a solution.
Notice that for each column, the column class is printed below the name (text enclosed in < >). This is irritating and related to printing a table from a built-in data frame. I have no idea how to fix this (within the confines of R Markdown) but I will work on a solution.
Recommendation Use
paged_table
for large tables where extensive styling is not a concern.
Knitr comes with its own tool for rendering simple tables called kable
. The documentation for kable
can be found here or by running ?knitr::kable()
in the Console. By itself, kable comes with almost no options. We can extend its functionality with the kableExtra
package and piping syntax from magrittr
. The features of kableExtra
are extensive and I will only touch on a few here. The documentation for kableExtra
can be found here or by running ?kableExtra
in the Console after the package has been installed and loaded.
I highly recommend you learn how to use these packages for making tables.
Again, you will need to load kableExtra
and either load the knitr
package or run the command like this: knitr::kable()
.
First, let’s look at the default kable
table output. We will use the head of the mtcars
data set.
mpg | cyl | disp | hp | drat | wt | qsec | vs | am | gear | carb | |
---|---|---|---|---|---|---|---|---|---|---|---|
Mazda RX4 | 21.0 | 6 | 160 | 110 | 3.90 | 2.620 | 16.46 | 0 | 1 | 4 | 4 |
Mazda RX4 Wag | 21.0 | 6 | 160 | 110 | 3.90 | 2.875 | 17.02 | 0 | 1 | 4 | 4 |
Datsun 710 | 22.8 | 4 | 108 | 93 | 3.85 | 2.320 | 18.61 | 1 | 1 | 4 | 1 |
Hornet 4 Drive | 21.4 | 6 | 258 | 110 | 3.08 | 3.215 | 19.44 | 1 | 0 | 3 | 1 |
Hornet Sportabout | 18.7 | 8 | 360 | 175 | 3.15 | 3.440 | 17.02 | 0 | 0 | 3 | 2 |
Valiant | 18.1 | 6 | 225 | 105 | 2.76 | 3.460 | 20.22 | 1 | 0 | 3 | 1 |
If we tried to render the entire table by omitting head
, we would just get a long, crappy table in our document. Not cool. Lets see if we can jazz this up a bit with kableExtra
.
mpg | cyl | disp | hp | drat | wt | qsec | vs | am | gear | carb | |
---|---|---|---|---|---|---|---|---|---|---|---|
Mazda RX4 | 21.0 | 6 | 160 | 110 | 3.90 | 2.620 | 16.46 | 0 | 1 | 4 | 4 |
Mazda RX4 Wag | 21.0 | 6 | 160 | 110 | 3.90 | 2.875 | 17.02 | 0 | 1 | 4 | 4 |
Datsun 710 | 22.8 | 4 | 108 | 93 | 3.85 | 2.320 | 18.61 | 1 | 1 | 4 | 1 |
Hornet 4 Drive | 21.4 | 6 | 258 | 110 | 3.08 | 3.215 | 19.44 | 1 | 0 | 3 | 1 |
Hornet Sportabout | 18.7 | 8 | 360 | 175 | 3.15 | 3.440 | 17.02 | 0 | 0 | 3 | 2 |
Valiant | 18.1 | 6 | 225 | 105 | 2.76 | 3.460 | 20.22 | 1 | 0 | 3 | 1 |
Here I used the pipe operator (%>%
) to pass the results of kable
to the function kable_styling
—a part of the kableExtra
package. By running kable_styling
as is, I am using defaults for all of the options.
The pipe operator is a powerful tool worth learning.
Back to the table. It certainly looks better than the default kable
version but we are missing the ability to page the table. I would like you to run these two commands and look at the output.
This command will include the whole table.
kable(mtcars) %>%
kable_styling()
And this command will transpose the table (swap rows and columns) using the transpose (t
) function. Remember, mtcars
has 32 rows and 11 columns but when you transpose the table, it has 11 rows and 32 columns. So it is really wide in the transposed state.
kable(head(t(mtcars))) %>%
kable_styling()
I hope you agree that neither of these tables are acceptable, especially the second one. Unfortunately, the kableExtra
package does not come with an option to add pagination. You can however put the table in a fixed-height, fixed-width (or both) box, and make it scrollable. We can do this by using a pipe operator and the scroll_box
function. While we are at it, lets also add tweak some options in kable_styling
to make a more handsome table.
kable(t(mtcars), caption = "*Scrollable kable table.*") %>%
kable_styling(bootstrap_options = c("striped", "hover",
"condensed", "responsive")) %>%
scroll_box(width = "100%", height = "300px")
Mazda RX4 | Mazda RX4 Wag | Datsun 710 | Hornet 4 Drive | Hornet Sportabout | Valiant | Duster 360 | Merc 240D | Merc 230 | Merc 280 | Merc 280C | Merc 450SE | Merc 450SL | Merc 450SLC | Cadillac Fleetwood | Lincoln Continental | Chrysler Imperial | Fiat 128 | Honda Civic | Toyota Corolla | Toyota Corona | Dodge Challenger | AMC Javelin | Camaro Z28 | Pontiac Firebird | Fiat X1-9 | Porsche 914-2 | Lotus Europa | Ford Pantera L | Ferrari Dino | Maserati Bora | Volvo 142E | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
mpg | 21.00 | 21.000 | 22.80 | 21.400 | 18.70 | 18.10 | 14.30 | 24.40 | 22.80 | 19.20 | 17.80 | 16.40 | 17.30 | 15.20 | 10.40 | 10.400 | 14.700 | 32.40 | 30.400 | 33.900 | 21.500 | 15.50 | 15.200 | 13.30 | 19.200 | 27.300 | 26.00 | 30.400 | 15.80 | 19.70 | 15.00 | 21.40 |
cyl | 6.00 | 6.000 | 4.00 | 6.000 | 8.00 | 6.00 | 8.00 | 4.00 | 4.00 | 6.00 | 6.00 | 8.00 | 8.00 | 8.00 | 8.00 | 8.000 | 8.000 | 4.00 | 4.000 | 4.000 | 4.000 | 8.00 | 8.000 | 8.00 | 8.000 | 4.000 | 4.00 | 4.000 | 8.00 | 6.00 | 8.00 | 4.00 |
disp | 160.00 | 160.000 | 108.00 | 258.000 | 360.00 | 225.00 | 360.00 | 146.70 | 140.80 | 167.60 | 167.60 | 275.80 | 275.80 | 275.80 | 472.00 | 460.000 | 440.000 | 78.70 | 75.700 | 71.100 | 120.100 | 318.00 | 304.000 | 350.00 | 400.000 | 79.000 | 120.30 | 95.100 | 351.00 | 145.00 | 301.00 | 121.00 |
hp | 110.00 | 110.000 | 93.00 | 110.000 | 175.00 | 105.00 | 245.00 | 62.00 | 95.00 | 123.00 | 123.00 | 180.00 | 180.00 | 180.00 | 205.00 | 215.000 | 230.000 | 66.00 | 52.000 | 65.000 | 97.000 | 150.00 | 150.000 | 245.00 | 175.000 | 66.000 | 91.00 | 113.000 | 264.00 | 175.00 | 335.00 | 109.00 |
drat | 3.90 | 3.900 | 3.85 | 3.080 | 3.15 | 2.76 | 3.21 | 3.69 | 3.92 | 3.92 | 3.92 | 3.07 | 3.07 | 3.07 | 2.93 | 3.000 | 3.230 | 4.08 | 4.930 | 4.220 | 3.700 | 2.76 | 3.150 | 3.73 | 3.080 | 4.080 | 4.43 | 3.770 | 4.22 | 3.62 | 3.54 | 4.11 |
wt | 2.62 | 2.875 | 2.32 | 3.215 | 3.44 | 3.46 | 3.57 | 3.19 | 3.15 | 3.44 | 3.44 | 4.07 | 3.73 | 3.78 | 5.25 | 5.424 | 5.345 | 2.20 | 1.615 | 1.835 | 2.465 | 3.52 | 3.435 | 3.84 | 3.845 | 1.935 | 2.14 | 1.513 | 3.17 | 2.77 | 3.57 | 2.78 |
qsec | 16.46 | 17.020 | 18.61 | 19.440 | 17.02 | 20.22 | 15.84 | 20.00 | 22.90 | 18.30 | 18.90 | 17.40 | 17.60 | 18.00 | 17.98 | 17.820 | 17.420 | 19.47 | 18.520 | 19.900 | 20.010 | 16.87 | 17.300 | 15.41 | 17.050 | 18.900 | 16.70 | 16.900 | 14.50 | 15.50 | 14.60 | 18.60 |
vs | 0.00 | 0.000 | 1.00 | 1.000 | 0.00 | 1.00 | 0.00 | 1.00 | 1.00 | 1.00 | 1.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.000 | 0.000 | 1.00 | 1.000 | 1.000 | 1.000 | 0.00 | 0.000 | 0.00 | 0.000 | 1.000 | 0.00 | 1.000 | 0.00 | 0.00 | 0.00 | 1.00 |
am | 1.00 | 1.000 | 1.00 | 0.000 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.000 | 0.000 | 1.00 | 1.000 | 1.000 | 0.000 | 0.00 | 0.000 | 0.00 | 0.000 | 1.000 | 1.00 | 1.000 | 1.00 | 1.00 | 1.00 | 1.00 |
gear | 4.00 | 4.000 | 4.00 | 3.000 | 3.00 | 3.00 | 3.00 | 4.00 | 4.00 | 4.00 | 4.00 | 3.00 | 3.00 | 3.00 | 3.00 | 3.000 | 3.000 | 4.00 | 4.000 | 4.000 | 3.000 | 3.00 | 3.000 | 3.00 | 3.000 | 4.000 | 5.00 | 5.000 | 5.00 | 5.00 | 5.00 | 4.00 |
carb | 4.00 | 4.000 | 1.00 | 1.000 | 2.00 | 1.00 | 4.00 | 2.00 | 2.00 | 4.00 | 4.00 | 3.00 | 3.00 | 3.00 | 4.00 | 4.000 | 4.000 | 1.00 | 2.000 | 1.000 | 1.000 | 2.00 | 2.000 | 4.00 | 2.000 | 1.000 | 2.00 | 2.000 | 4.00 | 6.00 | 8.00 | 2.00 |
For the scroll_box
function I set the width = "100%"
rather than specifying a dimension. This ensures the box will always be the width of the page no matter how small the window is.
Even though you cannot make a paged table with kable
, there are many styling options available in the kableExtra
package that makes this method extremely useful and worth learning. Plus, the code is relatively simple to write.
I want to show you one more feature that is not available the other table methods we cover in this Assignment—floating. Let’s first subset the mtcars
data set so we can make a small table. Next we use the full_width
and position
options to control the size and position of the table.
Here is our code chunk.
mtcars_sub <- mtcars[1:7,1:6]
kable(mtcars_sub) %>%
kable_styling(bootstrap_options = "striped",
full_width = FALSE,
position = "float_right")
mpg | cyl | disp | hp | drat | wt | |
---|---|---|---|---|---|---|
Mazda RX4 | 21.0 | 6 | 160 | 110 | 3.90 | 2.620 |
Mazda RX4 Wag | 21.0 | 6 | 160 | 110 | 3.90 | 2.875 |
Datsun 710 | 22.8 | 4 | 108 | 93 | 3.85 | 2.320 |
Hornet 4 Drive | 21.4 | 6 | 258 | 110 | 3.08 | 3.215 |
Hornet Sportabout | 18.7 | 8 | 360 | 175 | 3.15 | 3.440 |
Valiant | 18.1 | 6 | 225 | 105 | 2.76 | 3.460 |
Duster 360 | 14.3 | 8 | 360 | 245 | 3.21 | 3.570 |
Let’s say we have a bunch of text that we want to put side-by-side with this small table. Our subsetted mtcars
data set now has 7 rows and 6 columns. We can make our table smaller by setting full_width = FALSE
in kable_styling
and float the table by setting position = "float_right"
Please study the extensive options available in kable
and kableExtra
and create tables that implement some of the options.
Recommendation Use
kable
+kableExtra
for small tables where extensive styling is desired.
The last option I want to cover for building tables is implemented using the datatable
function from the DT package, an interface to the JavaScript library DataTables. To demonstrate the functionality, I will use a larger data set called USJudgeRatings
from the datasets
package. USJudgeRatings
has 43 rows and 12 columns. This table is too big—horizontally and vertically—to fit on a standard page.
The syntax for the DT::datatable
is more complicated than the other methods but that comes with more extensive functionality.
Working with
DT::datatable
is an advanced level skill. I highly recommend you learn how to use the package, but it will take practice.
Please make sure you are comfortable with the other methods first before trying to use DT::datatable
. I promise, if you do not know what you are doing, this package will cause a lot of frustration. That said, I use it all the time because it is awesome.
Moving on. If we run DT
on the USJudgeRatings
data set without any options the table will spill off the side of the page. Again, not cool. Try to run this command and see what happens.
datatable(USJudgeRatings)
DT::datatable
does not page tables horizontally like the paged_table
command does (described above). We can set the width of the table and add an option that allows horizontal scrolling. For this we use the options
argument. The syntax is to add width = "100%"
followed by options = list()
, where we put a comma separated list of options. For now, we just include scrollX
in our list of options.
Play around with the table a little. As you can see
The Show entries and Search box are added by default. We can decide whether to show these options or not. I will save that for later. For now, I want to leave you with a more stylized DT data table to give you a sense of the possibilities. Don’t worry so much about the code—pay attention to the functionality.
datatable(USJudgeRatings, width = "100%",
extensions = 'Buttons', options = list(
scrollX = TRUE,
dom = 'lfrtipB',
buttons = c('copy', 'csv', 'excel', 'pdf', 'print'),
pageLength = 5,
lengthMenu = c(5, 10, 20, 45)
)
)
I added buttons to download the table to different formats, changed the page length to 5, and changed the values in the Show entries drop down. Play around with the table. There is a lot more to do with this package and we will come back to it often.
Recommendation Use
DT::datatable
for large tables where extensive styling is desired.
That’s all for this assignment.
If you are interested in making your science more transparent and reproducible, you need to provide at least two things—the raw data you generated and the code you used to analyze it. Hopefully it is obvious by now that you need to provide access to both components; neither is particularly useful without the other. We will cover data availability in a future lesson. For this assignment, you will practice several different methods for making your code accessible. The method or methods you choose for your own work will depend on the type of analyses and the type of code.
Your assignment is to implement these methods in your document and play around with the different options.
Knitr comes with a built-in function called purl
, which allows you to extract all the R code from an R Markdown document and convert it to an R script. In order to run purl
, you must either load the knitr library first (i.e., library(knitr)
) and then run purl
or call the function directly by running knitr::purl()
. We will start by discussing some simple options for running purl
and then talk about how you run the command. The basic structure of the command is:
knitr::purl(input="filename.Rmd", documentation = L)
By default, purl
uses the base name of the input file as the base name of the output file. If you want to control this behavior, add the option output="filename.R"
where filename
is whatever you choose.
The option documentation
is an integer that specifies the level (L
) of documentation to add to the script. You have three choices
As far as I know, you cannot run purl
from inside an actual code chunk. I have no idea why nor was I able to find an answer. If have an answer or know a solution, please let us know. There easiest way to generate an R script from an R Markdown document with purl
to is to run the command in the Console like so:
knitr::purl(input="filename.Rmd", documentation = 1)
Run the
purl
command using the three options described above fordocumentation
option and look at the output files.
Inline R expressions are ignored by default. For example, if you had an inline R expression like this `r sqrt(2)`
the expression would not be included in the R script generated by purl
. If you want to include inline expressions in the R script, you need to set the global R option knitr.purl.inline = TRUE
before calling knitr::purl()
.
Remember way back when you generated your initial R Markdown document a default code chunk was added just below the YAML header?
```{r setup, include=FALSE}
knitr::opts_chunk$set(echo = TRUE)
```
If you add knitr.purl.inline = TRUE
to that code chunk, all inline expressions will be added to the R script.
```{r setup, include=FALSE}
knitr::opts_chunk$set(echo = TRUE, knitr.purl.inline = TRUE)
```
The last thing I want to mention is that you can prevent certain code chunks from appearing in the R Script by adding the option purl = FALSE
to the chunk. For example, if you add this option to the setup
chunk we just discussed that chunk will not appear in the final document.
```{r setup, include=FALSE, purl = FALSE}
knitr::opts_chunk$set(echo = TRUE, knitr.purl.inline = TRUE)
```
Test the behavior of the different options for purl
on your document. You can also use this funtion on any R Markdown document to quickly retrieve the code. For example, if you download the Rmd file for this page from GitHub you can run purl
to quickly extract the R code.
Recommendation Use
purl
to generate a simple R script version of your R Markdown document after the document is finished and saved. You can then link to the file somewhere in your document.
I hope by now you have seen that any document I present in this course has a link to a GitHub repo where you can download or copy the .Rmd
file. That’s great and all, but wouldn’t it be nice if you could do it right from the HTML page? Well, you can by adding one simple line of code as a property of html_document
in your YAML header —code_download: true
.
output:
html_document:
theme: journal
toc: true
toc_float: true
toc_depth: 3
highlight: tango
code_download: true
You could of course set the property to false
except this is the default value and nothing will change. Look back at the top of this page and you should now see a Code
button in the upper right corner. Click on the drop down and select Download Rmd
and this entire page should be saved to your Downloads directory.
Go ahead and add this option to your YAML header.
I have mentioned a time or two that the benefits of using R Markdown is the ability to execute and display code in your final document. If you go back to the default setup
chunk at the top of your .Rmd
you should see this:
knitr::opts_chunk$set(echo = TRUE)
I discussed this chunk previously in #5 of the Chunk structure & options section in Lesson 2. Briefly, this is a global command that ensures all R code chunks are visible in the final document, unless you escape this behavior by using echo = FALSE
in a particular chunk. Of course, you could also set the global option to FALSE
as is knitr::opts_chunk$set(echo = FALSE)
and then none of the R code would be visible at all. This is a perfectly fine option in some situations but not in others.
Let’s face it—code takes up a lot of space in a document and large code chunks are not particularly pleasing to look at. You may encounter situations where you both want the code available on a page but you also want to hide the code. For this we use a technique called code folding. Near the bottom of Lesson 0, I used some simple HTML to make a section that folds the R code for the Clifford Attractor. There is a little Show/hide
button that allows you to look at the code if you want to; otherwise it is hidden by default. But this approach is a little clunky because you must a) know some HTML and b) include this for every chunk.
R Markdown has a similar functionality for showing and hiding code but it only takes a single line of code added to the YAML header. Again, return to your YAML header and add the argument code_folding:
as a nested property of html_document:
. Your two options for code_folding:
are show
and hide
.
output:
html_document:
theme: journal
toc: true
toc_float: true
toc_depth: 3
highlight: tango
code_download: true
code_folding: hide
Once you add this to the YAML header, the drop down Code
button in the upper right corner of the page should now include Show All Code
and Hide All Code
. This allows the user to specify how they want to view the code in the document. Either property you set for code_folding
will be the default state for the entire document.
You should also notice that a new Code
button appears next to every code chunk in your document. Again, if you want to exclude the code from individual chunks, just set echo = FALSE
for that chunk and the code will not be included at all.
Add
code_folding:
argument to your YAML header and set a property value.
As far as I know there is no way of folding the results of a code chunk or folding individual chunks unless you use HTML.
That’s all for this assignment.
The last major piece of the puzzle is to add graphics to your R Markdown document. Think of this Assignment more as a mini guidebook and less as a to-do list of activities. The reason being, is that the topic of graphics is much too broad to cover in-depth during this course.
For the sake of convenience, I will use figure to refer to any visual representation of data (chart, plot, graph, etc.) made with (R) code and generated when the document is rendered. An image on the other hand, is any external visual product inserted into the document, like a png
or jpg
.
The distinction is that figures are created when the R Markdown document is rendered, and the R code is processed. In contrast, images are added to the rendered document by referencing a file.
Let’s look at an example that demonstrates the difference between a figure and an image. We can use the built-in pressure
data set from the datasets
package and the plot
command to display a temperature-by-pressure plot. Here is the simple line of code followed by the figure that is created when the document is rendered.
Basically, when I knit the R Markdown file (.Rmd
) three things happen. First, knitr
reads the code chunk, creates the plot, and saves the plot as a .png
file. Next, knitr adds a line of code to the subsequent Markdown file (.md
), which links to the .png
file. Finally, PanDoc embeds the .png
file into the HTML document. And just like that, I have a plot in my HTML document from a single line of R code.
Now instead let’s say I need to do some post-processing on the plot. I want the points to be larger and red, but I am too lazy to figure out how to code this in R (you should not be so lazy). I can use the same code to generate the plot in R but this time I will manually save the plot as a png
file. I then make the modifications in a vector graphics program like Inkscape, save the modified file, and then use Markdown code in my .Rmd
file to include the image in the final document. In this case, the plot is added directly to the HTML document by PanDoc rather than first being generated by knitr.
# R code to create and save image
png("files/pressure.png", width = 680, height = 500,
units = "px", pointsize = 16)
plot(pressure)
dev.off()
# Markdown code in the Rmd file to add an image.
![](files/pressure_modified.png)
Image added to the rendered document by referencing a file using Mardown.
Of course, you can add any image you want. The image does not need to be generated by R code or be a png
. For example, here is a Girih tile mosaic I made using the online tool Girih Designer.
![](files/girih.jpg)
Girih tiles are a set of regular polygons (with overlaid strapwork) used in medieval Islamic architecture. Check out this cool Science paper from 2007 that explores the geometry behind Girih tiles.
It is important to understand the distinction between code-generated figures vs. imported images. You need different tools to display and control each type of graphic.
Your goal for this Assignment is to add figures and images to your R Markdown page.
Check out this blog post from ZevRoss on Tips and tricks for working with images and figures in R Markdown documents. I reference this resource a lot.
I think a lot about data visualization and in my opinion, it is one of the most important skills you can develop as you move forward with your own work.
The simple graph has brought more information to the data analyst’s mind than any other device. — John Tukey
Almost any kind of data can be visualized and there are literally dozens and dozens of figure types you can create with R code. This includes maps, networks, flow charts, dendrograms, and so on. In an HTML document, it is possible to create both static and interactive figures. However, data visualization in R is a broad topic and can often get quite complicated. In addition to the numerous chart types, there are many R packages you can use for visualizations. Some, like ggplot2
, have extensive functionality while others (e.g., leaflet) are more appropriate for specific types of data and/or visualizations. Ultimately, the approach you choose will depend on your data and the type of message you want to convey.
My point in saying this is that a comprehensive tutorial on data visualization in R is simply beyond the scope of this Assignment. If there is sufficient interest in particular packages and/or chart types I am happy to write a lesson plan. For now, I would like to focus on a few best practices and strategies that I think are important for creating visualization products using code in your R Markdown document.
This list is far from comprehensive. There is a paper in PLoS Computational Biology from 2014 on Ten Simple Rules for Better Figures, which provides a nice primer on the subject of figure design.
If you want to get deeper into visualization methods and techniques, beyond what we discuss here, I strongly encourage you to learn more about the work of Edward Tufte, widely recognized as a pioneer in the field of data visualization.
For some inspiration, I also recommend visiting Martin Krzywinski data visualization site. The site itself is dense, but Martin’s visualizations are incredible, and the site is updated often.
One of the first things to think about is what kind of chart you want to create. As I mentioned, this will depend on the type of data you have and the message you are trying to convey. There are different factors to consider when selecting a chart type and many online tools available to help narrow down your selection.
The Chart Chooser was originally published in 2009 so it is a bit old now. But it can give you a good place to start when choosing a chart type. Cool Infographics by Randy Krum has a nice DataViz page that provides a more up-to-date collection of data visualization chart choosers, reference guides, and cheat sheets.
Yan Holtz has developed several websites dedicated to data visualization. These resources are AMAZING not to mention that his websites are beautiful. Please look at Yan’s From Data to Viz page. This page is interactive and designed to help you choose the most appropriate chart for your data. It has links to the code, explanations, and common mistakes to avoid.
Assignment: Spend time familiarizing yourself with different data visualization techniques and chart types. Think about your own data and data visualization needs.
The next step is to write the code for your figure. Where possible, I highly recommend that you code all of your figures directly in the R Markdown document. Coding your figures directly is beneficial for several reasons. By creating figures directly in an R Markdown document and including the code you can:
Here is an example of what I mean. On the left is the published version of an NMDS plot from a recent paper. On the right is the raw version of the plot generated with R code only—before any post-processing.
PDF post-processed in Inkscape | R code only saved to PDF |
I hope you agree that the differences between the two are minor. In the final version, I changed the location of the legend, moved some labels around for clarity, modified the font, and tweaked the vector overlays. Again, the goal is to code as much of the figure as possible, relying of vector graphics programs like Inkscape and Illustrator for beautification only.
As I mentioned above, teaching and learning R code to generate figures is currently beyond the scope of this course. That said, I want you to explore the resources listed below. Doing so will give you a better sense of different chart types and the R code used to generate them. If you do not know much R, start by copying code and then playing around with different parameters. And make sure you understand the default setting.
Your Assignment is to select 2-3 different chart types and add the R code to your document. Ideally, you would use your own data but feel free to use the dummy data provided with the examples.
If you have not used R to generate figures before, I highly recommend you first look at the Tutorial on Graphs from the Introduction to R course by Ernesto Bonadies, Lee Dietterich, Tauana Cunha, and Bruno de Medeiros. The tutorial covers some built-in R graphing functions as well as ggplot2
. It’s a great tutorial.
Remember, your visualization needs are particular to you and your needs. Find some charts that look interesting or useful and add them to your document. Of course, I am more than happy to help you implement any type of graph in your document :)
If you want to use other languages, similar pages are available for D3.js (a JavaScript library) and Python.
r-statistics.co by Selva Prabhakaran has Top 50 ggplot2 Visualizations (with full R Code).
Leaflet is one of the most popular open-source JavaScript libraries for interactive maps. Code + Tutorials.
I want to turn now to the topic of color. A question to ask yourself is do I need to use color in this figure? If you are unsure, the answer is probably ‘no’. High-contrast black and white graphics can be more readable than color and also better for copying and printing. So, if you don’t need color then don’t use it.
Edward Tufte is fond of saying that color can be your best ally or your worst enemy. Color is an important visual dimension that can help tell a compelling story. However, when it is not used correctly, color can be counterproductive and lead to confusion about the message a figure is trying to convey.
In my field—microbial ecology—we often rely on color to display information about diversity. Microbial diversity is pretty vast, and it can be difficult to display all of this diversity in a single, static figure. All too often, people misuse color and the literature is full of figures that are very difficult to interpret. I thought about showing you some actual examples but felt that it would be a little mean-spirited. Instead, I made a figure using my own data to demonstrate this point.
Here, each bar represents a different sample and each color a different microbial Class.
When you use too many colors you force your reader to continually go back and forth from the chart to the legend. This is annoying, confusing, and ultimately detracts from the meaning of the figure.
Another problem is that many of us have different abilities to perceive color and/or differences in color. Color blindness, also known as color vision deficiency, is a decreased ability to see color or differences in color. When designing figures, it is important to use a) a relatively small color palette and b) a palette that is friendly to a variety of people.
For more information on the topic, I recommend reading Bang Wong’s Nature Methods paper, Points of view: Color blindness. Wong proposes a color-blind friendly color palette that uses contrasting colors that can be distinguished by a range of people. Consider that roughly 8% of people (mostly males) are color blind. So, what do you think? Do you want Keanu Reeves to understand your figures or not? If so, consider a color scheme that is friendly to all vision types
Wong’s scheme is conservative—there are only 8 colors. Others have developed 12,13, and 15 color palette schemes that offer more options than Wong’s palette. Just be careful—–figures with too many colors can inhibit our ability to discern patterns.
Limited color palettes like these force us to be more careful when deciding what information to target or how many groups to display. For your consideration, I have included a few color-blind friendly color palettes. You can download these colors as plain text by clicking on the palette title.
Color | Name | HEX | RGB |
---|---|---|---|
_______ | black | #000000 | 0, 0, 0 |
_______ | blue | #0072B2 | 0, 114, 178 |
_______ | sky blue | #56B4E9 | 86, 180, 233 |
_______ | reddish purple | #CC79A7 | 204, 121, 167 |
_______ | bluish green | #009E73 | 0, 158, 115 |
_______ | vermillion | #D55E00 | 213, 94, 0 |
_______ | orange | #E69F00 | 230, 159, 0 |
_______ | yellow | #F0E442 | 240, 228, 66 |
Color | Name | HEX | RGB |
---|---|---|---|
_______ | dark grey | #323232 | 50, 50, 50 |
_______ | jazzberry jam | #BF3465 | 191, 52, 101 |
_______ | kepel | #50B29E | 80, 178, 158 |
_______ | light grey | #D9D9D9 | 217, 217, 217 |
_______ | seance | #731683 | 115, 22, 131 |
_______ | mariner | #1C6CCC | 28, 108, 204 |
_______ | capri | #21BCFF | 33, 188, 255 |
_______ | pale plum | #DFA5E5 | 223, 165, 229 |
_______ | saddle brown | #874310 | 135, 67, 16 |
_______ | mars yellow | #DB6D1B | 219, 109, 27 |
_______ | fuego | #B8CE17 | 184, 206, 23 |
_______ | paris daisy | #F4E345 | 244, 227, 69 |
Color | Name | HEX | RGB |
---|---|---|---|
_______ | black | #000000 | 0, 0, 0 |
_______ | blue lagoon | #006E82 | 0, 110, 130 |
_______ | seance | #8214A0 | 130, 20, 160 |
_______ | science blue | #005AC8 | 0, 90, 200 |
_______ | azure radiance | #00A0FA | 0, 160, 250 |
_______ | blush pink | #FA78FA | 250, 120, 250 |
_______ | bright turquoise | #14D2DC | 20, 210, 220 |
_______ | shiraz | #AA0A3C | 170, 10, 60 |
_______ | salem | #0A9B4B | 10, 155, 75 |
_______ | salmon | #FF825F | 255, 130, 95 |
_______ | ronchi | #EAD644 | 234, 214, 68 |
_______ | mint green | #A0FA82 | 160, 250, 130 |
_______ | dairy cream | #FAE6BE | 250, 230, 190 |
Color | Name | HEX | RGB |
---|---|---|---|
_______ | black | #000000 | 0, 0, 0 |
_______ | sherpa blue | #004949 | 0 73 73 |
_______ | persian green | #009292 | 0 146 146 |
_______ | hot pink | #ff6db6 | 255, 109, 182 |
_______ | cotton candy | #ffb6db | 255, 182, 219 |
_______ | pigment indigo | #490092 | 73 0 146 |
_______ | science blue | #006ddb | 0, 109, 219 |
_______ | heliotrope | #b66dff | 182 109 255 |
_______ | malibu | #6db6ff | 109, 182, 255 |
_______ | french pass | #b6dbff | 182 216 255 |
_______ | red berry | #920000 | 146, 0, 0 |
_______ | brown | #924900 | 146 73 0 |
_______ | mango tango | #db6d00 | 219, 109, 0 |
_______ | harlequin | #24ff24 | 36, 255, 36 |
_______ | laser lemon | #ffff6d | 255, 255, 109 |
If you want to create your own color palette (say based on Wong’s colors) you would write the code like this. Then you could pass the character string wong_pal
to any place you need a color palette.
wong_pal <- c("#009E73", "#D55E00", "#F0E442",
"#CC79A7", "#56B4E9", "#E69F00",
"#0072B2", "#7F7F7F", "#000000")
Here is another great article on Coloring for Colorblindness by David Nichols that has an interactive color picker and recommendations for accessible palettes.
Remember, these are only suggestions. Ultimately, it is your choice how to use color and what palettes work best for you needs, I am merely offering some food for thought. Again, color and figure design are very broad—I encourage you to spend some time exploring these topics in more depth with the reference I provided. See where it takes you.
The last thing we need to discuss is how to control the way your plot is rendered in the final document. Individual R packages have different options for controlling various aspects of a plot or chart including color, point size, line thickness, axis labels, etc. Again, this topic is beyond the scope of this lesson but let’s look at an example anyway. We return to our simple example from above using the plot
function with the cars
data set. We can use different built-in options to modify the chart (color added for demonstration purposes only).
plot(cars, main = "Plot Title",
sub = "This is the plot subtitle",
xlab = "temperature (C)",
ylab = "pressure (Pa)",
pch = 24, cex=1.5, col="red", bg="blue", lwd=2)
This code controls the plot style but not how the plot is rendered in the final document. By that I mean the size and alignment of the plot in the HTML document. To control these aspects, you will need to use various code chunk options. Knowing how to code each chunk option and knowing what each does takes some practice. Let’s look at a few of the most common options.
fig.align
: (‘character’) Alignment of figures in the output document (possible values are left
, right
and center
. Note, character values must be enclosed in single quotation marks like so: 'center'
.fig.width
and fig.height
: (default are 7 and 5, respectively; numeric) Width and height of the plot (in inches). Note Only works with R-generated figures.fig.asp
: (default NULL; numeric) The aspect ratio of the plot, i.e. the ratio of height/width; when fig.asp
is specified, the height of a plot (the chunk option fig.height
) is calculated from fig.width
X fig.asp
fig.dim
: (default NULL; numeric) Numeric vector of length 2, it gives fig.width
and fig.height
, e.g., fig.dim
= c(5, 7) is a shorthand of fig.width
= 5, fig.height
= 7; if both fig.asp
and fig.dim
are provided, fig.asp
will be ignored (with a warning)out.width
, out.height
: (NULL; character OR numeric) Width and height of the plot in the final output file (can be different with its real fig.width
and fig.height
, i.e. plots can be scaled in the output document). For example, as a percent out.width = '80%
or in pixels out.width = 600px
.fig.cap
: (NULL; character) Figure caption to be used. Note, character values must be enclosed in single quotation marks like so: 'This is a figure caption'
.OK, let’s use chunk options to change the size and alignment and add a caption. I will hide the R code since it is the same as above. If you want to see the chunk options you need to look at the raw .Rmd
file on the GitHub repo.
If you want to display multiple figures side-by-side, you need to set fig.show='hold'
. This chunk option holds all plots and outputs them at the very end of a code chunk, as opposed to printing them one by one. You also need to set out.width
equal to a percent value divided by the number of plots. For example, suppose we want to display three plots side-by-side, we need to set out.width='33%'
.
```{r grid, fig.show='hold', out.width='33%'}
p1 <- plot(pressure)
p2 <- plot(cars)
p3 <- plot(pressure)
```
There are more sophisticated ways to display multiple figures using something like grid.arrange
(from the gridExtra
package) or plot_grid
(from the cowplot
package). I encourage you to look at these packages.
Play around with these options to see how it affects the final output. Chunk options for figures can be confusing at times but you will get the hang of it with a bit of practice.
That is all I have to say about figures. Again, this is a broad topic and I suggest you look around to see what is possible. If there is something specific you want to do please post it to the Slack channel.
The other type of graphic you may want to use is an image—any externally generated visual product inserted into the document, like a png
or jpg
. When plots are not generated with R code, there is no way for knitr
to capture plots automatically. In this case, you may generate the images manually and then pass their file paths to some other function to include these images in the output. Don’t worry, this section is not long. First I want to cover a few best practices and then go over different ways of displaying and controlling images.
In R Markdown, you can use GIF (Graphic Interchange Format) for simple animations or SVG (Scalable Vector Graphics) for things like logos and maps. These file types are used in special situations and will not be covered here. By far the easiest file types to work with in R Markdown for static images are PNG (Portable Network Graphics) and JPG/JPEG (Joint Photographic Experts Group). There are really no strict rules for when you should use PNG vs. JPG. Generally speaking though:
Another thing to consider is image compression. Nothing slows down a website more than a bulky image. So always compress your images before adding them to your website. There are a ton of online and standalone compression tools available for compression. I use the tools from WebsitePlanet and Optimizilla. Honestly, I haven’t spent a ton of time comparing tools, but I want you to find a couple that work for you.
If you plan on working a lot with images, I strongly encourage you to get familiar with the different file types and compression options.
I would like to cover three basic methods to add images to an R Markdown document.
knitr::include_graphics
magick
packageMarkdown syntax is the easiest way to add an image, however you sacrifice control because there are very few options. I demonstrated the Markdown syntax way back at the beginning of this assignment but at this point it’s probably a distant memory. So, let’s review. The syntax is simple. Start a new line with an !
, followed by open and closed brackets ([]
), and then the relative file path, like so:
![](files/julia.png)
Image of a Julia set, an iterated function system generated using the equation \(f_{c}(z)=z^{2}+c\) where \(c\) is a complex number, in this case \(c=-0.7269 + 0.1889i\). Downloaded from Wikimedia Commons, created by IkamusumeFan, and licensed under CC-BY-SA-4.0.
You can also display multiple images side-by-side using Markdown. In this case, you must specify the width of an image (where \(width \leq 100/n\) & \(n = no. of images\)) and put the Markdown code for each image on the same line (no space between the two tags).
![](files/julia.png){width=50%}![](files/julia2.png){width=50%}
Two different Julia sets. The image on the right (\(c = −0.835 − 0.2321i\)) was downloaded from Wikimedia Commons, created by Simpsons_contributor, and released under the public domain.
We can also use Markdown code to include an image directly from a URL.
![](https://upload.wikimedia.org/wikipedia/commons/0/0a/Subsection_Bifurcation_Diagram_Logistic_Map.png)
A chaotic region from the bifurcation diagram of the logistic map, derived by iterating the equation \(x_{n+1} = rx(1-x_n)\) over all values for \(r[0,4]\). The map was popularized in a paper by the biologist Robert May entitled Simple mathematical models with very complicated dynamics. Image by InXnI and licensed under CC0. Original.
The issue with using Markdown code for images is that there are few options. It is difficult to make modifications without using HTML and CSS.
knitr::include_graphics
If you want a bit more control over the output you can use a built-in knitr
function called include_graphics
. The major advantage of this function is that it is portable, meaning that it works for all document formats that knitr
supports. That way you do not need to use different syntax for different document types to embed an external image. Since we use include_graphics
within a chunk, several chunk options are available for controlling the output. Go to your console window in RStudio and type ?knitr::include_graphics
to see the help menu. As you can see, there are very few options. Most of the control will come from using chunk options.
Note, the double colon (::
) in this command specifies the exact function from that specific package. This prevents any problems from arising if another package also has the same function name. I am not aware of any other packages that have the function include_graphics
but the convention I have seen is the knitr::include_graphics
so I will use that here. It is up to you whether or not you want to use the double colon.
It is important to note that some of the chunk options we used for figures will not work for images.
Specifically, fig.width
, fig.height
, and fig.dim
do not work with images. To control the size, we need to use out.width
and out.height
. Remember, out.width
and out.height
need either a character (e.g., '50%'
) OR a numeric (600px
) value. We also have access to fig.align
for placement and fig.cap
to add a caption. Here is a code chunk using knitr::include_graphics
.
```{r cave1, fig.cap="INSERT CAPTION.", out.width = '100%', fig.align='left'}
knitr::include_graphics("files/cave.jpg")
```
As we did above when we displayed figures side-by-side, we can do the same with images and the include_graphics
function. Here again we fig.show='hold'
and specify the out.width
. We use the c
function to combine values—in this case image file names—into a list. This is analogous to the Markdown syntax we used above fir displaying images side-by-side.
```{r cave_rock, out.width='50%', fig.cap="INSERT CAPTION", fig.show='hold', fig.align='left'}
knitr::include_graphics(c("files/cave.jpg","files/rock.jpg"))
```
As we did with Markdown, we can also use knitr::include_graphics
to add an image directly from the web.
magick
package: Advanced image-processing in RThe magick
package is designed to modernize and simplify high-quality image processing in R by wrapping the ImageMagick STL. First, install magick
in R using this command:
install.packages("magick")
Then add this to your setup code chunk:
library(magick)
I will highlight just a few of the options available in the magick
package, but please know this package has a lot of functionality. Much of what we controlled above with chunk options (e.g., out.width
) can instead be controlled directly with magick
. In addition, you can edit images, add text annotation, include filters and effects, animate, overlay images, etc. This package is definitely worth learning, especially if you plan to work with images.
I will not use any code chunk options to control the behavior of the image so I can demonstrate some of the magick
functionality. For this demonstration I will use an image of the wings of a female Blue-Spotted Hawker, Adversaeschna brevistyla by Jogn Tann licensed under CC BY 2.0. For this demonstration the jpg
was converted to png
using JPG to PNG then imported to Gimp where I cropped the imaged and made the background transparent. OK. Displaying an image is easy using the image_read
command. I will set the variable wings
so I can manipulate the image later on without needing to call the path. We can use the print
command to get some details about the image. If you do not plan to make any changes you can simply run the command without the variable name.
# image_read("files/wings.png") Uncomment to render the image directly.
wings <- image_read("files/wings.png")
print(wings)
## # A tibble: 1 x 7
## format width height colorspace matte filesize density
## <chr> <int> <int> <chr> <lgl> <int> <chr>
## 1 PNG 9759 4455 sRGB TRUE 1063380 72x72
This image is much larger than the page so knitr resized it to fit in the page container.
How about changing the background color and adding a border? This is an example of stringing together multiple commands. Cool.
We can also flip the image and resize proportionally to height of 200px.
Every time we call an image_
-like command, magick
will print the image. So, unless we want a bunch of images we need to assign a variable each time we use an image_
command. Then we can simply call the variable to print the image.
How about we make a set of wings? To do this, I will:
wings.png
file. These are the right-side wings.First I read in the file and assign it to the variable right
. Next, I scale the image using the image_scale
command. Notice I set the dimensions to x400
. This means resize proportionally to height of 200px. If I wanted to resize proportionally to width I would write it like this, 400
.
Now I will crop the image with image_crop
. This was a bit of a pain, but basically I left the height the same by specifying x400
and then set cropped 150px off the left of the image. You really just need to play around with this to get it right.
Finally, I use the c
command to combine the variables into a list and then use image_append
to print them side-by-side. Now we have a set of wings.
Your Assignment is to work through the magick tutorial and use
magick
in your document.
If you have a pretty big document already, I recommend starting a new dummy document for this tutorial. That way you can make changes and render the document more quickly. Work with JPG files, PNG files with a transparent background, and SVGs. You can also use the magick
package with R code generated figures so play with that too. Learn the magick package by working through the tutorial.
That’s all for this Assignment and for Lesson 2. You should now have all the big pieces you need to make an awesome, single-page, R Markdown document.
## R version 4.0.0 (2020-04-24)
## Platform: x86_64-apple-darwin17.0 (64-bit)
## Running under: macOS Catalina 10.15.4
##
## Matrix products: default
## BLAS: /Library/Frameworks/R.framework/Versions/4.0/Resources/lib/libRblas.dylib
## LAPACK: /Library/Frameworks/R.framework/Versions/4.0/Resources/lib/libRlapack.dylib
##
## locale:
## [1] en_US.UTF-8/en_US.UTF-8/en_US.UTF-8/C/en_US.UTF-8/en_US.UTF-8
##
## attached base packages:
## [1] stats graphics grDevices utils datasets methods base
##
## other attached packages:
## [1] magick_2.3 cowplot_1.0.0 ggplot2_3.3.0 gridExtra_2.3
## [5] scales_1.1.0 DT_0.13 rmarkdown_2.1 kableExtra_1.1.0
## [9] epuRate_0.1 knitr_1.28
##
## loaded via a namespace (and not attached):
## [1] Rcpp_1.0.4.6 highr_0.8 pillar_1.4.4 compiler_4.0.0
## [5] base64enc_0.1-3 tools_4.0.0 digest_0.6.25 jsonlite_1.6.1
## [9] gtable_0.3.0 evaluate_0.14 lifecycle_0.2.0 tibble_3.0.1
## [13] viridisLite_0.3.0 png_0.1-7 pkgconfig_2.0.3 rlang_0.4.6
## [17] cli_2.0.2 rstudioapi_0.11 crosstalk_1.1.0.1 yaml_2.2.1
## [21] xfun_0.13 withr_2.2.0 dplyr_0.8.5 stringr_1.4.0
## [25] httr_1.4.1 xml2_1.3.2 vctrs_0.2.4 htmlwidgets_1.5.1
## [29] hms_0.5.3 tidyselect_1.0.0 webshot_0.5.2 grid_4.0.0
## [33] glue_1.4.0 R6_2.4.1 fansi_0.4.1 purrr_0.3.4
## [37] readr_1.3.1 magrittr_1.5 ellipsis_0.3.0 htmltools_0.4.0
## [41] assertthat_0.2.1 rvest_0.3.5 colorspace_1.4-1 utf8_1.1.4
## [45] stringi_1.4.6 munsell_0.5.0 crayon_1.3.4