Category Archives: TidyX Screen Cast

TidyX Episode 120 – Working within columns in tidyverse

This week, Ellis Hughes and I go over various things you can do with columns in {tidyverse}.

Some of the topics we cover:

  • Selecting columns by both name and number
  • Using starts_with(), ends_with(), and match() functions to select columns with a specific words within their string
  • Using where() to select and rows where a specific condition is met
  • Several approaches to renaming columns
  • Relocating columns to put columns in the order you’d like them to be in
  • Differences between pluck() and pull() functions

To watch the screen cast, CLICK HERE.

To access our code, CLICK HERE.

TidyX Episode 118: Calculating summary statistics using window functions

Ellis Hughes and I continue working on data cleaning and data engineering processes in {tidyverse}. This week, we go over various ways of creating window functions using the {zoo} package. These types of functions are very useful when dealing with signal processing data such as GPS, accelerators, or force plates. In our example, we use the Lahman baseball database to get the home runs per season for Albert Pujols (who recently hit his 700th career HR!) to show windows of rolling average and standard deviation.

If you have any data cleaning or engineering problems that are giving your trouble, feel free to reach out to us and maybe we can work on the problem in one of our upcoming episodes!

To watch our screen cast, CLICK HERE.

To access our code, CLICK HERE.