Interested in learning Machine Learning (with Python)? Start with Kaggle’s ultra-practical and straight-forward ‘Intro to Machine Learning’ tutorial. It only takes 3 hours.
Got a new AWS account? Do these 3 steps now.
A new AWS account is granted, by default, root privileges. That means it has access to everything. Best practices dictate we avoid using root privileges whenever possible: Enable Multi-Factor Authentication, create and use an Administrator user account instead of root. Also, setup billing alerts to avoid surprise charges.
Pass column name as parameter to a function using dplyr
An R function with a parameter that accepts a data.frame column can’t evaluate the column argument until it is first ‘quoted’, followed by an ‘unquote’ within the dyplr function. ‘Quote’ a column using enquo(), then ‘unquote’ it using !!.
Radial data challenge
I stepped up to the Storytelling with Data Challenge for July 2019: identify data that makes sense to plot in a radial view and visualize it.
Viz and me, true love or just friends?
I love great visualizations, but does that mean I should exclusively commit myself to the Art and Science of Viz? Is being a friendly connoisseur good enough?
Add whitespace between RStudio markdown chunks
A little white space between R Markdown sections gives the reader a little pause in the story before changing the subject otherwise all the cells get smooshed together by default and it ends up looking like one long run-on sentence.
Add gradient colors to a bar chart with ggplot()
To emphasize value differences in a bar chart, using gradient colors is a nice option. ggplot makes it easy with scale_fill_gradient2().
Udacity Deep Learning Nanodegree: Completed
Today I completed the Udacity Deep Learning Nanodegree (DLND) program. The program included submitting 5 projects for review, which is the best part because we get our hands dirty. AWS (Amazon Web Services) was introduced, and we got to use AWS SageMaker for free (with Education ‘credit’). All-in-all I feel like I’ve been introduced to … [Read More]
Preserve structure when saving data to files in R
Writing data to a file using write.csv failes to retain column data types. A better way to save/restore data objects to/from files is by using RDS and RData.
ifelse alternative in R
Instead of a cumbersomely nested ifelse statement, use dplyr’s mutate and case_when functions instead.
replace_na not working
Convert data.frame factor columns to character columns before trying to replace NULL values.
Finding NULL values in R
Find NULLs in a dataframe using is.na().