data pipelines in R

Part 1 of this series can be found here.

Loading Packages

####Load Packages####
library(tidyverse)
library(readxl)
library(apaTables)
library(sjPlot)
library(strengejacke)

Loading the log-trace data and self-report survey data. Note that we assign a data set to an object three different times, once for each of the three different datasets.

####Import Data####
# Pre-survey for the F15 and S16 semesters
pre_survey<- pre_survey
# Gradebook and log-trace data for F15 and S16 semesters
course_data <- course_data
# Log-trace data for F15 and S16 semesters — this is for time spent
course_minutes <- course_minutes

1. Pre-survey data Often, survey data needs to be processed in…


Data science in educaction

This is part one of a series of articles in Building data pipelines in education. I recently read an article from medium that really taught me something, to learn data science and grasp the context, you must learn by doing (projects) and write/share/teach on the topic. So I decided to give it a try. This is my first article and I hope to continue writing on Data science, Data Engineering and machine learning.

If education were a building, it would be multi-storied with many — rooms. There are privately and publicly funded schools. There are more than 15 possible grade…

Francis Gichere

Data Scientist at Hawk Data-Hub. Adept user of R and Python with interests in Regression modelling, Design and Analysis of Experiments and Time series analysis.

Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store