Skip to content
View hathawayj's full-sized avatar

Highlights

  • Pro

Block or report hathawayj

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
hathawayj/README.md

Overall Stats Top Langs Visitor LinkedIn

Classes and Materials

I use our byuistats organization to host most of our data science curriculum at BYU-I. The public facing course sites allow you to see some details of the courses. Some of the statistics faculty host their material there as well.

I use our byuidatascience organization to host data that we use in our courses as well as my port of the R for data science book into Python.

My current 'work in progress' course is Big Data Programming and Analytics.

During Fall 2021 I was a visiting teaching fellow at Kennesaw State University. Here is the special topics course on data science in R and Python that I taught.

You can read more about the BYU-I Data Science Program using our BYU-I data science program website

Consulting

Our data science program integrates company work into our courses. Statistical and Data Science Consulting works in collaboration with the RBDCenter to offer experiences to undergraduate students and industry partners.

In addition, I consult on varied data science applications and environmental sampling under DataDriven Llc. and DataThink.io.

  • Data Visualization: I specialize in ggplot2 programming and the principles of data visualization.
  • Tidyverse Programming: You can't visualize complex data without being very handy with dplyr, tidyr, and the other packages of the tidyverse.
  • Pyspark and Big Data Programming: My current heavy consulting focuses on medical records data at scale.
  • Statistical and Machine Learning Analytics: A strong background in machine learning and statistical modeling.
  • Sample size design: For ten years, I was a lead statistician on Visual Sample Plan and managed it for a little over a year before I moved to academia.

Pinned Loading

  1. remark-template remark-template Public template

    Make sure to change the below URL to your repo after you create your repository from this template. Turn on github.io pages from the settings and set your source to master and root.

    HTML

  2. io-pandas-tidyverse io-pandas-tidyverse Public

    R

  3. medium-data medium-data Public

    Some explorations on handling data in Python and R

    Python 5