About the GitHub Repo

Introduction


The code examples provided in this Expo are available in a GitHub repository.

This page describes how to download or clone the repository (repo), the directory structure, and some of the tools to set up and manage the R packages used in the analysis.

This Expo demonstrates how to get started with the suite of tools that we use at Metrum Research Group to ensure traceable and reproducible pharmacometrics research. We proceed step-by-step through fitting a simple non-linear mixed effects model using Stan. We chose this example to illustrate a functional workflow of these tools with an understanding that some standards (e.g., sample sizes, etc) are simplified relative to the complexities of many typical analyses.

Note that the code provided in this GitHub repo may not work exactly as written on your system. The code was written and tested in a Metworx environment; minor modifications may be required to allow for differences in the setup of your modeling environment. If you believe an error is included in any of the scripts, or would like help in troubleshooting the code, you can post an issue to the GitHub repo by following the “Report an issue” link on the right sidebar of this page.

Tools used


MetrumRG packages

Metrum Package Network (MPN) / pkgr Create and manage curated, reproducible R package environments.

CRAN packages

renv Reproducible environments for your R projects.

Browsing and downloading the code

All the code and data for this Expo are located in the GitHub repository. The GitHub web interface allows you to explore the code and data in the project from your web browser.

The entire Expo can be downloaded by clicking the green Code dropdown and selecting Download ZIP.

Users familiar with Git and GitHub can clone the repo and interact with it as they would in any other Git project.

Repository structure

This Expo uses the same directory and file structure as all MetrumRG projects. The repo README briefly outlines the location of key datasets and scripts. Below are descriptions of some of the top-level directories and files in the repo.

  • data/ - Data files with the following subdirectories:
    • data/source/ - Raw source data. This directory is empty because this Expo doesn’t demonstrate the assembly of a derived data set from the source data.
    • data/derived/ - Data sets derived for use in this Expo.
    • data/spec/ - Data specification YAML files. See Introduction to yspec for more details.
  • script/ - Analysis scripts including the .R and .Rmd files are described in Project walkthrough page listing .
  • model/ - Stan model files and model outputs. Typically, there will be subdirectories for different parts of the analysis.
  • deliv/ - Project deliverables with the following subdirectories:
    • deliv/figure/ - Figures, typically pdf and png files. There will be subdirectories for different parts of the analysis.
    • deliv/table/ - Tables, typically as .tex or .pdf, contain subdirectories paralleling those in deliv/figure.
    • deliv/report/ - LaTeX files for creating the final report. MetrumRG uses LaTeX for report writing, though this part of our process is not detailed in this Expo.

Setting up your R environment

MetrumRG manages R packages and their dependencies on a project-by-project level as each project uses its own isolated package library. This accomplishes two objectives:

  • All the packages needed for a project are installed and with the package versions you expected.
  • The R environment can be easily reproduced later, by you or a collaborator, in order to reliably reproduce your analysis.

We do this using three tools: pkgr, MPN, and renv. To learn more about how and why this works well, you can find more information in the Metworx Knowledge Base here.

Although we recommend this approach to manage your R environment, the code in this Expo will run without setting up your R environment in this way.

Install R packages with pkgr

Install all R packages needed to run this analysis using pkgr. If you’re using Metworx, you’ll already have pkgr installed. If you’re not using Metworx, click here for installation instructions.

In your terminal (not your R console) navigate to the top-level of this project repository and run the following:

pkgr install

This will read the pkgr.yml file and begin installing the requested packages. (To learn more about the pkgr.yml configuration file, click here.) When pkgr finishes running, restart your R session and begin working.

There are numerous packages used in this Expo and installing them will take some time. If you are using Metworx, we recommend launching a workflow with at least eight vCPUs on the head node for improved efficiency as pkgr uses all available cores to install packages.

Note that pkgr and renv are already set up in this repo so you can get started by running the pkgr install command. To read more about how to use this approach in your own projects, click here.