2001 2002 2003 2004 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018

04/17/2014: Using RMarkdown in an Analysis Archive and Retrieval System

RMarkdown is good stuff. Hopefully, you've already heard of RMarkdown. If not, you'll understand it pretty quickly by looking at this simple example:

http://rpubs.com/medined/replacing_part_of_time_series_using_time_based_selection

Essentially, you mix R code with Markdown markup to create a 'living' document. The R code is executed when the page is displayed. The full power of R (and all of its extensions) can be used. There are many examples of this online.

RMarkdown pages can be computer-generated. Imagine if any given analytic documented the intermediate steps from Data Load to Final Visualization using RMarkdown? I bet user confidence in the final product would increase. It would also be trivial for the document to be duplicated and tweaked (draft mode) before being republished. Since RMarkdown is text-based, you could provide human-readable diff reports between analyses. Another advantage of this text-based system would be full-text search across all analyses.

I could also point out the value in being able to produce an analytic report without needing to know Java, Python, or another programming language. Just knowing the math is complex enough.

Update: http://studio.sketchpad.cc/sp/pad/view/ro.9QNw0rsxwki4J/rev.480 - The archive timeline widget allows visitors to view all versions of the source document.

Update: You can do this same kind of thing with Python code. Check out http://ipython.org/notebook.html.

I'm not saying R is the answer to all problems. But this idea of archivable, diffable analytic solutions was interesting.


subscribe via RSS