I just updated my R for Business Analytics site (http://rforanalytics.wordpress.com/ ). Additions are as below you can go to http://rforanalytics.wordpress.com/ for the complete list- What I am trying to do is build a kind of Task View dedicated to Business Analytics (aimed at Business Analyst and Data Scientists) with slightly better HTML ( maybe Markdown later on) and some visual appeal.
—————ADDITIONS TO R FOR ANALYTICS COMPANION SITE
Interviews with R Community
http://rforanalytics.wordpress.com/interviews-with-r-community/
Developers
Jeroen Ooms (OpenCPU)
https://decisionstats.com/2013/03/18/interview-jeroen-ooms-opencpu-rstats/
Christian (Statace)
Ian Fellows (Deducer)
https://decisionstats.com/2013/04/03/interview-dr-ian-fellows-fellstat-com-rstats-deducer/
Jeff Allen (Trestle)
https://decisionstats.com/2013/02/02/interview-jeff-allen-trestle-technology-rstats-rshiny/
Gergely Darcozi (RApporter)
https://decisionstats.com/2013/01/04/the-making-of-a-r-startup-part-1-rstats/
ODBC /Databases for R (including Hadoop and NoSQL)
http://rforanalytics.wordpress.com/useful-links-for-r/odbc-databases-for-r/
R with MongoDB
http://cran.r-project.org/web/packages/rmongodb/rmongodb.pdf
This R package provides an interface to the NoSQL MongoDB database
using the MongoDB C-driver version 0.8
ps-
R with JSON
http://cran.r-project.org/web/packages/jsonlite/index.html
This package is a fork of the RJSONIO package
R with CouchDB
https://github.com/wactbprot/R4CouchDB
R with MonetDB
http://cran.r-project.org/web/packages/MonetDB.R/index.html
MonetDB.R: Connect MonetDB to R
Allows to pull data from MonetDB into R
Cassandra with R
http://cran.r-project.org/web/packages/RCassandra/RCassandra.pdf
Neo4j with R
# Function for querying Neo4j from within R
# from http://stackoverflow.com/questions/11188918/use-neo4j-with-r
query <- function(querystring) {
h = basicTextGatherer()
curlPerform(url = "localhost:7474/db/data/ext/CypherPlugin/graphdb/execute_query",
postfields = paste("query", curlEscape(querystring),
sep = "="), writefunction = h$update, verbose = FALSE)
result <- fromJSON(h$value())
data <- data.frame(t(sapply(result$data, unlist)))
names(data) <- result$columns
return(data)
}
# --------------------------------------
# import all data into neo4j
# --------------------------------------
nrow(venueDataset) # number of venues
https://github.com/RevolutionAnalytics/RHadoop/wiki
RHadoop consists of the following packages:
- NEW! plyrmr – higher level plyr-like data processing for structured data, powered by
rmr
- rmr – functions providing Hadoop MapReduce functionality in R
- rhdfs – functions providing file management of the HDFS from within R
- rhbase – functions providing database management for the HBase distributed database from within R
R with Spark
http://amplab-extras.github.io/SparkR-pkg/
SparkR is an R package that provides a light-weight frontend to use Apache Spark from R. SparkR exposes the Spark API through the RDD
class and allows users to interactively run jobs from the R shell on a cluster.
R with Hive
RHive is an R extension facilitating distributed computing via HIVE query. RHive allows easy usage of HQL(Hive SQL) in R, and allows easy usage of R objects and R functions in Hive.
http://cran.r-project.org/web/packages/RHive/index.html
DDR with R – Rhipe (dormant)
RImpala
A package to connect and run queries on Cloudera Impala (thanks to Mu Sigma)
http://cran.r-project.org/web/packages/RImpala/index.html
Pig with R
http://hortonworks.com/blog/bootstrap-sampling-with-apache-pig/