Revolution Webinar Series #Rstats

Revolution Analytics Webinar-

 

Featured Webinar
David Champagne REGISTER NOW
Presenter David Champagne
CTO, Revolution Analytics
Date Tuesday, December 20th
Time 11:00AM – 11:30AM Pacific 
Click here for the webinar time in your local time zone

Big Data Starts with R

Traditional IT infrastructure is simply unable to meet

the demands of the new “Big Data Analytics” landscape.   Many enterprises are turning to the “R” statistical programming language and Hadoop (both open source projects) as a potential solution. This webinar will introduce the statistical capabilities of R within the Hadoop ecosystem.  We’ll cover:

  • An introduction to new packages developed by Revolution Analytics to facilitate interaction with the data stores HDFS and HBase so that they can be leveraged from the R environment
  • An overview of how to write Map Reduce jobs in R using Hadoop
  • Special considerations that need to be made when working with R and Hadoop.

We’ll also provide additional resources that are available to people interested in integrating R and Hadoop.

 

Upcoming Webinars
Wed, Dec 14th
11:00AM – 11:30AM PT
Revolution R Enterprise – 100% R and MoreR users already know why the R language is the lingua franca of statisticians today: because it’s the most powerful statistical language in the world. Revolution Analytics builds on the power of open source R, and adds performance, productivity and integration features to create Revolution R Enterprise. In this webinar, author and blogger David Smith will introduce the additional capabilities of Revolution R Enterprise.
 Archived Webinars-
Revolution Webinar: New Features in Revolution R Enterprise 5.0 (including RevoScaleR) to Support Scalable Data AnalysisRevolution R Enterprise 5.0 is Revolution Analytics’ scalable analytics platform.  At its core is Revolution Analytics’ enhanced Distribution of R, the world’s most widely-used project for statistical computing.  In this webinar, Dr. Ranney will discuss new features and show examples of the new functionality, which extend the platform’s usability, integration and scalability

 

Graphs in Statistical Analysis

One of the seminal papers establishing the importance of data visualization (as it is now called) was the 1973 paper by F J Anscombe in http://www.sjsu.edu/faculty/gerstman/StatPrimer/anscombe1973.pdf

It has probably the most elegant introduction to an advanced statistical analysis paper that I have ever seen-

1. Usefulness of graphs

Most textbooks on statistical methods, and most statistical computer programs, pay too little attention to graphs. Few of us escape being indoctrinated with these notions:

(1) numerical calculations are exact, but graphs are rough;

(2) for any particular kind of statistical data there is just one set of calculations constituting a correct statistical analysis;

(3) performing intricate calculations is virtuous, whereas actually looking at the data is cheating.

A computer should make both calculations and graphs. Both sorts of output should be studied; each will contribute to understanding.

Of course the dataset makes it very very interesting for people who dont like graphical analysis too much.

From http://en.wikipedia.org/wiki/Anscombe%27s_quartet

 The x values are the same for the first three datasets.

Anscombe’s Quartet
I II III IV
x y x y x y x y
10.0 8.04 10.0 9.14 10.0 7.46 8.0 6.58
8.0 6.95 8.0 8.14 8.0 6.77 8.0 5.76
13.0 7.58 13.0 8.74 13.0 12.74 8.0 7.71
9.0 8.81 9.0 8.77 9.0 7.11 8.0 8.84
11.0 8.33 11.0 9.26 11.0 7.81 8.0 8.47
14.0 9.96 14.0 8.10 14.0 8.84 8.0 7.04
6.0 7.24 6.0 6.13 6.0 6.08 8.0 5.25
4.0 4.26 4.0 3.10 4.0 5.39 19.0 12.50
12.0 10.84 12.0 9.13 12.0 8.15 8.0 5.56
7.0 4.82 7.0 7.26 7.0 6.42 8.0 7.91
5.0 5.68 5.0 4.74 5.0 5.73 8.0 6.89

For all four datasets:

Property Value
Mean of x in each case 9 exact
Variance of x in each case 11 exact
Mean of y in each case 7.50 (to 2 decimal places)
Variance of y in each case 4.122 or 4.127 (to 3 d.p.)
Correlation between x and y in each case 0.816 (to 3 d.p.)
Linear regression line in each case y = 3.00 + 0.500x (to 2 d.p. and 3 d.p. resp.)
But see the graphical analysis –
While R has always been great in emphasizing graphical analysis, thanks in part due to work by H Wickham and others, SAS products and  language has also modified its approach at http://www.sas.com/technologies/analytics/statistics/datadiscovery/
 SAS Visual Data Discovery combines top-selling SAS products (Base SASSAS/STAT® and SAS/GRAPH®), along with two interfaces (SAS® Enterprise Guide® for guided tasks and batch analysis and JMP® software for discovery and exploratory analysis).
 and  ODS Statistical Graphs at
While ODS Statistical graphs is still not as smooth as say R’s GGPLOT2 http://tinyurl.com/ggplot2-book, it still is a progressive step
Pretty graphs make for better decisions too !

 

 

My Digital Trail

Someone I know recently mentioned that I have an extensive Digital Trail. I do.

I have 7863 connections at http://www.linkedin.com/in/ajayohri, 31 likes at https://www.facebook.com/ajayohri and 19 likes at https://www.facebook.com/pages/Ajay-Ohri/157086547679568, 409 friends (and 13 subscribers) at https://www.facebook.com/byebyebyer .On twitter I have 499 followers at http://twitter.com/0_h_r_1 and 344 followers at http://twitter.com/rforbusiness , and even on Google Plus some 617 people circling me at https://plus.google.com/116302364907696741272 (besides 6 other pages on G+)

Even my Youtube channel at http://www.youtube.com/decisionstats is more popular than I am in non-digital life. my non existant video blog at http://videosforkush.blogspot.com/ and my poetry blog at http://poemsforkush.wordpress.com/, and my comments on other social media, and my blurbs on my tumblr http://kushohri.tumblr.com/, and you get a lot of my psych profile.

Why do I do leave so much trail digitally?

For one reason- I was a bit of introvert always and technology set me free, the opportunity to think and yet be relaxed in anonymous chatter.

For the second reason- I am divorced and my wife got my 4 yr old son’s custody. Even though I talk to him once a day for a couple of minutes, somehow I hope when he grows, he reads my digital trail , maybe even these words, on the kind of man I was and the phases and seasons of life I went through.

 

That is all.

 

 

Announcing Jaspersoft 4.5

Message from  Jaspersoft

————————–

Announcing Jaspersoft 4.5:
Powerful Analytics for All Your Data

This new release provides a single, easy-to-use environment designed with the non-technical user in mind — delivering insight to data stored in relational, OLAP, and Big Data environments.New in Jaspersoft 4.5

Broad and Deep Big Data Connectivity
Intuitive drag and drop web UI for performing reporting and analysis against Hadoop, MongoDB, Cassandra, and many more.

Improved Ad Hoc Reporting and Analysis
Non-technical users can perform their own investigation.

Supercharged Analytic Performance
Enhanced push-down query processing and In-Memory Analysis engine improves response times for aggregation and summary queries.

Join us for an in-depth review and demo, showcasing the new features for self-service BI across any data source.

For more information on Jaspersoft 4.5, or any Jaspersoft solution, contact at sales@jaspersoft.com, or            415-348-2380      .

 

Download Jaspersoft 4.5 Today.

. Download your free 30 day evaluation trial now.

download now button

 

 

Software as a Religion ( SaaR)

The decline of organized religion and debate about such matters in the Western Hemisphere has been co-related to the increase in debates and arguments (again mostly) in the Western Hemisphere on software. Be it the PC vs Mac, the Microsofties vs Open Sourcers, the not so evil Google versus fans of Facebook, considerable activity is now being done by human beings in terms of social interaction on the merit’s and demerit’s of each software bundle. Perhaps for the first time in human history these interactions are being captured digitally on medium (that is hopefully longer lasting than papyrus).

Will this lead to newer branches of psychologists, sociologists (Goodwin’s law is too simplistic but an effort)

Even software as a religion is plausible, all they need is another college drop-put whizkid  to find a way to make it effective.

Religion as a software has of course been around for several millennium.

Also see http://goo.gl/smISa

Analyzing AdSense

I ran a recent experiment on my website- subjugating it to CTR ads (and not just the banner ads). Of course there is hardly a choice I have except for Google Adsense-and let me know if you know any reliable alternatives.

This is what the analytics says

So basically 43 ads out of 147,305 Ads were clicked.

This makes a Google Adsense ad/algorithm/you almost 99.971% of the time to ignore it, ( I am assuming some of the 147,362 ads which were not clicked were a bit annoying)

So I apologize to yall -Adsense aint making no sense, as they would say in old Tennessee

Still $12 per month when directed to charity is good enough…I donated some to Wikipedia.

$19.77 AdSense Revenue
$0.42 AdSense Revenue / 1000 Visits
43 AdSense Ads Clicked
0.00 AdSense Ads Clicked / Visit
0.09% AdSense CTR
$0.42 AdSense eCPM
147,305 AdSense Ads Viewed
2.92 AdSense Unit Impressions / Visit
46,585 AdSense Page Impressions
0.99 AdSense Page Impressions / Visit

Google Webinar on Web Analytics

Google webinar on web analytics-

recommended for anyone with anything to do with the WWW

From

http://analytics.blogspot.com/2011/11/webinar-reaching-your-goals-with.html

 

Webinar: Reaching Your Goals with Analytics

 

 

Is your website performing as well as it could be? Do you want to get more out of your digital marketing campaigns, including AdWords and other digital media? Do you feel like you have gaps in your current Google Analytics setup?

We’ve heard from many of our users who want to go deeper into their Analytics — with so much data, it can be hard to know where to look first. If you’d like to move beyond standard “pageview” metrics and visitor statistics, then please join us next Thursday:

Webinar: Reaching Your Goals with Analytics
Date: Thursday, December 1
Time: 11am PST / 2pm EST
Sign up here!

During the webinar, we’ll cover:

  • Key questions to ask for richer insights from your data
  • How to define “success” (for websites, visitors, or campaigns)
  • How to set up and use Goals
  • How to set up and use Ecommerce (for websites with a shopping cart)
  • How to link AdWords to your Google Analytics account

Whatever your online business model — shopping, lead-generation, or pure content — these tools will deliver actionable insights into your buying cycle.

This webinar will be led by Joe Larkin, a technical specialist on the Google Analytics team, and it’s designed for intermediate users of Google Analytics. If you’re comfortable with the basics, but you’d like to do more with your data, then we hope you’ll join us next week!