The Amazing Watson makes Data Science so elementary

I got the email from IBM saying try out Watson, yada yada yada. I was not so sure what to expect. So i uploaded the diamonds dataset from the flagbearer ggplot2 package in R.

Simple benchmark- can IBM Watson data viz beat the best data viz package (ggplot2) in the best statistical language (R)

To my chagrin and humility- here are the results

Interface is awesome

Watson actually asks questions which an experienced Data Scientist would ask

The default data visualization is actually superior but the tabs for customizing appearance needs some work.

STEP 1

Just uploaded the dataset and these were some of the questions asked by Watson to me.

Screenshot from 2015-07-16 00:25:30

 

 

Screenshot from 2015-07-16 00:25:07Step 2

Look at how Watson answers one of these questions

Screenshot from 2015-07-16 00:17:40

Screenshot from 2015-07-16 00:16:29

Screenshot from 2015-07-16 00:15:43

 

 

Screenshot from 2015-07-16 00:32:49Step 3

I added human input(me) to try and customize it

Screenshot from 2015-07-16 00:33:12

 

Screenshot from 2015-07-16 00:33:25

Ajay Ohri Interview by BigStep

I was recently interviewed by Bigstep as part of their Expert Interview program. Click here to read the interview and let me know what you think!

http://blog.bigstep.com/big-data-experts-interviews/expert-interview-with-ajay-ohri-on-the-importance-of-big-data/

Accelerating R: RStudio and the new R Consortium

jjallaire's avatarRStudio Blog

To paraphrase Yogi Berra, “Predicting is hard, especially about the future”. In 1993, when Ross Ihaka and Robert Gentleman first started working on R, who would have predicted that it would be used by millions in a world that increasingly rewards data literacy? It’s impossible to know where R will go in the next 20 years, but at RStudio we’re working hard to make sure the future is bright.

Today, we’re excited to announce our participation in the R Consortium, a new 501(c)6 nonprofit organization. The R Consortium is a collaboration between the R Foundation, RStudio, Microsoft, TIBCO, Google, Oracle, HP and others. It’s chartered to fund and inspire ideas that will enable R to become an even better platform for science, research, and industry. The R Consortium complements the R Foundation by providing a convenient funding vehicle for the many commercial beneficiaries of R to give back to the community, and…

View original post 138 more words

KDnuggets Poll -Is Rapid Miner 3 times more used as SAS

16th annual KDnuggets Software Poll continued to get huge attention from analytics and data mining community and vendors, attracting about 2,800 voters, who chose from a record number of 93 different tools.

from

http://www.kdnuggets.com/2015/05/poll-r-rapidminer-python-big-data-spark.html

What seems a rather disquieting sampling error-

RapidMiner remains the most popular suite for data mining/data science, but it got fewer votes than last year

 

The top 10 tools by share of users were

  1. R, 46.9% share ( 38.5% in 2014)

  2. RapidMiner, 31.5% ( 44.2% in 2014)

  3. SQL, 30.9% ( 25.3% in 2014)

  4. Python, 30.3% ( 19.5% in 2014)

  5. Excel, 22.9% ( 25.8% in 2014)

  6. KNIME, 20.0% ( 15.0% in 2014)

  7. Hadoop, 18.4% ( 12.7% in 2014)

  8. Tableau, 12.4% ( 9.1% in 2014)

  9. SAS, 11.3 (10.9% in 2014)

 

I really dont think Rapid Miner has three times SAS users. I have no doubts on the credibility of the poll but there seems either sampling bias or something plain wrong here

!!!!

and 44.2 % of users used Rapid Miner last year ( I dont think one in two data miners uses Rapid Miner)

So there is some error here- or maybe different ways of counting a user or not!!

Moobhi Review- Piku Emotion in Motion

Shoojit Sircar has written a love poem to the saga of probashi Bongalis, Kolkatta longing and the fine yet quixotic and sometimes insular Bong culture. He has relied on shortcuts and stereotypes to finish the story in the time alloted. Deepika looks great with Kajal laced Bengali Eyes, but someone needs to tell her to get accent training. Irrfan can act better with his eyes and mouth closed, than Karan Johar can act with his entire body.

Amitabh Bachchan just disappears into his role as Bhaskar Da. Moushmi Chatterjee lifts occasional sag into the story pace. What a nice story? If only non Bengalis knew more about their culture than just Bengali sweets.

piku-mos_650_103114032933