SAS Thought Leader declares war on data scientists on Valentine Eve


It all started because of the Google Guy, Hal Varian

Feb 25, 2009 – I keep saying the sexy job in the next ten years will be statisticians Hal Varian, The McKinsey Quarterly, January 2009.

Then these guys ( Thomas H. Davenport and D.J. Patil)  made us sexy -that too in the Harvard Business Review.

Jill Dyche* is a thought leader. That’s what her job says. that too at SAS  which took over her start-up Baseline Consulting. (* In addition to this, she writes forewords for struggling poets here )

She says here

If the importance of data scientists is growing with the advent of big data, the sooner we understand what exactly it is they do, the better.

That is fair enough. But to add grievous injury to data scientists, She adds

(For fun I wrote a blog post on being a data scientist’s girlfriend.)

Actually the blog post was-Why I Wouldn’t Have Sex with a Data Scientist

But there’s no use. The data scientist is preoccupied. Preoccupied with finding, accessing, analyzing, validating, cleansing, integrating, provisioning, modeling, verifying, and explaining data to his management, colleagues, end-users, and friends.

And this is the year of the statistician ??

This is bare knuckles tactics. The art of Vaseline Insulting? Perish the thought. Geeks and Data Scientists  rule.

Dont we? and we are perfect? right.

We statisticians (and data scientists and big dataists and data miners and business analysts and …)

are bringing sexy back!

Justin+Timberlake+JT+PNG+1(and we need a hug too.)

The dichotomy in being a writer on open source with a non-open access publisher

  • The publisher adds credibility to your work


  • A self fulfilling prophecy where researchers want to publish in exclusive journals and closed -access books, for the sole reason that others did so as well before them and thereby donate their knowledge and money to the publisher


The dichotomy in being a writer on open source with a non-open access publisher?

  • I write on open source R , 
  • and I have been published (one book )
  • and am on contract to write two more ( R for Cloud Computing) and (R for Web and Social Media Analytics)
  • My publisher does have open access journals.
  • But the book is at $50. Most of India lives at less than 2$ per day. Thats 800 million people in my country alone.

But the publisher is the most reputed in this field. So what are my choices? How do I get more people to have choices to read books.

Take open knowledge , curate it, and turn it behind a $50 paywall. I am sorry, Aaron. People like me are the reason ……


Easier Tagging for E Commerce by Google Tag Manager

Ok I guess I am a bit late to this, but I really like the concept of Google Tag Manager and the fact they have a WordPress plugin ready What does it do? It integrates all your tags on websites on one dashboard. So much easier Web Analytics for marketing people who dont want to learn Reg Ex , JS etc.


IT-friendly – Google Tag Manager has lots features to set your mind
at ease—like user permissions, automated error checking, the Debug
Console, and asynchronous technology. So everything runs efficiently,
with no unpleasant surprises.
• Quick and easy – Users add or change tags whenever they want, to
keep sites running smoothly and quickly. Tags are managed with an
easy-to-use web interface, so there’s no need to write or rewrite site
code following implementation.
• Verified tags & templates – Google Tag Manager makes it easy to
verify that new tags are working properly, so users don’t need to call on
IT to check the tags. Built-in tag templates and automatic error checking
also prevent tags with improper formatting from even being deployed
on your site.
• Swift loading – Google Tag Manager replaces all your measurement
and marketing tags with a single, asynchronously loading tag—so your
tags can fire faster without getting in each other’s way.


Springer Book on Seamless R and C++ Integration with Rcpp and Workshop on Rcpp #rstats

The talented Dirk Eddelbuettel has just announced that his new book is coming out this year. I would recommend this book for anyone trying to use R for analytics, Big Data or customized use cases especially if they have a need to speed up the computation times. Rcpp can speed up R by a huge magnitude, and thats because C++ comes closer to the metal.

If you are a business analyst who knows a bit of R, but always were intimidated by C++ programming, well this is the right how to do book for you.

Continue reading “Springer Book on Seamless R and C++ Integration with Rcpp and Workshop on Rcpp #rstats”

Interview – Naveen Gattu, COO and Co-Founder at Gramener #dataviz

Here is an interview with Naveen Gattu, COO and co-founder of Gramener ,one of the most happening data science companies.


Ajay- Describe the story so far for Gramener. What have been the key turning points ?

Naveen- All founders of Gramener are first generation entrepreneurs, started our careers with IBM were very successful in our corporate jobs with hefty pay packages, but always at the back of the mind can’t we work our ourselves and have FUN.

With this thought in mind 6 of us got together in 2010 to lay foundation for Gramener, with our consulting experience we wanted to get into business analytics , but soon we realized that there are lot many people who are doing great analytics but not an effective way of presentation, we wanted to establish niche for ourselves and create an offering to make “Data Consumption” easy and joyful.


Our significant milestone was Airtel Continue reading “Interview – Naveen Gattu, COO and Co-Founder at Gramener #dataviz”

Rexer Analytics- What do you use for Data Mining? #rstats #spss #sas #bigdata

Message from Karl Rexer whom we have interviewed here. Come on show your support for your favorite data mining software.


Rexer Analytics has been conducting the Data Miner Survey since 2007.  Each survey explores the analytic behaviors, views and preferences of data miners and analytic professionals.  Over 1300 people from around the globe participated in the 2011 survey.  Summary reports (PDFs of about 40 pages) from previous surveys are available FREE to everyone who requests them by emailing at

Also, highlights of earlier Data Miner Surveys are available online, including best practices shared by respondents on analytic success measurement, overcoming data mining challenges, and other topics.  The FREE Summary Report for this 2013 Data Miner Survey will be available to everyone in the fall of 2013.

Your survey responses are completely confidential.  This research is not being conducted on behalf of any third party, but is solely for Rexer Analytics to disseminate the findings throughout the data mining and analytics community.

To participate, please click on the link below, then click on the “Start Survey” link on the bottom of the webpage.  Please enter the access code in the space provided.  The survey should take approximately 15-20 minutes to complete.  Anyone who has had this email forwarded to them should use the access code in the forwarded email.

Survey Link:

Access Code:  UL3X7

%d bloggers like this: