Saving Private Snowden

No apologies no pardons

Full disclosure on what was stolen and what was not

Helping cooperate to make system more secure

Unconditional surrender

Obama is a lawyer and so is Clinton.  They don’t get privacy but they do get popular opinion.  So a popular campaign for Snowmen pardon needs to be organized.

Personally I doubt if Edward can come come home before 2020

A Tutorial at

I just wrote a tutorial on SAS language here at

SAS was the first statistical language I learnt, followed by R, Python, Julia..

Of course the first language I learnt was in school BASIC

Here is a tutorial on SAS language again for learners, it uses the SAS University Edition

SAS (pronounced “sass”) once stood for “statistical analysis system” but now is known simply as SAS. It is a computer language for statistical computing. Much before the term data science, business analytics and business intelligence was coined, SAS was created at North Carolina State University to do the important and useful task of turning raw data into analysis using code and statistics.

SAS System is a suite of products that SAS Institute has been selling since 1976.

Jim Goodnight has been the CEO and vision behind the growth of SAS language since almost 40 years now. John Sall has made additional contributions to statistics by creating JMP (interviewed here at StatisticsViews )

SAS Institute has consistently ranked as one of the best employers within USA. However recently R and Python languages have challenged traditional share of SAS language in statistical computing, while SPSS has been acquired by IBM.

Tutorial Overview

This tutorial is here to help a reader with learning the simple  SAS language, and perhaps to inspire other languages to be both simple and responsive to a wide diversity of users from beginners to advanced, from academics to enterprises of various sizes and in geographies. The tutorial is based on SAS Studio interface given for free in the SAS University Edition.



Python Tutorial at

Crime Forecasting Challenge : Data Science Contest

The Real-Time Crime Forecasting Challenge seeks to harness the advances in data science to address the challenges of crime and justice. It encourages data scientists across all scientific disciplines to foster innovation in forecasting methods. The goal is to develop algorithms that advance place-based crime forecasting through the use of data from one police jurisdiction.

aims to:

  1. Encourage “nontraditional” crime forecasting researchers to compete against more “traditional” crime forecasting researchers.
  2. Compare available crime forecasting methods.
  3. Improve place-based crime forecasting.

Accordingly, the Challenge will have three categories of contestants: students; individuals/small businesses; and large businesses

This Challenge will be based on the locations listed in calls-for-service (CFS) records provided by the Portland Police Bureau (PPB) for the period of March 1, 2012 through February 28, 2017


  1. Overview
  2. How to Enter
  3. Important Dates
  4. Judges
  5. Judging Criteria
  6. Prizes
  7. Other Rules and Conditions
  8. Prize Disbursement and Challenge Winners
  9. Contact Information
  10. Data for Download




How do I shift to a data science career

I get this question a  lot – How do I shift to a data science career. I have been doing data analysis since 2004 (in SAS) when we used to call it business analytics , and since 2007 in R, Since 2014 in Python,  by when we re branded business analytics as data science. So here are a few basics to people trying to SHIFT to data science.

My answer is learn coding, learn math, and most importantly know when to use what for insights. Data scientists are as good as the insights they create or miss not the code they write.

See this first

A slideshare I put forward last year for Summer School

Do this self examination-

  1. What are you good at – programming , stats, or business
  2. What are you bad at- programming , stats or business
  3. What can you learn and at what proficiency

Learning Programming

Learning R, SAS, Python is easy but there is a confusing clutter of resources out there on the internet.

SAS Language -should be learnt from SAS University Edition and for the SAS Certification Exam.

Dont wanna be SAS Certified (its just 100$ psst)

Here is some free SAS Training by Decisionstats

There is no certification in R or Python, though Hadoop has it just like SAS has it.

For R- learn R and RStudio till you can master some of the code here

Screenshot from 2016-08-23 10-48-29

or see all the R packages here at CRAN VIEWS

For Python-

A shorter tutorial on Python by the author is here

Learn PANDAS and SCIKIT-LEARN  example


Learning Statistics and Techniques


Data Mining in R

Where to learn machine learning

Learning Business

This comes with experiences and domain research and study.


I hope this helps. I will follow with specific answers to specific career questions in data science soon.