a geek rap video about predictive analytics

Left brain thinking is verbal and analytical. Right brain is non-verbal and intuitive, using pictures rather than words

Dr Eric Siegel http://www.cs.columbia.edu/~evs/ demonstrates both here

Eric was an Assistant Professor of Computer Science at Columbia University in New York City from 1997 – 2001.

Ph.D., founder of Predictive Analytics World and Text Analytics World,  Executive Editor of the Predictive Analytics Times,  is the author of the bestselling, award-winning Predictive Analytics: The Power to Predict Who Will Click, Buy, Lie, or Die,

Conflict of Interest Disclaimer-I write poetry here

http://poemsforkush.wordpress.com/

Should Agencies like Secret Service protect digital assets of leaders or friends or family during elections or later

The revelation that Russian Intel (or Snowden working with Russian Intel) hacked into H R Climton’s Campaign Surveys should be both bad news as well as good news to cyber activists.

First of all it increases the demand in terms of jobs for legal cyber security

Secondly it pinpoints the need of security as an important component to decision makers at a time when they are most likely to pay attention ( oh ! My website got hacked! You got my attention!)

However the bad news is

Digital Assets of protected members would be secured by same agencies ( or different agencies) – a jurisdiction nightmare

Hacking into friends of friends, family after official government protection over is a crime, but when Govts hack it is difficult for a teenager /friend of teenager  who hacked their Facebook. Ditto goes for Senate Staffers or Staffers combating cyber crime.

Will cyber crime and cyber war between nations make things personal not just business, due to the ease, low cost and plausible deniability

(Note- this is a strategic what if scenario, No Pokemons were hurt during the making of this post)

Psalm 2

Why do the nations conspire[a]
    and the peoples plot in vain?
The kings of the earth rise up
    and the rulers band together

(unrelated bonus

 

Data Science for Olympics and lack of Reproducible Research

Despite the plethora of data generated in Sports, there is not much open data for Olympics and one wonders why if sharing best practices and data openly on what works and what does not can reduce the level of Russian athletes being banned in a cylical cold war era game.

Some links I found useful

http://www.kdnuggets.com/2014/01/data-mining-predict-sochi-winter-olympics-medal-counts.html

Could data mining techniques accurately predict the medal counts at the Olympics? A predictive model could give us an estimate of the number of medals each nation might win; but how close could we get to the actual outcomes? It was a tantalizing project …

Sochi-Ru By Dan Graettinger with Tim Graettinger

• Which nation will bring home the most medals at the upcoming Winter Olympics in Sochi, Russia?

• Will any nation from Africa, South America, or the Middle East finally break through and win a medal?

• Why do some nations win a bundle of medals while others win only a few?

• Can data mining give us the answers to these questions?

and
https://www.ibm.com/developerworks/community/blogs/jfp/entry/data_science_is_hard?lang=en

the Graettinger brothers do? They used a seemingly  standard methodology: learn from the past to predict the future.  More precisely, they used past Olympics results to build a predictive model.  Each country is represented by a feature vector, i.e. a set of quantities drawn form several categories:

  • Economic
  • Population
  • Human Development
  • Geography
  • Religion
  • Politics and Freedom

Then they used a standard technique known as linear regression to find which set of features were best for predicting medal count.  I was reading their blog post with great interest until I saw what were the most meaningful features found by the linear regression algorithm:

  • Geographic area
  • GDP per capita
  • Value of Exports
  • Latitude of Nation’s Capital

and

http://www.discoverycorpsinc.com/winter-olympic-medal-predict_1/?cm_mc_uid=41778113867514699665668&cm_mc_sid_50200000=1469966566

I was able to find data in many categories:

  • Economic
  • Population
  • Human Development
  • Geography
  • Religion
  • Politics and Freedom

Thankfully, there were some good sources out there[f3], and I collected enough data that I felt I had a good chance to predict some meaningful outcomes.  But would it be enough?  There is more than one way to go about predicting the medal count at the Olympics, and the route before me was the “30,000 feet” approach.

So any takers?

Hackers for Hacking the Olympics 🙂

 

 

Latest DecisionStats Intern

Congratulations to our latest intern for completing the intensive internship at DecisionStats . See work done by here here-

https://datascience899.wordpress.com/blog/

Her latest blog post tries to use Python to understand police shootings in USA

https://datascience899.wordpress.com/2016/07/23/python-ii/

https://datascience899.wordpress.com/2016/07/23/operations-in-r/

https://datascience899.wordpress.com/2016/07/23/sql-ii/

 

Previous Interns wrote great Python code and R code

see

https://github.com/decision-stats (Sarah Masud and Farheen)

and

Anshul Gupta

Cricket Analysis – http://nbviewer.ipython.org/gist/anshulkgupta93/39689db8b337c3ccf247

pyCURL- http://nbviewer.ipython.org/gist/anshulkgupta93/a87a0884ada6e9380952

and

Chandan Routray

https://decisionstats.com/2014/08/04/the-first-decisionstats-com-intern/

 

Some points for future interns at DecisionStats-

  1. We normally dont pay interns anything
  2. 80 % interns drop out or are let go because they cannot keep up with the assignments
  3. Remaining 20% usually learn a lot in the intensive program
  4. Internships are like a free boot camp
  5. No more internships till June 2017 because I am trying to write a book
  6. Some research assistantships might be available in December 2016 to help with some code or Lyx formatting for the former
  7. See my LinkedIn profile for reviews given by the 20% interns who manage to stick around
  8. I usually emphasize writing, polyglot tools (both R, SAS and Python) , logical thinking and concise communication for my interns
  9. I usually treat them as students since I dont work for or in a university. That might change as I try and transition out from business to academic research options for a non Phd

 

Movie Review: Sultan

I had just come back after 5 months in the land of the brave and free (  Trumpistan ) so it was my first Bollywood movie in some time and Salman Khan did not disappoint in his annual Id outing. Sultan followed a wrestler who became a wrestler to impress a girl, and then gets separated post losing the girl. Fairly standard sports movie fare of rise, fall and rise again, though with a great Indian twist and I really think they could make Sultan 2.The heroine should be Rhonda Rousy in that one though. Anushka is not credible enough as a skinny wrestling prodigy, though Salman easily does the best acting of his career (minus the dancing)

The songs were ok ok, though the dialogues were amazing. Bhai rocks

https://en.wikipedia.org/wiki/Sultan_(2016_film)

sultan