Dealing with zip files in R #rstats

> setwd("/home/ajay/Downloads")
> a=dir()
> class(a)
[1] "character"
> grep(".zip",a)
[1]  37  38  41  43  88  96 133
> b=grep(".zip",a)
> a[b]
[1] "alissa-coming-soon-v2-0(1).zip"            
[2] "alissa-coming-soon-v2-0.zip"               
[3] "CAX_EMC_Journalist_Data.zip"               
[4] "CAX_EMC_Racer_Data.zip"                    
[5] "matlab_R2015a_glnxa64.zip"                 
[6] "Photos.zip"                                
[7] "unvbasicvapp__9411003__vmx__en__sp0__1.zip"
> unzip("CAX_EMC_Racer_Data.zip")
> c=dir() 
library(Hmisc)

> c[c %nin% a] 
[1] "CAX_EMC_Racer_Garmin_Camera.csv" 
[2] "CAX_EMC_Racer_Garmin_Watch_Data.csv"
[3] "CAX_EMC_Racer_Motorcycle_Data.csv"
  ps- I know Hadley's convenient wrappR  packages are all the rage now, but nothing, i repeat
 nothing beats Frank Harell and Ripley's cool packages

	

DecisionStats Summer School in Delhi 2015 #rstats

This summer get a foothold in the world of data science. These are in classroom trainings for Delhi India and all prices are in INR only.

If you are interested apply here-

bit.ly/decisionstats

Screenshot from 2015-05-20 20:01:25

SUMMER SCHOOL 2015

  • Bring your own device. Hardware – with >2GB RAM and >20 GB Hard Disk Free
  • Eligibility Criterion – People Interested in a career as a data scientist. No prior skills are required but statistics and programming can help.
  • 1 class is of 2.5 hours followed by a break of 1 hour . Each Day has two classes per batch

Course Details

15 – 16 June 17- 18 June 19 June – 22 ,23,24 June 25 -26 June
Course Name Introduction to

Data Science

Introduction to Analytics

using Python

Introduction to

Analytics using R

SAS Language

Fundamentals

Hours 10 10 20 10
Classes 4 4 8 4
Days 2 2 4 2
Cost 8000
10000
12000
15000
25000

Taking all four courses gives you a saving of 80% with 50 hours total class time.

Instructor will teach in person and open for clearing doubts on the spot.

Course Outline

Basics of Data Science Introduction to Python Introduction to R Introduction to Interface
Basics of Analytics Introduction to iPython Introduction to R Studio Introduction to SAS language
LTV Analysis Introduction to Pandas Introduction to R Data Step
LTV Analysis Quiz Introduction to iPython Notebook Introduction to Rattle Proc Print
RFM Analysis IDE- IDLE and Spyder Deducer Proc Means and Proc Freq
RFM Analysis Quiz Python 1 Quiz R Quiz 1 SAS Quiz 1
Basic Stats Data Input Data Input Proc Univariate
Introduction to Modeling Data Analysis Data Analysis Do loops
Data Summarization Data Summarization Proc sgplot
Introduction to Google Analytics Data Visualization Data Visualization Proc SQL
Blogging Data Output Data Output SAS Macro Language
Web Analytics Quiz Ipython 2 Quiz R Quiz 2 menu driven options
data.table ODS Output
ggplot
sports analytics SAS Quiz 2
regression model
data mining
R Quiz 3
social network analysis
text mining
time series forecasting
Using apis
association analysis
R Quiz 4
RODBC
sqldf
spatial analytics
RMarkDown
Using Github
R Quiz 5

If you are interested apply here-

bit.ly/decisionstats

Has Your Data Become Overwhelming?

Note from Sponsors- Chicago Events from Predictive Analytics

Has Your Data Become Overwhelming?

giphy

Let Predictive Analytics World Help!

Attend our Chicago event(s) to develop the skills and strategies necessary to take your data to a whole other level.

Predictive Analytics World for Business
June 8 – 11, 2015
PAW Business is the leading cross-vendor event for predictive analytics professionals, managers and commercial practitioners. This conference covers a wide range of business applications for predictive analytics across industry sectors including marketing, credit scoring, insurance, fraud detection, web optimization, and much more. Register Today

Predictive Analytics World for Manufacturing
June 8 – 11, 2015
At PAW Manufacturing, join peers and thought leaders in leveraging new predictive analytics tools and techniques to solve manufacturing problems. Shape manufacturing with predictive analytics.
Register Today

eMetrics Summit
June 8 – 11, 2015
Be part of the eMetrics Summit where marketing analytics practitioners, experts and visionaries discuss capturing and applying insights from data.
Register Today

Predictive Analytics Times Executive Breakfast
June 10, 2015
Join the founder of Predictive Analytics World, Eric Siegel, and exclusive sponsor, Dell Software, to witness a concrete overview of how predictive analytics drives actionable value at
the Predictive Analytics Times Executive Breakfast.
* Attendance is Free – Submit Request to Attend


When in doubt , use Einstein

1

Stupid Emails from CEOs

  • We have 2o million in funding so work with us, can we pay you 25,ooo (or 400 $) to build a model for predicting influence score within social networks which we will patent
  • We have 20 million in funding so work with us, can we pay you 100,000 ( or 1600 $) to build a model to get healthcare score, which we will not even attribute to you as inventor
  • i am an advisor to this company and can get lots of millions of dollars in funding can i pay you in stock options
  • wow can u just give me 20 minutes of your time which I wont pay for, so I can decide analytics strategy
  • what is the difference between SAP and SAS
  • can we copy existing algorithms in R but make a slight modification and make them proprietary and patent them
  • we dont have any data, cant we just make an algorithm first
  • can we patent a regression model
  • i have 10 years experience in hadoop. lets make a virtual machine for R and R Studio separately which we can install on desktop. basically a 2gb download for a 100 mb software.

falls down. lies very very still.

crazyones-poster-copy2

Doing cricket analysis and web scraping in R using rvest #rstats

I modified the code used here (https://decisionstats.com/2012/04/07/cricinfo-statsguru-database-for-statistical-and-graphical-analysis/ )  with the code used here (http://reed.edu/data-at-reed/resources/R/rvest.html )  to get a more updated version of Cricket Analysis but using the rvest package by Hadley W Screenshot from 2015-05-12 13:55:03

Note you can change the url to make the records , the team , the type of record (batting or fielding)

I found it to be the third html table in the field by just testing it around

library(rvest)
page=html("http://stats.espncricinfo.com/ci/engine/stats/index.html?class=1;team=6;template=results;type=batting")
data %
 html_nodes("table") %>%
 .[[3]] %>%
 html_table()
head(data)

also see

http://codeforsacramento.org/blog/tutorial/2015/01/31/webscraping-with-r.html
http://renkun.me/pipeR-tutorial/Examples/rvest.html

Summer School in Analytics in Delhi

A comprehensive summer program is being offered by DecisionStats.org . It will involve multiple languages for analytics including Python, SAS, R, and will also equip you with social media skills, web analytics and social media analytics.

It is a classroom based training and is aimed only for students who can attend classes in Hauz Khas Village , Delhi.

–here is the full message

We will conduct a summer workshop in analytics. It will be a vigorous paid certificate program. After the program, we may offer internships to some of you.
Kindly fill this form and also forward it to your peers. Please cascade to your social media network and anyone you feel who could benefit from analytics training.

Interns for DecisionStats – a cutting edge analytics firm

We have the annual summer internship back at DecisionStats. This year we especially need Graphic Designing Interns and people who want to be Data Scientists

So apply at info@decisionstats.org or link below

 

1) We now have a separate arm for Training and Consulting at http://decisionstats.org Basically we have hived off that business separately. We also have a new office in Hauz Khas Village.

2) Last year (first year of internships) our Intern Chandan from IIT KGP made this (http://www.slideshare.net/ajayohri/decisionstatscom-data-science-virtual-internship ) and this ( http://www.slideshare.net/ajayohri/python-for-r-users ) .  He had no knowledge of either R and Python before he began.

3) Preference will be given to people who can come to office than telecommute.

 

 

http://internshala.com/internship/detail/multiple-profiles-management-graphic-design-internship-in-delhi-at-decisionstats1429240830

 

About Decisionstats (http://decisionstats.com):

Data Science and Analytics Website that deals in cutting edge research, consulting, writing and speaking assignments

About the Internship:

The communication intern will proof read, edit and write content including blog posts and social media. The intern will be given on the job training for social media, web analytics and search engine optimization as well as an understanding of digital business. Only requirement needs to be learnability, truthfulness and a good command of English

The graphic design intern will create , edit and write graphics including icons, logos, posters and infographics. The intern will be given on the job training for designing in a real time environment, web analytics and search engine optimization as well as an understanding of digital business. Only requirement needs to be learnability, truthfulness and a good command of design.

The management intern will create , edit and make schedules and assist in cordination. The intern will be given on the job training for managing in a start up environment, web analytics and search engine marketing as well as an understanding of digital business. Only requirement needs to be learnability, truthfulness, passion and good management skills.

The data science intern will create , edit and make data science research and assist in writing. The intern will be given on the job training for data science and analytics. Only requirement needs to be learnability, truthfulness, passion for writing code and hacking problems on the fly.

# of Internships available: 4
Who can apply:

The internships require people who are serious about careers, can devote the agreed upon hours per week and meet deadlines. Preferences will be given to candidates from established institutes and prior academic record.

Streams:
Analytics, Design, Engineering Management, English, Humanities, Management, Engineering