Many students want to get an internship in data science.
Here is a list of free resources and THINGS TO DO to help you prepare BEFORE the interview
not including political parties or state governments here is a list of top 5 unethical companies in India in terms of controversies
- Company 1 -cant say because they bought ads on my media channel
- Company 2- censored because they are friends to the government
- Company 3- threatened with goondas as well as law suits
- Company 4- included only so we can play with it’s stock price
- Company 5- a company on its way down and no friends to help
That is how the way the financial media reports news on companies in India. This is because unlike the United States , our SEBI ( equivalent to SEC) does not investigate insider trading with the same zeal as Preet Bharara does.
Sorry for the spam.
- Sometimes VC money is like BC money
- we can build Taj Mahal in excel
- angel investors all died and went to heaven where they are watching Indra and Maneka
- devil is in details of everything they asked you to just sign saying, arrey yaar, this is a standard form
- hiring is always just in time
- zuckerberg is the 10th avatar of vishnu
- a million dollars is not cool. A billion dollars is cool. A billion people is even hotter
- lying is number 3 sport after cricket and politics discussion
- everybody criticizes anything
- steve jobs came to india as a teenager. So Indians can claim credit
- every indian american who got succesful is because of his Indian DNA as per our newpapers
- everybody knows anybody. Or says they do
- every developer can learn hadoop in 2 days just payment in advance
- every client’s cheque is delayed atleast once
- Dilbert was a graduate from IIT
- In india there is no ration on passion, but often we have passion for ration
I was there on June 10 to attend the hands-on-session cum event for IBM’s PaaS offering IBM Bluemix. The event took place at http://cercles.co.in/ which is a relatively new startup coworking space in Hauz Khas Village Delhi. Since this was just 5 minute walk from where I currently live, I attended along with an intern and a colleague of my new training company http://decisionstats.org The event was nicely organized, the infrastructure was good, the speakers were quite awesome. To read what really happened you can see the summary at the clouddelhi hashtag.
One thing I noticed R is not really given as much attention in Bluemix. I particularly found IBM Watson APIs (which are RESTful) to be a great case for #rstats packages .
Bluemix has a nice interface, and they are offering 30 day free which is quite low compared to 1 yr of AWS. IBM is focussed on hybrid cloud for enterprises and opportunities for people like us depend on becoming ISV (Independent Software Vendors) or Partners in the IBM ecosystem https://twitter.com/hashtag/CloudDelhi?src=hash
Fortunately I didnt have to speak. I liked Cercles well enough to book a seat for my startup for the next month something which I have not done so despite considering two-three other co-working hubs in the past in Delhi-Gurgaon.
An additional thing was Woman in Tech as a theme. I found some of the reactions interesting there. Perhaps Governments need to adopt the Woman in Tech theme, but they seem ignorant and uninformed as corporations try to tweak their policies to gain and retain talent, than advise policy makers to help create a better ecosystem.
I am holding a workshop near my residence in Hauz Khas Village, Delhi, India
It is to teach Marketing via Internet, Branding via Blogging and other fun stuff to make money while having fun on the internet.
Here is the event
A 2 hour Workshop on Web Analytics and Social Media Analytics
By Ajay Ohri, Founder of DecisionStats and a Technology expert with two published books and blogs with lakhs of views.
10th April 2015, Friday
Venue: Hauz Khas Village
Spread your credentials to the entire world by Blogging! It becomes easy for professionals to prove their worth by showing their knowledge through their blog. A good blog earns money too. It definitely acts as your launching pad in the world.
Build Effective Profiles
93% of recruiters review social profiles of candidates while recruiting. Learn how to build awesome profiles on LinkedIn.
Web Analytics (Google Analytics) – Measure your traffic in websites. Improve your web traffic through organic Search Engine Optimization and Search Engine Marketing (Adwords).
Learn how to Run Marketing Campaigns on Twitter & Facebook Pages Facebook can help make and succeed businesses through it’s marketing features.Use Twitter hashtags and campaigns for promoting your product.
Step into a world of unlimited possibilities by a digital analyst training.
Entry fee: Rs 100 per person (Student I card) Rs 200 (normal)
Contact: Sonam Tuteja – 9999659616 email@example.com
New Zealand just made it to their first ever world cup final ( yes it is cricket) and they made it with a thrilling six ( like a home run) for the last ball. Congrats to New Zealand .Of course R was created in New Zealand too and Hadley Wickham is from New Zealand
I recently installed the rvest package from https://github.com/hadley/rvest and its now on CRAN as well
library(rvest) lego_movie <- html("http://www.imdb.com/title/tt1490017/") rating <- lego_movie %>% html_nodes("strong span") %>% html_text() %>% as.numeric() rating #>  7.9 cast <- lego_movie %>% html_nodes("#titleCast .itemprop span") %>% html_text() cast #>  "Will Arnett" "Elizabeth Banks" "Craig Berry" #>  "Alison Brie" "David Burrows" "Anthony Daniels" #>  "Charlie Day" "Amanda Farinos" "Keith Ferguson" #>  "Will Ferrell" "Will Forte" "Dave Franco" #>  "Morgan Freeman" "Todd Hansen" "Jonah Hill" poster <- lego_movie %>% html_nodes("#img_primary img") %>% html_attr("src") poster #>  "http://ia.media-imdb.com/images/M/MV5BMTg4MDk1ODExN15BMl5BanBnXkFtZTgwNzIyNjg3MDE@._V1_SX214_AL_.jpg"
The most important functions in rvest are:
- Create an html document from a url, a file on disk or a string containing html with
- Select parts of a document using css selectors:
html_nodes(doc, "table td")(or if you’ve a glutton for punishment, use xpath selectors with
html_nodes(doc, xpath = "//table//td")). If you haven’t heard of selectorgadget, make sure to read
vignette("selectorgadget")to learn about it.
- Extract components with
html_tag()(the name of the tag),
html_text()(all text inside the tag),
html_attr()(contents of a single attribute) and
- (You can also use rvest with XML files: parse with
xml(), then extract components using
- Parse tables into data frames with
- Extract, modify and submit forms with
- Detect and repair encoding problems with
- Navigate around a website as if you’re in a browser with
submit_form()and so on. (This is still a work in progress, so I’d love your feedback.)
While Hadley Wickham seems busy with reading excel files ( see https://github.com/hadley/readxl) maybe using rvest can help in more sports analysis now!
Meanwhile I am searching for equivalent of readHTMLtable