Spring Cleaning – What I wrote

 

A partial list of writings by me over the years

 

  • Big Data Initiatives in Developing Nations

 

Can big data, open data, and programs such as the Aadhaar Project enhance lives in underprivileged segments of society? March 2015

http://www.ibmbigdatahub.com/blog/big-data-initiatives-developing-nations

2) Downsides Dampen Open-Source Analytics September 2011 http://www.allanalytics.com/author.asp?section_id=1408&doc_id=233454

 

3) KDNuggets – Articles on Data Science

 

  1. Using Python and R together: 3 main approaches December 2015

 

  1. Interview: Ingo Mierswa, RapidMiner CEO on “Predaction” and Key Turning Points  June 2014
  2. Guide to Data Science Cheat Sheets 2014/05/12
  3. Book Review: Data Just Right 2014/04/03
  4. Exclusive Interview: Richard Socher, founder of etcML, Easy Text Classification Startup 2014/03/31
  5. Trifacta – Tackling Data Wrangling with Automation and Machine Learning 2014/03/17
  6. Paxata automates Data Preparation for Big Data Analytics 2014/03/07
  7. etcML Promises to Make Text Classification Easy  2014/03/05
  8. Wolfram Breakthrough Knowledge-based Programming Language – what it means for Data Science? 2014/03/02

Programmable Web- Articles on APIs

 

  1. Keen IO Helps Developers Solve Custom Analytics Needs 06-09-2014
  2. Scoreoid Aims to Gamify the World Using APIs 01-27-2014
  3. Plot.ly’s Plot to Visualize More Data 01-22-2014
  4. LumenData’s Acquisition of Algorithms.io is a Win-Win 01-08-2014
  5. Yactraq API Sees Huge Growth in 2013 01-06-2014
  6. Scrape.it Describes a Better Way to Extract Data12-20-2013
  7. Exclusive Interview: App Store Analytics API 12-04-2013
  8. APIs Enter 3d Printing Industry 11-29-2013
  9. PW Interview: José Luis Martinez of Textalytics 11-06-2013
  10. PW Interview Simon Chan PredictionIO 11-05-2013
  11. PW Interview: Scott Gimpel Founder and CEO FantasyData.com 10-23-2013
  12. PW Interview Brandon Levy, cofounder and CEO of Stitch Labs 10-08-2013
  13. PW Interview: Jolo Balbin Co-Founder Text Teaser 09-18-2013
  14. PW Interview:Bob Bickel CoFounder Redline13 07-29-2013
  15. PW Interview : Brandon Wirtz CTO Stremor.com 07-04-2013
  16. PW Interview: Andy Bartley, CEO Algorithms.io 06-04-2013
  17. PW Interview: Francisco J Martin, CEO BigML.com 05-30-2013
  18. PW Interview: Tal Rotbart Founder- CTO, SpringSense 05-28-2013
  19. PW Interview: Jeh Daruwala CEO Yactraq API, Behavorial Targeting for videos 05-13-2013
  20. PW Interview: Michael Schonfeld of Dwolla API on Innovation Meeting the Payment Web 05-02-2013
  21. PW Interview: Stephen Balaban of Lamda Labs on the Face Recognition API 04-29-2013
  22. PW Interview: Amber Feng, Stripe API, The Payment Web 04-24-2013
  23. PW Interview: Greg Lamp and Austin Ogilvie of Yhat on Shipping Predictive Models via API 04-22-2013
  24. Google Mirror API documentation is open for developers 04-18-2013
  25. PW Interview: Ricky Robinett, Ordr.in API, Ordering Food meets API 04-16-2013
  26. PW Interview: Jacob Perkins, Text Processing API, NLP meets API 04-10-2013
  27. Amazon EC2 On Demand Windows Instances -Prices reduced by 20% 04-08-2013
  28. Amazon S3 API Requests prices slashed by half 04-03-2013
  29. PW Interview: Stuart Battersby, Chatterbox API, Machine Learning meets Social 04-02-2013
  30. PW Interview: Karthik Ram, rOpenSci, Wrapping all science APIs 03-20-2013
  31. Viralheat Human Intent API- To buy or not to buy 03-13-2013
  32. Interview Tammer Kamel CEO and Founder Quandl 03-07-2013
  33. YHatHQ API: Calling Hosted Statistical Models 03-04-2013
  34. Quandl API: A Wikipedia for Numerical Data 02-25-2013
  35. Amazon Redshift API is out of limited preview and available! 02-18-2013
  36. Windows Azure Media Services REST API 02-14-2013
  37. Data Science Toolkit Wraps Many Data Services in One API 02-11-2013
  38. Diving into Codeacademy’s API Lessons 01-31-2013
  39. Google APIs finetuning Cloud Storage JSON API 01-29-2013
  40. Interview Hilary Mason Chief Scientist bitly 01-28-2013
  41. Interview: Viralheat CEO Raj Kadam on API Growth 01-22-2013
  42. Google Compute API – Affordable Computing at Google Scale 01-17-2013
  43. Ergast API Puts Car Racing Fans in the Driver’s Seat12-05-2012
  44. Springer APIs- Fostering Innovation via API Contests 11-20-2012
  45. Statistically programming the web – Shiny,HttR and RevoDeploy API 11-19-2012
  46. Google Cloud SQL API- Bigger ,Faster and now Free 11-12-2012
  47. A Look at the Web’s Most Popular API -Google Maps API 10-09-2012
  48. Cloud Storage APIs for the next generation Enterprise 09-26-2012
  49. Last.fm API: Sultan of Musical APIs 09-12-2012
  50. Socrata Data API: Keeping Government Open 08-29-2012
  51. BigML API Gets Bigger 08-22-2012
  52. Bing APIs: the Empire Strikes Back 08-15-2012
  53. Google Cloud SQL: Relational Database on the Cloud 08-13-2012
  54. Google BigQuery API Makes Big Data Analytics Easy 08-07-2012
  55. Your Store in The Cloud -Google Cloud Storage API 08-01-2012
  56. Predict the future with Google Prediction API 07-30-2012
  57. The Romney vs Obama API 07-27-2012

 

StatisticsViews

http://www.statisticsviews.com/details/feature/8868901/A-Tutorial-on-Python.html

 

CONFERENCES AND TALKS

1) Big Data Big Analyticshttp://krishnarajpm.com/bigdata/abstract.pdf Workshop on  Statistical Machine Learning and Game Theory  Approaches for Large Scale Data Analysis  9 July 2012 – 14 July 2012  Sponsored by Mathematical Sciences, Division of Science and Engineering  Research Board at Bangalore India

Department of Science & Technology Government of India. (sponsored airfare-hotel accomodation-honorium)

SLIDES Big data Big Analytics

2) Data Analytics using the Cloud- Challenges and Opportunities for India at 1st International Symposium on Big Data and Cloud Computing Challenges(ISBCC-2014) March 27-28, 2014 VIT University, Chennai, India Sponsored by BRNS (flight)

http://chennai.vit.ac.in/isbcc/

SLIDES Data analytics using the cloud challenges and opportunities for india from Ajay Ohri

3) Open Source Analytics at OSSCamp 2014 http://osscamp.in/

http://osscamp.in/events/6/open-source-analytics-overview-r-python-and-others

SLIDES- Open source analytics from Ajay Ohri

4) Society for Industrial and Applied Mathematics- Delhi Technological University Evolute 2015 : Annual Symposium Speaker

5) Talk on Analytics as a profession at Indian Institute of Technology Delhi

Learning R and Teaching R from Ajay Ohri

Workshops

Pre-Placement training workshop for Economics Students, Delhi School of Economics.

A Workshop on R from Ajay Ohri

Books

R for Business Analytics http://www.springer.com/us/book/9781461443421

R for Cloud Computing : A Data Science Approach http://www.springer.com/us/book/9781493917013

Revolution Analytics ( Microsoft) Corporate Blog

http://blog.revolutionanalytics.com/2011/08/9-more-ways-to-bring-data-into-r.html

http://blog.revolutionanalytics.com/2012/11/using-r-in-the-human-resources-department.html

 

Journal Articles

Journal of Statistical Software

https://www.jstatsoft.org/article/view/v066b04

Technometrics

Technometrics, Vol. 55 (3), August, 2013

http://amstat.tandfonline.com/doi/abs/10.1080/00401706.2013.822219

 

Major Media

been cited by Wired Magazine and ReadWriteWeb for espousing a marketplace for algorithms.

http://www.wired.com/2014/08/algorithmia/

http://readwrite.com/2011/06/01/an-app-store-for-algorithms/

 

Interviews (of Ajay Ohri)

  1. Big Step Interview July 2015  Expert Interview with Ajay Ohri on the Importance of Big Data http://blog.bigstep.com/big-data-experts-interviews/expert-interview-with-ajay-ohri-on-the-importance-of-big-data/
  2. AnalyticsVidhya Feb 2015 Interview with Industry expert – Ajay Ohri, Founder, decisionstats.com http://www.analyticsvidhya.com/blog/2015/02/interview-expert-ajay-ohri-founder-decisionstats-com/
  3. AnalyticsIndia Magazine Nov 2012 Interview – Ajay Ohri, Author “R for Business Analytics” http://analyticsindiamag.com/interview-ajay-ohri-author-r-for-business-analytics/
  4. HRTechEurope More R in HR Nov 2012 http://blog.hrtecheurope.com/more-r-in-hr/
  5. Data Mining Research Jan 2011 Interview Data Mining Research interview: Ajay Ohrihttp://www.dataminingblog.com/data-mining-research-interview-ajay-ohri/

AnalyticBridge Apr 2008 Interview with Ajay Ohri, Data Mining Consultant from India http://www.analyticbridge.com/group/interviews/forum/topics/2004291:Topic:11703

Writing for kdnuggets.com

I have been writing freelance for kdnuggets.com

Its a great learning for me to be a better writer especially for analytics and programming

These are a list of articles -interviews are in bold and I will keep updating this list when there are new additions

  1. Interview: Ingo Mierswa, RapidMiner CEO on “Predaction” and Key Turning Points June 2014

  2. Guide to Data Science Cheat Sheets 2014/05/12

  3. Book Review: Data Just Right 2014/04/03
  4. Exclusive Interview: Richard Socher, founder of etcML, Easy Text Classification Startup 2014/03/31
  5. Trifacta – Tackling Data Wrangling with Automation and Machine Learning 2014/03/17
  6. Paxata automates Data Preparation for Big Data Analytics 2014/03/07
  7. etcML Promises to Make Text Classification Easy  2014/03/05
  8. Wolfram Breakthrough Knowledge-based Programming Language – what it means for Data Science? 2014/03/02

Writing on APIs for Programmable Web

I have been writing free lance on APIs for Programmable Web. Here is an updated list of the articles, many of these would be of interest to analytics users. Note- some of these are interviews and they are in bold. Note to regular readers: I keep updating this list , and at each updation bring it to the front page, then allowing the blog postings to slide it down!

Scoreoid Aims to Gamify the World Using APIs January 27th, 2014

Plot.ly’s Plot to Visualize More Data January 22nd, 2014

LumenData’s Acquisition of Algorithms.io is a Win-Win January 8th, 2014

Yactraq API Sees Huge Growth in 2013  January 6th, 2014

Scrape.it Describes a Better Way to Extract Data December 20th, 2013

Exclusive Interview: App Store Analytics API December 4th, 2013

APIs Enter 3d Printing Industry November 29th, 2013

PW Interview: José Luis Martinez of Textalytics November 6th, 2013

PW Interview Simon Chan PredictionIO November 5th, 2013

PW Interview: Scott Gimpel Founder and CEO FantasyData.com October 23rd, 2013

PW Interview Brandon Levy, cofounder and CEO of Stitch Labs October 8th, 2013

PW Interview: Jolo Balbin Co-Founder Text Teaser  September 18th, 2013

PW Interview:Bob Bickel CoFounder Redline13 July 29th, 2013

PW Interview : Brandon Wirtz CTO Stremor.com   July 4th, 2013

PW Interview: Andy Bartley, CEO Algorithms.io  June 4th, 2013

PW Interview: Francisco J Martin, CEO BigML.com 2013/05/30

PW Interview: Tal Rotbart Founder- CTO, SpringSense 2013/05/28

PW Interview: Jeh Daruwala CEO Yactraq API, Behavorial Targeting for videos 2013/05/13

PW Interview: Michael Schonfeld of Dwolla API on Innovation Meeting the Payment Web  2013/05/02

PW Interview: Stephen Balaban of Lamda Labs on the Face Recognition API  2013/04/29

PW Interview: Amber Feng, Stripe API, The Payment Web 2013/04/24

PW Interview: Greg Lamp and Austin Ogilvie of Yhat on Shipping Predictive Models via API   2013/04/22

Google Mirror API documentation is open for developers   2013/04/18

PW Interview: Ricky Robinett, Ordr.in API, Ordering Food meets API    2013/04/16

PW Interview: Jacob Perkins, Text Processing API, NLP meets API   2013/04/10

Amazon EC2 On Demand Windows Instances -Prices reduced by 20%  2013/04/08

Amazon S3 API Requests prices slashed by half  2013/04/02

PW Interview: Stuart Battersby, Chatterbox API, Machine Learning meets Social 2013/04/02

PW Interview: Karthik Ram, rOpenSci, Wrapping all science API2013/03/20

Viralheat Human Intent API- To buy or not to buy 2013/03/13

Interview Tammer Kamel CEO and Founder Quandl 2013/03/07

YHatHQ API: Calling Hosted Statistical Models 2013/03/04

Quandl API: A Wikipedia for Numerical Data 2013/02/25

Amazon Redshift API is out of limited preview and available! 2013/02/18

Windows Azure Media Services REST API 2013/02/14

Data Science Toolkit Wraps Many Data Services in One API 2013/02/11

Diving into Codeacademy’s API Lessons 2013/01/31

Google APIs finetuning Cloud Storage JSON API 2013/01/29

2012
Ergast API Puts Car Racing Fans in the Driver’s Seat 2012/12/05
Springer APIs- Fostering Innovation via API Contests 2012/11/20
Statistically programming the web – Shiny,HttR and RevoDeploy API 2012/11/19
Google Cloud SQL API- Bigger ,Faster and now Free 2012/11/12
A Look at the Web’s Most Popular API -Google Maps API 2012/10/09
Cloud Storage APIs for the next generation Enterprise 2012/09/26
Last.fm API: Sultan of Musical APIs 2012/09/12
Socrata Data API: Keeping Government Open 2012/08/29
BigML API Gets Bigger 2012/08/22
Bing APIs: the Empire Strikes Back 2012/08/15
Google Cloud SQL: Relational Database on the Cloud 2012/08/13
Google BigQuery API Makes Big Data Analytics Easy 2012/08/05
Your Store in The Cloud -Google Cloud Storage API 2012/08/01
Predict the future with Google Prediction API 2012/07/30
The Romney vs Obama API 2012/07/27

Some tips on creating a useful blog for beginners

1) Blog post title should be self explanatory

2) Use categories and tags for better navigation

3) Use a theme which attracts not distracts

4) Simple language in blog writing works best

5) Useful blogs get more traffic than autobiographical blogs. Unless you are a celebrity.

6) People who enjoy writing blogs create better blogs

7) Writing a blog  is like jogging. Do it every day , even when its boring and painful. or Do it as much as your schedule permits.

12

How to be a better writer

tumblr_mp7rc872DQ1rnvzfwo1_1280

Background- I wrote this as an accident while trolling on Quora. I was not confident of what I wrote- in fact I wrote it anonymous except people kept asking me why! It was pure serendipity- I wrote it less than 4 minutes and submitted without thinking. Then edited once based on feedback.

Some one clearly more smarter than me made my tips for writing into a picture http://amandaonwriting.tumblr.com/post/54265230509

and it went popular on Tumblr just like it did on Quora!

Apparently if some guy like Wil Wheaton likes your words, it can go viral!  It has 41799 notes ( reblogs+hearts) on Tumblr as of now.

http://wilwheaton.tumblr.com/post/54699823961/torteen-great-advice-to-writers

Words . Reposted by a member of STAR TREK:NG. I can now die a happy Geek! The Internet is a funny thing!

Thank you everyone! Now if only Google learnt to include OCR for Images as part of text search!

  1. Write 50 words . That’s  a paragraph.
  2. Write 400 words . That’s a page.
  3. Write 300 pages. That’s a manuscript.
  4. Write everyday. That’s a habit.
  5. Edit and Rewrite. That’s how you get better.
  6. Spread your writing for people to comment. That’s called feedback.
  7. Dont worry about rejection or publication. That’s a writer.
  8. When not writing, read. Read from writers better than you. Read and Perceive.

But overall, just write more to get better.

1887+ votes on Quora!!:) Probably my most viewed content ever- !

61036 people  have viewed this answer!

https://www.quora.com/Writing/How-can-we-improve-our-writing-skill/answers/2048810?__snids__=94784810&__nsrc__=1

Also it got a mention here-

http://bookmarketingbestsellers.com/ajay-ohri-the-8-rules-of-writing/

Now I think I should take some of my own advice and get back to writing

The dichotomy in being a writer on open source with a non-open access publisher

  • The publisher adds credibility to your work

versus

  • A self fulfilling prophecy where researchers want to publish in exclusive journals and closed -access books, for the sole reason that others did so as well before them and thereby donate their knowledge and money to the publisher

aaronswartz-v2

The dichotomy in being a writer on open source with a non-open access publisher?

  • I write on open source R , 
  • and I have been published (one book )
  • and am on contract to write two more ( R for Cloud Computing) and (R for Web and Social Media Analytics)
  • My publisher does have open access journals.
  • But the book is at $50. Most of India lives at less than 2$ per day. Thats 800 million people in my country alone.

But the publisher is the most reputed in this field. So what are my choices? How do I get more people to have choices to read books.

Take open knowledge , curate it, and turn it behind a $50 paywall. I am sorry, Aaron. People like me are the reason ……

 

Writing a technical book

This is a fairly concise collection on how to write a technical book. It may seem arrogant for a 1- book author like me to do so, but I get a lot of queries on this and it seems there is a fair amount of information asymmetry on this process.  I have experience with getting rejected and accepted in both creative and technology domains, but I will make this post fairly tech specific.

Books I have Written-(click on images to go to the book site)

Cred-

Poetry (Self Published)

In Case I Don't See You Again
Corporate Poetry
Poets & Hackers (e-book)
Technology (Published )
R for Business Analytics
(Currently Writing)
R for Cloud Computing ( Springer) – Due 2013
R for Web Analytics and Social Media Analytics (Springer) – Due 2014
Top 5 Myths on Writing and Getting Published
  • Publishers dont like unsolicited manuscripts.

Well they don’t like unsolicited manuscripts from total unknowns. This is also very domain specific. If you are writing a novel, or a poetry book, or a technical book, approval rates will depend on current interest in that domain.

Advice– If you are first time author to be, choose your niche domain as one which you are passionate about and which has been generating some buzz lately. It could be Python, D3, R etc.

  • Publishers get all the money

No, they don’t make that much money compared to a Hollywood studio. Yes, books are expensive, but they basically are funding a whole supply chain that may or may not be efficient. Your book is subsidizing all the books that didn’t sell. Proof reading, and editing are not very glamorous jobs, but they take a long time, and are expensive. I have much more respect for editors now than say 3 years ago. The ultimate in supply chain efficiency would be if each and every hard copy was printed on demand, and each and every soft copy was priced efficiently given pricing elasticity. Pricing analytics on dynamic book pricing (like on Amazon)— hmm

  • Writers get all the money

You would be lucky to get more than 14% from a gross selling price of a hard copy or more than 40% of an electronic book. You want to make money, dont write technical books, write white papers and make webinars.

  • Writers get no money

You don’t make money by writing a technical book, but your branding does go up significantly, and you can now charge for training, webinars, talks, conferences, white papers, articles. These alternatives can help you survive.

  • I got a great idea- but I keep getting rejected. That guy had a lousy idea, but he keeps writing.

THAT guy wrote a great proposal, spent time building his brand, and wrote interesting stuff. Publishers like to sell books, not ideas.Writer jealousy and insecurity are part of the game – you have a limited amount of energy in a day- spend that writing or spend that reading. Ideally do both.

Book Publication

The book publication process has three parts-

1) Proposal

2) Manuscript

3) Editing

1) Proposal- Write an awesome proposal. Take tips from the publisher website. Choose which publisher is more interested in publishing the topic (hint- go to all the websites) . Those publisher websites confusing you yet- jump to the FAQ.

Some publishers I think relevant to technical books-

http://www.springer.com/authors/book+authors/faq+for+book+authors?SGWID=0-1725014-0-0-0

http://support.sas.com/community/authors/index.html

http://oreilly.com/oreilly/author/intro.csp

2) Manuscript- Write daily . 300 words. 300 times. Thats a manuscript. It is tough for people like us. Hemingway had  it easy. I used a Latex GUI called Lyx for writing http://www.lyx.org/. You may choose your own tool, style, time of day /night, cafe , room to spur your creative juices.

3) Editing- you will edit, chop, re edit and rewrite a book many times. It is ok. Make it readable is my advice. Try and think of a non technical person and try and explain your book to clear your ideas.

Once your proposal is accepted, you sign a contract for royalty and copyright.

Once the contract is signed you write the manuscript.This also involves a fair amount of research, citations, folder management , to keep your book figures, your citations ready. I generally write the citation then and there within the book, and then organize them later chapter by chapter. Un-cited work leads to charges of plagiarism which is the buzz kill for any author. Write, Cite, Rewrite.

You will also need to create index (can be done by software) so people can navigate the book better , and appendix for hiding all the stuff you couldn’t leave behind.

Once you submit the manuscript ,you choose the cover, discuss the rewrites with editor, edit the changes suggested, and resend the manuscript files, count till six months for publication. Send copies to people you like who can help spread the word on your book. Wait for reviews, engage with positivity with everyone, then wait for sales figures. Congrats- you are a writer now!