Web Analytics is funny statistics

I have a simple question for my Web Analytics software. I want to know who is reading what, and how much are they being impacted ?

In return my Web Analytics gives me dashboards that can be line charts, bar plots, path diagrams (including Google Analytics).

  • Some questions for my Web Analytics to answer-
  • Will it count 500 CEOs reading my blog as less significant as 5000 college students. Thats not a problem if I am on a social network or is it?
  • I get 15000 unique viewers every month . How many people is that? Does that mean the same 500 people visited every day. Does it mean every day a different 500 people visited. Yes I know Google Analytics has some kind of pie chart (horrible) split and returning and new users- but HOW MANY PEOPLE DID I reach?
  • What did they do after the read my blog? Where did they go? Google shares Adsense revenue. Can it share data too- lets call it DataSense. Even create a new internet data bureau (like we have credit data bureau for financial data)
  • How can I use the web analytics software to give me a forecast of future traffic ( by a time series plot with an added regressor of number of posts per category type ?)
  • How can I get some ANALYTICS to take a decision from the web analytics- (A Siri for Web Analytics?-  You last posted X days ago. Please consider posting. Please consider delaying posting to a more appropriate time?)
  • Is there more to life for a blogger than views and visitors. Is there some way we can measure satisfaction?
  • Is there a SEO penalty for boasting on blog traffic boasting when meeting another blogger. Is there a SEO incentive for openly sharing your web statistics
  • Can Google Analytics give a big data dump for open data analytics (sigh). Can you use custom JS libraries for making your own dashboard with GA

Screenshot from 2015-10-22 08:01:27


You wake up every day with a bank balance of 12 hours of productive work time. every night as you go to sleep the balance goes to zero. you wake up every day with a finite energy balance of a few kilojoules willing to be expended. the balance is upto you but it cannot be carried over the next day.

You wake up with choices and you go to sleep with having made the decisions on the choices. You can focus on what lies forward and stay positive.OR . You can swallow the negativity and be swallowed in it’s swamp.

Intelligent men can make bad choices. Choices that you make today will be with you in the future.

repair the world

The phrase “Tikkun Olam” literally means “world repair.” It is commonly used to refer to the pursuit of social action and social justice.  The concept behind it originate in kabbalah, in the teachings of the 16th century mystic Isaac Luria.

One of the ways you can repair the world is by building good products to replace the ones that are currently in it.  Facebook replaced Orkut and MySpace and its easy user interface and user experience enabled grandmoms to see the pictures of their children from a distance, for old friends to reunite, for global entrepreneurship to be energized. Google Search repaired the world, by pioneering Online Advertising, by  investing in free video sharing, by investing and mentoring and inspiring numerous startups and of course with their free and open source Android system that lowered the cost of smartphone ownership.

Hollywood tries to repair the world by repeatedly portraying certain values. With an immense audience, Hollywood has great soft power beyond glamor or superficial entertainment. Recently it has started portraying repair the world science and global cooperation rather than American dominance.

The Walk is a great movie. Robin (Joseph Gordon-Levitt) takes in help from his accomplices  Gandhi (Ben Kingsley), a George Carlin lookalike, a mathematician (yay!!), a pretty french girl fulfills his childhood dream. He audaciously walks the tallest buildings in the world , the Twin Towers at the World Trade Center. Based on a real-life story, The Walk shows an inspired French Man was instrumental in the World Trade Center’s birth as the center of the world’s greatest city.  The movie is more of a poem of the architecture of the buildings too and it’s haunting to think why the actions of 19 angry men led by a stupid ideology of hatred and destruction killed that dream. I have seen the city of New York once in 2009 but it was between trade towers at the moment. Bin Laden wanted to repair the Muslim World’s dominance of corrupted dynasties supported by the West. The West reacted by unleashing war, and then squandering the entire goodwill by extending it to Iraq, a mess that endures today. In spite of everything,  people are truly good at heart -that is a concept that only an idealistic teenager can have.

To repair the world, you need to assist the ones trying to heal it and you need to thwart the ones trying to steal it. Repairing the world means stopping the ones sapping the climate, the environment and the planet of precious resources.

No force can stop an idea whose time can come, but as time changes the ideas to repair the world change. What have you done to repair the world today?

How Jupyter Ipython threaten the dominance of RStudio for data science developers

RStudio is the clear market leader in IDE used by developers for R data science.

R is the clear market leader for data science.

Python can do with more wrappers for R like packages.

But Jupyter is awesome (once you get it working!)

Screenshot from 2015-10-20 14:29:53

Hopefully, multi core stuff and cloud hosted stuff should be easy too. Google Cloud Data Labs with hosted Jupyter is just the first step. see https://cloud.google.com/datalab/

One of the best things I like about Jupyter OVER RStudio’s interface is the ability to divide code blocks in cells. In addition the ability to install new packages from with RSTUDIO really helps me over the Jupyter. The syntax prompt in latest version of RSTUDIO is something I wish JUPYTER really worrks on.

Can we have a RSTUDIO like interface to working with Python. Yes Yhat made one and called it RODEO. This is because the interface is based on the ACE editor ( yes esseentially RStdudio the company married ACE Editor to Hadley Wickham to get RSTUDIO the product 😉  . Shiny was wonderful but for scalable data science Python and Java help me just as much as R does for BIg DATA ANALYSIS) Scalability is the key here! Rpubs isnt as popular as NBviewer is and now we can wrap markdown within a Jupyter notebook

Screenshot from 2015-10-20 14:28:03

Screenshot from 2015-10-20 14:27:36

Screenshot from 2015-10-15 18:37:41

can Jupyter help in my data science work more than RStudio? These are early days but I prefer a cross platform cross language ( Julia, Python and R) solution anyday. Provided it works just as seamlessly than the established market leader RStudio.

BIG DATA ANALYTICS is where I clearly see JUPYTER help data scientists more than RStdudio as you can use the IRKERNEL. I am especially hoping to see the Spark Kernel , JS Kernel  https://www.npmjs.com/package/ijavascript  and others be  more production ready for business enterprises.


A version of the Spark Kernel is deployed as part of the Try Jupyter! site. Select Scala 2.10.4 (Spark 1.4.1) under the New dropdown. Note that this version only supports Scala.


Python/Jupyter kernels:

The Kernel Zero, is of course IPython, which you can get though ipykernel, and still comes (for now) as a dependency of jupyter. The IPython kernel can be thought as a reference implementation, here are other available kernels:

Name Link Jupyter/IPython Version Language(s) Version 3rd party dependencies
ICSharp https://github.com/zabirauf/icsharp Jupyter 4.0 C# 4.0+ scriptcs
IRKernel http://irkernel.github.io/ IPython 3.0 R 3.2 rzmq
SageMath http://www.sagemath.org/ IPython 3.2 Any

Screenshot 2015-10-20 14.07.08 (1)

Screenshot 2015-10-17 18.10.10


Screenshot 2015-10-20 11.17.52

The God of Big Data

An Omniscient God should be someone who can analyze all data all the time in any form. Think of a twitter created by the alpha waves of the thoughts of your brain. God is following you. Like Tom of MySpace , HE is friends to all of us. Or thats what the definition of GOD is supposed to be

super small sensitive sensors generate data that are analyzed by algorithms that present results for decision making by angels. sensors embedded in your brain your mobile your watch your laptop.

What do humans do with this data? Do they give you a personalized learning program on what to read today. Do they give you a personalized exericse and food program on how to stay healthy today? or do they just ask you to click click click, so you can maybe convert a bt, and then buy buy buy.

Buying generates debt and economic activity and puts pressure on the buyer to keep working harder so he can come to play the buying game in conspicuous consumption. Does the product you buy list your carbon impact. No that will lower sales. Does it give you a rush of chemicals approximating happiness. Of course retail therapy is the best psycho therapy.

A merciful loving God that has access to all data would intervene not only yo keep us buying and working, but to keep us informed of choices we make and choices we disdain. On how to be better educated, better informed, better health and better impact on government and enironment.

Google failed us as the God of all Data. Alphabet is just a primal soup right now.

The God of Big Data? Now accepting applications.

a little knowledge is a unicorn startup

I see them all day and I see them from morning to evening. Bright-eyed young men on one side, cynical middle-aged men smiling on the other side of the table. Playing a game that is half-way between strip poker and Russian roulette. The one where one side claims they are trying to change the world by giving the other side 10X on their investment. If you could only sign the cheque Mr Middle Aged Man, and if you could only could rub your nose with powder Mr Young Man.Being a data scientist can be an advantage in these oxygen sucking bone marrow draining meetings as you play umpire to Sharapova grunting to beat Hingis in this tennis match. You can do a tech jargon dump, avoid questions by claiming just to be the tech guy or you can give them crazy brilliant scientist with the bad joke routine

What grates my teeth , gets my goat Billy, irritates my allergic cough and what really annoys me is when the potential investor tries to show off their technology chutzpah and when the-the potential investee starts sucking icecream just to stay afloat in the game. Life is unfair and Karma is a bitch cliche, but when I feel like shouting at the inane conversations in the meetings I have to attend, I amuse myself by calculating how much money I am making every second ( it’s not much , but we have a long term plan).

A little knowledge is a dangerous thing. Young men  often shoot more of this not knowing enough, but old men who just won’t do a google search before they speak inane is what drives me  temp insane. Do you really want to hand over 25% of your company to this guy for 300K kid.

That’s right, the valuations in Indian startups are so mind blowing fucking low, that all the failed VC’s or the homesick desis return home and over night go from upper middle-class suburb desperate househusbands to indian angel investors returned from cowboy land to the glorious motherland to do what is right by her. They used to be proud first generation Americans till they returned to become Wily Oriental Gentlemen.

Technology startups in India are hot, but I think it’s not a bubble because valuations are low in dollars, but its a correlated and cointegrated bubble with the hash they smoke in San Carlos streets.


  • the tech talent is shoddy (crumbs after the outsourcing lions have been fed )
  • and its all wild wild east laisse faire (in one of the most corrupt countries still)
  • and all the  sex  (500 million people less than 30 in India , cough)
  • and all the Opium Triangle drugs make this more of a Mad Man on TV than a MAD magazine on live street.

Google your jargon, kid and google your bullshit unicorn breeders.

In India, cow dung is holy, but bullshit is not.

That’s not even a metaphor. Its reality and reality bites the slumdog millionaires faster than it bites the husband of a Kardashian.

Cheques and Balances for Startups

The concept of Western democracy rests in the idea of checks and balances. Judiciary (Law Courts)acts as check to Executive (Governments) and in turn balance is provided by Legislature ( Senate /Parliament). To counter the friction between plebians and patricians in a society, Legislature is further divided into Lower House (House of Commons, Congress, Lok Sabha) and Upper House ( House of Lords, Senate, Rajya Sabha). This divide and then rule policy serves as a risk mitigation force in governance for countries. The risks of course are unstable, violent and unpredictable changes in operation (like revolutions, coups, revolts, dictatorships)

Organizations are famously known to be non-democratic. Startups even less so. Yet every team member in a technology startup is valuable and that value increases after they have got on the job training in cutting edge technologies and exposed to solving interesting problems. You balance their tendency to churn by giving good food, great culture, flexible timing. An important part is stock options granted timely and in a fair manner, and salaries paid. Cheques check attrition in startups. A healthy bank balance in the organization is the only thing that balances the uncertainty and insecurity of working in unstructured dynamic environments like technology startups.

Learning must be balanced by earning for young members of each startup. Older experienced members of startup need checks and balances of  a different sort as they have multiple opportunities in employability and opportunity costs in getting a lower startup salary. Older members of a startup , okay, lets call them seasoned members of a startup have more obligations like student debt, house debt, child education etc.

I wonder if the level of entrepreneurship in a society would increase if we had lesser student debt, by giving cheaper education, by using mobile, internet and video more. Trainings in various kinds of coding languages should also be increased in accessibility and duration. I see many technology stacks being coded in bad choices of platforms solely because the techies were not prompted to enroll in the right paid program for online self-learning. So for example if there is shortage of Ruby on Rails or Python developers in data science in India, rather than invest a small 50$ per employee per month in online based training, companies continue to work with legacies of LAMP, and other legacy software.

You just can’t google your way to being a data scientist or a big data engineer. Trainings are just as important as good food for members in your startup.

What balances the benevolent dictatorship of the passion of the founder. A seasoned mentor in the team. What checks people from hire and fire and reneging on stock option promises. Defined processes established by a compliance person.

It is upto you the founder, or the investor on what kinds of checks and balances you want to create in your startup. Do you want the startup to look like Canada or do you want it to be like Syria? Choose the cheques, checks, and balances accordingly.