A SunBurst of Insight

a nice addition to Big Data Visualization- sunbursts (which I have covered in the Dat Viz chapter of my R book)

Great work by BigML.com

davidgerster's avatarThe Official Blog of BigML.com

This is a guest post by David Gerster (@gerster), a data scientist and investor in BigML.

I work at a consumer web company, and recently used BigML to understand what drives return visits to our site. I followed Standard Operating Procedure for data mining, sampling a group of users, dividing them into two classes, and creating several features that I hoped would be useful in predicting these classes. I then fed this training data to BigML, which quickly and obediently produced a decision tree:

decision_tree

Next I used BigML’s interface to examine the tree’s many subsets, shown as “nodes” in the diagram above. I moused over a node at the top of the tree and saw that it achieved high separation for a large fraction of the training set:

Shhh, I'm hunting nodes!

This one node covered 58% of the data, and separated the two classes with 73% confidence. (“Confidence” is a measure…

View original post 546 more words

Predictive Analytics World goes to Chicago

Message from our Sponsors and my favorite Analytics conference ( only if I could attend a cool analytics conference nearby in Asia (singapore/turkey?)  -sighs) Even useR wont come to Asia ever?-

This is the number 1 conference for analytics in the world and it is next month in Chicago, USA? So you think you have the best analytics software or product or service. Here is where you can find it out!

It’s time to amp-up your analytics strategy. It’s time to beef up your analytics strategy by attending Predictive Analytics World Chicago, June 10-13, 2013. With over 30 case studies from leading organizations across a spectrum of industries, this is the must-attend event for anyone serious about their analytics strategy.

Here’s what your peers had to say about their experience at PAW:

“Great speakers, interesting content, and great networking. PAW conferences are among my favorite analytic events!”
– Karl Rexer, Ph.D. Rexer Analytics“This vendor neutral conference always gives me tangible ideas I can put to work right away.”
– Greg Hayworth, Humana

“Predictive Analytics World did a great job keeping up with the trends in Predictive Modeling. There were also plenty of opportunities to learn about the most valuable resources available to data scientists.”
– Conor Sontag, Marketing Evolution

“People who are in analytics must join Predictive Analytics World and see the state of the art projects.”
– Burak Buyuktombak, Avea Telecommunication Services (Turkey)

And there is more where that came from.

Who’s attending PAW Chicago 2013?

Here are just a few of the many companies attending:

Whose attending PAW Chicago

And many more!

Registration options for all budgets.

PAW Chicago has a variety of conference pass options available to meet budgets of all sizes.

Learn more about pricing and how to register.

Register Now!

2013 Chicago Sponsors
Follow Us on Twitter Be a Fan on Facebook LinkedIn Group Live Twitter Feed

What if Google decides to make Google Contact Lenses

From the Gamification course in Coursera- what a brave new world this could be if we could create minute contacts of Google Glass?

Sight from Sight Systems on Vimeo.

Iron Man 3 Review – Oh what a tangled web we weave!

Iron Man 3 has more. Twice the villains. Five times the Iron Men. More girls for Tony Stark. Pepper Pots finally gets some action scenes. But I like the guy with the English accent the best- even better than Australian hunk. Anything more and you could end up with a spoilt ending. Hollywood gives us movie sequels knowing we are suckers for watching how the story will turn out. This edition though was not so great as the last Avengers universe franchise. Go with lowered expectations or with your kids, and you will be okay.

On and one more thing! At the end of ALL the credits- the Dr Bruce Banner makes a 2 minute nice appearance. Witty! The movie just make a big hit financially but alarm bells should ring in the Avenger sequel universe. Especially since Star Trek and Super Man are just here!

Iron Man 3 Teaser Poster

Top five ways to do business unethically in India

Over a decade long career , I have often been reminded of this saying from erstwhile mentors in long forgotten consulting email group- It is not WHAT you KNOW, it is WHO you KNOW. The power of WHO you KNOW can defeat even what you know , have learnt or worked hard at. Accordingly these are some wry observations on how businesses sometimes take shortcuts in India, and the whys and wherefores.

1) Regulatory Arbitrage due to Lack of Regulatory Oversight- This is especially true in terms of labor practices. This includes under-paying Caucasians and non -Indians for internships , or jobs (in the name of sponsoring the work visa). India is an extremely inexpensive place to stay in, but it is sometimes unfriendly (in terms of laws not people) to people visiting from the West. This ranges from amusing things to paying 10 times the price for non Indian visitors to Taj Mahal- to not so funny things as paying them lower salaries because they need  a reason to stay on. Unfortunately this is true in many countries -underpaying aliens, but it is much better regulated in the West.

2) Stealing Intellectual Property– I have often known people to steal presentations and even excel macros from the place they were working to the new place. Almost no one gets prosecuted for intellectual property theft (unless you are caught with 10,000 pirated music or film cds)

3) Using Pirated Softwares – Lack of awareness of FOSS means many SMEs use shortcuts including downloading software from Pirate Bay and using this to work for clients in the West. Example- This could be as simple as downloading SAS software from Internet, or using WPS software for training and mis-representing SAS Institute’s name. (added confusion due to SAS -software,company,language ) . There are other major companies who suffer from this too, notably Microsoft.

This could be as complex as using academic versions of enterprise software for businesses purposes. In each case because of the geography, legal risk is quite low, and returns quite high from pirated software. It also helps lower the unethical vendor’s quotation of prices compared to the one who is doing it straight.

One way to avoid this is –ask your vendor to show you copy of how many legal licence’s for software. It can also help in cutting down exaggerated bench strength claims of vendors, as sometimes businesses hire many people and then put them on internal projects.

4) Illegal Trade practices- This include making employees sign a 1 year bond for not leaving the company after they have visited the West for company work- in the name of training . This also includes abusing the loopholes in various types of visa.

5) Ignoring signed contracts and negotiating to lower prices at every step illegally, in collusion with other vendors ( there is no effective anti -trust act ) and using the complete inadequate and lengthy nature of filing court cases in India.
Almost every non Indian client I know pays on time- almost every Indian client I know needs reminders. This is more of a mindset problem , knowing the reluctance to file lawsuits in India given slow progress in the courts ( India has 1.2 billion people and per capita access to judges and lawyers is quite low). The buzz word is- How much can we settle this? Lets do a settlement!

In the long run, this is choking off growth and potential of SMEs in India. In a continuing series- I will help the non Indian users with ways to use technology for legal remedies  in India for intellectual property  along with known case studies and examples.

Using a Linux only package in Windows #rstats

Here is some R code for using a R package that has only a tar.gz file available (used to load R packages in Linux) and no Zip file available (used to load R packages in Windows).

Step 1- Download the tar.gz file.

Step 2 Unzip it (twice) using 7zip

Step 3 Change the path variable below to your unzipped, downloaded location for the R sub folder within the package folder .

Step 4 Copy and Paste this in R

Step 5 Start using the R package in Windows (where 75% of the money and clients and businesses still are)

Caveat Emptor- No X Dependencies (ok!)

path="C:\\Users\\KUs\\Desktop\\segue\\R"
b=dir(path)
c=length(b)
for (i in 1:c){source(gsub(" ","",paste(path,"\\",b[i])))}
ls()

 

R2D2

Adding a + to the bit.ly link you get to get analytics on your spammers

Just add a + sign to any bit.ly link and you get to see associated analytics for that link.

you can get information (traffic, referrers, locations, conversations) about any Bit.ly link simply by taking the short URL and adding a “+” at the end (minus the quotes)

Click on the image below and notice the + sign in the URL.

Read more here this can be useful than just fun-

Using Bit.ly for Spying, Link Building and Happiness

Unrelated- I interview Hilary Mason, Analytics legend and Bit.ly Chief Scientist here –

Interview Hilary Mason Chief Scientist bitly

nah