Creating Online Communities

Sometime back I had asked the question- How much do you think would it be to have the top 100 bloggers on SAS language on the same page, in a manner that the RSS feeds get updated on their own. The answer is here-

Wordframe. I had covered this software before in comparison to Ning.com and they proved favorable.

This is a small startup, East Europe and US based since more than a decade and very hard working. They allegedly wanted to become open source and had plans to create third party applications when I checked with them in January but this may be on hold for a new product launch.

Would sas.com pay for 1000$ set up fee and 200$ monthly fee for getting the top 100 SAS bloggers on their sascommunity.org website.

Would oracle pay for 1000$ set up fee and 200$ monthly fee for getting the top 100 Oracle bloggers on a website sponsored by them.

How much would Aster Data pay for say 100 bloggers about Hadoop ( ahem- assuming there are 100 people who CAN blog about Hadoop- a bit like Einstein’s 5 people in the world can understand his theory of relativity).

Check this site out.

image

Advertising Feed back Loop: Facebook and Adwords

In Systems design a feedback loop is something that measures the difference between expected and actual output and adds it back to the system to enhance it’s performance

For a better more accurate definition see http://en.wikipedia.org/wiki/Feedback

Here is how Facebook measures the satisfaction whether I got for an ad on my page- note fraudelent or systematic cheaters would be eliminated by the law of averages.

It places a very small thumbs up or thumbs down on an ad on right hand corner.

When I place a thumbs down on an ad ( note it shows only graphical ads)-it asks me why i didnt like the ad

Was it

1) Misleading
2) Offensive
3) Uninteresting
4) Irrelevant
5) Repetitive
6) Other- allows text mining of free form allowing subjective feedback.

However if I type say Business Intelligence on Google – I get the following

http://www.google.co.in/search?hl=en&q=business+intelligence&meta=&aq=0&oq=business+intell

Open Source Ad on Pentaho at prime spot just above the results. Makes me wonder how much open source will spend.

Search Results-
Wikepedia entry usually at the top ( but no ads there*)

Search Optimization means SAS Business Intelligence is now fifth from top- which is the first mention of a BI vendor there ( Note cost to SAS is zero as you can see below). In the remaining top 4, two are similar http://www.businessintelligence.com and http://www.businessintelligence.co.uk as the title page tag remains more important than sheer consumer behaviour to Pagerank.

Right Column Sponsored Links ( These guys spend more money than SAS to be on the sponsored links and still end up in a less favourable ad page than in top vendor on search results page)-

1 http://www.brgglobal.com
2 SAP.com/India
3 IBM.com/cognos
4 http://www.BusinessObjects-India.com
5 http://www.softsmith.com
6 http://www.syncfusion.com/BizIntell
7 http://www.intelligententerprise.com/

(Note from Ajay*- A senior IIM Alumnus CK Prahlad (the first batch in fact) – first expounded the concept of core competence-

Now-what is Google’s core competence.
I believe that Google should simply donate all its information on wikipedia, and offer Google Knol to it- rather than just wallow in sunk cost theory or expect an immediate return from this transaction. It will get lots of goodwill- and maybe the Google Wikipedia Knol will take off.

Similarly sell Orkut to Facebook and just take the advertising and search functions to be combined in a seperate spin off called Google Search before Microsoft forces the Europeans and Justice department to do so.

Or continue to moon the giant.)

I didnt like the Adwords ads or the order they were in. But there is no way in Adwords for me to give feedback to it. Facebook listens. Will Adwords listen.

Interview Dominic Pouzin Data Applied

Here is an interview with Dominic Pouzin, CEO of http://www.data-applied.com which is a startup making waves in the fields of Data Visualization.
meAjay – Describe your career in applied science. What made you decide to pursue a career in science? Some people think that careers in science are boring. How would you convince a high school student to choose a career in science?

Dominic- It’s important to realize that we are surrounded by products of science and engineering. By products of science, I mean bridges we cross on our way to work, video games we play for entertainment, or even the fabric of clothes we wear. Anyone who is curious should want to know how things really work. In that case, a scientific education makes sense, because it provides the tools necessary to understand and improve our world. I would also argue that a scientific training can also be a stepping stone towards high levels of achievements in other fields. For example, to become a financial wizard, a top patent attorney, or direct large clinical trials, a scientific education serves as a strong foundation. In addition, it’s probably easier to switch from science to another field than the other way round. Who wants to learn about matrix calculus in their forties? In my case, I graduated with a Masters in Computer Science degree, and spent 10 years at Microsoft leading software development teams for the Windows server, Exchange server, and Dynamics CRM product lines. I wish that, along the way, I had found time for a PhD in data mining, but years of practical software engineering experience also has its advantages.

Ajay- What advice would you give to someone who just got laid off, and is pondering whether he should / should not start a business?

Dominic- Working for a large company used to mean trading some autonomy for more stability and access to a wide array of resources. However, in this economy, the terms of the equation have changed. Many workers who lost their jobs found that this stability had disappeared. Others found that resources have become scarcer due to shrinking budgets. With this shift in the balance, entrepreneurship starts becoming more appealing.

Creating your own business might sound daunting, but for example creating a US Washington State LLC takes about 15 minutes, costs 200 dollars, and only requires an Internet connection. Managing payroll may sound like a big headache, but again specialized companies can handle all payroll matters on your behalf for only a few dollars a month. So while this part is relatively easy, you also need two things which are more difficult to come by:

a/ an unshakable belief in what you are trying to achieve, and

b/ a willingness to handle anything that comes your way.

You need to think like a commando solider who just landed on a beach: you’ve got great skills, but you’re alone, and can’t afford to fail. Practically, you may find yourself working for weeks or months with little or no income, and friends and family thinking that you are wasting your time. So, if necessary, try finding a co-founder to boost your confidence and motivate one another. Also, unless you want to spend most of your time chasing people for money, personal savings are a must.

Ajay- So describe your company. How does data visualization work? What differentiates your company from so many data visualization companies?

Dominic- We’re trying to stir things up a bit in terms of making it easier for regular business users to benefit from data mining. For example, we enable new “BI in the cloud” scenarios by allowing users to simply point a browser to access analysis results, or by allowing applications to submit and analyze data using an XML-based API. Built-in collaboration features, and more interactive visualizations, are also definitely part of our story.

Finally, while we focus on data mining (ex: time series forecasting, association rule mining, decision trees, etc.), we also make available other things such as pivot charts or tree maps. No data mining algorithm there, but why should business users care as long as the insight is there?

dataapplied_overview-500x326

To answer your question about visualization, most packages offer basic features such as the ability to pick colors, or to change labels, etc. For differences to emerge, you have to ask the right questions.

*

Access: does visualization require an application to be installed on each computer? Our visualization work directly from a web page, so there is nothing to install (and upgrades are automatic).
*

Search: can visualization results be searched, so as to enable drill-down scenarios? In the age of Google, we enable search everywhere, so that views can be constrained to what the user is looking for.
*

Collaboration: can visualization results be tagged using comments, or shared with other users while securely controlling access, etc.? Visualization is only a starting point – chances are that you will need to talk to someone before analysis is complete – so we offer plenty of collaboration features.
*

Export:how easy is it for a business user to present analysis results to management in a way that is understandable? We make it easy to export visualization content to a shared gallery, and as presentation-ready images.

There are a couple of other things we do as well in terms of interaction (ex: zoom, select, focus, smart graph layout), and a couple we don’t have yet (ex: geo-mapping, export to PDF).

But in conclusion, I would say that useful data visualization is as much about the way you present data (and that must be compelling!), as it is about how one accesses, searches, secures, shares, or exports visualizations.

Ajay- The technology sector was hit the hardest by the immigration of skilled workers. As a technology worker, what do you have to say about immigration? What do you have to say about outsourcing? Do you have any plans for selling your products outside the United States?

Dominic- I am a US permanent resident, half French, half British, and my wife is Indian. So you won’t find it surprising to hear that I am in favor of immigration. In 1996, as an engineering student in France, I made the unusual choice to study one year at the Indian Institute of Technology (Delhi).

In fact, I was the only one in my engineering college (France’s largest) to select India as a destination (my friends all went to the US, UK, Australia, Germany, etc.). Now that India has become a recognized player in the IT field, several dozen students from the same engineering college chose India as a destination. So I guess the immigration is starting to flow both ways!

Also, among the people I used to work with at Microsoft and who left to start a company, a good proportion are immigrants. So it’s important to recognize that immigrants not only help fill high-tech positions, but also create jobs.

Finally, as an entrepreneur trying to keep costs low, outsourcing is a tool you can’t afford to ignore. For example, websites such as http://www.elance.com provide easy access to the global marketplace. For those worried about quality, it’s possible to review customer ratings and portfolios. We keep track of visitors coming to our website, and the majority of the visitors to date have been from outside the US.

Ajay-  What is the basic science used by your company’s product?

Dominic – We use a client / server model. On the server, at the lowest level, we use SQL databases (accessed using ODBC), acting as data and configuration repositories.

Immediately above that sits a computing layer, which offers scalable, distributed data mining algorithms. We implement algorithms which scale well with the number of rows and attributes, but also properly handle a mix of discrete / numeric / missing values.

For example, just for clustering, the literature has some incredibly powerful algorithms (ex: WaveCluster, an algorithm based on wavelet transforms), but which also fail as soon as you enter real-world situations (ex: some fields are discrete).

On top of the computing layer sits a rich, secure web-based XML API, which allows users to manipulate analysis and collaboration objects, while enforcing security.

For the client, we built a web-based visualization application using Microsoft Silverlight. To ensure client / server communications are as efficient as possible, we use a fair amount of data compression and caching.

Ajay-  Who are your existing clients and what is the product launch plan for next year?

Dominic- We’re only in alpha mode right now, so our next customers are in fact beta testers. We’re still busy adding new features. It’s good to be small and nimble, it allows us to move quickly. Sorry, I can’t confirm any launch date yet!

Ajay-  What does the CEO of a startup company do, when he has free time (assuming he has any)?

Dominic- When you spend most of your time working on analytics, it’s sometimes hard to leave your analytical brain at work.

For example, I am sure that readers who come to your website and visit a casino can’t help themselves and immediately start calculating the exact odds of winning (instead of just having fun).

Among other things, I enjoy challenging friends to programming puzzles (actually, they’re recycled Microsoft interview questions). My current bedtime reading is a book about data compression. I think you got the picture!

******************************************************************

Dominic is currently making promising data visualization products at http://data-applied.com/ .To read more about him, please visit his profile page http://www.analyticbridge.com/profile/DominicPouzin

For the past one week I have been fending off repeated denial of service attacks on Decisionstats.com

My email addresses are widely known thanks to my participation on technical forums and my willingness to reach out and make friends ( 8500+ on Linkedin alone).

While the attacks were particularly vicious including email accounts, server changes, ip addresses to deny certain geographies, poisoned RSS feed/scripts, and scripts in folders to re direct posts.

Thanks to the fine team supporting me and some friendly advice from bloggers at Google- the worst is over. I have not hesitated to bash Google at will over its big size and even added a few cartoons on them – (even now I think they should sell Orkut to Facebook and add Twitter to create a spinoff Google Social Media for an IPO and take that particularly search and advertising)

But Google Search and Gmail and Google Analytics and Particularly Youtube are free- and thanks to them knowledge has spread way around the world. Also working behind the scenes the Google team spends a huge amount fighting spam and all for free. Or a bit of advertising on a website.

Thanks a lot Google men, and here is a You Tube video ( which you may have seen many times before but still) that celebrates the one ness of a global online community ( despite the occasional sith attacks)

saP or saS or sasR or saaS

Some pending news and posts- It appears that the company SAP is moving closer to major acquisitions. This includes launching more and more applications that are analytical in nature as well coming together in an alliance with hardware major Teradata. Teradata off course is a very close partner to SAS Institute. So could SAP and SAS and or Terdata be moving closer to a major announcement on BI and BA merging.

The open source database movement with Hadoop is the one which can be the real game changer in the managed database industry and AsterData is the company to watch here.

However R with its modular extensions is a different paradigm in language developement and SAS no longer has the nimbleness or flexibity in creating such apps- at the same time it has lost a fair deal of credibility in the young academia (due to R) as well cost sensitive consumers (due to WPS)

The succession issue of Jim Goodnight continues to be the biggest problem for SAS Institute- Jim is not getting younger and his second line is not expected to be of the same class as the Sall/ Goodnight partnership. Of all the major companies in software, Jim Goodnight stood alone in remaining private and thus managed to escape distractions of share prices while building up the franchise. Surviving oil shocks, cold wars, three recessions Mr Goodnight has cared for his local community as well despite being active in SAS and fending off sustained attempts by open source languages.

. An automatic partner for Mr Goodnight should have been Google or even Google Labs with the Brin/Page duo being the top data miners ( commerically) of this generation as Sall/Goodnight were 30 years ago.

SAP may spend a lot of its cash but the supply chain paradigm is best served by SaaS and exemplified by Salesforce.com and Force.com developers.

As the ancient Chinese said- May you live in interesting times.