Google Custom Search

Here is a revised version of the Custom Search Engine that I first talked of last year- this year it now includes Business Intelligence Sites.

Try it out and let me know of you want to help create a customized Data Mining Engine- Note it already has 800 plus analytics and Business Intelligence Sites.

I got much better results than Google when searching for R, but thats to be expected 🙂

Building KXEN Models on Ubuntu

Doing analytics on Linux sometimes seems user unfriendly but the reality is itis not so- and it is actually cheaper for you as you can focus on the analytical software rather than the Operating System licensing costs.

virtualization-kvm-ubuntu
kxen2

virtualization-kvm-ubuntu

Note: The software used in this were KXEN Linux version 2.4 and Ubuntu Hardy Heron.

Using KXEN on an Ubuntu Linux proved surprisingly even more easy. Thanks to some excellent help provided by the KXEN support team and some discussions with KXEN’s head of Research,Bertrand the following 5 step procedure should help you start building models in KXEN right away.

Using Ubuntu has the added advantages of security, low costs as well as all the ease of a Graphical User Interface-

1)Backward Compatability

$ sudo apt-get install libstdc++5

(it will then ask the password).

2)Installing Java

$ sudo apt-get install sun-java6-jre

3) Download and Unzipping the Software
Download the zipped folder from the KXEN Download Site.
Unzip the Linux Version of KXEN system from the Download Site- This creates the master folder (example Kxen_X86-Linux-2.4.21-4.Elsmp_v5_0_3 )

4)Licensing
Run the KXEN Node Generator in KxNodeCodeGenerator folder within the master folder above.
The new file KXEN Node.txt is then sent back to support team and they send the License_nl.cfg

5)Installation ( for stand alone client)

Install a JVM 1.4.2 and export the java/bin directory in the $PATH environment variable.

The exe is located in the folder KJWizardJNI- The following commands
$ cd KJWizardJNI
$ PATH=/opt/j2sdk1.4.2_10/bin:$PATH ; export PATH
$ ./KJWizardJNI.sh

Run KXEN Models Happily ever after !!!

Note KXEN offers the ability to export models in a variety of formats including PMML, SAS, SQL and other languages

Disclaimer- I am a consultant on social media to KXEN

Google Custom Search

Here is a revised version of the Custom Search Engine that I first talked of last year- this year it now includes Business Intelligence Sites.

Try it out and let me know of you want to help create a customized Data Mining Engine- Note it already has 800 plus analytics and Business Intelligence Sites.

I got much better results than Google when searching for R, but thats to be expected 🙂

saP or saS or sasR or saaS

Some pending news and posts- It appears that the company SAP is moving closer to major acquisitions. This includes launching more and more applications that are analytical in nature as well coming together in an alliance with hardware major Teradata. Teradata off course is a very close partner to SAS Institute. So could SAP and SAS and or Terdata be moving closer to a major announcement on BI and BA merging.

The open source database movement with Hadoop is the one which can be the real game changer in the managed database industry and AsterData is the company to watch here.

However R with its modular extensions is a different paradigm in language developement and SAS no longer has the nimbleness or flexibity in creating such apps- at the same time it has lost a fair deal of credibility in the young academia (due to R) as well cost sensitive consumers (due to WPS)

The succession issue of Jim Goodnight continues to be the biggest problem for SAS Institute- Jim is not getting younger and his second line is not expected to be of the same class as the Sall/ Goodnight partnership. Of all the major companies in software, Jim Goodnight stood alone in remaining private and thus managed to escape distractions of share prices while building up the franchise. Surviving oil shocks, cold wars, three recessions Mr Goodnight has cared for his local community as well despite being active in SAS and fending off sustained attempts by open source languages.

. An automatic partner for Mr Goodnight should have been Google or even Google Labs with the Brin/Page duo being the top data miners ( commerically) of this generation as Sall/Goodnight were 30 years ago.

SAP may spend a lot of its cash but the supply chain paradigm is best served by SaaS and exemplified by Salesforce.com and Force.com developers.

As the ancient Chinese said- May you live in interesting times.

SPSS launches two more PASWs

Just got news from the Chicago school of analytics, or the company known as SPSS. they have decided to lauch two more PASW products and you can see this from the release itself.

SPSS Inc. and the value of Predictive Analytics.

This week we announced PASW Data Collection 5.6 feedback management and survey research software, and PASW Collaboration & Deployment Services 4, our integrated platform to share, manage, automate and integrate analytic assets directly into business processes.

PASW Data Collection 5.6 (formerly Dimensions)

* The use of surveys to capture “Voice of the Customer” across multiple touch-points is integral to bringing data about people’s attitudes into analytical decision-making to improve customer intimacy.
* PASW Data Collection 5.6 supports the entire survey lifecycle — from authoring to managing the data collection process to survey reporting and analysis — supporting global, multichannel research and feedback collection.
* New functionality includes data entry capabilities, an enhanced authoring interface suitable for the novice and the research professional, and new phone-based interviewing capabilities designed to shape the modern survey research call center. This release also further extends the enterprise readiness of the data collection platform with enhancements to performance and security.

You can read the press release at http://www.spss.com/press/template_view.cfm?PR_ID=1088

PASW Collaboration and Deployment Services 4 (formerly Predictive Enterprise Services)

* The platform automates analytical processes for greater consistency and control, and deploys results to business users, consumers or directly into operational systems to reduce customer churn, improve marketing campaigns or identify cases of fraud.
* PASW Collaboration and Deployment Services 4 provides the foundation to integrate analytics into key business processes, so the right decisions are made and the best actions are taken on a consistent, repeatable basis.
* New functionality includes enhanced collaboration capabilities that provide more options for publishing analytical results; enhancements to the Automation Service with additional integration options; and a Real-time Scoring Service to deploy analytical scores into existing applications.

You can read the full press release at http://www.spss.com/press/template_view.cfm?PR_ID=1087

Interview KXEN Bruno Delahaye

In my continuing coverage of KXEN, the plucky company that has managed to revolutionize analytics automation and social network analysis- Here is an interview with KXEN’s Vice President Bruno Delahaye.

246ee7c

Ajay – What is the best feature you like in KXEN. – both   as a company and as a product.

Bruno- Well actually what I like the most about KXEN is the will to make a difference. This is true at different levels of course: each individual within the company is trying to make things happen. For employees at KXEN this is not just a job: they want to change the game! The product side is naturally cascading from this. We are not simply recoding existing algorithms like some of our competitors are doing, instead we are looking in every domain of predictive and descriptive analytics where we can deliver higher value to our customers. When customers, thanks to the automation we provide, come back to us stating that they manage to increase their modeling productivity by 10 or even 50 compared to their previous modeling process we really think that what we provide is changing the game. Also, the fact that we have well over 500 customers globally today is proving that our customers recognize this as well!

Ajay : What areas has KXEN been most suitable for ? Biggest success story so far.
Bruno- KXEN has been very successful for 2 types of customers. We have been very successful in companies with mature Data Mining practices, companies that have realized that they need to move from a fully hand crafted approach to a more industrialized one in order to answer business requirements. As an example, lots of large companies run 10s of marketing campaigns per month and actually use data mining for only 1 or 2 at best… once organizations have understood the power of Data Mining they certainly want to target each campaign. Only KXEN can provide the level of automation required for this. On the other side, new data mining users (either new companies or new departments in a company) are also very eager to use KXEN. The learning curve with KXEN is so quick that it enables them to use their existing team (the ones that are aware of the business issues) and make them run within few days successful churn management programs or rebuild their customer segmentation in a reliable manner.

If you were expecting figures here, some Vodafone entities are claiming that they reduce churn in some customer segments by more than 10% by implementing KXEN. Unicredit in Austria mentioned that due to KXEN they gained an additional 50m€ per season….as you can guess the success of our customers always brighten our days.

Ajay : What areas would you rather not recommend KXEN? What other software would you recommend in those cases ?

Bruno- Well, I would recommend to use KXEN in every area of course, nevertheless where we have been less successful so far is with companies where time pressure to deliver analysis is lower. Basically, research departments tend to use more softwares like SAS EM or SPSS Clementine that are more methods/algorithms oriented rather than results oriented.

Ajay : What is the biggest challenge you have faced while introducing KXEN to a wider audience.
Bruno- The bigger challenge we have is in building domain expertise, it is indeed very difficult to build knowledge of our teams at the same time in Customer Lifecycle Analytics, in HRM, SCM… that is where building a confident relationship with the customer is so important. We have to prove to our prospect very early in the discussions that with KXEN they will make significant steps forward! This is also where our partner are so important to us. KXEN works with international as well as local partners with specific expertise to help our customers make the best possible use of the KXEN Data mining software to insure a high and fast ROI.

Ajay -Do you think the text mining as well as the Data Fusion approach can work for online web analytics, search engines or ad targeting?

Bruno- The data fusion approach is certainly one that makes sense for online web analytics. Analyzing the sequence of events rather than just taking into account whether an event occurs is actually a very powerful way to predict customer behavior or in this case the next click or the next action that is going to be made. I am not in this case claiming that everything has to be real-time as this could be the cause of the creation of weak or even unreliable/non stable models. Instead what we recommend our customer to do is to split the learning part that can be made off-line from the deployment that needs to be done real-time.

Ajay- Describe the relationships of KXEN with other members of the business intelligence community in terms of alliances.

Bruno- KXEN is a very good complement to BI vendors. We are actually partnering with several Data warehouse Vendors. For Data warehouse, the equation is quite simple they allow customers to structure and store the data but to provide real ROI, solutions need to be plugged on top of them. Setting a Data warehouse if you do not use the stored data is just another cost, what KXEN does is enabling to take advantage of the data asset to build customer segments that you will use to define your marketing mix, or simply target your customer either for cross-selling, up-selling or retention/loyalty purposes. The same is valid for credit scoring, fraud detection….

Case Study- Assume I have 50000 leads daily on a Car buying website. How would KXEN help me in scoring the model (as compared to other online based scoring solutions). Is it technically possible for me to install KXEN on Windows/ Other instances in remote computing like Amazon EC2 and not a server sitting somewhere.

The key difference, I believe, is that with KXEN you will indeed be able to do this even if you are not a data mining expert, if you want to use the results of yesterday’s campaigns to rebuild a model and if you can only afford to spend 10 minutes on this task every day. At the end of the day what we allow our users is to answer their business questions within the time frame they have rather than trying to convince them that they do not really need to do so many analysis for their business to run successfully

Ajay- And that was Bruno, VP, EMEA, KXEN. His profile can be seen here

http://www.linkedin.com/in/brunodelahaye

Bruno Delahaye manages KXEN’s operations for Continental Europe, Middle East, Africa and South America at KXEN. He is responsible for identifying and managing key partnership opportunities and developing the overall strategy for new partnerships.

For more on KXEN please go to http://www.kxen.com, you may need to regsiter to download their properietary white papers on Structural Risk Management or Text Mining.

Conflict of Interest Disclaimer-I am a consultant to KXEN as a social media consultant. Chairman Roger Hadaad was one of the first Chairman of a major corporation to agree to give interview to this small blog.