Running R GUI on Google Compute

I wanted to run R GUIs ( rattle, Rcmdr, Deducer) on my Google Compute Instance, but didnt know how to figure out how to enable x11.

Initially I just tried to enable x11 forwarding in the local ssh (Ubuntu) and remote sshd( GCE), but it still needed some more.

Note I use gedit to edit files locally ( since it is easier) and vi to edit files remotely ( because I didnt have a graphical environment there yet) . I used vi help from the link here  (basically sudo vi filename opens the file in Linux, you scroll down and press Insert to write your changes, then hit escape, then write this to save and quit :qw ( or :q! to NOT save and quit), your mouse is quite useless and the arrow keys dont help much in vi- I assure you that)

[local]
/etc/ssh_config or ~/.ssh/config
ForwardX11 yes

restarted local ssh

[remote]
/etc/sshd_config
X11Forwarding yes

restarted remote sshd

Well this is how it is done- following is a copy and paste from actual discussion-

here are two steps you have to do in order to run X-windows applications on your instance.

1) You have to install some X-windows applications on your instance.  I used the command
sudo apt-get install xterm
which works on Ubuntu.  On Centos, you would use the command
yum install xterm
but I didn’t test that.
2) You have to create an X-windows tunnel through SSH.  You do that with the -X switch to the gcutil ssh command:
 gcutil ssh –ssh_arg -X INSTANCE
When you login to the instance, verify that the tunnel is in place.
$rman@test-pd:~$ echo $DISPLAY
localhost:10.0
rman@test-pd:~$
By way of contrast, this is what it looks like if the tunnel didn’t work:
rman@test-pd:~$ echo $DISPLAY
rman@test-pd:~$

Hat Tip- gce discussion group on google groups  https://groups.google.com/forum/#!forum/gce-discussion  and Jeff Silverman from the GCE team.

Interviews and Reviews: More R #rstats

I got interviewed on moving on from Excel to R in Human Resources (HR) here at http://www.hrtecheurope.com/blog/?p=5345

“There is a lot of data out there and it’s stored in different formats. Spreadsheets have their uses but they’re limited in what they can do. The spreadsheet is bad when getting over 5000 or 10000 rows – it slows down. It’s just not designed for that. It was designed for much higher levels of interaction.

In the business world we really don’t need to know every row of data, we need to summarise it, we need to visualise it and put it into a powerpoint to show to colleagues or clients.”

And a more recent interview with my fellow IIML mate, and editor at Analytics India Magazine

http://analyticsindiamag.com/interview-ajay-ohri-author-r-for-business-analytics/

AIM: Which R packages do you use the most and which ones are your favorites?

AO: I use R Commander and Rattle a lot, and I use the dependent packages. I use car for regression, and forecast for time series, and many packages for specific graphs. I have not mastered ggplot though but I do use it sometimes. Overall I am waiting for Hadley Wickham to come up with an updated book to his ecosystem of packages as they are very formidable, completely comprehensive and easy to use in my opinion, so much I can get by the occasional copy and paste code.

 

A surprising review at R- Bloggers.com /Intelligent Trading

http://intelligenttradingtech.blogspot.in/2012/10/book-review-r-for-business-analytics.html

The good news is that many of the large companies do not view R as a threat, but as a beneficial tool to assist their own software capabilities.

After assisting and helping R users navigate through the dense forest of various GUI interface choices (in order to get R up and running), Mr. Ohri continues to handhold users through step by step approaches (with detailed screen captures) to run R from various simple to more advanced platforms (e.g. CLOUD, EC2) in order to gather, explore, and process data, with detailed illustrations on how to use R’s powerful graphing capabilities on the back-end.

Do you want to write a review too? You can visit the site here

http://www.springer.com/statistics/book/978-1-4614-4342-1

 

JSS launches special edition for GUI for #Rstats

I love GUIs (graphical user interfaces)- they might be TCL/TK based or GTK based or even QT based. As a researcher they help me with faster coding, as a consultant they help with faster transition of projects from startup to handover stage  and as an R  instructor helps me get people to learn R faster.

I wish Python had some GUIs though 😉

 

from the open access journal of statistical software-

JSS Special Volume 49: Graphical User Interfaces for R

Graphical User Interfaces for R
Pedro M. Valero-Mora, Ruben Ledesma
Vol. 49, Issue 1, Jun 2012
Submitted 2012-06-03, Accepted 2012-06-03
Integrated Degradation Models in R Using iDEMO
Ya-Shan Cheng, Chien-Yu Peng
Vol. 49, Issue 2, Jun 2012
Submitted 2010-12-31, Accepted 2011-06-29
Glotaran: A Java-Based Graphical User Interface for the R Package TIMP
Joris J. Snellenburg, Sergey Laptenok, Ralf Seger, Katharine M. Mullen, Ivo H. M. van Stokkum
Vol. 49, Issue 3, Jun 2012
Submitted 2011-01-20, Accepted 2011-09-16
A Graphical User Interface for R in a Rich Client Platform for Ecological Modeling
Marcel Austenfeld, Wolfram Beyschlag
Vol. 49, Issue 4, Jun 2012
Submitted 2011-01-05, Accepted 2012-02-20
Closing the Gap between Methodologists and End-Users: R as a Computational Back-End
Byron C. Wallace, Issa J. Dahabreh, Thomas A. Trikalinos, Joseph Lau, Paul Trow, Christopher H. Schmid
Vol. 49, Issue 5, Jun 2012
Submitted 2010-11-01, Accepted 2012-12-20
tourrGui: A gWidgets GUI for the Tour to Explore High-Dimensional Data Using Low-Dimensional Projections
Bei Huang, Dianne Cook, Hadley Wickham
Vol. 49, Issue 6, Jun 2012
Submitted 2011-01-20, Accepted 2012-04-16
The RcmdrPlugin.survival Package: Extending the R Commander Interface to Survival Analysis
John Fox, Marilia S. Carvalho
Vol. 49, Issue 7, Jun 2012
Submitted 2010-12-26, Accepted 2011-12-28
Deducer: A Data Analysis GUI for R
Ian Fellows
Vol. 49, Issue 8, Jun 2012
Submitted 2011-02-28, Accepted 2011-09-08
RKWard: A Comprehensive Graphical User Interface and Integrated Development Environment for Statistical Analysis with R
Stefan Rödiger, Thomas Friedrichsmeier, Prasenjit Kapat, Meik Michalke
Vol. 49, Issue 9, Jun 2012
Submitted 2010-12-28, Accepted 2011-05-06
gWidgetsWWW: Creating Interactive Web Pages within R
John Verzani
Vol. 49, Issue 10, Jun 2012
Submitted 2010-12-17, Accepted 2011-05-11
Oscars and Interfaces
Antony Unwin
Vol. 49, Issue 11, Jun 2012
Submitted 2010-12-08, Accepted 2011-07-15

Interview Rapid-I -Ingo Mierswa and Simon Fischer

Here is an interview with Dr Ingo Mierswa , CEO of Rapid -I and Dr Simon Fischer, Head R&D. Rapid-I makes the very popular software Rapid Miner – perhaps one of the earliest leading open source software in business analytics and business intelligence. It is quite easy to use, deploy and with it’s extensions and innovations (including compatibility with R )has continued to grow tremendously through the years.

In an extensive interview Ingo and Simon talk about algorithms marketplace, extensions , big data analytics, hadoop, mobile computing and use of the graphical user interface in analytics.

Special Thanks to Nadja from Rapid I communication team for helping coordinate this interview.( Statuary Blogging Disclosure- Rapid I is a marketing partner with Decisionstats as per the terms in https://decisionstats.com/privacy-3/)

Ajay- Describe your background in science. What are the key lessons that you have learnt while as scientific researcher and what advice would you give to new students today.

Ingo: My time as researcher really was a great experience which has influenced me a lot. I have worked at the AI lab of Prof. Dr. Katharina Morik, one of the persons who brought machine learning and data mining to Europe. Katharina always believed in what we are doing, encouraged us and gave us the space for trying out new things. Funnily enough, I never managed to use my own scientific results in any real-life project so far but I consider this as a quite common gap between science and the “real world”. At Rapid-I, however, we are still heavily connected to the scientific world and try to combine the best of both worlds: solving existing problems with leading-edge technologies.

Simon: In fact, during my academic career I have not worked in the field of data mining at all. I worked on a field some of my colleagues would probably even consider boring, and that is theoretical computer science. To be precise, my research was in the intersection of game theory and network theory. During that time, I have learnt a lot of exciting things, none of which had any business use. Still, I consider that a very valuable experience. When we at Rapid-I hire people coming to us right after graduating, I don’t care whether they know the latest technology with a fancy three-letter acronym – that will be forgotten more quickly than it came. What matters is the way you approach new problems and challenges. And that is also my recommendation to new students: work on whatever you like, as long as you are passionate about it and it brings you forward.

Ajay-  How is the Rapid Miner Extensions marketplace moving along. Do you think there is a scope for people to say create algorithms in a platform like R , and then offer that algorithm as an app for sale just like iTunes or Android apps.

 Simon: Well, of course it is not going to be exactly like iTunes or Android apps are, because of the more business-orientated character. But in fact there is a scope for that, yes. We have talked to several developers, e.g., at our user conference RCOMM, and several people would be interested in such an opportunity. Companies using data mining software need supported software packages, not just something they downloaded from some anonymous server, and that is only possible through a platform like the new Marketplace. Besides that, the marketplace will not only host commercial extensions. It is also meant to be a platform for all the developers that want to publish their extensions to a broader community and make them accessible in a comfortable way. Of course they could just place them on their personal Web pages, but who would find them there? From the Marketplace, they are installable with a single click.

Ingo: What I like most about the new Rapid-I Marketplace is the fact that people can now get something back for their efforts. Developing a new algorithm is a lot of work, in some cases even more that developing a nice app for your mobile phone. It is completely accepted that people buy apps from a store for a couple of Dollars and I foresee the same for sharing and selling algorithms instead of apps. Right now, people can already share algorithms and extensions for free, one of the next versions will also support selling of those contributions. Let’s see what’s happening next, maybe we will add the option to sell complete RapidMiner workflows or even some data pools…

Ajay- What are the recent features in Rapid Miner that support cloud computing, mobile computing and tablets. How do you think the landscape for Big Data (over 1 Tb ) is changing and how is Rapid Miner adapting to it.

Simon: These are areas we are very active in. For instance, we have an In-Database-Mining Extension that allows the user to run their modelling algorithms directly inside the database, without ever loading the data into memory. Using analytic databases like Vectorwise or Infobright, this technology can really boost performance. Our data mining server, RapidAnalytics, already offers functionality to send analysis processes into the cloud. In addition to that, we are currently preparing a research project dealing with data mining in the cloud. A second project is targeted towards the other aspect you mention: the use of mobile devices. This is certainly a growing market, of course not for designing and running analyses, but for inspecting reports and results. But even that is tricky: When you have a large screen you can display fancy and comprehensive interactive dashboards with drill downs and the like. On a mobile device, that does not work, so you must bring your reports and visualizations very much to the point. And this is precisely what data mining can do – and what is hard to do for classical BI.

Ingo: Then there is Radoop, which you may have heard of. It uses the Apache Hadoop framework for large-scale distributed computing to execute RapidMiner processes in the cloud. Radoop has been presented at this year’s RCOMM and people are really excited about the combination of RapidMiner with Hadoop and the scalability this brings.

 Ajay- Describe the Rapid Miner analytics certification program and what steps are you taking to partner with academic universities.

Ingo: The Rapid-I Certification Program was created to recognize professional users of RapidMiner or RapidAnalytics. The idea is that certified users have demonstrated a deep understanding of the data analysis software solutions provided by Rapid-I and how they are used in data analysis projects. Taking part in the Rapid-I Certification Program offers a lot of benefits for IT professionals as well as for employers: professionals can demonstrate their skills and employers can make sure that they hire qualified professionals. We started our certification program only about 6 months ago and until now about 100 professionals have been certified so far.

Simon: During our annual user conference, the RCOMM, we have plenty of opportunities to talk to people from academia. We’re also present at other conferences, e.g. at ECML/PKDD, and we are sponsoring data mining challenges and grants. We maintain strong ties with several universities all over Europe and the world, which is something that I would not want to miss. We are also cooperating with institutes like the ITB in Dublin during their training programmes, e.g. by giving lectures, etc. Also, we are leading or participating in several national or EU-funded research projects, so we are still close to academia. And we offer an academic discount on all our products 🙂

Ajay- Describe the global efforts in making Rapid Miner a truly international software including spread of developers, clients and employees.

Simon: Our clients already are very international. We have a partner network in America, Asia, and Australia, and, while I am responding to these questions, we have a training course in the US. Developers working on the core of RapidMiner and RapidAnalytics, however, are likely to stay in Germany for the foreseeable future. We need specialists for that, and it would be pointless to spread the development team over the globe. That is also owed to the agile philosophy that we are following.

Ingo: Simon is right, Rapid-I already is acting on an international level. Rapid-I now has more than 300 customers from 39 countries in the world which is a great result for a young company like ours. We are of course very strong in Germany and also the rest of Europe, but also concentrate on more countries by means of our very successful partner network. Rapid-I continues to build this partner network and to recruit dynamic and knowledgeable partners and in the future. However, extending and acting globally is definitely part of our strategic roadmap.

Biography

Dr. Ingo Mierswa is working as Chief Executive Officer (CEO) of Rapid-I. He has several years of experience in project management, human resources management, consulting, and leadership including eight years of coordinating and leading the multi-national RapidMiner developer team with about 30 developers and contributors world-wide. He wrote his Phd titled “Non-Convex and Multi-Objective Optimization for Numerical Feature Engineering and Data Mining” at the University of Dortmund under the supervision of Prof. Morik.

Dr. Simon Fischer is heading the research & development at Rapid-I. His interests include game theory and networks, the theory of evolutionary algorithms (e.g. on the Ising model), and theoretical and practical aspects of data mining. He wrote his PhD in Aachen where he worked in the project “Design and Analysis of Self-Regulating Protocols for Spectrum Assignment” within the excellence cluster UMIC. Before, he was working on the vtraffic project within the DFG Programme 1126 “Algorithms for large and complex networks”.

http://rapid-i.com/content/view/181/190/ tells you more on the various types of Rapid Miner licensing for enterprise, individual and developer versions.

(Note from Ajay- to receive an early edition invite to Radoop, click here http://radoop.eu/z1sxe)

 

New book on BigData Analytics and Data mining using #Rstats with a GUI

Joseph Marie Jacquard
Image via Wikipedia

I am hoping to put this on my pre-ordered or Amazon Wish list. The book the common people who wanted to do data mining with , but were unable to ask aloud they didnt know much.  It is written by the seminal Australian authority on data mining Dr Graham Williams whom I interviewed here at https://decisionstats.com/2009/01/13/interview-dr-graham-williams/

Data Mining for the masses using an ergonomically designed Graphical User Interface.

Thank you Springer. Thank you Dr Graham Williams

http://www.springer.com/statistics/physical+%26+information+science/book/978-1-4419-9889-7

Data Mining with Rattle and R

Data Mining with Rattle and R

The Art of Excavating Data for Knowledge Discovery

Series: Use R

Williams, Graham

1st Edition., 2011, XX, 409 p. 150 illus. in color.

  • Softcover, ISBN 978-1-4419-9889-7

    Due: August 29, 2011

    54,95 €
  • Encourages the concept of programming with data – more than just pushing data through tools, but learning to live and breathe the data
  • Accessible to many readers and not necessarily just those with strong backgrounds in computer science or statistics
  • Details some of the more popular algorithms for data mining, as well as covering model evaluation and model deployment

Data mining is the art and science of intelligent data analysis. By building knowledge from information, data mining adds considerable value to the ever increasing stores of electronic data that abound today. In performing data mining many decisions need to be made regarding the choice of methodology, the choice of data, the choice of tools, and the choice of algorithms.

Throughout this book the reader is introduced to the basic concepts and some of the more popular algorithms of data mining. With a focus on the hands-on end-to-end process for data mining, Williams guides the reader through various capabilities of the easy to use, free, and open source Rattle Data Mining Software built on the sophisticated R Statistical Software. The focus on doing data mining rather than just reading about data mining is refreshing.

The book covers data understanding, data preparation, data refinement, model building, model evaluation,  and practical deployment. The reader will learn to rapidly deliver a data mining project using software easily installed for free from the Internet. Coupling Rattle with R delivers a very sophisticated data mining environment with all the power, and more, of the many commercial offerings.

Content Level » Research

Keywords » Data mining

Related subjects » Physical & Information Science

Related- https://decisionstats.com/2009/01/13/interview-dr-graham-williams/

The long tail of the internet

On a whim, I took the all time stats of my blog posts (more than 1000 posts) , and tried to plot their distribution.

Basically I copied and pasted all the data in a Google docs spreadsheet. and I created dummy codes (like URL1, URL2…. URL 500)

Next I  downloaded the….

I wasnt in the mood for downloading and uploading stuff so I decided to use GGPLOT using Jeroen’s Application at http://www.stat.ucla.edu/~jeroen/

I used the mirror server that Dataspora provides as I have had latency issues with Jeroen’s website.

I got this error while trying to connect the Dataspora App to my Google spreadsheet

The page you have requested cannot be displayed. Another site was requesting access to your Google Account, but sent a malformed request. Please contact the site that you were trying to use when you received this message to inform them of the error. A detailed error message follows:

The site “http://dataspora.com” has not been registered.

Oh dear! Back to Jeroen’s /UCLA’s page.

http://rweb.stat.ucla.edu/ggplot2/

I get this warning but it still manages to log in

This website has not registered with Google to establish a secure connection for authorization requests. We recommend that you continue the process only if you trust the following destination:

http://rweb.stat.ucla.edu/R/googleLogin?domain=rweb.stat.ucla.edu

wow it works! thats cloud computing now so I wonder why Google and Amazon continue to ignore the rApache, and Jeroen’s cloud app . Surely their Google Fusion Tables can be always improved or tweaked. Not to mention the next gen version of R which will have its own server

Pretty cool screenshot (but click to see more)

I get the following pretty graph. Hadley Wickham would be ashamed of me by now.

What went wrong- well one page has 36000 views . Scale is the key to graphical coherence . So I redo- delete home page in Google spreadsheet ,reimport replot. ( I didnt know how to modify data in the cloud app, maybe we need a cloud PlyR) I redo it again as I have a big outlier-The top 10 Statistical GUI article which ironically has only 5 GUIs in that article but hush dont tell to high quality search engine)

So again Belatedly I discover something called layer in ggplot.

Base Graphics engine has really spoilt me to write short functions for plots.

I give up. I rather prefer hist() I go to my favorite GUI Rattle, but it has some dating issues with the dll of GTK+

So I go to John Fox’s simple GUI. R Commander- is the best GUI if you use Occam’s Razor, and I am using Occam’s Chainsaw now.

I get the analysis I want in 12 secs


Summary- GGPLot is more complicated than base graphics engine.

Deducer GUI is not as simple too

R Commander is the best GUI because it retains simplicity

Ignore long tail of internet only at your peril

Almost 2/3 rds of my daily traffic of 400+ comes from old archived content That is why Search Engine Optimization and Alerts for Keywords are CRITICAL for any poor soul trying to write on a blog (which has no journal like prestige nor rewards)

If you make life easier for the search engine, it being a fair chap, rewards you well

Existing web traffic estimates like Comscore and Google Trends ignore this long tail

Comments are welcome (Data is pasted below of 500 rows X 2 columns if you can come up with a better analysis)

Since SAS has ignored web analytics and Google Analytics is hmm hmm,  this could be an area of opportunity for R developers as well to create a web analytics package.

Title
Views
Home page 36,185
Top 10 Graphical User Interfaces in Statistical Software More stats 8,264
Matlab-Mathematica-R and GPU Computing More stats 2,166
Wealth = function (numeracy, memory recall) More stats 2,162
The Top Statistical Softwares (GUI) More stats 2,118
About DecisionStats More stats 1,902
Libre Office More stats 1,770
Using Facebook Analytics (Updated) More stats 1,446
Windows Azure vs Amazon EC2 (and Google Storage) More stats 1,386
Interview Hadley Wickham R Project Data Visualization Guru More stats 1,204
Test drive a Chrome notebook. More stats 1,201
Interview Professor John Fox Creator R Commander More stats 1,190
Top ten RRReasons R is bad for you ? More stats 1,178
SAS Institute files first lawsuit against WPS- Episode 1 More stats 1,131
R Package Creating More stats 1,104
Interfaces to R More stats 1,039
Using Red R- R with a Visual Interface More stats 950
Google Maps – Jet Ski across Pacific Ocean More stats 922
Norman Nie: R GUI and More More stats 851
Not so AWkward after all: R GUI RKWard More stats 805
Running R on Amazon EC2 More stats 786
Startups for Geeks More stats 785
Creating a Blog Aggregator for free More stats 749
Cloud Computing with R More stats 676
Rapid Miner- R Extension More stats 671
Parallel Programming using R in Windows More stats 664
Revolution R for Linux More stats 645
Red R 1.8- Pretty GUI More stats 638
John Sall sets JMP 9 free to tango with R More stats 601
Wordle.net More stats 597
Funny Images from India More stats 571
R is an epic fail or is it just overhyped More stats 568
Great article on Notepad++ and R in R Journal More stats 564
Certifications in Analytics and Business Intelligence More stats 548
R Excel :Updated More stats 542
Enterprise Linux rises rapidly:New Report More stats 537
So which software is the best analytical software? Sigh- It depends More stats 520
Funny Photo :It happens only In India More stats 518
Creating 3D Graphs with Data in R More stats 507
SPSS /PASW Certification – Free until Sept 15 More stats 497
Interview :Dr Graham Williams More stats 476
GNU PSPP- The Open Source SPSS More stats 474
Professors and Patches: For a Betterrrr R More stats 467
Running R on Amazon EC2 :Windows More stats 462
WPS response to SAS Lawsuit More stats 458
R language on the GPU More stats 450
KXEN and a Data Mining Survey More stats 449
News on R Commercial Development -Rattle- R Data Mining Tool More stats 449
WPS ( Alternative SAS Language Software) Pricing More stats 447
Kill R? Wait a sec More stats 445
SAS Institute lawsuit against WPS Episode 2 The Clone Wars More stats 442
How to be a BAD blogger? More stats 435
ROC Curve More stats 431
Bulls ,Bears ,Tigers and Asses More stats 424
Trrrouble in land of R…and Open Source Suggestions More stats 422
Interview- BI Dashboards dMINE Sanjay Patel More stats 417
Top Seven Reasons :Why Outsourcing is Bad for India More stats 408
Interviews @Decisionstats More stats 407
Running a R GUI,and parallel programming on Amazon EC2 More stats 394
Unbreakable Oracle Linux- and Unshakable-Libre Office- More stats 393
IBM SPSS 19: Marketing Analytics and RFM More stats 387
Analyzing SAS Institute-WPS Lawsuit More stats 377
Hive Tutorial: Cloud Computing More stats 377
R and Hadoop More stats 374
Graphics Presentations More stats 373
Sector/ Sphere – Faster than Hadoop/Mapreduce at Terasort More stats 370
Benchmarking GNU R: DirkE’s view and a Ninja wishlist More stats 363
Webfocus RStat: Pervasive BI using R More stats 363
Open Source Business Intelligence: Pentaho and Jaspersoft More stats 362
How to do Logistic Regression More stats 362
CommeRcial R- Integration in software More stats 359
So what’s new in R 2.12.0 More stats 357
Interview Michael J. A. Berry Data Miners, Inc More stats 356
Data Mining through the Android More stats 352
Newer version of Alternative SAS / WPS 2.4 launched More stats 350
How to Analyze Wikileaks Data – R SPARQL More stats 348
JMP 9 releasing on Oct 12 More stats 343
The R Online WikiBook More stats 340
Hadley’s tutorials on R Visualization More stats 340
Interview Tasso Argyros CTO Aster Data Systems More stats 339
Parsing XML files easily More stats 337
A Software Called Rattle More stats 335
Which software do we buy? -It depends More stats 329
Jim Goodnight on Open Source- and why he is right -sigh More stats 328
SAS/Blades/Servers/ GPU Benchmarks More stats 326
R Commander Plugins-20 and growing! More stats 324
10 iPhone Apps you can actually use ( and dont have to pay for) More stats 316
R Modeling with huge data More stats 315
The Popularity of Data Analysis Software More stats 315
Interview Donald Farmer Microsoft More stats 307
Learning SAS for free More stats 305
Comparing Base SAS and SPSS More stats 304
Towards better Statistical Interfaces More stats 302
Making NeW R More stats 301
Using Code Snippets in Revolution R More stats 300
R Apache – The next frontier of R Computing More stats 298
Using JMP 9 and R together More stats 297
Doing Time Series using a R GUI More stats 295
Amazon announces Micro Instances for cloud computing More stats 295
Top 5 Free Music Websites More stats 295
Web R- Elastic R and RevoDeploy R More stats 291
R for Stats : Updated More stats 290
Heritage Health Prize- Data Mining Contest for 3mill USD More stats 289
Google AppInventor -Android and Business Intelligence More stats 281
Top R Interviews More stats 278
An Introduction to Data Mining-online book More stats 272
Interview Jim Davis SAS Institute More stats 272
Economic: Indian Caste System -Simplification More stats 271
Rattle Re-Introduced More stats 271
KXEN – Automated Regression Modeling More stats 267
Movie Review- Inglorious Basterds More stats 267
Interview :Doug Savage ,Creator SavageChickens.com More stats 261
IPSUR – A Free R Textbook More stats 258
SAS with the GUI Enterprise Guide (Updated) More stats 256
Trying out Google Prediction API from R More stats 256
Segmenting Models : When and Why More stats 253
Using R and Excel Together More stats 253
R Oracle Data Mining More stats 253
KNIME More stats 253
Using PostgreSQL and MySQL databases in R 2.12 for Windows More stats 250
Fighting Back -The Net, Social Media, Spam, Identity Theft, Terrorism More stats 249
Libre Office (Beta) 3 Launched More stats 248
India to make own DoS -citing cyber security More stats 247
Interview Dominic Pouzin Data Applied More stats 242
R releases new version R 2.9.2 More stats 240
SAS to launch SAS/IML with R ( updated) More stats 239
Playing with Playwith- R Package for Interactive Data Visualizations More stats 234
Predictive Analytics World Conference More stats 231
Analytics and BI for small biz More stats 231
Interview Jeanne Harris Co-Author -Analytics at Work and Competing with Analytics More stats 230
Using R for Time Series in SAS More stats 228
General Electric ‘s breach of the spirit and letter of integrity More stats 227
Interview Luis Torgo Author Data Mining with R More stats 222
Browser Based Model Creation More stats 222
Interview James Dixon Pentaho More stats 221
Thoughts on WPS, SAS , R More stats 220
Choosing R for business – What to consider? More stats 220
Buying SAS Institute More stats 219
Google: Prediction API and other cool stuff More stats 218
Interview : R For Stata Users More stats 216
Viva Libre Office More stats 216
Top 10 Games on Linux -sudo update More stats 214
When China overtook India- using DEDUCER More stats 214
KDD 2009 : Demos More stats 211
Interview Dean Abbott Abbott Analytics More stats 210
Statistically Speaking More stats 203
Data Visualization using Tableau More stats 203
SAS and JMP : Visual Data Discovery More stats 203
High Performance Computing and R More stats 200
Troubleshooting Rattle Installation- Data Mining R GUI More stats 194
Google Realtime Live Updates on Egypt Yemen Tunisia Jordan.. More stats 192
New Deal in Statistical Training More stats 191
Interview Ken O Connor Business Intelligence Consultant More stats 190
Karmic Koala versus Windows 7 More stats 189
Interview Shawn Kung Sr Director Aster Data More stats 189
Pun on Putin More stats 189
Towards better analytical software More stats 188
Dryad- Microsoft’s answer to MR More stats 188
Analyzing Indian – Chinese Relationships More stats 188
LibreOffice News and Google Musings More stats 186
Special Issue of JSS on R GUIs More stats 184
Using Google Docs for Web Scraping More stats 181
Using Reshape2 for transposing datasets in R More stats 180
IBM Buys Netezza More stats 180
Libreoffice 3.3 released More stats 180
Google moving on from MapReduce: rest of world still catching up More stats 179
Linux= Who did what and how much? More stats 176
Interview Carole Jesse Experienced Analytics Professional More stats 176
HIRE ME More stats 175
Test Drive a Google Chrome Notebook: Last Two Days left More stats 174
Q&A with David Smith, Revolution Analytics. More stats 174
R , Ubuntu, RCmdr Updates More stats 173
Interview KNIME Fabian Dill More stats 173
Big Data and R: New Product Release by Revolution Analytics More stats 173
Automated Content Aggregation More stats 173
R or SAS —– R and SAS ? More stats 170
Graphs More stats 169
How to use Oracle for Data Mining More stats 169
Carolina and SAS More stats 166
Interview John Sall Founder JMP/SAS Institute More stats 165
Aster Data hires Quentin Gallivan as CEO More stats 165
Oracle for possible takeover of REvolution Computing More stats 164
The Best and Worst Graphs Ever More stats 163
Statistical Analysis with R- by John M Quick More stats 163
Growing Rapidly: Rapid Miner 4.5 More stats 161
SAP and BI on Demand More stats 161
Google Snappy More stats 161
Google Refine More stats 161
Scoring SAS and SPSS Models in the cloud More stats 159
Hey Professor, I am not a Monkey More stats 157
REVolution Computing fails to create a Revolution More stats 156
SAS Lawsuit against WPS- Application Dismissed More stats 156
KDNuggets Poll on SAS: Churn in Analytics Users More stats 154
SAS Early Days More stats 154
Interview James Taylor Decision Management Expert (Updated) More stats 151
Google Books Ngram Viewer More stats 148
Review – R for SAS and SPSS Users More stats 148
New R Journal Edition More stats 146
Here comes PySpread- 85,899,345 rows and 14,316,555 columns More stats 145
Interview Karl Rexer -Rexer Analytics More stats 144
Poem: The Extroverted Engineer More stats 144
Hearst DataMining Challenge More stats 144
This Is It More stats 142
Interview Timo Elliott SAP More stats 141
The Blind Side – Movie Review More stats 141
Data Mining Survey Results :Tools and Offshoring More stats 140
Going Deap : Algols in Python More stats 140
ADVERTISE More stats 139
Interview Jeff Bass, Bass Institute (Part 2) More stats 139
Interview Jim Harris Data Quality Expert OCDQ Blog More stats 139
Do Monkeys Pay for Sex? More stats 138
Privacy Browsing Extensions in Google Chrome More stats 137
China biggest threat to Indian Software in 5 years: Indian Tech CEO More stats 136
Software HIStory: Bass Institute Part 1 More stats 135
Grenier’s Theory for Competitiveness More stats 134
Interview Charlie Berger Oracle Data Mining More stats 134
Karmic Koala Ubuntu/Linux 9.2 Preview More stats 133
Analytics and Journals More stats 133
Using Code Editors in R More stats 132
Interview Stephanie McReynolds Director Product Marketing, AsterData More stats 132
Amcharts- Cool Charts Web Editor More stats 130
Mapreduce Book More stats 128
Interesting R competition at Reddit More stats 127
Color of Statistics More stats 127
Amazon goes free for users next month More stats 127
#3443 (deleted) More stats 127
Interview Sarah Blow – Girly Geekdom Founder More stats 126
Social Network Analysis: Using R More stats 126
Interview Thomas C. Redman Author Data Driven More stats 126
Audio Interview Anne Milley , Part 1 More stats 124
Advanced Analytics on Multi-Terabyte Datasets- Conferences More stats 123
Geek Humour More stats 123
John M. Chambers Statistical Software Award – 2011 More stats 122
My friend -The Computer More stats 120
M2009 Interview Peter Pawlowski AsterData More stats 118
R Journal Dec 2010 and R for Business Analytics More stats 118
Top ten RRReasons R is bad for you ? More stats 116
Interview Michael Zeller,CEO Zementis on PMML More stats 115
Fast R Graphics More stats 114
New Google Ad Planner More stats 114
Making Sense: Hadoop and MapReduce More stats 114
Using SAS/IML with R More stats 114
Facebook App by SAP Crystal Reports More stats 113
Whats behind that pretty SAS Blog? More stats 113
Interview Alison Bolen SAS.com More stats 113
Ajay @ arts More stats 112
My latest creation More stats 112
Indian Crabs – A story More stats 112
Open Source’s worst enemy is itself not Microsoft/SAS/SAP/Oracle More stats 112
Google Cloud Print -print documents from the internet More stats 111
WPS and SAS- A rah-rah comparison More stats 110
Facebook Gmail Killer Threatens to commit Hara Kari live on AOL Techcrunch if unsucessful More stats 110
Open Source and Software Strategy More stats 109
Windows Azure and Amazon Free offer More stats 108
R for Analytics is now live More stats 108
Open Source Compiler for SAS language/ GNU -DAP More stats 107
Using Chromium /Chrome on Ubuntu Linux More stats 107
Interview John Moore CTO, Swimfish More stats 106
Nice BI Tutorials More stats 106
Creating Customized Packages in SAS Software More stats 106
Business Analytics Analyst Relations /Ethics/White Papers More stats 105
Web Crawling Automation More stats 105
The SAS-WPS Lawsuit- Preliminary Hearing More stats 105
Handling time and date in R More stats 105
KXEN Update More stats 104
MapReduce Analytics Apps- AsterData’s Developer Express Plugin More stats 104
+ 1 your website -updated More stats 103
Movie Review- Peepli Live More stats 103
Better Data Visualization in WordPress.com Stats More stats 102
Customizing your R software startup More stats 102
LibreOffice Beta 2 (Office Fork off Oracle) launches! More stats 102
KXEN Case Studies : Financial Sector More stats 102
Deleting Twitter, Facebook,LinkedIn- Accepting Life More stats 102
Google Street View shows Gladiators fighting More stats 101
Carole-Ann’s 2011 Predictions for Decision Management More stats 101
Amazon goes HPC and GPU: Dirk E to revise his R HPC book More stats 101
Happy Thanksgiving Id More stats 101
Interview Phil Rack WPS Consultant and Developer More stats 100
SPSS launches two more PASWs More stats 99
Interview David Smith REvolution Computing More stats 99
Data Mining with R More stats 97
Dataset too big for R ? More stats 97
How Jesus saved my Butt More stats 97
Interview Evan Levy Baseline Consulting More stats 97
The Latest GUI for R- BioR More stats 96
WPS Version 2.5.1 Released – can still run SAS language/data and R More stats 96
SAS legal falls flat against WPS again: Technical Grounds More stats 95
World Programming System:300 pounds for The power of SAS language More stats 94
KNIME and Zementis shake hands More stats 93
Interview Eric Siegel, Phd President Prediction Impact More stats 93
Interview Sarah Burnett BI Analyst,Ovum group More stats 92
Quantifying Analytics ROI More stats 92
PSPP – SPSS ‘s Open Source Counterpart More stats 91
PySpread Magic More stats 91
Interview SPSS Olivier Jouve More stats 91
Interesting Data Visualization:Friendwheels More stats 91
R on Windows HPC Server More stats 90
The declining market for Telecommunication Churn Models More stats 90
Getting Inside R More stats 90
The Big Data Summit Agenda More stats 90
Review: Clash of the Titans More stats 89
Red Hat worth 7.8 Billion now More stats 89
Movie Review : Rajneeti (Politics) More stats 89
3 Idiots: Insight to Indian Engineer Campus Life More stats 89
The Comic Water Games (aka Common Wealth Games) More stats 88
Computer Education grants from Google More stats 88
Challenges of Analyzing a dataset (with R) More stats 87
Input Data in R using the top 3 R GUI More stats 86
Complex Event Processing- SASE Language More stats 85
Interview with Anne Milley, SAS II More stats 85
Data Mining Presentation at M2009 by Dr Vincent Granville More stats 85
Brief Interview Timo Elliott More stats 85
Mapping Health Statistics at CDC.gov More stats 85
Amazon’s Turks Mturk.com More stats 84
Business Intelligence and Stat Computing: The White Man’s Last Stand More stats 84
Movie Review- Dabangg More stats 84
Movie Review: Sherlock Holmes More stats 84
SAS Data Mining 2009 Las Vegas More stats 83
Chinese Fortune Cookies More stats 83
SPSS and R More stats 83
Manjunath- A Batchmate on my mine More stats 82
Data Mining 2010:SAS Conference in Vegas More stats 81
DirkE and JD swoon about Shane’s MOM in Room 106 while writing R code More stats 81
SAS to R Challenge: Unique benchmarking More stats 81
S A S GOOD LIFE UNDER SIEGE – NYT More stats 81
Pentaho and R: working together More stats 81
Interview John F Moore CEO The Lab More stats 80
Ways to use both Windows and Linux together More stats 80
Brief Interview with James G Kobielus More stats 80
For R Writers- Inside R More stats 79
Using Ipod and Iphone with your Ubuntu Laptop More stats 79
Webcasts: Oracle Data Mining More stats 79
The Cloud OS is finally here or is it?: Karmic Koala More stats 79
Movie Review: Lafangey Parinday (Rouge Birds) More stats 79
SAS announcement in education initiatives More stats 78
Using R from within Python More stats 78
Event: Predictive analytics with R, PMML and ADAPA More stats 78
Interesting R and BI Web Event More stats 78
Bruno Aziza, Microsoft Global BI Lead joins PAW Keynote More stats 77
Common Analytical Tasks More stats 77
RWui :Creating R Web Interfaces on the go More stats 77
R Successor Language ‘Tea’ announced More stats 76
Learning SPSS for SAS users More stats 76
Protovis a graphical toolkit for visualization More stats 76
Interview Paul van Eikeren Inference for R More stats 75
Data Visualization: Central Banks More stats 75
Oracle Data Mining 11 G R2 More stats 75
Interview Peter J Thomas -Award Winning BI Expert More stats 75
Weak Security in Internet Databases for Statisticians More stats 74
Open Source Cartoon More stats 74
Top Ten Graphs for Business Analytics -Pie Charts (1/10) More stats 74
SAS Sentiment Analysis wins Award More stats 74
JMP Genomics 5 released More stats 74
Short Interview Jill Dyche More stats 73
Interview David Katz ,Dataspora /David Katz Consulting More stats 73
PMML 4.0 More stats 73
Ponder This: IBM Research More stats 72
PAW Videos More stats 71
PASW 13 :The preview More stats 71
Cisco SocialMiner More stats 70
Review-The Dark knight More stats 70
MapReduce Patent Granted More stats 70
Cloud Computing and GPU ( and some stats softwares) More stats 70
IBM Business Analytics Forum More stats 70
And now- The Business Analytics Summit More stats 70
Creating an Anonymous Bot More stats 69
R and SAS in Twitter Land More stats 69
Interview:Richard Schultz , CEO REvolution Computing More stats 69
China -United States -The Third Opium War More stats 68
Quick-R and Statmethods.net More stats 68
R Node- and other Web Interfaces to R More stats 68
Life Mojo – A Health Startup More stats 68
Using Views in R and comparing functions across multiple packages More stats 68
Another R Tutorial More stats 67
Interview Karen Lopez Data Modeling Expert More stats 67
QGIS and R More stats 66
Christmas Carol: The Best Software (BI-Stats-Analytics) More stats 66
Software Lawsuits :Ergo More stats 66
STEM is cool More stats 65
Date Night More stats 65
More Advanced SAS Modeling Procs More stats 65
The Big Data Event- Why am I here? More stats 65
Interview Gary Cokins SAS Institute More stats 65
Browser based Music Creation More stats 64
Interview Steve Sarsfield Author The Data Governance Imperative More stats 63
GrapheR More stats 63
Google Web Intelligence (Beta) More stats 61
Data Mining 2009 Interviews- Terry Whitlock, BlueCross BlueShield of TN More stats 60
Audio Interviews -Dr. Colleen McCue National Security Expert More stats 60
Red R- A new beginning More stats 59
YouTube Features: Audio Swap, Mobile posts and Themes More stats 59
R for Predictive Modeling:Workshop More stats 59
KDD2009: Papers Research and Industrial More stats 58
Chapman/Hall announces new series on R More stats 58
Data Visualization and Politics More stats 58
T Shirts Design More stats 58
Jump to JMP: Using Data Analysis in a visual manner More stats 58
Aster Analytics and MapReduce.org More stats 57
OK Cupid Data Visualization- Flow Chart to your Heart More stats 57
R for SAS and SPSS Users More stats 57
Carbon Footprints in the snow More stats 57
Summer School on Uncertainty Quantification More stats 57
High Performance Computing within R: Tutorial More stats 57
Running Stats Softwares on Clouds More stats 57
Amazing Data Visualization- UN Counter Terrorism More stats 56
Cloud MapReduce More stats 56
Statistical Features in WPS More stats 56
An R Package only for SAS Users More stats 56
R is Ready for Business™ More stats 55
A Google App for Sales- ERPLY More stats 55
Rexer Analytics Annual Data Miner Survey More stats 55
Cartoons on R More stats 55
American Decline- Why outsourcing doesnt make sense More stats 55
Friday Cartoon Series- New More stats 55
What softwares do you plan to use/learn in the next one year? More stats 54
Great App for Online Sketching More stats 54
September Roundup by Revolution More stats 54
Using Firesheep on Campus, Caltrain and beyond More stats 54
Decisionstats Interview at Big Data Summit, AsterData More stats 53
Learning Hadoop More stats 53
The White Man’s Burden-Poem More stats 53
Curt Monash on Analytics with MapReduce More stats 53
To R or Not to R : Data Mining and CRM for Free More stats 52
Algorithms and Ads: No Free Lunches and Hill Climbing More stats 52
Interview: Roger Haddad, Founder of KXEN Automated Modeling Software More stats 52
Google and Me on Privacy and Openness More stats 52
MapReduce.org More stats 52
Why do bloggers blog ? More stats 52
Live Streaming for Free : UStream More stats 51
Light Cycle of Tron review More stats 51
Lyx Releases 2 More stats 51
Interview – Anne Milley, SAS Part 1 More stats 51
SAS News More stats 51
KXEN EMEA User Conference 2010-Success in Business Analytics More stats 51
2011 Forecast-ying More stats 51
Kill Analytics More stats 50
Social Media Analysis Toolkit More stats 50
Multi State Models More stats 50
R and Cloud Computing More stats 50
Dataists shake up R community with a rocking contest More stats 50
Interview Anne Milley JMP More stats 49
Movie Review: Between the Folds More stats 49
Jokes in Economics More stats 49
Interview Ajay Ohri Decisionstats.com with DMR More stats 49
One more Y Tube Video More stats 49
Happy Diwali /Google Music More stats 48
SPSS Directions : Rexer Survey Results More stats 48
Redlining in Internet Access and notes on Regression Models More stats 48
Poem : A Poets Life More stats 48
Predictive Analytics World More stats 48
Interview- Phil Rack More stats 48
Building KXEN Models on Ubuntu More stats 48
New Year Resolution Presentation More stats 48
Adobe gulps Omniture More stats 47
SAS Modeling Procs More stats 47
Oracle Open World/ RODM package More stats 47
KDNuggets Survey on R More stats 47
IBM and Revolution team to create new in-database R More stats 47
SAS Institute invests in R project More stats 46
Not just a Cloud More stats 46
New Version of R released: R 2.10.1 More stats 46
Review- Iron Man2 More stats 46
Online Analytics: Monte Carlo Simulation More stats 45
Predictive Forecasting in Commercial Applications More stats 45
The Race -by D.H Groberg More stats 45
SAS Scoring Accelerators More stats 45
IBM launches Smart Analytics Cloud More stats 45
Reactions to IBM -SPSS takeover. More stats 45
Zementis partners with R Analytics Vendor- Revo More stats 44
A Missing Mandelbrot Who Dun It More stats 44
Downloading your Facebook Photos More stats 44
Android Tutorial More stats 44
The Mommy Track More stats 44
My First You Tube Video: Courtesy the competiton on VOLNIGHT by Univ of Tennessee More stats 44
Born in the USA? More stats 43
Interview Eric A. King President The Modeling Agency More stats 43
Interview Augusto Albeghi (Straycat) —Founder Straysoft More stats 43
Why Cloud? More stats 43
Innovative ways of Calculus: Gifting a comic set for Christmas More stats 43
To find the best chaat or paan shop More stats 43
Google unleashes Fusion Tables More stats 42
Using SAS and C/C++ together More stats 42
Whats new in the latest version of R More stats 42
Bollywood 101 More stats 42
Who will forecast for the forecasters? More stats 42
Learning R Easily :Two GUI’s More stats 41
Harvard DropOut Writes Open Letter- His Startup has 350m users More stats 41
BI Software More stats 41
How to read blogs in Indonesian and Chinese! More stats 41
Window to a Blue Cloud: Azure Pricing More stats 41
China bans Chinese Food for Googleplex More stats 41
SAS Program for Students More stats 41
The Year 2010 More stats 40
What do you want to know in data analytics? More stats 40
America’s Data Book: Census Abstract 2011 More stats 40
Big Data Management and Advanced Analytics More stats 40
AsterData partners with Tableau More stats 40
Using R from other Software More stats 40
SAS on Fraud More stats 40

Protected: Whats behind that pretty SAS Blog?

This content is password protected. To view it please enter your password below:

%d bloggers like this: