professor – Page 2 – DECISION STATS

Google – Turns the Page

Duderstadt Center "The Dude", which ... — Image via Wikipedia

Larry Page
Co-Founder and President, Products

Larry Page was Google’s founding CEO and grew the company to more than 200 employees and profitability before moving into his role as president of products in April 2001. He continues to share responsibility for Google’s day-to-day operations with Eric Schmidt and Sergey Brin.

The son of Michigan State University computer science professor Dr. Carl Victor Page, Larry’s love of computers began at age six. While following in his father’s footsteps in academics, he became an honors graduate from the University of Michigan, where he earned a bachelor’s degree in engineering, with a concentration on computer engineering. During his time in Ann Arbor, Larry built an inkjet printer out of Lego™ bricks.

While in the Ph.D. program in computer science at Stanford University, Larry met Sergey Brin, and together they developed and ran Google, which began operating in 1998. Larry went on leave from Stanford after earning his master’s degree.

In 2002, Larry was named a World Economic Forum Global Leader for Tomorrow. He is a member of the National Advisory Committee (NAC) of the University of Michigan College of Engineering, and together with co-founder Sergey Brin, Larry was honored with the Marconi Prize in 2004. He is a trustee on the board of the X PRIZE, and was elected to the National Academy of Engineering in 2004.

and no coincidence but it reminded me of the Metallica video- Turn the Page. Forgive the Pun, herr Eric

https://www.youtube.com/watch?v=dOibtqWo6z4

Meet Larry Page, Google’s New CEO (huffingtonpost.com)
Larry Page: Google’s king of search (guardian.co.uk)
Sergey Brin: We’ve Touched 1 Percent Of What Social Search Can Be (techcrunch.com)
Everything You Need To Know About Larry Page, Google’s New CEO (GOOG) (businessinsider.com)
Co-founder Larry Page to Become CEO of Google (slog.thestranger.com)
How Larry Page’s Google Will Function (mashable.com)
Google Shakeup: Larry Page Takes Over as CEO (sfist.com)
TechCrunch Interview With Eric Schmidt, Larry Page And Sergey Brin (techcrunch.com)
Google CEO Schmidt to step down, hand reins to Larry Page (techflash.com)

Interview Luis Torgo Author Data Mining with R

Example of k-nearest neighbour classification — Image via Wikipedia

Here is an interview with Prof Luis Torgo, author of the recent best seller “Data Mining with R-learning with case studies”.

Ajay- Describe your career in science. How do you think can more young people be made interested in science.

Luis- My interest in science only started after I’ve finished my degree. I’ve entered a research lab at the University of Porto and started working on Machine Learning, around 1990. Since then I’ve been involved generally in data analysis topics both from a research perspective as well as from a more applied point of view through interactions with industry partners on several projects. I’ve spent most of my career at the Faculty of Economics of the University of Porto, but since 2008 I’m at the department of Computer Science of the Faculty of Sciences of the same university. At the same time I’ve been a researcher at LIAAD / Inesc Porto LA (www.liaad.up.pt).

I like a lot what I do and like science and the “scientific way of thinking”, but I cannot say that I’ve always thought of this area as my “place”. Most of all I like solving challenging problems through data analysis. If that translates into some scientific outcome than I’m more satisfied but that is not my main goal, though I’m kind of “forced” to think about that because of the constraints of an academic career.

That does not mean I’m not passionate about science, I just think there are many more ways of “doing science” than what is reflected in the usual “scientific indicators” that most institutions seem to be more and more obsessed about.

Regards interesting young people in science that is a hard question that I’m not sure I’m qualified to answer. I do tend to think that young people are more sensible to concrete examples of problems they think are interesting and that science helps in solving, as a way of finding a motivation for facing the hard work they will encounter in a scientific career. I do believe in case studies as a nice way to learn and motivate, and thus my book 😉

Ajay- Describe your new book “Data Mining with R, learning with case studies” Why did you choose a case study based approach? who is the target audience? What is your favorite case study from the book

Luis- This book is about learning how to use R for data mining. The book follows a “learn by doing it” approach to data mining instead of the more common theoretical description of the available techniques in this discipline. This is accomplished by presenting a series of illustrative case studies for which all necessary steps, code and data are provided to the reader. Moreover, the book has an associated web page (www.liaad.up.pt/~ltorgo/DataMiningWithR) where all code inside the book is given so that easy copy-paste is possible for the more lazy readers.

The language used in the book is very informal without many theoretical details on the used data mining techniques. For obtaining these theoretical insights there are already many good data mining books some of which are referred in “further readings” sections given throughout the book. The decision of following this writing style had to do with the intended target audience of the book.

In effect, the objective was to write a monograph that could be used as a supplemental book for practical classes on data mining that exist in several courses, but at the same time that could be attractive to professionals working on data mining in non-academic environments, and thus the choice of this more practically oriented approach.

Regards my favorite case study that is a hard question for an author… still I would probably choose the “Predicting Stock Market Returns” case study (Chapter 3). Not only because I like this challenging problem, but mainly because the case study addresses all aspects of knowledge discovery in a real world scenario and not only the construction of predictive models. It tackles data collection, data pre-processing, model construction, transforming predictions into actions using different trading policies, using business-related performance metrics, implementing a trading simulator for “real-world” evaluation, and laying out grounds for constructing an online trading system.

Obviously, for all these steps there are far too many options to be possible to describe/evaluate all of them in a chapter, still I do believe that for the reader it is important to see the overall picture, and read about the relevant questions on this problem and some possible paths that can be followed at these different steps.

In other words: do not expect to become rich with the solution I describe in the chapter !

Ajay- Apart from R, what other data mining software do you use or have used in the past. How would you compare their advantages and disadvantages with R

Luis- I’ve played around with Clementine, Weka, RapidMiner and Knime, but really only playing with teaching goals, and no serious use/evaluation in the context of data mining projects. For the latter I mainly use R or software developed by myself (either in R or other languages). In this context, I do not think it is fair to compare R with these or other tools as I lack serious experience with them. I can however, tell you about what I see as the main pros and cons of R. The main reason for using R is really not only the power of the tool that does not stop surprising me in terms of what already exists and keeps appearing as contributions of an ever growing community, but mainly the ability of rapidly transforming ideas into prototypes. Regards some of its drawbacks I would probably mention the lack of efficiency when compared to other alternatives and the problem of data set sizes being limited by main memory.

I know that there are several efforts around for solving this latter issue not only from the community (e.g. http://cran.at.r-project.org/web/views/HighPerformanceComputing.html), but also from the industry (e.g. Revolution Analytics), but I would prefer that at this stage this would be a standard feature of the language so the the “normal” user need not worry about it. But then this is a community effort and if I’m not happy with the current status instead of complaining I should do something about it!

Ajay- Describe your writing habit- How do you set about writing the book- did you write a fixed amount daily or do you write in bursts etc

Luis- Unfortunately, I write in bursts whenever I find some time for it. This is much more tiring and time consuming as I need to read back material far too often, but I cannot afford dedicating too much consecutive time to a single task. Actually, I frequently tease my PhD students when they “complain” about the lack of time for doing what they have to, that they should learn to appreciate the luxury of having a single task to complete because it will probably be the last time in their professional life!

Ajay- What do you do to relax or unwind when not working?

Luis- For me, the best way to relax from work is by playing sports. When I’m involved in some game I reset my mind and forget about all other things and this is very relaxing for me. A part from sports I enjoy a lot spending time with my family and friends. A good and long dinner with friends over a good bottle of wine can do miracles when I’m too stressed with work! Finally,I do love traveling around with my family.

Luis Torgo

Short Bio: Luis Torgo has a degree in Systems and Informatics Engineering and a PhD in Computer Science. He is an Associate Professor of the Department of Computer Science of the Faculty of Sciences of the University of Porto. He is also a researcher of the Laboratory of Artificial Intelligence and Data Analysis (LIAAD) belonging to INESC Porto LA. Luis Torgo has been an active researcher in Machine Learning and Data Mining for more than 20 years. He has lead several academic and industrial Data Mining research projects. Luis Torgo accompanies the R project almost since its beginning, using it on his research activities. He teaches R at different levels and has given several courses in different countries.

For reading “Data Mining with R” – you can visit this site, also to avail of a 20% discount the publishers have generously given (message below)-

For more information and to place an order, visit us at http://www.crcpress.com. Order online and apply 20% Off discount code 907HM at checkout. CRC is pleased to offer free standard shipping on all online orders!

link to the book page http://www.crcpress.com/product/isbn/9781439810187

Price: $79.95
Cat. #: K10510
ISBN: 9781439810187
ISBN 10: 1439810184
Publication Date: November 09, 2010
Number of Pages: 305
Availability: In Stock
Binding(s): Hardback

Finally! A practical R book on Data Mining: “Data Mining With R, Learning with Case Studies,” by Luis Torgo (r-bloggers.com)
INFORMS Data Mining Competition leaders used Open Source software (r-bloggers.com)
Is Data-Mining Free Speech? The Supreme Court Agrees to Decide a Crucial Case (dailyfinance.com)
Mining of Massive Data Sets (kinlane.com)
Case Study (jonathanlewis.wordpress.com)
Statistical Aspects of Data Mining (kinlane.com)
5 of the Best Free and Open Source Data Mining Software (junauza.com)
US top court to decide state drug data mining law (reuters.com)
Data-mining Google Books: Does the Reader Have To Be Human? (scholarlykitchen.sspnet.org)
Data Mining Competitions | TunedIT (tunedit.org)

Book Reviews- Hindu Myths- Mere Christianity

A statue of Hindu deity Shiva in a temple in B... — Image via Wikipedia

Over the month long break I took, I was helping firm up my ideas for R for Analytics , I also took a break and read some books. Here are brief reviews of two, three of them-

1) Hindu Myths

This is a classical book translated from original Sanskrit written by Professor Wendy O Flaherty of University of Chicago. I found some of the older myths very interesting in terms of contradictions, retelling the same story in a modified way by another classic, the beautiful poetic and fantastic imagery evoked by Hindu myths. Some stories are as relevant in prayers, fasts and religious ceremonies as they were around 11000 years while most have morphed , edited or even distorted.

It should help the non Indian reader understand why hundreds of millions of conservative Indians worship Shiv Ling ( or literally an idol of the Phallus of Shiva), the Hindu two cents of creation of the universe, and the somewhat fantastic stories on super heroes /gods/ in the ancient world.

The book suffers from a few drawbacks in my opinion-

1) Sanskrit is a bit like Latin- you can lose not just the flavor but original meaning of words and situational context. Some of the stories made better sense when i read a more recent Hindi translation.

2) An excessive emphasis on sexual imagery rather than emotional imagery. The author seems wonder struck to read and translate ancient indians were so matter of fact about physical relationships. However the words were always written in discrete poetic than crass soft pornography.

3) Almost no drawings or figures. This makes the book a bit dense to read at 300 pages.

I liked another book on Hindu Myths (Myth= Mithya which I read in 2009) and you can see if you can read it if you find the topic interesting.

A Handbook of Hindu Mythology

Hindus have one God.
They also have 330 million gods: male gods, female gods, personal gods, family gods, household gods, village gods, gods of space and time, gods for specific castes and particular professions, gods who reside in trees, in animals, in minerals, in geometrical patterns and in man-made objects.
Then there are a whole host of demons.
But no Devil.

Mere Christianity by C S Lewis is a classic book on reinterpreting Christianity in modern times. However the author wrote this when World War 2 was on and it seems more like a British or Anglo Saxon interpretation of beliefs of Christ Jesus– who was actually a Jewish teacher born in Middle East Asia.

While the language and reading makes it much easier to read- it is recommended more at Western audiences, than Eastern ones, as it seems some of the parables are a more palatable re interpretation of the New Testament. The Bible is a deceptively easy book to read, the language is short and beautiful-and the original parables in the Gospels remain powerful easy to understand.

C S Lewis tends to emphasize morality than religiosity or faith, and there is not much comparison with any other faith or alternative morality. Dumbing down the Bible so as to market it better to reluctant consumers seems to be Mr Lewis intention and it is not as scholarly a work as an exercise in pure prose.

However it is quite good as a self improvement book and is quite better than the “You Can Win” kind of books or even business concept books.

Note- I find reading books on religion as good exercises in reading the fountain source of philosophies. As a polytheist- I tend to read more than one faith.

The Hindus: An Alternative History by Wendy Doniger – review (guardian.co.uk)
Newsweek Depicts Obama as Hindu Deity (foxnews.com)
“Hindu’s want to take back yoga” and related posts (christianresearchnetwork.com)

The Year 2010

My annual traffic to this blog was almost 99,000 . Add in additional views on networking sites plus the 400 plus RSS readers- so I can say traffic was 1,20,000 for 2010. Nice. Thanks for reading and hope it was worth your time. (this is a long post and will take almost 440 secs to read but the summary is just given)

My intent is either to inform you, give something useful or atleast something interesting.

see below-

	Jan	Feb	Mar	Apr	May	Jun

2010	6,311	4,701	4,922	5,463	6,493	4,271

Jul	Aug	Sep	Oct	Nov	Dec	Total

5,041

5,403

17,913

16,430

11,723

10,096

98,767

Sandro Saita from http://www.dataminingblog.com/ just named me for an award on his blog (but my surname is ohRi , Sandro left me without an R- What would I be without R :)) ).

Aw! I am touched. Google for “Data Mining Blog” and Sandro is the best that it is in data mining writing.

”

DMR People Award 2010
There are a lot of active people in the field of data mining. You can discuss with them on forums. You can read their blogs. You can also meet them in events such as PAW or KDD. Among the people I follow on a regular basis, I have elected:

Ajay Ori

He has been very active in 2010, especially on his blog . Good work Ajay and continue sharing your experience with us!”

What did I write in 2010- stuff.

What did you read on this blog- well thats the top posts list.

2009-12-31 to Today

Title		Views
Home page		21,150
Top 10 Graphical User Interfaces in Statistical Software		6,237
Wealth = function (numeracy, memory recall)		2,014
Matlab-Mathematica-R and GPU Computing		1,946
The Top Statistical Softwares (GUI)		1,405
About DecisionStats		1,352
Using Facebook Analytics (Updated)		1,313
Test drive a Chrome notebook.		1,170
Top ten RRReasons R is bad for you ?		1,157
Libre Office		1,151
Interview Hadley Wickham R Project Data Visualization Guru		1,007
Using Red R- R with a Visual Interface		854
SAS Institute files first lawsuit against WPS- Episode 1		790
Interview Professor John Fox Creator R Commander		764
R Package Creating		754
Windows Azure vs Amazon EC2 (and Google Storage)		726
Norman Nie: R GUI and More		716
Startups for Geeks		682
Google Maps – Jet Ski across Pacific Ocean		670
Not so AWkward after all: R GUI RKWard		579
Red R 1.8- Pretty GUI		570
Parallel Programming using R in Windows		569
R is an epic fail or is it just overhyped		559
Enterprise Linux rises rapidly:New Report		537
Rapid Miner- R Extension		518
Creating a Blog Aggregator for free		504
So which software is the best analytical software? Sigh- It depends		473
Revolution R for Linux		465
John Sall sets JMP 9 free to tango with R		460

So how do people come here –

well I guess I owe Tal G for almost 9000 views ( incidentally I withdrew posting my blog from R- Bloggers and Analyticbridge blogs – due to SEO keyword reasons and some spam I was getting see (below))

http://r-bloggers.com is still the CAT’s whiskers and I read it a lot.

I still dont know who linked my blog to a free sex movie site with 400 views but I have a few suspects.

2009-12-31 to Today

Referrer	Views
r-bloggers.com	9,131
Reddit	3,829
rattle.togaware.com	1,500
Twitter	1,254
Google Reader	1,215
linkedin.com	717
freesexmovie.irwanaf.com	422
analyticbridge.com	341
Google	327
coolavenues.com	322
Facebook	317
kdnuggets.com	298
dataminingblog.com	278
en.wordpress.com	185
google.co.in	151
xianblog.wordpress.com	130
inside-r.org	124
decisionstats.com	119
ifreestores.com	117
bits.blogs.nytimes.com	108

–

Still reading this post- gosh let me sell you some advertising. It is only $100 a month (yes its a recession)

Advertisers are treated on First in -Last out (FILO)

I have been told I am obsessed with SEO , but I dont care much for search engines apart from Google, and yes SEO is an interesting science (they should really re name it GEO or Google Engine Optimization)

Apparently Hadley Wickham and Donald Farmer are big keywords for me so I should be more respectful I guess.

Search Terms for 365 days ending 2010-12-31 (Summarized)

2009-12-31 to Today

Search	Views
libre office	925
facebook analytics	798
test drive a chrome notebook	467
test drive a chrome notebook.	215
r gui	203
data mining	163
wps sas lawsuit	158
wordle.net	133
wps sas	123
google maps jet ski	123
test drive chrome notebook	96
sas wps	89
sas wps lawsuit	85
chrome notebook test drive	83
decision stats	83
best statistics software	74
hadley wickham	72
google maps jetski	72
libreoffice	70
doug savage	65
hive tutorial	58
funny india	56
spss certification	52
donald farmer microsoft	51
best statistical software	49

What about outgoing links? Apparently I need to find a way to ask Google to pay me for the free advertising I gave their chrome notebook launch. But since their search engine and browser is free to me, guess we are even steven.

Clicks for 365 days ending 2010-12-31 (Summarized)

2009-12-31 to Today

URL	Clicks
rattle.togaware.com	378
facebook.com/Decisionstats	355
rapid-i.com/content/view/182/196	319
services.google.com/fb/forms/cr48basic	313
red-r.org	228
decisionstats.wordpress.com/2010/05/07/the-top-statistical-softwares-gui	199
teamwpc.co.uk/products/wps	162
r4stats.com/popularity	148
r-statistics.com/2010/04/r-and-the-google-summer-of-code-2010-accepted-students-and-projects	138
socserv.mcmaster.ca/jfox/Misc/Rcmdr	138
spss.com/certification	116
learnr.wordpress.com	114
dudeofdata.com/decisionstats	108
r-project.org	107
documentfoundation.org/faq	104
goo.gl/maps/UISY	100
inside-r.org/download	96
en.wikibooks.org/wiki/R_Programming	92
nytimes.com/external/readwriteweb/2010/12/07/07readwriteweb-report-google-offering-chrome-notebook-test-11919.html	92
sourceforge.net/apps/mediawiki/rkward/index.php?title=Main_Page	92
analyticdroid.togaware.com	88
yeroon.net/ggplot2	87

so in 2010,

SAS remained top daddy in business analytics,

R made revolutionary strides in terms of new packages,

JMP launched a new version,

SPSS got integrated with Cognos,

Oracle sued Google and did build a great Data Mining GUI,

Libre Office gave you a non Oracle Open office ( or open even more office)

2011 looks like a fun year. Have safe partying .

IBM SPSS 19 Now Available to the Global Academic Community via e-academy’s OnTheHub eStore (prweb.com)
ACM Data Mining Camp 3 (revolutionanalytics.com)
Accessing R from Python using RPy2 (r-bloggers.com)
Mining of Massive Data Sets (kinlane.com)
5 FeedBurner Alternatives You Should Know About (techie-buzz.com)
Uncertainty, Risk, Statistics and Data Mining (zyxo.wordpress.com)
‘Data Mining’ Gains Traction in Education (edreformer.com)
If you cut your RSS short I will ignore your post (chrisabraham.com)
Solar trends for 2011 (cleanbreak.ca)

Top Cartoonists:Updated

Here is a list of cartoonists I follow- I sometimes think they make more sense than all the news media combined.

1) Mike Luckovich He is a Pulitzer Prize winning cartoonist for AJC at http://blogs.ajc.com/mike-luckovich/

I love his political satire-sometimes not his politics- though he is a liberal (surprisingly most people from creative arts tend to be liberal- guess because they support and need welfare more, 🙂 ) Since I am in India- I call myself a conservative (when filing taxes) or liberal (when drinking er tea)

2) Hugh Mcleod- of Gaping Void is very different from Mike above, in the way an abstract painter would be from a classical

artist. I like his satire on internet, technology and personal favorite – social media consultants. Hugh casts a critical eye on the world of tech and is an immensely successful artist- probably the Andy Warhol of this genre in a generation.

3) Doug Savage of Savage Chickens http://www.savagechickens.com/ has a great series of funny cartoons based on chickens drawn on Post it notes. While his drawing is less abstract than Hugh’s above, he sometimes touches an irreverent note more like Hugh than anyone else.

4) Professor Jorge Cham of Phd Comics http://www.phdcomics.com/comics.php is probably the most read comic in grad school – and probably the only cartoonist with a Phd I know of.

5) Scott Adams of Dilbert http://www.dilbert.com/ is probably the first “non kid stuff” cartoonist I started reading-in fact I once wrote to him asking for advice on my poetry to his credit- he replied with a single ” Best of Luck email”

They named our email server in Lucknow, UP, India for him (in my business school at http://iiml.ac.in ) Probably the best of corporate toon humor. Maybe they should make the Dilbert movie yet.

6) Randall Munroe of xkcd.com

XKCD is geek cartooning at its best.

For catching up with the best toons in a week, the best is Time.com ‘s weekly list at http://www.time.com/time/cartoonsoftheweek

It is the best collection of political cartoons.

An Obama Presidency May Be Rough Going for Political Cartoonists [Obama Era] (gawker.com)
Palling Around With Monuments [This Thing Looks Like That Thing] (gawker.com)
the microaudience: the mot likely way to make money on the internet (gapingvoid.com)
Cartoon(ist) of the Week – Joel Pett (underthelobsterscope.wordpress.com)
Sweden suicide bombings: I’m a constant target, says cartoonist – Telegraph.co.uk (news.google.com)
Indy cartoonist elated to find torrents of his work (boingboing.net)
Six Cartoonists Tour Afghanistan w/USO (waronterrornews.typepad.com)
Dilbert & Medicine (ivor-kovic.com)
Nigerian Cartoonist Tayo Fatunla (theworld.org)

Why do bloggers blog ?

Xbox (revision 1.0) internal layout. Including... — Image via Wikipedia

Step 1 is to create internal motivation to create a blog in the first place

Step 2 is to find what to write

Reasons Bloggers Blog-

Basic -Ranting

Examples- I hate Facebook Platform team treats me badly with waits, and breaks my code.

SAS Marketing wont give me a big discount to make me look good in front of my boss.

Companies wont give me their software for free- even though I will use it to make money (and not play X Box)

I want my vendors to be FOSS but my customers to switch to SaaS.

Google wont do this- Apple wont do that- Microsoft wont do those.

Revolution would give me 4 great packages but not the open source for RevoScaler (which only 300 people would understand in the first place)

Safety-

I better kiss the Professor and give a Turkey for dinner, as he sits on my thesis committee.

I will recommend Prof X’s lousy book in the hope he recommends my lousy book as a textbook too.

It is safe to laugh when the boss is making a joke-I should comment on her corporate blog, and retweet her.

Belonging-

I belong to this great online community of smart people. Let me agree to what they say.

I really believe in EVERYTHING that ALL the 2 MILLION members of the community have to say ALL the TIME.

I belong to this online community because all my friends are on my computer.

4 Egositic

My blog page rank is now X plus delta tau because of sugary key words (2004)

My technorati numbers rise (2005)

I was once on Digg (2007)

I have Z * exp N followers on Twitter and even more on Facebook (2008)

My Klout is increasing on twitter, My stack overflow reputation ‘s cup floweth over. (2009)

My Karma on Reddit is more important than my Karma in real life (2010)

Self Actualization-

I got time to kill- and I think I may learn more, meet intersting people and discover something wandering on the internet.

All those who wonder are not lost- Wikiquote

I got a story to tell, poems to write, code to give away. A free Blog is something a Chinese , an Iranian and a North korean really really know what the value is.

But after all that, WHY Do Bloggers Blog?

Because we are still waiting for Facebook to create the Blog Killer.
Its better than saying I am unemployed and a social loner
Reddit Karma feels good. Any Karma of any kind.

View this document on Scribd

Calling BS on Klout and the Concept of Influencers (mizzinformation.com)
Overheard on #Blogchat: The Next Level (@tc_geeks) (blogworld.com)
Why Facebook and Twitter Are Not Replacing Blogging (dannybrown.me)
Report: Blogging Falls to Facebook and Twitter (socialtimes.com)
Facebook and Twitter have become indispensable to bloggers (venturebeat.com)
Whaddya Mean-Blogging is Dead? (janetfouts.com)
(VIDEO) Microblogging vs. Blogging: Is There a Battle? (blogher.com)

Amazon goes HPC and GPU: Dirk E to revise his R HPC book

Amazon just did a cluster Christmas present for us tech geek lizards- before Google could out doogle them with end of the Betas (cough- its on NDA)

Clusters used by Academic Departments now have a great chance to reduce cost without downsizing- but only if the CIO gets the email.

While Professor Goodnight of SAS / North Carolina University is still playing time sharing versus mind sharing games with analytical birdies – his 70 mill server farm set in Feb last is about to get ready

( I heard they got public subsidies for environment- but thats historic for SAS– taking public things private -right Prof as SAS itself began as a publicly funded project. and that was in the 1960s and they didnt even have no lobbyists as well. )

In realted R news, Dirk E has been thinking of a R HPC book without paying attention to Amazon but would now have to include Amazon

(he has been thinking of writing that book for 5 years, but hey he’s got a day job, consulting gigs with revo, photo ops at Google, a blog, packages to maintain without binaries, Dirk E we await thy book with bated holes.

Whos Dirk E – well http://dirk.eddelbuettel.com/ is like the Terminator of R project (in terms of unpronounceable surnames)

Back to the cause du jeure-

From http://aws.amazon.com/ec2/hpc-applications/ but minus corporate buzz words.

Unique to Cluster Compute and Cluster GPU instances is the ability to group them into clusters of instances for use with HPC

applications. This is particularly valuable for those applications that rely on protocols like Message Passing Interface (MPI) for tightly coupled inter-node communication.

Cluster Compute and Cluster GPU instances function just like other Amazon EC2 instances but also offer the following features for optimal performance with HPC applications:

When run as a cluster of instances, they provide low latency, full bisection 10 Gbps bandwidth between instances. Cluster sizes up through and above 128 instances are supported.
Cluster Compute and Cluster GPU instances include the specific processor architecture in their definition to allow developers to tune their applications by compiling applications for that specific processor architecture in order to achieve optimal performance.

The Cluster Compute instance family currently contains a single instance type, the Cluster Compute Quadruple Extra Large with the following specifications:

23 GB of memory
33.5 EC2 Compute Units (2 x Intel Xeon X5570, quad-core “Nehalem” architecture)
1690 GB of instance storage
64-bit platform
I/O Performance: Very High (10 Gigabit Ethernet)
API name: cc1.4xlarge

The Cluster GPU instance family currently contains a single instance type, the Cluster GPU Quadruple Extra Large with the following specifications:

22 GB of memory
33.5 EC2 Compute Units (2 x Intel Xeon X5570, quad-core “Nehalem” architecture)
2 x NVIDIA Tesla “Fermi” M2050 GPUs
1690 GB of instance storage
64-bit platform
I/O Performance: Very High (10 Gigabit Ethernet)
API name: cg1.4xlarge

Amazon Announces Cluster GPU Instance for Cloud Computing (insidehpc.com)
New EC2 Instance Type – The Cluster GPU Instance (aws.typepad.com)
Expanding the Cloud – Adding the Incredible Power of the Amazon EC2 Cluster GPU Instances (allthingsdistributed.com)

Larry Page Co-Founder and President, Products

Related Articles

Please share:

Related Articles

Please share:

Related Articles

Please share:

2009-12-31 to Today

2009-12-31 to Today

Search Terms for 365 days ending 2010-12-31 (Summarized)

2009-12-31 to Today

Clicks for 365 days ending 2010-12-31 (Summarized)

2009-12-31 to Today

Related Articles

Please share:

Related Articles

Please share:

Related Articles

Please share:

Related Articles

Please share:

Larry Page
Co-Founder and President, Products