SAS Program for Students

Here is a good program for students who learn or are learning SAS software products to take part in a big, fully paid up conference, the SAS Global Users Conference next year. The last date for submitting papers is October 26, so there is plenty of time if you know someone who would be better off going there.

Citation-
http://support.sas.com/learn/ap/student/amb.html

Select students will be named SAS Student Ambassadors and earn the opportunity to present their research at the 2010 SAS Global Forum in Seattle, Washington on April 11-14.

What are benefits of being a SAS Student Ambassador?
Have your travel expenses and registration fees paid to attend SAS Global Forum 2010.
Be featured and prominently recognized before an international audience by presenting at SAS Global Forum 2010.
Interact with a global audience of SAS users from every industry and sector.
Gain the unique title of SAS Student Ambassador – a true resume or CV differentiator!

Am I eligible for the SAS Student Ambassador Program?
Graduate and undergraduate students are eligible to apply to this program. Recent graduates also may be eligible to apply. The submitted project must have been conducted by the student within 12 months of the submission deadline.

Understanding Map/Reduce

if you think Map/Reduce was another buzz word that some boys cooked up while in their cheerless lab in Stanford, you need to take another deeper, hard look at the technology that will promise to overthrow current industry dynamics in the Big Data category.

Here is a white paper by Aster Data alumni- Aster has the same pedigree as Google and is on it’s way to make other Big Data companies into Kilo Data and Mega Data.

One of the best things of this paper is it actually helps answer the question – which all other databases are there and how does M/R compare with them.

Citation-

http://www.asterdata.com/resources/downloads/whitepapers/sqlmr.pdf

Copyright-

VLDB ’09 2009, Lyon, France. Copyright 2009 VLDB Endowment, ACM 0000000000000/00/00.

Sqlmr

View more documents from ajayohri.

A special thanks to Tasso CTO, Aster Data for pointing this paper to me. SQL/ MR turned 1 year old on August 25 this year.

Movie Review- Inglorious Basterds

When a Knoxville born director creates a movie with Brad Pitt speaking in an East Tennessee accent, Orange country citizens cant help but catch the movie. If you need escapist fare to charge them ole grey cells with good ol fashioned American movie- this is the one.

I could wax eloquent on the direction and deft artistry of Quentin Tarantino, or the acting of Hans Landau- or even the breakthrough perfromance of Brad Pitt who plays the leader of a Nazi hunting commando unit. For the first time- you see Brad’s character overshadowing his adorable mannerisms. Watch how he rolls those Southern R’s.

But there is plenty of that- and you need to watch it yourself.

As the director himself puts it- A Basterd’s work is never done.

Inglourious-Bastards-1819

This is just a loong weekend post..

If you liked the concept of data mining and entertainment- well I actually got inspired by the entertaining writing of the prominent R blogger J D Long of Cerebral Mastication

http://www.cerebralmastication.com/

Interview Merv Adrian IT Market Strategy

An interview with renowned technology analyst Merv Adrian is as follows. Mr Adrian has spent three decades in IT industry and has also served as SVP at Forrester Research, and now is founder of IT Market Strategy. Merv talks on his views on technology and how he sees the next big tech trends coming.

untitled

Ajay- Describe your career in science and technology. What do you think is the best thing that science careers offer to people.

Merv- I wouldn’t characterize myself as having worked in science – even computer science implies a direction quite different from my own. I began as a statistician, and after getting the opportunity to learn some computer skills, spent a number of years coding a variety of decision support and data integration programs. I joined the software industry as a technical journal editor, and held a variety of marketing, strategy and analyst relations positions before beoming an industry analyst.

Ajay-  With your background in finance, how do you think the next generation of financial reporting systems should be built for early warning signals of crisis. Due to think predictive analytics can play a bigger role than just traditional reporting and metric aggregation.

Merv- We’re already seeing greater specialization in financial applications that provide context: an understanding of the industry the firm is in, and its special requirements for industry standards, compliance, etc. This added specific depth and breadth, combined with increasing sophistication in pre-built models for predictive analytics and access to hitherto unavailable volumes of historical data, will extend the reach of applications used by financial professionals. Guided analysis and advanced visualization will make more sophisticated tools available and understandable, promoting more awareness of impending problems and recommending courses of action.

Ajay- What tips would you like to give to aspiring analysts or science journalists and bloggers. What are the top 5 things do’s /don’ts that you refer to while writing a report or analysis.

Merv-

* Never stop learning, and never assume you know enough. Be humble enough to acknowledge that the person you talk to may have something to teach you – and ask about anything you hear that you don’t understand.

* Remember your audience. If you don’t know who you’re writing for, how can you decide what matters to them?

* Explain why what you’re saying matters, and to whom. Don’t assume your readers know.

* Be clear about what is new or changed. If it’s business as usual, there should be a good reason to write about it – maybe change was expected but is slow in coming.

* Acknowledge your sources and collaborators. Be generous with credit.

Ajay- Recently some BI Analyst firms saw the departure of star analysts to found their own firm. How do you think tech companies can manage the retention of talented people especially those who become a bigger brand than they were originally supposed to.

Merv- I’m assuming you want the response to refer to analyst firms. “Bigger than they were originally supposed to” is not an accurate way to describe the sitution.

Analyst firms don’t put a ceiling on their employees’ brand building; quite the contrary. They train them, give them a platform, and sustain them as they do so. Nonetheless, the firm’s brand, not the individual analyst’s brand, is what matters to them.

In general, the big ones don’t care about retaining people. They believe they can easily replace them, and history shows that they recover well from such departures.

Ajay- What incentives apart from the usual financial ones can help build a culture of intrapreneurship in which employees help build startups within the parent firm.

Merv- You can’t leave finances out of it. The business model that has been shown to work for intrapreneurs is partnership, where the partners share significantly in successful practices they build and deliver.

In consulting firms, if you build a practice, you benefit from its success. Analyst firms are increasingly making consulting part of the analyst job description, but the big ones have not made any moves to institute a partner-style model. So to your point, those who build their own brand successfully are likely to leave.

They become entrepreneurs, not intrapreneurs.

Ajay- What are your views on the next 12 months in terms of technology and BI industry dynamics. What is your wishlist- the top three things that you wish happen in the field of technology.

Merv-

* We’re entering a period of great ferment in data management as a set of upstarts has had early success with specialized analytic database platforms. As spending rebounds, most will see their momentum continue. Several have new funding, strong management teams, and early successes to build on.

* The rest of BI will see similar expansion. BI is a perennial growth market and it’s not about to slow down – predictive analytics, advanced visualization, more spohisticated and widespread use of text analytics, and the movement to SaaS models will play a role.

* True analytic applications, as described above in a financial context, will also continue their momentum. You’ll see them in other places, from manufacturing to retail, as micro-verticalization and the proliferation of templated best business practice models roll out form the largest players.

Those are both predictions and a wish list – they move us closer to delivering on the promise of having computing power help us improve business results. There are many more changes coming in packaging, licensing, the emergence of dramatically more powerful hardware platforms – but that’s business as usual. The aging server population will need replacement, and the rebound will be substantial, hastening the generational shift. And a skills shortage will re-accelerate the growth of offshoring. The next decade will be transformational in many ways.

Ajay- How does Merv Adrian balance his work and home life? How important is work life balance in this profession and do you think younger analysts sometimes dont pay attention to it.

Merv- As a manager, I always did my best to remind analysts working for me to leave time for the things that define us a humans – family, friends and faith. What we leave behind will be there, not in our reports, no matter how good they are. We all find ourselves consumed by our work, and social media exacerbates the situation unless we build in ways to be human too.

Youth will always be in a hurry, but the natural maturation process usually works out fine. Where I see balance begin to reassert itself is usually in the family – when your kids arrive, you must stop and remember to be there for them. You mustn’t delegate that one – nobody has ever looked back at their life and said “I wish I had spent less time with my kids.”

Thanks for asking, Ajay. My final thought is that all this guidance is aspirational; I struggle for balance every day. Sometimes I do pretty well at it,and often I fall short. I try to keep my values firmly in mind and strive to live up to them, as we all do.

Analyst and consultant Merv Adrian founded IT Market Strategy after three decades in the IT industry. During his tenure as Senior Vice President at Forrester Research, he was responsible for all of Forrester’s technology research, and covered the software industry. Earlier, as Vice President at Giga Information Group, Merv focused on facilitating collaborative research and covered data management and middleware. Prior to becoming an analyst, Merv was Senior Director, Strategic Marketing at Sybase, where he also held director positions in data warehouse marketing and analyst relations

For more on Merv’s views you can go here

Merv’s B-Eye Network channel at http://www.b-eye-network.com/channels/5097/
Blog: http://mervadrian.wordpress.com
Twitter: merv

Here comes the Cloud- VMWare

Cloud

From http://vcloudexpress.terremark.com

From the fascinating umbrella group at http://twitter.com/cloudcomp_group

If you are a Desktop software expert who thinks the Cloud is nothing to worry, remember what happened to Punching Tapes, Mainframes, Vaccuum tubes.

With the current pricing, even smaller software package creators can offer Statistics as a Service ( STaS).

Great move by VMware !

The world of Predictive Analytics: It's back

PAWS 2009 is back with a slam dunk line up of sponsors, and keynote speakers.

The deadline for early bord registration ends on Sept 4.

What’s holding you back?

https://www.eiseverywhere.com/ereg/index.php?eventid=5215&PHPSESSID=36nq7po4hjoasvkcsv0tm5ppj3&

Pricing
Predictive Analytics World Fall 2009

Includes breakfasts, lunches, priceless networking during coffee breaks, the PAW Reception, and full access to program sessions and sponsor expositions.

Early Bird Price
(July 1 – Sept 4)
Regular     Price

Two Day Pass
(Oct 20-21)

$1390 $1590

Predictive Modeling Methods Workshop
(Oct 22)

$795 $895

Putting Predictive Analytics to Work
(Oct 19)

$795 $895

Hands-On Predictive Analytics
(Oct 19)

$795 $895

paws

Disclaimer- I have no monetary transactions with PAW conference but as a blog partner get access to interviews , book review or content.

Interview Charlie Berger Oracle Data Mining

Here is an interview with Charlie Berger, Oracle Data Mining Product Management. Oracle is a company much respected for its ability to handle and manage data, and with it’s recent acquisition of Sun- has now considerable software and financial muscle to take the world of data mining to the next generation.

Ajay- Describe your career in data mining so far from college, jobs, assignments and projects. How would you convince high school students to take up science careers?

Charlie- In my family, we were all encouraged to pursue science and technical fields. My Dad was a Mechanical Engineer and all my siblings are in scientific and medical fields. Early on, I had narrowed my career choices to engineering or medicine; the question when I left for college was which kind. My Freshman Engineering exposed students to 6 weeks of the curriculum for each of the engineering disciplines. I found myself drawn to the field of Operations Research and Industrial Engineering. I liked the applied math and problem solving aspects. While not everyone has an aptitude or an interest in Math or the Sciences, if you do, it can be a fascinating field.

Ajay- Please tell us some technical stuff about Oracle Data Mining and Oracle Data Miner products. How do they compare with other products notably from SAS and SPSS? What is unique in Oracle’s suite of data mining products- and some market share numbers to back these please?

Charlie- Oracle doesn’t share product level revenue numbers. I can say that Oracle is changing the analytics industry. Ten years ago, when Oracle acquired the assets of Thinking Machines, we shared a vision that over time, as the volumes of data expand, at some point, you reach a point where you have to ask whether it makes more sense to “move the data to the algorithms” or to “move the algorithms to the data”. Obviously, you can see the direction that Oracle pursued. Now after 10 years of investing in in-database analytics, we have 50+ statistical techniques and 12 machine learning algorithms running natively inside the kernel of the Oracle Database. Essentially, we have transformed the database to become an analytical database. Today, you now see the traditional statistical software vendors announcing partnering initiatives for in-database processing or in the case of IBM, acquiring SPSS. Oracle pioneered the concept of using a relational database to not only store data, but to analyze it too. Moving forward, I think that we are close to the tipping point where in-database analytics are accepted as the winning IT architecture.

This trend towards moving the analytics to where the data are stored makes a lot of sense for many reasons. First, you don’t have to move the data. You don’t have to have copies of the data in external analytical sandboxes where it open to security risks and over time, becomes more aged and irrelevant.

I know of one major e-tailor who constantly experiments by randomly showing web visitors either offers “A” or a new experimental offer “B”. They would export massive amounts of data to SAS afterwards to perform simple statistical analyses. First, they would calculate the median purchase amounts for the duration of the experiment for customers who were shown both offers. Then, they would perform a t-test hypothesis test to determine whether a statistically valid monetary advantage could be gained. If offer “B” were outperforming offer “A”, the e-tailor would Continue reading “Interview Charlie Berger Oracle Data Mining”