While R packagers have a lot to be proud of in the graphics packages of R, the truth of the matter is that the lack of GUI even for Graphical Analysis hinders the ease of usage in adopting R’s powerful graphics for statistical analysis. As a contrast , SAS and JMP have been combined together in the SAS Visual Data Discovery Environment
I really liked the GUI of JMP ( which is very rich in stats testing) and with the powerful data handling capabilities on the desktop of SAS, this is clearly an outstanding effort to create terrific graphics ( see below)
Note the combination of the two- Great Graphics WITH a GUI. in R the GUI that comes closest to matching JMP is R Commander, but it’s graphical capabilities are kept basic as it is not meant for replacement of the beloved Kommand prompt
( maybe an expanded plugin for graphics or hexabin would help)
It would be interesting to see an on demand Ec2 cloud hosted version of visual data discovery by SAS (with JMP as the front end) even for a limited pilot of six months and targeted at the SMB segment. Or a Salesforce.com application that integrates Salesforce.com data with the tests and standard procedures in SAS and JMP.
Note of Discontent- The JMP Website is terrible. It has a different font from the SAS Website ( they could atleast use the same CSS ) and overall is the worst part of the otherwise excellently elegant JMP. Hope they upgrade their website soon ( they havent done it this year atleast).
An interview with Carole Jesse, an experienced Analytics professional in SAS, JMP , analytics and Risk Management.
Ajay- Describe your career in science from school to now.
Carole- Truthfully, my career in science started in 7th grade. Hey, I know this is further back in time than you intended the question to go! However, something significant happened that year that pretty much set me on the path that I am still on today. I discovered Algebra. Up to that point in time, I was an average student in ‘arithmetic’. Algebra introduced LETTERS into the mix with numbers, in the simplest of ways that we have all seen: ‘Solve for x in the equation x+2=5’. That was something I could get behind, AND I excelled at it immediately. Without mathematical excellence, efforts in learning science can fall apart. Mathematics is everywhere!
I spent the rest of my secondary education consuming all the math and science that I could get. By the time I entered college I had already been exposed to pre-calculus and physics and was actually surprised by those in my college Freshman courses who had not seen anti-derivatives, memorized the quotient rule, or worked an inclined plane friction problem before.
My goal as an undergraduate was to become a Veterinarian. The beauty of a pre-Vet curriculum is that it is pretty much like pre-Med, rigorous and broad in the sciences. In my first two years of undergraduate work, I was exposed to more Chemistry, more Mathematics, more Physics, along with things like Genetics, Biology, even the Plant and Animal Sciences. Although I did not stick with my pursuit of Veterinary Medicine, it laid a solid foundation that has served me very well in the strangest of places.
I consider myself a Mathematician/Statistician due to my academic degrees in those areas, first a BS in Mathematics/Physics at the University of Wisconsin followed by a MS in Statistics at Montana State University. In between the BS and MS I also dabbled briefly in Electrical Engineering at the University of Minnesota.
Since academia, it is my breadth in ALL sciences which has allowed me to be very fluid in straddling diverse industries: from High Volume Manufacturing of Consumer Products, to Nuclear Energy, to Semiconductor Manufacturing/Packaging, to Financial Services, to Health Care. I succeed at business problem solving in these industries by applying my Statistical Methods knowledge, coupled with business acumen and peripheral understanding of the technologies used. I have worked closely with scientists and engineers, and could enter THEIR world speaking THEIR language, which was an aid in getting to these solutions quickly.
I can not place enough emphasis on the importance of exposure to a broad range of sciences, and as early as possible, for anyone who wants to be involved in Advanced Analytics and Business Intelligence. As a manager, I look closely at candidates for these diverse sorts of backgrounds.
Ajay- I find the number of computer scientists and analysts to be overwhelmingly male despite this being a lucrative profession. Do you think that BI and Analytics are male dominated? How can the trend be re-shaped?
Carole- Welcome to my world! All kidding aside, yes that has been my observation as well. While I am not versed in the specifics of actual gender statistics in Computer Science and Advanced Analytics versus other fields, based on my years in and around these fields, there does appear to be a bias.
This is not due to a lack of capability or interest in these fields on the part of women. I believe it is more due to the long history of cultural norms and negative social messages that perhaps push woman away from these fields. The messages can be subtle, but if you pay close attention, you will see them. Being one of 10 females in an undergraduate engineering class of 150 students has a message right there. Even though these 10 women were able to make entry to the class, the pressure of being a minority, whether gender based or otherwise, can be a powerful influencer in remaining there.
In my own experience, I have encountered frequent judgments where I was made to feel “good at math” was an unacceptable trait for a woman to have. It is important to note that these judgments have been delivered equally by men AND women. So I think until both genders develop higher expectations of women in the hard science areas, the trends will continue. It has been decades since my 7th grade introduction to algebra, but it appears the negative social messages regarding girls in math and science are still present today. Otherwise there would be no need (i.e. no market) for books like Danica McKellar’s “Math Doesn’t Suck,” and the follow-up “Kiss My Math,” both aimed at battling these negative messages at the middle school level.
As to how I have battled these cultural expectations, I developed a thick skin. I have also learned to expect excellence from myself even when a teacher, or a peer, or a boss may have had lower expectations for me than for a male counterpart. Sort of a John Mayer “Who Says” type of attitude. Who says I can’t do Math and Science. Watch me.
Ajay- How would you explain Risk Management using software to a class of graduate students in mathematics and statistics?
Carole- There are many areas of Risk Management. My specific experience has been on the Credit Risk Management and Fraud Risk Management sides in a couple of industries. For credit risk in financial services, typically there is a specific department whose role is to quantify and predict credit risk. Not just for the current portfolio, but for new products as well. Various methodologies are utilized, ranging from summarization of portfolio characteristics that have a known relationship to default to using historical data to build out predictive models for production implementation.
Key skills needed here are good understanding of the business, solid statistical methods knowledge, and computing skills. As far as the computing /software skills needed, there are three main categories 1) query and preparation of data, 2) model building and validation, and 3) model implementation. The actual tools will likely differ across these categories.
For example, 1) might be tackled with SAS®, Business Objects, or straight SQL;
2) requires a true modeling package or coding language like SAS®, SPSS, R, etc; and lastly
3) is the trickiest, as implementation can have many system limitations, but SAS® or C++ are often seen at implementation.
Ajay- Describe some of your most challenging and most exciting projects over the years.
Carole- I have been very fortunate to have many challenges and good projects in every role I have been in, but as I look back today, some things that stand out the most were in ‘high tech’. By virtue of being high tech, there is no fear of technology, and it is fast-paced and ever evolving to the next generation of product.
I spent seven years in the Semiconductor industry during the 90’s at Micron Technology, Intel, and Motorola. At the beginning of that window, we left the “486 processor” world, and during that window we spanned the realm of “Pentium processors.” Moore’s Law dominated all of this. To stay competitive all of these companies embraced statistical methods to help speed up development time.
At one point, I supported a group of about 10 R&D engineers in the Design and Analysis of their process improvement and simplification experiments. This afforded me exposure to much of the leading edge research the team was working on.
I recall one project with the goal of optimizing capacitance via surface roughness of the capacitor structures. In addition to all the science involved at the manufacturing step, what made this so interesting was the difficulty in measuring capacitance at the point in the process where film roughness was introduced. All we had were surface images after this step. The semiconductor wafers had to pass through several more process steps to get to the point where capacitance could actually be measured. All of this provided challenges around the design of the experiment and the data handling and analysis.
By working closely with both the process engineer and the process technician I was able to gather the image files off the image tool that were taken from the experimental runs. I used SAS® (yes, another shameless plug for my favorite software) to process the images using Fast Fourier Transforms. Subsequently, the transformed data was correlated to the capacitance in the analysis of the experimental results. Finding the “sweet spot” for capacitance, as driven by surface roughness, provided a huge leap for this process technology team.
The challenges of today are much different than they were in the 90s. In the more recent years, I have been working with transactional data related to financial services or health care claims. The challenges manifest themselves in the sheer volume of the data. In the last decade in particular most industries have been able to put the infrastructures in place to gather and store massive amounts of data related to their businesses. The challenge of turning this data into meaningful actionable information has been equally exciting as using Fast Fourier Transforms on image processing to optimize capacitance!
Currently I am working with an Oracle database where one table in the schema has 250 million records and a couple hundred fields. I refer to this as a “Pushing Tera” situation, since this one table is close to a Terabyte in size. As far as storing the data, that is not a big deal, but working with data this large or larger is the challenge.
Different skill sets are needed here beyond those of just an analyst, data miner, or statistician. These VLDB situations have morphed me into a bit of an IT person.
- How do you efficiently query such large databases? An inefficient SQL query will not be a bother in a situation where the database is small. But when the database is large, SQL efficiency is key. Many skills needed for industry are not necessarily taught in academia, but rather get picked up along the way, like Unix and SQL. I now write efficient SQL code, but many poorly written jobs gave their lives so that I could learn these efficiencies!
- Eventually I will need to organize this data into an application specific format and put data security controls around the process. Again, is this Advanced Analytics? Not really, it is more of an MIS role. The newness in these challenges keeps me excited about my work.
Ajay- How important do you think work life balance is for people in this profession? What do you do to chill out?
Carole- I don’t think the work-life balance is any more or less important to the decision science professionals than it is to any other profession really. I have friends in many other professions like Law, Nursing, Financial Planning, etc. with the same work-life balance struggles.
We live in a busy culture that includes more and more demands placed on us professionally. Let’s face it, most of us are care-takers to someone besides ourselves. It might be a spouse, or a child, or a dog, or even an elderly parent. Therefore, a total focus on work is bound to upset the work-life balance for most of us.
My biggest struggle comes in the form of balancing the two sides of my brain. That may sound weird, but one thing you have to agree with is that all of this is pretty “Left Brained”: mathematics, statistics, business intelligence, computing, etc.
To balance this out, and tap into my Right Brain, I like to dabble in the arts to some extent. Don’t get me wrong, I am not an artist! But that doesn’t mean I can’t draw on creativity in the artistic sense. For example, this past summer I took a course on Adobe Photoshop and Illustrator at Minneapolis College of Art and Design. This provided the best of both worlds, combining software and art! In addition to learning how to remove Cindy Crawford’s mole (yes, we did this), there were some very useful projects. One of my course projects was creating my customized Twitter background. An endeavor like this provides me a ‘chilling out’ factor from the normal work world. I know of many other Left Brain leaners that do similar things, like playing a musical instrument, or painting, etc. This is another reason why I took up digital photography: more visual arts.
Volunteer work has a balancing effect too. I try to give back to the community when I can. Swinging a hammer at Habitat for Humanity, or doing record keeping for an Animal Rescue organization, are things I have participated in.
And if none of this works, I enjoy cooking for my family and friends, and plying them with wine!
Ajay- What are you views on:
Carole- Data Quality
I’d have to say I am for data Quality! Who isn’t? But the reality is that data is dirty. That “Pushing Tera” Oracle table I mentioned earlier, well it turns out it has some issues. And it is incumbent upon me to determine the quality of that data before attempting to do anything analytical with it. One place in industry where value enhancement are needed: database administrators with business knowledge. It seems that more times than not, even if there was a business savvy DBA they may have moved on, leaving the consumers of that data (that would be me) to fend for themselves. There is some debate over which philosopher said “Know thyself.” Today’s job challenge is to “Know thy data” or perhaps “Value those that know thy data.”
B) Predictive Analytics for Fraud Monitoring
There is a huge market for analytics in fraud detection and prevention. But it is not for the faint of heart. Insiders, at least in Mortgage and Health Care, are the typical perpetrators of lucrative fraud. These insiders know how the industry processes work and they exploit this. As soon as one loophole is discovered and patched, fraudsters are looking for another loophole to exploit. This makes the task of predictive analytics different for Fraud than other areas where underlying patterns are probably more stable. Any methodology used here must have “turn on a dime” features built in, if possible. With economic conditions as they are, fraud detection/monitoring will remain important and challenging field.
Carole Jesse has been applying statistical methods and advanced analytics in a variety of industries for the last 20 years. Her career spans High Volume Manufacturing of Consumer Products, Nuclear Energy, Semiconductor Manufacturing/Packaging, Financial Services, and Health Care. Applications have ranged from Design and Analysis of Experiments to Credit Risk Prediction to Fraud Pattern Recognition. Carole holds a B.S. in Mathematics from the University of Wisconsin and a M.S. in Statistics from Montana State University, as well as several professional certifications. All the opinions expressed here are her own, and not those of her employers: past, present, or future. (Although her dog Angie may have had some influence.) Ms. Jesse currently lives and works in Minneapolis, Minnesota.
Over the past month or so, I have really begun to appreciate the GUI of JMP. It is very clean and intutively designed. And excellent for a SAS Environment .
Best of all you can easily download a 30 day trial and pricing for this software is quite reasonable.
The worst part of JMP- the droll website. In fact on website, I can deduce something of an Ohri’s Law on Websites.
The better the software, the worse off is the website.
Corollary- The worse off the software, the better is the website in terms of glitz.
JMP is definitely worth a trial for 30 days if you
a) Want to learn a new stats software skill fast
2) Unhappy with visual data analysis of current softwares.
Integrating JMP ‘s functionality with a BI reporting tool is a formidable data decisionmaking tool and it works nicely for me in data analysis I do.
Here is an interview with John Sall, inventor of SAS and JMP and co-founder and co-owner of SAS Institute, the largest independent business intelligence and analytics software firm. In a free wheeling and exclusive interview, John talks of the long journey within SAS and his experiences in helping make JMP the data visualization software of choice.
JMP is perfect for anyone who wants to do exploratory data analysis and modeling in a visual and interactive way – John Sall
Ajay- Describe your early science career. How would you encourage today’s generation to take up science and math careers?
John- I was a history major in college, but I graduated into a weak job market. So I went to graduate school and discovered statistics and computer science to be very captivating. Of course, I grew up in the moon-race science generation and was always a science enthusiast.
Ajay- Archimedes leapt out the bath shouting “Eureka” when he discovered his principle. Could you describe a “Eureka” moment while creating the SAS language when you and Jim Goodnight were working on it?
John- I think that the moments of discovery were more like “Oh, we were idiots” as we kept having to rewrite much of the product to handle emerging environments, like CMS, minicomputers, bitmap workstations, personal computers, Windows, client-server, and now the cloud. Several of the rewrites were even changing the language we implemented it in. But making the commitment to evolve led to an amazing sequence of growth that is still going on after 35 years.
Ajay- Describe the origins of JMP. What specific market segments does the latest release of JMP target?
John- JMP emerged from a recognition of two things: size and GUI. SAS’ enterprise footprint was too big a commitment for some potential users, and we needed a product to really take advantage of graphical interactivity. It was a little later that JMP started being dedicated more to the needs of engineering and science users, who are most of our current customers.
Ajay- What other non-SAS Institute software do you admire or have you worked with? Which areas is JMP best suited for? For which areas would you recommend software other than JMP to customers?
John- My favorite software was the Metrowerks CodeWarrior development environment. Sadly, it was abandoned among various Macintosh transitions, and now we are stuck with the open-source GCC and Xcode. It’s free, but it’s not as good.
JMP is perfect for anyone who wants to do exploratory data analysis and modeling in a visual and interactive way. This is something organizations of all kinds want to do. For analytics beyond what JMP can do, I recommend SAS, which has unparalleled breadth, depth and power in its analytic methods.
Ajay- I have yet to hear of a big academic push for JMP distribution in Asia. Are there any plans to distribute JMP for free or at very discounted prices in academic institutions in countries like India, China or even the rest of the USA?
John- We are increasing our investment in supporting academic institutions, but it has not been an area of strength for us. Professors seem to want the package they learned long ago, the language that is free or the spreadsheet program their business students already have. JMP’s customers do tell us that they wish the universities would train their prospective future employees in JMP, but the universities haven’t been hearing them. Fortunately, JMP is easy enough to pick up after you enter the work world. JMP does substantially discount prices for academic users.
Ajay- What are your views on tech offshoring, given the recession in the United States?
John- As you know, our products are mostly made in the USA, but we do have growing R&D operations in Pune and Beijing that have been performing very well. Even when the software is authored in the US, considerable work happens in each country to localize, customize and support our local users, and this will only increase as we become more service-oriented. In this recession, JMP has still been growing steadily.
Ajay- What advice would you give to young graduates in this recession? How does learning JMP enhance their prospect of getting a job?
John- Quantitative fields have been fairly resistant to the recession. North Carolina State University, near the SAS campus, even has a Master of Science in Analytics < http://analytics.ncsu.edu/ > to get people job-ready. JMP experience certainly helps get jobs at our major customers.
Ajay- What does John Sall do in his free time, when not creating world-class companies or groovy statistical discovery software?
John- I lead the JMP division, which has been a fairly small part of a large software company (SAS), but JMP is becoming bigger than the whole company was when JMP was started. In my spare time, I go to meetings and travel with the Nature Conservancy <http://www.nature.org/ >, North Carolina State University <http:// http://ncsu.edu/ >, WWF <http://wwf.org/ >, CARE <http://www.care.org/ > and several other nonprofit organizations that my wife or I work with.
John Sall is a co-founder and Executive Vice President of SAS, the world’s largest privately held software company. He also leads the JMP business division, which creates interactive and highly visual data analysis software for the desktop.
Sall joined Jim Goodnight and two others in 1976 to establish SAS. He designed, developed and documented many of the earliest analytical procedures for Base SAS® software and was the initial author of SAS/ETS® software and SAS/IML®. He also led the R&D effort that produced SAS/OR®, SAS/QC® and Version 6 of Base SAS.
Sall was elected a Fellow of the American Statistical Association in 1998 and has held several positions in the association’s Statistical Computing section. He serves on the board of The Nature Conservancy, reflecting his strong interest in international conservation and environmental issues. He also is a member of the North Carolina State University (NCSU) Board of Trustees. In 1997, Sall and his wife, Ginger, contributed to the founding of Cary Academy, an independent college preparatory day school for students in grades 6 through 12.
Sall received a bachelor’s degree in history from Beloit College in Beloit, WI, and a master’s degree in economics from Northern Illinois University in DeKalb, IL. He studied graduate-level statistics at NCSU, which awarded him an honorary doctorate in 2003.
Originally nicknamed as John’s Macintosh Program, JMP is a leading software program in data visualization for statistical software. Researchers and engineers – whose jobs didn’t revolve solely around statistical analysis – needed an easy-to-use and affordable stats program. A new software product, today known as JMP®, was launched in 1989 to dynamically link statistical analysis with the graphical capabilities of Macintosh computers. Now running on all platforms, JMP continues to play an important role in modeling processes across industries as a desktop data visualization tool. It also provides a visual interface to SAS in an expanding line of solutions that includes SAS Visual BI and SAS Visual Data Discovery. Sall remains the lead architect for JMP.
Ajay- I am thankful to John and his marketing communication specialist Arati for this interview.With an increasing focus on data to drive more rational decision making, SAS remains an interesting company to watch for in the era of mega- vendors and any SAS Institute deal and alliance will be making potential investment bankers as well as newer customers drool. For previous interviews and coverage of SAS please use www.decisionstats.com/tag/sas