Interview Sarah Blow – Girly Geekdom Founder

Here is an interview with Sarah Blow, community manager of the famous twitter startup TweetMeMe which is very popular to bloggers and founder of Girly Geek Dinners – a community effort to promote women in areas of technology and sciences.

Sarah tweets under the name Girly Geek while I tweet under the name Dude of Data, so I met her by chance on the Twitter.

Here is the interview-

1) Describe your career in science from high school to your present position.

That could take a while…. High School for me was split into Middle School for 2 years where Science was dull but practical and Secondary School where Science was a lot of fun and I set the table on fire in the chemistry lesson… My Chemistry teacher always reminds me how incendiary I am! and High School was up north for my A levels where I didn’t choose science subjects as I really wasn’t sure about the science teachers there. However at the last school I did an AS in computer science and it was my teacher there that recommended I considered a career in the technology industry. Originally I was considering law. As a young child I wanted to study law and go to Cambridge. As I grew up I guess things changed, I loved playing with my Commadore 64 and was good with databases etc so my natural progression was to Computer Science.

I didn’t study A Level maths so my options were somewhat limited however I got my first choice University placement at Manchester University (UMIST as it was then). Whilst there I won a scholarship to do my Masters of Enterprise in Computer Science and then went onto my first job as a Software Engineer at Cardinal Health. Then I started the Girl Geek Dinners and decided a change was in order in terms of my career as I found I was good at the community aspect of engaging people with technology. So I looked around for a while and then moved to my current position as Community Manager at TweetMeme.

B) What are the challenges and complexities in managing the community for Tweetmeme

TweetMeme has over 150 million buttons across hundreds of thousands of websites around the world crossing language, location, content management systems and server farms. As such it is my role to ensure those buttons are installed and working as the users require. That’s a LOT of users and a LOT of buttons to look after. I also support the developers that help to create the plugins for the different content management platforms and those using our API. The complexities of all this are the different languages, implementations, levels of understanding of code and template editing as well as the conversational language translations. In my case I speak and can understand French, some German, some Spanish and some Italian. However Google Translate is my friend!

I also communicate with the press and news services, put announcements up on our blog site, and create the support documentation found in our help area and on our forums. When users feedback comments and suggestions I also represent them and their views within technical meetings and in the design decision process. So really my role is incredibly varied and covers a real range of things.

2) Why are there so few women in science compared to other fields- even though it is quite a lucrative profession.

I think there are many barriers from when you grow up and what your parents expect you to do as a career, through to career advice at schools through to what options you choose at GCSE and what maths paper you do (higher or lower) as these do have a big impact on what doors you leave open or close. I also believe personal choice and interest areas have a lot to do with what you consider as a potential career option. Many people just don’t consider computing as a career these days as computers are fundamental to all jobs.

When you look at what jobs you considered as a young child did you aspire to be the next Bill Gates or was it more likely a fighter pilot, fireman or something similarly heroic. Many females look to nursing/ doctor roles as their heroic roles or law where they can put baddies behind bars. Many look to vetinary sciences or forensic science too.

What you aren’t told as a child is where there are heroic jobs in the real world that can lead you to do wonderful things and yet still be able to make money and have fun!

3) Describe your work at GirlyGeekdom on promoting women geeks. ( or women in science careers)

This question mentions specifically the GirlyGeekdom site http://girlygeekdom.com which was a blog site that I created a few years ago after starting Girl Geek Dinners where I could create and bring together interesting geeky content to inspire others to use, play with and enjoy. I wanted to create a fun and energetic environment where anyone male or female could feel like they were in a little geeky world. Which is where the name of the site GirlyGeekdom came from. The promotion of women geeks is only part of what we do on the site but it does bring together issues from around the world and hopefully move beyond that to bring sensible conclusions and a route forward. One thing I didn’t want the site to be was a list of complaints and issues with no attempt at finding solutions.

To help encourage more females into the industry we let them know about awards and intiatives that identify great female role models. We interview interesting people from the tech industry when we come across them and place them into our inspire series of video’s. We also have regular competitions supported by industry sponsors to get young people interacting with our site. We have both serious and non-serious content and we have a range of volunteer writers from around the world submitting great inspirational articles.

4) What are some tools you can recommend for getting un interested students interested in science careers.

One of the great recent tools to get young people interested in science based careers is to mix some of the things they already love doing with science. So for example recently I was introduced to the Manga Guide series which is basically a merge of manga stories with scientific based content in a fun non-science based story approach. This sort of thing is great for getting those who haven’t considered science as fun to look at it in a different way but still with the opportunity to learn more about it!

Other tools include advice on how to work your way through the University Clearing process, including all the links to useful sites recommended by the UK govornment etc. If you don’t get your first choices for uni, then why you should consider computer science or similar subjects as a suitable alternative!

5) How important is work life balance for you? What do you do to de stress.

Work life balance is very important to me and I get a LOT of requests on my time regarding both GirlyGeekdom, Girl Geek Dinners, my day job, friends, family and my hobbies. As such I have to tread a very fine balancing act to ensure that I meet expectations in all of those areas. A large part of doing that is actually to set reasonable expectations with each group of people with regard to my time and availability. I’m actually very lucky as my work isn’t too far from home and as such I do get to spend time there.

I work for a start up company called TweetMeme as their Community Manager so I’m on the internet daily looking after their community. I also do a lot of things outside of that. I tend to rest at lunchtime and take the breaks that I need. I don’t tend to work through every break I get as I’ve tried that in the past and that just tires me out. Instead I tend to time box things. So work is generally my standard office hours. I use my phone for emails on the go and tend to keep up with those then and when I’m at home cooking my tea! (Multi tasking works well!) I keep weekends free for friends and family as much as I can and evenings are a combination of GirlyGeekdom, Girl Geek Dinners, social events for work and spending time with family or relaxing.

In terms of what I do to de-stress… I do a range of things. I’m a member of a really nice gym which has some beautiful swimming pools which I love! So you’ll find me in the gym or the pool if it’s been a particularly crazy week. Or alternatively enjoying a good film at home or a good book and some relaxing music. Then at the weekends you’ll find me doing the more fun stuff that takes time to do! So I’m into rock climbing, white water kayaking, kite surfing and diving. In the summer I also get back into my roller blading!

6) Can we expect a Girly Geekdom in United States. What about a book?

In terms of a GirlyGeekdom in the US… well if someone from the US wants to write on the site they are always welcome, they just need to ask. We already have Girl Geek Dinners out there in 9 different locations, so there’s nothing to stop more of them happening. I’d love to do a Girl Geek conference which may well be called GirlyGeekdom but I don’t think that will be 2010… but it could be a 2011 possibility! As for a book! That’s an interesting question. I’ve considered it but right now I don’t have the time to write one, so if I did then it would probably be a combination of blog posts and ideas or the how to guide on GIrl Geek Dinners.

About SARAH-

Contacts who are into the new media space can contact her through Twitter or via LinkedIn. For those who are into the more traditional channels of communication then you can contact Sarah via e-mail. A more detailed perspective is given on her blog here.

Interview Donald Farmer Microsoft

Here is an Interview with Donald Farmer of Microsoft talking about the passion for the exciting business intelligence projects at MS.

Q Describe your career from high school to your current job responsibilities at Microsoft. How can technology companies in America work together to grow the home pool of American science students ( irrespective of market share battles)

A My background is relatively unusual for a technology professional, although at Microsoft one meets people with a very wide range of backgrounds. I had little interest in studying Computer Science formally. For me, software was always a means to an end: a way of solving what were, for me, “more interesting” problems. Of course, I cannot deny that computer science is a compelling subject in itself, just not for me. Yet, from my early teens in Scotland, I had computers to try (starting with the justly famous Sinclair range) and I used them to store, classify and analyze the data I needed for my other work. So, as I studied philosophy and languages, and as I worked in history, archaeology, forestry, fish-farming and so on (through many variations) before I became more completely involved in Business Intelligence, I used database techniques extensively.

I spent some years as a consultant, building all sorts of applications, My first predictive application enabled fish-farmers with private water supplies to balance the needs of fish production and hydro-electric generation based on past, present and predicted rainfall. I believe that application is still in use today, 15 years later!

Later, I joined an excellent group of developers and analysts at AppsMart, building a data mart rapid-development application. That brought me into the Microsoft sphere, as we built on the SQL Server platform and were actively involved in the SQL Server Data Warehouse ecosystem.

With the dot-com bust of 2000, I happily found an opportunity to work with Microsoft. There I started working on Analysis Services, later leading a team of program managers in Integration Services. In that time, we did some really interesting work along with Zhaohui Tang’s team, integrating Data Mining capabilities with our ETL tool, to enable predictive analytics in the flow of data. The implications of this technique are still only being realized: we have used it for imputing missing data, and have an interesting patent on how to use this technique for detecting outliers in streaming data. In addition, we included fuzzy matching techniques from Surajit Chaudhuri’s team, to give even more flexibility.

More recently I have been working in Data Mining, with a marvelous and energetic team under Jamie MacLennan, and then in the last couple of years I have been managing a super team of Program Managers building the client interfaces for our new PowerPivot application.

My current role is not focused on a single product, but rather I look across all the business intelligence products to see how we can engage our engineering knowledge ever more effectively with customers, partners, analysts and, of course, with other teams across Microsoft.

So, as you can see my background is very varied. In some ways, that means that I am not well placed to speak to how the USA can better grow a pool of science students, as I was never one myself. Yet, I do think there are some lessons I can share. Firstly, we should not make the mistake of focusing only on science and technology as an end in itself. We do need to encourage the use of information science techniques in all appropriate fields, including liberal arts, and also “power professions” such as medicine and law. The USA provides wonderful educational opportunities in these fields, but all too often young people have to choose between science and arts. Many of the best talents I have met in the world of analytics have backgrounds which are very diverse.

Q) Describe the current status of SQL Server and Microsoft Data Mining. What are the areas in Business Intelligence we can see much more excitement and innovation in the coming few months from you guys.

A) Data Mining remains one of the most popular technologies in the SQL Server stack. I have presented recently in China, Germany, The Netherlands and the UK, and at every conference the data mining sessions were among the most popular and the most successful. This speaks volumes about the interest in this field. it also reflects how successfully Microsoft has broadened our user base by shipping the Excel Data Mining Add-ins.

Q) How is Microsoft’s cloud computing venture Azure going? How is Sharepoint doing? What do you personally feel on the remote sharing and computing model.

A) Azure and Sharepoint are, of course, very different beasts. Windows Azure, and especially SQL Azure which we launched at PDC in November, are proving to be very popular. In particular SQL Server Azure is really succeeding with it’s strong development and management story – you design and manage cloud databases with the same tools and techniques as you do for on-premise databases. There has been a fabntastic response to this, especially from emerging economies where the idea of having Microsoft manage your data infrastructure at any scale is very attractive. At TechEd South Africa, for example, David Robinson from the SQL Azure team got a tremendous reception. However, there are difficulties in emerging economies because of poor bandwidth. Shortly after David and I were in South Africa, local businesses held a race: they tied a usb stick with files to the leg of a carrier pigeon and set it off home from Pietermaritzburg to Durban, simultaneously trying to download the same files between the same locations online. The pigeon won!

So, I do think the cloud offers tremendous opportunities for business to scale and manage their resources effectively, but it’s early days.

Q And when can I start do data mining from within my Excel workbook- I remember working on a SQL Server Analysis Plugin for an cloud Excel prototype last year.

A You should be using Excel for data mining right now. Just go to http://www.sqlserverdatamining.com and look for the links, on the right hand side of the page. These are released products. You can also go to http://www.sqlserverdatamining.com/cloud to try an experimental cloud service – but it is only experimental and could be up or down at any time.

For more conventional, OLAP-like, analytics you should also try out PowerPivot in beta. See http://www.powerpivot.com . PowerPivot is an application that plugs into Excel and enables business users to build quite complex models, over basically unlimited data volumes, quickly and easily. It’s proving to be hugely popular already. I am sure it will dominate much of the BI news in 2010.

Q) What are the risks, and challenges in creating new technology when working for an Industry leader like Microsoft where the spotlight is on every step you take and the competition is brutal.

A) I simply don’t think about brutal competition. Even in nature I see far more symbiosis than competition. I personally think competition is a very negative mindset although the term “competitor” is the common shorthand for another vendor in the space and I do use it that way myself – but more from habit than conviction.

In the database world, you might say Oracle are our competitors. Yet most of the Oracle customers I know (and I was an Oracle customer myself once) are also SQL Server customers. Often they use Reporting Services, or Analysis Services. Integration Services had to ship a fast-loading Oracle destination, because so many customers want to use SQL Server tools to load Oracle databases. I see far more cases like that, where the picture is complex and symbiotic, than I do of outright competition.

In the analytic space, almost every tool out there has one feature in common – one feature which everyone uses. Export to Excel.

I genuinely love working with our partners, and I am lucky to have good friends throughout the industry: at SAP, Oracle, IBM, SAS … you name it. We all benefit from empowering businesses with better tools. As the old saying goes, “the rising tide lifts all boats.”

Q) In terms of Lines of Code, Microsoft may have given the maximum number of shared libraries and code away- yet sometimes comes from a perception problem because of vintage. Do you think all cool tech companies become not so cool after some years, even if they dont fundamentally change.

A) I think the idea of a company being “cool” is itself just a phase we’re going through as an industry as we’re growing up. As the tech industry matures, you’ll see more emphasis on value, and net contributution. In many ways, Microsoft, and IBM I think, are ahead of the curve, as companies which are valued for their stability, resources and our ability to continually provide compelling new solutions and services. I travel a lot, and I see classrooms in western China, and emerging businesses in Africa, and women starting to work in new careers in the Middle East, and I don’t see them prioritizing cool. But I do see them doing amazing things with Microsoft technology.

Q) Describe your blogging style and what best tips would you give to technology bloggers.

A) I don’t blog enough, sadly, although I do try.

I have two blogs. One, at http://blogs.technet.com/sqlserverexperts/ is a shared “SQL Server Experts” blog. It’s very focussed on Microsoft technologies, of course. I especially like to blog about trends that I am seeing in my work with customers. My other blog, at http://beyeblogs.com/donaldfarmer/ is more personal, and includes gleanings from my other interests. I especially like doing my first blog of April there – that’s always fun.

My advice to bloggers should probably be “do what I say, not what I do.” However, most important I think, is to be authentic in your voice. My business intelligence bloggers are Jill Dyche, Evan Levy, David Loshin, William McKnight and Neil Raden – all of them blog quite regularly and are always great to read. There are others out there who are just as interesting, but don’t quite have the same rhythm to their blogging. I admire, but sadly fail to emulate, those who blog regularly and effectively.

Q) What do you do when not at work.

A) My wife is an artist, and she keeps me busy helping out with events and projects. We live on a wild couple of acres in Washington and caring for that is a lot of fun too. Otherwise, I mostly read, cook and play the piano. I love cooking, although I’m not sure how good I am – my son is now a professional chef, so perhaps I had some influence. I play the piano badly, but I can lose myself in that. I read very well. I love to read poetry – and I struggle to read Chinese poetry in the original. It’s such a fascinating language, and the poetry is so complex and yet so simple. That will be a lifetime study.

Biography-

Donald Farmer is the Principal Program Manager, SQL Server Data Mining, at Microsoft Corp.

M2009 Interview Peter Pawlowski AsterData

Here is an interview with Peter Pawlowski, who is the MTS for Data Mining at Aster Data. I ran into Peter at his booth at AsterData during M2009, and followed up with an email interview. Also included is a presentation by him of which he was a co-author.

[tweetmeme source=”decisionstats”]

Ajay- Describe your career in Science leading up till today.

Peter- Went to Stanford, where I got a BS & MS in Computer Science. I did some work on automated bug-finding tools while at Stanford.
( Note- that sums up the career of almost 60 % of CS scientists)

Ajay- How is life working at Aster Data- what are the challenges and the great stuff

Peter- Working at Aster is great fun, due to the sheer breadth and variety of the technical challenges. We have problems to solve in the optimization, languages, networking, databases, operating systems, etc. It’s been great to think about problems end-to-end & consider the impact of a change on all aspects of the system. I worked on SQL/MR in particular, which had lots of interesting challenges: how do you define the API? how do you integrate with SQL? how do you make it run fast? how do you make it scale?

Ajay- Do you think Universities offer adequate preparation for in demand skills like Mapreduce, Hadoop and Business Intelligence

Peter-   Probably not BI–I learned everything I know about BI while at Aster. In terms of M/R, it’d be useful to have more hands-on experience with distributed system which at school. We read the MapReduce paper but didn’t get a chance to actually play with M/R. I think that sort of exposure would be useful. We recently made our software available to some students taking a data mining class at Stanford, and they came up with some fascinating use cases for our system, esp. around the Netflix challenge dataset.

Ajay- Describe some of the recent engineering products that you have worked with at Aster

Peter-  SQL/MR is the main aspects of nCluster that i’ve worked with–interesting challenged described in #2.

Ajay- All BI companies claim to crunch data the fastest at the lowest price at highest quality as per their marketing brochure- How would you validate your product’s performance scientifically and transparently.

Peter- I’ve found that the hardest part of judging performance is to come up with a realistic workload. There are public benchmarks out there, but they may or may not reflect the kinds of workloads that our customers want to run. Our goal is to make our customers’ experience as good as possible, so we focus on speeding up the sorts of workloads they ask about.
And here is a presentation at Slideshare.net on more of what Peter works on.

Data Mining 2009 Interviews- Terry Whitlock, BlueCross BlueShield of TN

Terry Whitlock, Health Care Expert

Terry Whitlock is an employee of BlueCross BlueShield of Tennessee. He has more than 15 years of analytical experience in managed care. He has utilized SAS for nearly 13 of his 15 years over a wide variety of research in the field of managed care. Over his career he has been published in Managed Care Interface and presented research at Best of Blues as well as Academy Health Research

simply download the Interview file at https://decisionstats.com/wp-content/uploads/2009/11/20091027-112844.m4a

Audio Interview Anne Milley , Part 1

Here is an interview I did at M2009 with Anne Milley, senior SAS Strategist – This is an audio interview so listen on-

n1105016319_322

or if bandwidth is too slow, you can download the interview here itself

https://decisionstats.com/wp-content/uploads/2009/11/memo1.m4a

 

 

 

 

 

Faye Meredith of SAS was kind enough to speak aloud the questions

This was a nice follow up on the March interview

<https://decisionstats.wordpress.com/2009/03/04/interview-anne-milley-sas-part-1/>.

Biography-

Anne H. Milley is senior director of SAS’ technology product marketing in worldwide marketing, Anne Milley oversees the marketing of SAS technologies.  Her ties to SAS began with her thesis on bank failure prediction models and the term structure of interest rates.  She completed this at The Federal Home Loan Bank of Dallas and became a manager in the credit group.  She continued her use of SAS at 7-Eleven, Inc. as a senior business consultant performing sales analysis and designing and conducting tests to aid in strategic decision-making, e.g., price sensitivity studies, advertising and promotion analysis.

Milley has authored various papers, articles and an award-winning report for the 1999 KDD Contest: http://www-cse.ucsd.edu/users/elkan/kdresults.html.  She co-chaired the SAS Data Mining Technology Conferences, M2001 and M2002 as well as SAS’ inaugural forecasting conference, F2006.  She has served on web mining committees for KDD and SIAM and on the Scientific Advisory Committee for Data Mining 2002.  In 2008 she completed a 5-month working sabbatical at a major financial services company in the United Kingdom.

Disclaimer- Travel and Stay Expenses to Data Mining 2009 were paid for me by SAS Institute. This is as per FCC regulations to bloggers. All opinions expressed are personal and not representing any organization.

Interview Carole Jesse Experienced Analytics Professional

An interview with Carole Jesse, an experienced Analytics professional in SAS, JMP , analytics and Risk Management.

CAJphoto_20091019(2)

Ajay- Describe your career in science from school to now.

Carole- Truthfully, my career in science started in 7th grade. Hey, I know this is further back in time than you intended the question to go!  However, something significant happened that year that pretty much set me on the path that I am still on today.  I discovered Algebra.  Up to that point in time, I was an average student in ‘arithmetic’. Algebra introduced LETTERS into the mix with numbers, in the simplest of ways that we have all seen: ‘Solve for x in the equation x+2=5’.  That was something I could get behind, AND I excelled at it immediately. Without mathematical excellence, efforts in learning science can fall apart.  Mathematics is everywhere!

I spent the rest of my secondary education consuming all the math and science that I could get. By the time I entered college I had already been exposed to pre-calculus and physics and was actually surprised by those in my college Freshman courses who had not seen anti-derivatives, memorized the quotient rule, or worked an inclined plane friction problem before.

My goal as an undergraduate was to become a Veterinarian.  The beauty of a pre-Vet curriculum is that it is pretty much like pre-Med, rigorous and broad in the sciences.  In my first two years of undergraduate work, I was exposed to more Chemistry, more Mathematics, more Physics, along with things like Genetics, Biology, even the Plant and Animal Sciences.  Although I did not stick with my pursuit of Veterinary Medicine, it laid a solid foundation that has served me very well in the strangest of places.

I consider myself a Mathematician/Statistician due to my academic degrees in those areas, first a BS in Mathematics/Physics at the University of Wisconsin followed by a MS in Statistics at Montana State University. In between the BS and MS I also dabbled briefly in Electrical Engineering at the University of Minnesota.

Since academia, it is my breadth in ALL sciences which has allowed me to be very fluid in straddling diverse industries: from High Volume Manufacturing of Consumer Products, to Nuclear Energy, to Semiconductor Manufacturing/Packaging, to Financial Services, to Health Care. I succeed at business problem solving in these industries by applying my Statistical Methods knowledge, coupled with business acumen and peripheral understanding of the technologies used. I have worked closely with scientists and engineers, and could enter THEIR world speaking THEIR language, which was an aid in getting to these solutions quickly.

I can not place enough emphasis on the importance of exposure to a broad range of sciences, and as early as possible, for anyone who wants to be involved in Advanced Analytics and Business Intelligence. As a manager, I look closely at candidates for these diverse sorts of backgrounds.

Ajay- I find the number of computer scientists and analysts to be overwhelmingly male despite this being a lucrative profession. Do you think that BI and Analytics are male dominated?  How can the trend be re-shaped?

Carole- Welcome to my world!  All kidding aside, yes that has been my observation as well. While I am not versed in the specifics of actual gender statistics in Computer Science and Advanced Analytics versus other fields, based on my years in and around these fields, there does appear to be a bias.

This is not due to a lack of capability or interest in these fields on the part of women. I believe it is more due to the long history of cultural norms and negative social messages that perhaps push woman away from these fields.  The messages can be subtle, but if you pay close attention, you will see them.  Being one of 10 females in an undergraduate engineering class of 150 students has a message right there.  Even though these 10 women were able to make entry to the class, the pressure of being a minority, whether gender based or otherwise, can be a powerful influencer in remaining there.

In my own experience, I have encountered frequent judgments where I was made to feel “good at math” was an unacceptable trait for a woman to have.  It is important to note that these judgments have been delivered equally by men AND women. So I think until both genders develop higher expectations of women in the hard science areas, the trends will continue.  It has been decades since my 7th grade introduction to algebra, but it appears the negative social messages regarding girls in math and science are still present today. Otherwise there would be no need (i.e. no market) for books like Danica McKellar’s “Math Doesn’t Suck,” and the follow-up “Kiss My Math,” both aimed at battling these negative messages at the middle school level.

As to how I have battled these cultural expectations, I developed a thick skin. I have also learned to expect excellence from myself even when a teacher, or a peer, or a boss may have had lower expectations for me than for a male counterpart. Sort of a John Mayer “Who Says” type of attitude.  Who says I can’t do Math and Science. Watch me.

Ajay- How would you explain Risk Management using software to a class of graduate students in mathematics and statistics?

Carole- There are many areas of Risk Management.  My specific experience has been on the Credit Risk Management and Fraud Risk Management sides in a couple of industries.  For credit risk in financial services, typically there is a specific department whose role is to quantify and predict credit risk.  Not just for the current portfolio, but for new products as well.  Various methodologies are utilized, ranging from summarization of portfolio characteristics that have a known relationship to default to using historical data to build out predictive models for production implementation.

Key skills needed here are good understanding of the business, solid statistical methods knowledge, and computing skills.  As far as the computing /software skills needed, there are three main categories 1) query and preparation of data, 2) model building and validation, and 3) model implementation.  The actual tools will likely differ across these categories.

For example, 1) might be tackled with SAS®, Business Objects, or straight SQL;

2) requires a true modeling package or coding language like SAS®, SPSS, R, etc; and lastly

3) is the trickiest, as implementation can have many system limitations, but SAS® or C++ are often seen at implementation.

Ajay- Describe some of your most challenging and most exciting projects over the years.

Carole- I have been very fortunate to have many challenges and good projects in every role I have been in, but as I look back today, some things that stand out the most were in ‘high tech’.  By virtue of being high tech, there is no fear of technology, and it is fast-paced and ever evolving to the next generation of product.

I spent seven years in the Semiconductor industry during the 90’s at Micron Technology, Intel, and Motorola. At the beginning of that window, we left the 486 processor world, and during that window we spanned the realm of Pentium processors.” Moore’s Law dominated all of this. To stay competitive all of these companies embraced statistical methods to help speed up development time.

At one point, I supported a group of about 10 R&D engineers in the Design and Analysis of their process improvement and simplification experiments.  This afforded me exposure to much of the leading edge research the team was working on.

I recall one project with the goal of optimizing capacitance via surface roughness of the capacitor structures.  In addition to all the science involved at the manufacturing step, what made this so interesting was the difficulty in measuring capacitance at the point in the process where film roughness was introduced. All we had were surface images after this step.  The semiconductor wafers had to pass through several more process steps to get to the point where capacitance could actually be measured. All of this provided challenges around the design of the experiment and the data handling and analysis.

By working closely with both the process engineer and the process technician I was able to gather the image files off the image tool that were taken from the experimental runs. I used SAS® (yes, another shameless plug for my favorite software) to process the images using Fast Fourier Transforms. Subsequently, the transformed data was correlated to the capacitance in the analysis of the experimental results.  Finding the sweet spot for capacitance, as driven by surface roughness, provided a huge leap for this process technology team.

The challenges of today are much different than they were in the 90s.  In the more recent years, I have been working with transactional data related to financial services or health care claims.  The challenges manifest themselves in the sheer volume of the data. In the last decade in particular most industries have been able to put the infrastructures in place to gather and store massive amounts of data related to their businesses.  The challenge of turning this data into meaningful actionable information has been equally exciting as using Fast Fourier Transforms on image processing to optimize capacitance!

Currently I am working with an Oracle database where one table in the schema has 250 million records and a couple hundred fields.  I refer to this as a “Pushing Tera” situation, since this one table is close to a Terabyte in size. As far as storing the data, that is not a big deal, but working with data this large or larger is the challenge.

Different skill sets are needed here beyond those of just an analyst, data miner, or statistician.  These VLDB situations have morphed me into a bit of an IT person.

  • How do you efficiently query such large databases? An inefficient SQL query will not be a bother in a situation where the database is small. But when the database is large, SQL efficiency is key. Many skills needed for industry are not necessarily taught in academia, but rather get picked up along the way, like Unix and SQL. I now write efficient SQL code, but many poorly written jobs gave their lives so that I could learn these efficiencies!
  • Eventually I will need to organize this data into an application specific format and put data security controls around the process.  Again, is this Advanced Analytics?  Not really, it is more of an MIS role. The newness in these challenges keeps me excited about my work.

Ajay- How important do you think work life balance is for people in this profession? What do you do to chill out?

Carole- I don’t think the work-life balance is any more or less important to the decision science professionals than it is to any other profession really.  I have friends in many other professions like Law, Nursing, Financial Planning, etc. with the same work-life balance struggles.

We live in a busy culture that includes more and more demands placed on us professionally.  Let’s face it, most of us are care-takers to someone besides ourselves.  It might be a spouse, or a child, or a dog, or even an elderly parent. Therefore, a total focus on work is bound to upset the work-life balance for most of us.

My biggest struggle comes in the form of balancing the two sides of my brain.  That may sound weird, but one thing you have to agree with is that all of this is pretty Left Brained:  mathematics, statistics, business intelligence, computing, etc.

To balance this out, and tap into my Right Brain, I like to dabble in the arts to some extent.  Don’t get me wrong, I am not an artist!  But that doesn’t mean I can’t draw on creativity in the artistic sense. For example, this past summer I took a course on Adobe Photoshop and Illustrator at Minneapolis College of Art and Design. This provided the best of both worlds, combining software and art! In addition to learning how to remove Cindy Crawford’s mole (yes, we did this), there were some very useful projects.  One of my course projects was creating my customized Twitter background. An endeavor like this provides me a ‘chilling out’ factor from the normal work world. I know of many other Left Brain leaners that do similar things, like playing a musical instrument, or painting, etc. This is another reason why I took up digital photography: more visual arts.

Volunteer work has a balancing effect too. I try to give back to the community when I can. Swinging a hammer at Habitat for Humanity, or doing record keeping for an Animal Rescue organization, are things I have participated in.

And if none of this works, I enjoy cooking for my family and friends, and plying them with wine!

Ajay- What are you views on:

Carole- Data Quality

I’d have to say I am for data Quality! Who isn’t? But the reality is that data is dirty.  That “Pushing Tera” Oracle table I mentioned earlier, well it turns out it has some issues.  And it is incumbent upon me to determine the quality of that data before attempting to do anything analytical with it.  One place in industry where value enhancement are needed:  database administrators with business knowledge.  It seems that more times than not, even if there was a business savvy DBA they may have moved on, leaving the consumers of that data (that would be me) to fend for themselves. There is some debate over which philosopher said “Know thyself.”  Today’s job challenge is to “Know thy data” or perhaps “Value those that know thy data.”

B) Predictive Analytics for Fraud Monitoring

There is a huge market for analytics in fraud detection and prevention.  But it is not for the faint of heart. Insiders, at least in Mortgage and Health Care, are the typical perpetrators of lucrative fraud. These insiders know how the industry processes work and they exploit this.  As soon as one loophole is discovered and patched, fraudsters are looking for another loophole to exploit.  This makes the task of predictive analytics different for Fraud than other areas where underlying patterns are probably more stable.  Any methodology used here must have “turn on a dime” features built in, if possible.  With economic conditions as they are, fraud detection/monitoring will remain important and challenging field.

Biography

Carole Jesse has been applying statistical methods and advanced analytics in a variety of industries for the last 20 years.  Her career spans High Volume Manufacturing of Consumer Products, Nuclear Energy, Semiconductor Manufacturing/Packaging, Financial Services, and Health Care.  Applications have ranged from Design and Analysis of Experiments to Credit Risk Prediction to Fraud Pattern Recognition.  Carole holds a B.S. in Mathematics from the University of Wisconsin and a M.S. in Statistics from Montana State University, as well as several professional certifications.  All the opinions expressed here are her own, and not those of her employers: past, present, or future.  (Although her dog Angie may have had some influence.)  Ms. Jesse currently lives and works in Minneapolis, Minnesota.

You can find Carole on Twitter as @CaroleJesse and at LinkedIn http://www.linkedin.com/in/CaroleJesse



1) Describe your career in science from school to now.

Truthfully, my career in science started in 7th grade. Hey, I know this is further back in time than you intended the question to go!  However, something significant happened that year that pretty much set me on the path that I am still on today.  I discovered Algebra.  Up to that point in time, I was an average student in ‘arithmetic’. Algebra introduced LETTERS into the mix with numbers, in the simplest of ways that we have all seen: ‘Solve for x in the equation x+2=5’.  That was something I could get behind, AND I excelled at it immediately. Without mathematical excellence, efforts in learning science can fall apart.  Mathematics is everywhere!

I spent the rest of my secondary education consuming all the math and science that I could get. By the time I entered college I had already been exposed to pre-calculus and physics and was actually surprised by those in my college Freshman courses who had not seen anti-derivatives, memorized the quotient rule, or worked an inclined plane friction problem before.

My goal as an undergraduate was to become a Veterinarian.  The beauty of a pre-Vet curriculum is that it is pretty much like pre-Med, rigorous and broad in the sciences.  In my first two years of undergraduate work, I was exposed to more Chemistry, more Mathematics, more Physics, along with things like Genetics, Biology, even the Plant and Animal Sciences.  Although I did not stick with my pursuit of Veterinary Medicine, it laid a solid foundation that has served me very well in the strangest of places.

I consider myself a Mathematician/Statistician due to my academic degrees in those areas, first a BS in Mathematics/Physics at the University of Wisconsin followed by a MS in Statistics at Montana State University. In between the BS and MS I also dabbled briefly in Electrical Engineering at the University of Minnesota.

Since academia, it is my breadth in ALL sciences which has allowed me to be very fluid in straddling diverse industries: from High Volume Manufacturing of Consumer Products, to Nuclear Energy, to Semiconductor Manufacturing/Packaging, to Financial Services, to Health Care. I succeed at business problem solving in these industries by applying my Statistical Methods knowledge, coupled with business acumen and peripheral understanding of the technologies used. I have worked closely with scientists and engineers, and could enter THEIR world speaking THEIR language, which was an aid in getting to these solutions quickly.

I can not place enough emphasis on the importance of exposure to a broad range of sciences, and as early as possible, for anyone who wants to be involved in Advanced Analytics and Business Intelligence. As a manager, I look closely at candidates for these diverse sorts of backgrounds.

2) I find the number of computer scientists and analysts to be overwhelmingly male despite this being a lucrative profession. Do you think that BI and Analytics are male dominated?  How can the trend be re-shaped?

Welcome to my world!  All kidding aside, yes that has been my observation as well. While I am not versed in the specifics of actual gender statistics in Computer Science and Advanced Analytics versus other fields, based on my years in and around these fields, there does appear to be a bias.

This is not due to a lack of capability or interest in these fields on the part of women. I believe it is more due to the long history of cultural norms and negative social messages that perhaps push woman away from these fields.  The messages can be subtle, but if you pay close attention, you will see them.  Being one of 10 females in an undergraduate engineering class of 150 students has a message right there.  Even though these 10 women were able to make entry to the class, the pressure of being a minority, whether gender based or otherwise, can be a powerful influencer in remaining there.

In my own experience, I have encountered frequent judgments where I was made to feel “good at math” was an unacceptable trait for a woman to have.  It is important to note that these judgments have been delivered equally by men AND women. So I think until both genders develop higher expectations of women in the hard science areas, the trends will continue.  It has been decades since my 7th grade introduction to algebra, but it appears the negative social messages regarding girls in math and science are still present today. Otherwise there would be no need (i.e. no market) for books like Danica McKellar’s “Math Doesn’t Suck,” and the follow-up “Kiss My Math,” both aimed at battling these negative messages at the middle school level.

As to how I have battled these cultural expectations, I developed a thick skin. I have also learned to expect excellence from myself even when a teacher, or a peer, or a boss may have had lower expectations for me than for a male counterpart. Sort of a John Mayer “Who Says” type of attitude.  Who says I can’t do Math and Science. Watch me.

3) How would you explain Risk Management using software to a class of graduate students in mathematics and statistics?

There are many areas of Risk Management.  My specific experience has been on the Credit Risk Management and Fraud Risk Management sides in a couple of industries.  For credit risk in financial services, typically there is a specific department whose role is to quantify and predict credit risk.  Not just for the current portfolio, but for new products as well.  Various methodologies are utilized, ranging from summarization of portfolio characteristics that have a known relationship to default to using historical data to build out predictive models for production implementation.  Key skills needed here are good understanding of the business, solid statistical methods knowledge, and computing skills.  As far as the computing /software skills needed, there are three main categories 1) query and preparation of data, 2) model building and validation, and 3) model implementation.  The actual tools will likely differ across these categories.  For example, 1) might be tackled with SAS®, Business Objects, or straight SQL; 2) requires a true modeling package or coding language like SAS®, SPSS, R, etc; and lastly 3) is the trickiest, as implementation can have many system limitations, but SAS® or C++ are often seen at implementation.

4) Describe some of your most challenging and most exciting projects over the years.

I have been very fortunate to have many challenges and good projects in every role I have been in, but as I look back today, some things that stand out the most were in ‘high tech’.  By virtue of being high tech, there is no fear of technology, and it is fast-paced and ever evolving to the next generation of product.

I spent seven years in the Semiconductor industry during the 90’s at Micron Technology, Intel, and Motorola. At the beginning of that window, we left the 486 processor world, and during that window we spanned the realm of Pentium processors.” Moore’s Law dominated all of this. To stay competitive all of these companies embraced statistical methods to help speed up development time.

At one point, I supported a group of about 10 R&D engineers in the Design and Analysis of their process improvement and simplification experiments.  This afforded me exposure to much of the leading edge research the team was working on.

I recall one project with the goal of optimizing capacitance via surface roughness of the capacitor structures.  In addition to all the science involved at the manufacturing step, what made this so interesting was the difficulty in measuring capacitance at the point in the process where film roughness was introduced. All we had were surface images after this step.  The semiconductor wafers had to pass through several more process steps to get to the point where capacitance could actually be measured. All of this provided challenges around the design of the experiment and the data handling and analysis.

By working closely with both the process engineer and the process technician I was able to gather the image files off the image tool that were taken from the experimental runs. I used SAS® (yes, another shameless plug for my favorite software) to process the images using Fast Fourier Transforms. Subsequently, the transformed data was correlated to the capacitance in the analysis of the experimental results.  Finding the sweet spot for capacitance, as driven by surface roughness, provided a huge leap for this process technology team.

The challenges of today are much different than they were in the 90s.  In the more recent years, I have been working with transactional data related to financial services or health care claims.  The challenges manifest themselves in the sheer volume of the data. In the last decade in particular most industries have been able to put the infrastructures in place to gather and store massive amounts of data related to their businesses.  The challenge of turning this data into meaningful actionable information has been equally exciting as using Fast Fourier Transforms on image processing to optimize capacitance!

Currently I am working with an Oracle database where one table in the schema has 250 million records and a couple hundred fields.  I refer to this as a “Pushing Tera” situation, since this one table is close to a Terabyte in size. As far as storing the data, that is not a big deal, but working with data this large or larger is the challenge.

Different skill sets are needed here beyond those of just an analyst, data miner, or statistician.  These VLDB situations have morphed me into a bit of an IT person.

  • How do you efficiently query such large databases? An inefficient SQL query will not be a bother in a situation where the database is small. But when the database is large, SQL efficiency is key. Many skills needed for industry are not necessarily taught in academia, but rather get picked up along the way, like Unix and SQL. I now write efficient SQL code, but many poorly written jobs gave their lives so that I could learn these efficiencies!
  • Eventually I will need to organize this data into an application specific format and put data security controls around the process.  Again, is this Advanced Analytics?  Not really, it is more of an MIS role. The newness in these challenges keeps me excited about my work.

5)  How important do you think work life balance is for people in this profession? What do you do to chill out?

I don’t think the work-life balance is any more or less important to the decision science professionals than it is to any other profession really.  I have friends in many other professions like Law, Nursing, Financial Planning, etc. with the same work-life balance struggles.

We live in a busy culture that includes more and more demands placed on us professionally.  Let’s face it, most of us are care-takers to someone besides ourselves.  It might be a spouse, or a child, or a dog, or even an elderly parent. Therefore, a total focus on work is bound to upset the work-life balance for most of us.

My biggest struggle comes in the form of balancing the two sides of my brain.  That may sound weird, but one thing you have to agree with is that all of this is pretty Left Brained:  mathematics, statistics, business intelligence, computing, etc.

To balance this out, and tap into my Right Brain, I like to dabble in the arts to some extent.  Don’t get me wrong, I am not an artist!  But that doesn’t mean I can’t draw on creativity in the artistic sense. For example, this past summer I took a course on Adobe Photoshop and Illustrator at Minneapolis College of Art and Design. This provided the best of both worlds, combining software and art! In addition to learning how to remove Cindy Crawford’s mole (yes, we did this), there were some very useful projects.  One of my course projects was creating my customized Twitter background. An endeavor like this provides me a ‘chilling out’ factor from the normal work world. I know of many other Left Brain leaners that do similar things, like playing a musical instrument, or painting, etc. This is another reason why I took up digital photography: more visual arts.

Volunteer work has a balancing effect too. I try to give back to the community when I can. Swinging a hammer at Habitat for Humanity, or doing record keeping for an Animal Rescue organization, are things I have participated in.

And if none of this works, I enjoy cooking for my family and friends, and plying them with wine!

6) What are you views on:

A)  Data Quality

I’d have to say I am for data Quality! Who isn’t? But the reality is that data is dirty.  That “Pushing Tera” Oracle table I mentioned earlier, well it turns out it has some issues.  And it is incumbent upon me to determine the quality of that data before attempting to do anything analytical with it.  One place in industry where value enhancement are needed:  database administrators with business knowledge.  It seems that more times than not, even if there was a business savvy DBA they may have moved on, leaving the consumers of that data (that would be me) to fend for themselves. There is some debate over which philosopher said “Know thyself.”  Today’s job challenge is to “Know thy data” or perhaps “Value those that know thy data.”

B) Predictive Analytics for Fraud Monitoring

There is a huge market for analytics in fraud detection and prevention.  But it is not for the faint of heart. Insiders, at least in Mortgage and Health Care, are the typical perpetrators of lucrative fraud. These insiders know how the industry processes work and they exploit this.  As soon as one loophole is discovered and patched, fraudsters are looking for another loophole to exploit.  This makes the task of predictive analytics different for Fraud than other areas where underlying patterns are probably more stable.  Any methodology used here must have “turn on a dime” features built in, if possible.  With economic conditions as they are, fraud detection/monitoring will remain important and challenging field.