The Big Data Event- Why am I here?

I am here braving New York’s cold weather, as I prepare for this evening’s events. If you follow this blog closely ( including the poems) ,it is a welcome change— New York is a nice city people are friendly if you ask them nicely and the bus is a great way to watch the city – best of all I like the crowds which I have grown used while living in India.

Why Am I here?

Because the topics that are discussed here are cutting edge to the point that I cannot find anyone willing to teach me Hadoop and Map-Reduce while in University and at the same time teach me statistics on them as well ( as in how do we do a K Means clustering on a 1 terabyte dataset).

I asked the organizers on what makes the event special ( every event promises special Mojo after all).

This is what they said-

What is the unique value proposition of the event that will help developers and both current and potential customers-

The essence of the event is to explore new innovations in massively-parallel processing data warehousing technology and how it can help companies gain more insight from their data.  Applications include fraud detection, behavioral targeting, social network analysis, better predictions/forecasting, bioinformatics, etc.  We are exploring how MapReduce and Hadoop can be integrated into the enterprise IT system to help evolve data warehousing/BI/data mining

and to put it even more nicely’

The industry’s first big data event, Big Data Summit ‘09, being held this evening in New York City, will showcase Hadoop’s fit with MPP data warehouses. Aster Data will be presenting alongside Colin White, President and Founder of BI Research, Mike Brown of comScore Inc., and Jonathan Goldman, who represents LinkedIn.”

That’s good enough for me to drop into Roosevelt Hotel on East 45th Street at around 6 pm for some reluctant networking ( read: beers). 5 years ago whie working for GE , I used to run queries using SAS on a 147 million row database (the size of the DB) and wait 3 hours for it to come back. Today that much data fits very snugly in my laptop. How soon will we have Terabyte level personal computing, and Petabyte level business computing and the challenges it poses to standard statistical assumptions and synching of hardware and software- Big Big Data is an interesting area to watch.

Interview Shawn Kung Sr Director Aster Data

Here is an interview with Shawn Kung, Senior Director of Product Management at Aster Data. Shawn explains the difference between the various database technologies, Aster’s rising appeal to its unique technological approach and touches upon topics of various other interests as well to people in the BI and technology space.

image001

Ajay -Describe your career journey from a high school student of science till today .Do you think science is a more lucrative career?

Shawn: My career journey has spanned over a decade in several Silicon Valley technology companies.  In both high school and my college studies at Princeton, I had a fervent interest in math and quantitative economics.  Silicon Valley drew me to companies like upstart procurement software maker Ariba and database giant Oracle.  I continued my studies by returning to get a Master’s in Management Science at Stanford before going on to lead core storage systems for nearly 5 years at NetApp and subsequently Aster.

Science (whether it is math, physics, economics, or the hard engineering sciences) provides a solid foundation.  It teaches you to think and test your assumptions – those are valuable skills that can lead to a both a financially lucrative and personally inspiring career.

Ajay- How would you describe the difference between Map Reduce and Hadoop and Oracle and SAS, DBMS and Teradata and Aster Data products to a class of undergraduate engineers ?

Shawn: Let’s start with the database guys – Oracle and Teradata.  They focus on structured data – data that has a logical schema and is manipulated via a standards-based structured query language (SQL).  Oracle tries to be everything to everyone – it does OLTP (low-latency transactions like credit card or stock trade execution apps) and some data warehousing (typically summary reporting).  Oracle’s data warehouse is not known for large-scale data warehousing and is more often used for back-office reporting.

Teradata is focused on data warehousing and scales very well, but is extremely expensive – it runs on high-end custom hardware and takes a mainframe approach to data processing.  This approach makes less sense as commodity hardware becomes more compute-rich and better software comes along to support large-scale MPP data warehousing.

SAS is very different – it’s not a relational database. It really offers an application platform for data analysis, specifically data mining.  Unlike Oracle and Teradata which is used by SQL developers and managed by DBAs, SAS is typically run in business units by data analysts – for example a quantitative marketing analyst, a statistician/mathematician, or a savvy engineer with a data mining/math background.  SAS is used to try to find patterns, understand behaviors, and offer predictive analytics that enable businesses to identify trends and make smarter decisions than their competitors.

Hadoop offers an open-source framework for large-scale data processing.  MapReduce is a component of Hadoop, which also contains multiple other modules including a distributed filesystem (HDFS).  MapReduce offers a programming paradigm for distributed computing (a parallel data flow processing framework).

Both Hadoop and MapReduce are catered toward the application developer or programmer.  It’s not catered for enterprise data centers or IT.  If you have a finite project in a line of business and want to get it done, Hadoop offers a low-cost way to do this.  For example, if you want to do large-scale data munging like aggregations, transformations, manipulations of unstructured data – Hadoop offers a solution for this without compromising on the performance of your main data warehouse.  Once the data munging is finished, the post-processed data set can be loaded into a database for interactive analysis or analytics. It is a great combination of big data technologies for certain use-cases.

Aster takes a very unique approach.  Our Aster nCluster software offers the best of all worlds – we offer the potential for deep analytics of SAS, the low-cost scalability and parallel processing of Hadoop/MapReduce, and the structured data advantages (schema, SQL, ACID compliance and transactional integrity, indexes, etc) of a relational database like Teradata and Oracle.  Often, we find complementary approaches and therefore view SAS and Hadoop/MapReduce as synergistic to a complete solution.  Data warehouses like Teradata and Oracle tend to be more competitive.

Ajay- What exciting products have you launched so far and what makes them unique both from a technical developer perspective and a business owner perspective

Shawn: Aster was the first-to-market to offer In-Database MapReduce, which provides the standards and familiarity of SQL and databases with the analytic power of MapReduce.  This is very unique as it offers technical developers and application programmers to write embedded procedural algorithms once, upload it, and allow business analysts or IT folks (SQL developers, DBAs, etc) to invoke these SQL-MapReduce functions forever.

It is highly polymorphic (re-usable), highly fault-tolerant, highly flexible (any language – Java, Python, Ruby, Perl, R statistical language, C# in the .NET world, etc) and natively massively parallel – all of which differentiate these SQL extensions from traditional dumb user-defined functions (UDFs).

Ajay- “I am happy with my databases and I don’t need too much diversity or experimentation in my systems”, says a CEO to you.

How do you convince him using quantitative numbers and not marketing adjectives?

Shawn: Aster has dozens of production customers including big-names like MySpace, LinkedIn, Akamai, Full Tilt Poker, comScore, and several yet-to-be-named retail and financial service accounts.  We have quantified proof points that show orders of magnitude improvements in scalability, performance, and analytic insights compared to incumbent or competitor solutions.  Our highly referenceable customers would be happy to discuss their positive experiences with the CEO.

But taking a step back, there’s a fundamental concept that this CEO needs to first understand.  The world is changing – data growth is proliferating due to the digitization of so many applications and the emergence of unstructured data and new data types.  Like the book “Competing on Analytics”, the world is shifting to a paradigm where companies that don’t take risks and push the limits on analytics will die like the dinosaurs.

IDC is projecting 10x+ growth in data over the next few years to zetabytes of aggregate data driven by digitization (Internet, digital television, RFID, etc).  The data is there and in order to compete effectively and understand your customers more intimately, you need a large-scale analytics solution like the one Aster nCluster offers.  If you hold off on experimentation and innovation, it will be too late by the time you realize you have a problem at hand.

Ajay- How important is work life balance for you?

Shawn: Very important.  I hang out with my wife most weekends – we do a lot of outdoors activities like hiking and gardening.  In Silicon Valley, it’s all too easy to get caught up in the rush of things.  Taking breaks, especially during the weekend, is important to recharge and re-energize to be as productive as possible.

Ajay- Are you looking for college interns and new hires what makes aster exciting for you so you are pumped up every day to go to work?

Shawn: We’re always looking for smart, innovative, and entrepreneurial new college grads and interns, especially on the technical side.  So if you are a computer science major or recent grad or graduate student, feel free to contact us for opportunities.

What makes Aster exciting is 2 things –

first, the people.  Everyone is very smart and innovative so you learn a tremendous amount, which is personally gratifying and professionally useful long-term.

Second, Aster is changing the world!

Distributed systems computing focused on big data processing and analytics – these are massive game-changers that will fundamentally change the landscape in data warehousing and analytics.  Traditional databases have been a oligopoly for over a generation – they haven’t been challenged and so the 1970’s based technology has stuck around.  The emergence of big data and low-cost commodity hardware has created a unique opportunity to carve out a brand new market…

what gets me pumped every day is I have the ability to contribute to a pioneer that is quickly becoming Silicon Valley’s next great success story!

Biography-

Over the past decade, Shawn has led product management for some of Silicon Valley’s most successful and innovative technology companies.  Most recently, he spent nearly 5 years at Network Appliance leading Core Systems storage product management, where he oversaw the development of high availability software and Storage Systems hardware products that grew in annual revenue from $200M to nearly $800M.  Prior to NetApp, Shawn held senior product management and corporate strategy roles at Oracle Corporation and Ariba Inc.

Shawn holds an M.S. in Management Science and engineering from Stanford University, where he was awarded the Valentine Fellowship (endowed by Don Valentine of Sequoia Capital).  He also received a B.A. with high honors from Princeton University.

About Aster

Aster Data Systems is a proven leader in high-performance database systems for data warehousing and analytics – the first DBMS to tightly integrate SQL with MapReduce – providing deep insights on data analyzed on clusters of low-cost commodity hardware. The AsternCluster database cost-effectively powers frontline analytic applications for companies such as MySpace, aCerno (an Akamai company), and ShareThis.

Running on low-cost off-the-shelf hardware, and providing ‘hands-free’ administration, Aster enables enterprises to meet their data warehousing needs within their budget. Aster is headquartered in San Carlos, California and is backed by Sequoia Capital, JAFCO Ventures, IVP, Cambrian Ventures, and First-Round Capital, as well as industry visionaries including David Cheriton and Ron Conway.

KXEN Conference :How to get the Best ROI..

Citation : http://www.kxen.com/index.php?option=com_content&task=view&id=560&Itemid=1005

kxen

How to get the Best ROI from your Advanced Analytics Investment?
MonDAY, NovemBER 2, 2009
09:30-10:30 AM Registration & Breakfast
10:30-10:45 AM Welcome & Opening Remarks,
by Roger Haddad, CEO KXEN
10:45-11:30 AM Keynote Session:
Customer Focus and Momentum Strategy,
by Marc Beauvois-Coladon, on the original work of
Professor Jean-Claude Larréché, INSEAD Alfred
H. Heineken Chair of Marketing
11:30-12:30 AM Technology Directions,
by Erik Marcade, CTO KXEN
12:30-01:45 PM Lunch
01:45-02:30 PM Customer Case Study: Groupama – Insurance
02:30-03:15 PM Boosting Lift by integrating Web Browsing Behavior in Affinity Modeling, by Edouard Servan-Schreiber, Assistant Director Advanced Analytics Teradata EMEA
03:15-04:00 PM Customer Case Study: Leveraging Social Network Analysis in Customer Retention with KXEN
by Jaroslaw Kosinski, Project Manager Planning and Strategic Analysis TPSA (Telco)
04:00-04:15 PM
Break

04:15-05:00 PM Business Track 1: Towards Predictive BI: Manage your strategic KPI’s (workshop part 1)

Technical Track 1: How to build a Data Factory

05:00-05:45 PM Business Track 2: Towards Predictive BI: Manage your Strategic KPI’s (workshop – part 2)

Technical Track 2: How to build a Modeling Factory

05:45-06:30 PM Customer Case Study: Industry
08:00-00:00 PM Gala Dinner @ the Eiffel Tower
TuesDAY, NovemBER 3, 2009
09:00-09:30 AM Welcome Coffee
09:30-10:15 AM How to optimise Performance and Effectiveness of Relationship Management Programs, while Taking Advantage of just-in-time Marketing,
by Alexandre Losson, Head of CRM, Business & Decision
10:15-11:00 AM Customer Case Study: Wolters Kluwer – Publishing & Media
11:00-11:15 AM Break
11:15-12:00 AM Business Track 3: Automate the Production of Hyper Targeted Campaigns, by Emmanuel Duhesme, CEO Probance

Technical Track 3: How to implement Social Networks Analysis

12:00-12:45 AM Business Track 4: How to present your Data Mining Results to get Visibility, by Frederic Binon,
Marketing Expert Manager at Belgacom

Technical Track 4: KXEN New Versions Tips & Tricks

12:45-01:45 PM Lunch
01:45-02:15 PM Partner Presentation: Geoconcept
02:15-03:00 PM Customer Case Study: The Co-operative Financial Services – Bank
03:00-03:15 PM
Break
03:15-04:15 PM Ask your Questions to KXEN Executives!
04:15-05:00PM Closing Remarks by Roger Haddad, CEO KXEN
Option: NOVEMBER 4, 2009
Complimentary Post-Conference Technical Workshop

Registrations on a First Come / First Served Basis – Places are limited

09:00-11:30 AM ADM: KXEN Data Manipulation Training & Exercices
11:30-01:00 PM KMF: Tour of KXEN Modeling Factory
01:00-02:00 PM Lunch
02:00-05:00 PM KSN: KXEN Social Network Analysis Training & Exercices
Registration requests for this workshop should be sent by e-mail to event-eur@kxen.com once your Conference Registration has been proceeded.

KXEN EMEA & Asia user CONFERENCE
NovembER 2 & 3, 2009 – Paris
Agenda

And now- The Business Analytics Summit

Citation-http://www.businessanalyticsnews.com/usa/speakers.shtml

Here is another conference but focused on Business Analytics ( which is an under hyped term compared to Business Intelligence which some feel is over hyped)

It seems ( jokes apart) Business Analytics is really the Business to be in this season and with the economy picking up steam- more power to the Numerati is all I can say.

One must really say- this conference does have a very long list of speakers ( maybe the most exhaustive I have seen this year) and almost like they went around with an encyclopedia of who’s who. It is also partnered by KD Nuggets and good ol Gregory sure knows how to partner and promote ( I am still a kid at that 😉

Surprisingly No R ,SPSS and No SAS in this one – so what exact software are they going to use in the case studies ( machine learning ( an oxymoron)  or SQL ?

Speaker Job Title Company
Michael Berry Founder & Principal Data-Miners Inc
Richard Boire Partner Boire Filler Group
Chris Bowman Former School Superintendent Schools in the City of Baker School System
John Elder Chief Scientist Elder Research Inc
Usama Fayyad CEO Open Insights
Regina Gray Vice President of Strategic Insight Services, Experian Marketing Analytics Experian
Seth Grimes President Alta Plana
Mitch Hendricks System Analyst ConAgra Foods
Duncan Houldsworth Global Head of Analytics Hall and Partners
Jean-Paul Isson VP Business Intelligence & Predictive Analytics Monster Worldwide
Evan Sandhaus Semantic Technologist New York Times Company
Mark Smith CEO & EVP Research Ventana Research
David Stodder Editorial Director, Writer and Principle Analyst Perceptive Information Strategies
James Taylor CEO Decision Management Solutions
Bill Tuohig Senior Director, Web Intelligence J.D. Power & Associates
Dave Wells Director of Education eLearningCurve
David White Senior Research Analyst Aberdeen Group
Michael Berry
A founder and principal of Data Miners with twenty years of experience applying data mining techniques to business problems in marketing and CRM. With his colleague, Gordon Linoff, Michael has authored three of the most widely read and respected books on data mining, Data Mining Techniques, Mastering Data Mining, and Mining the Web (all published by John Wiley & Sons). These best sellers in the field have been translated into many languages.

A data mining educator as well as a consultant, Michael is in demand as a keynote speaker and seminar leader in the area of data mining generally and the application of data mining to customer relationship management in particular.

Prior to founding Data Miners, Michael spent 8 years at Thinking Machines Corporation. There he specialized in the application of massively parallel supercomputing techniques to business and marketing applications, including one of the largest database marketing systems of the time. He later ran a data mining laboratory for Naviant Technology Solutions.

Recent projects include clustering towns to create localized content editions of a major metropolitan newspaper and modeling anniversary churn for a wireless telephone company.

Richard Boire
Richard Boire’s experience in database marketing and data mining  dates back to 1983, when he received an MBA from Concordia University in Finance and Statistics. Immediately after, he joined Reader’s Digest where he developed segmentation and modeling routines for all direct marketing programs.

Richard’s progressive career path led him to American Express, where he pioneered predictive modelling technology for all direct marketing programs. He was quickly promoted to senior manager, responsible for credit risk and market behaviour segmentation. As a result of this expanded role, he pioneered the development of models which targeted the acquisition of new customers based on return on investment.

With this experience, Richard formed his own consulting company back in 1994 which is now called the Boire Filler Group , a Canadian leader in offering  analytical and database services to companies seeking solutions  to their existing data mining or database marketing challenges.

Richard is a recognized authority on data mining and is among a very few, select top five experts in this field in Canada, with expertise and knowledge that is difficult, if not impossible to replicate in Canada. He gives seminars on segmentation and data mining for such organizations as Canadian Marketing Association (CMA), Direct Marketing News,Direct Marketing Association Toronto and AARM. His articles have appeared in Direct Marketing News, Strategy Magazine, and Marketing Magazine. He has taught applied statistics and database marketing at Concordia University, Vanier College in Montreal, and  Humber College in Toronto and is currently a part-time lecturer at both George Brown College and Seneca College   in Toronto. Richard is currently  Chair at the CMA’s Database and Technology Committee  where he  has chaired  sub-committees  responsible for  both the 2000 Database and Technology Seminar as well as the  2002 Database and Technology Seminar and has currently chaired the first-ever Customer Profitability Conference  which was  held on Nov.17/2005. He has co-authored  white papers on the following topics: ‘Best Practices in Data Mining’ as well as   ‘Profitability:  The State of Evolution among Canadian Companies’

Chris Bowman
Chris Bowman has been in education since 1969 as a teacher, an assistant principal, a principal, and a supervisor of transportation, technology, federal programs, and middle schools.  He is now retired after serving as the superintendent of schools in the City of Baker School System in Baton Rouge, Louisiana.   I have a picture for you if you choose to use it.  Please correct the fact that I am not the retired superintendent of The Lafourche Parish School System.
John Elder
Dr. John Elder heads a data mining consulting team with offices in Charlottesville, Virginia and Washington DC (www.datamininglab.com).  Founded in 1995, Elder Research, Inc. focuses on Federal, commercial, investment, and security applications of advanced analytics, including text mining, stock selection, image recognition, biometrics, process optimization, cross-selling, drug efficacy, credit scoring, market timing, and fraud detection.

John obtained a BS and MEE in Electrical Engineering from Rice University, and a PhD in Systems Engineering from the University of Virginia, where he’s an adjunct professor teaching Optimization or Data Mining.  Prior to 14 years at ERI, he spent 5 years in aerospace defense consulting, 4 heading research at an investment management firm, and 2 in Rice’s Computational & Applied Mathematics department.

Dr. Elder has authored innovative data mining tools, is a frequent keynote speaker, and was co-chair of the 2009 Knowledge Discovery and Data Mining conference, in Paris.  John’s courses on analysis techniques — taught at dozens of universities, companies, and government labs — are noted for their clarity and effectiveness.  Dr. Elder was honored to serve for 5 years on a panel appointed by the President to guide technology for National Security.  His book on Practical Data Mining, with Bob Nisbet and Gary Miner, was published in May 2009. John is a follower of Christ and the proud father of 5.

Regina Gray
Regina Gray is the Vice President of the Strategic Insight Services team for Experian Marketing Services. With proven results and expertise, Regina leads her team in providing consulting and analytical services to help facilitate clients in their efforts to acquire and retain customers, increase profitability and return-on-investment from marketing campaigns, and communicate effectively.  Regina oversees the teams delivering strategy and planning, customer insights and targeting, and campaign management services.  In addition, she supervises all phases of professional services and analytics projects including providing up-front client consultation, assessing business needs, analyzing complex internal and external data, model delivery, and post-campaign assessment of model performance.

As a nationally recognized speaker at industry conferences and events, Regina informs and enlightens her audiences to the realm of possibilities in marketing strategy. Regina  has executed and supervised consulting and analytical projects for clients across all vertical markets, including, retail, catalog, publishing, financial, travel, fundraising, insurance, consumer packaging goods, telecommunications, cable and health care industries.

Graduating with honors from both, Rutgers University, where she earned her Bachelors Degree in Mathematics, and Montclair State University, where she earned her Masters Degree in Statistics, Regina brings a richly diverse analytical and mathematical view as Vice President of Strategic Insight Services.  Her previous experience teaching mathematics and statistics, both at the high school and undergraduate college level, and as a statistical consultant analyzing and reporting survey results for large university-based programs, has enabled Regina to apply innovative strategies and techniques in obtaining her goals and objectives.  Regina’s commitment and vision make her an admirable advocate with valued Experian clients in their pursuit of individual success.

Duncan Houldsworth
Duncan is the leader of brand and communications research Hall & Partner’s Analytics Practice which is a leading in unlocking the analytical promise of brand research. Previously he held leadership positions in analytics at Information Resources, Inc. (IRI), Experian and Simmons Market Research. He began his career in the UK working for Experian, working on marketing analytics for financial services and retail customers like Barclays Bank, HSBC and Discover Card.

Experian moved him to the US in 2000, where he led analytical product development and consulting, working with companies like Sears, Best Buy, American Express and Citibank. After a spell leading Simmons Marketing Research efforts to develop new analytical products based on their syndicated research data, Duncan took a senior position at IRI leading their analytical technology practice and delivering solutions built around marketing mix modeling and econometric analysis for the major names in the CPG industry such as Kraft, Campbell’s soup and Sara Lee. A graduate of the University of Warwick, UK, with a masters degree in Marketing Research from Nottingham Business School and an Executive MBA from The Kellogg School of Management at Northwestern University.

Jean-Paul Isson
Jean-Paul Isson heads up the Global Business Intelligence & Predictive Analytics group at Monster Worldwide. His team is responsible for Global Customer Scoring/Segmentation, Predictive Modeling and Sales Support Web mining Applications building. Mr. Isson built the Global Business Intelligence team from ground up .The Global BI team has been a pioneer in delivering innovative solutions to Monster and its customers throughout:The Job Posting Benchmarking tool (Optimizer tool) and the Global BI Score segmentation that are now used worldwide to prioritize sales and CRM activities across Monster. JP Isson has more than 16 years in data mining focusing on customer behavior modeling and segmentation. Prior to joining Monster more than 4 years ago, Mr. Isson was Leading the Global Customer Behavior Modeling team at Rogers Wireless (# 1 Canadian Telco Company) where he worked for over 5 years in developing and implementing actionable Customer Life Time Value Models, Churn Predictive Models, Acquisition & Propensity to buy models, Market segmentation, and marketing research solutions for Price plan optimization.Mr. Isson started his professional career as a Statistician at Credit scoring department team at Credit Lyonnais Paris; A France leading Bank Company. When moved to Canada 11 years ago Mr. Isson worked as principal statistician at Universite du Quebec at Hull. Where he worked conjointly with several Professors for various publications including Youth Smoking Behavior Study utilizing the Theory of Planned Behavior (Ajzen Fishbein).

Mr. Isson has a Master Degree in Mathematics and Applied statistics from Universite Pierre & Marie Curie Paris 6 France. He has several articles related to his successful implementation of Business Intelligence And Predictive Analytics solutions; (1to1 media, ItWorld Canada, SAS Magazine, CRM Magazine Scott directories, Direction informatique, It Business Canada): He gave several interviews on Canadian local business news magazines and TV to cover Business Intelligence and Monster Employment Index.

He has been speaker for SAS users groups and at IDC events like the most recent 2009 CIO of the year event in Denmark. He is member of conference board of Canada, and Canadian Statisticians Association interviews on Canadian local business news magazines and TV to cover Business Intelligence and Monster Employment Index. He is member of conference board of Canada, and Canadian Statisticians Association.

Bill Tuohig
Bill Tuohig draws on nearly a decade of market research experience in his role as senior director of product management for J.D. Power and Associates’ Web Intelligence division. His expertise involves mining marketing intelligence from social media, which, in turn, is used by Fortune 1000 companies to drive their marketing programs and product innovation. Bill serves as the bridge between the business and technology sides of the business, contributing to both long-term product strategy and near-term development roadmaps, and providing guidance to the development and packaging of the company’s technological advances in machine learning and natural language processing.

Bill formerly served as the vice president of product management for Umbria, a company that was acquired by J.D. Power and Associates in 2008. Prior to Umbria, he oversaw voice-of-the-customer (VoC) solutions for Informative Inc., helping customers such as Bose, Sprint and Mars with product development, marketing and branding initiatives. Bill also spent several years consulting for prominent firms such as Deloitte Consulting, Coopers & Lybrand and Sapient.

Bill received an MBA from the University of Pennsylvania’s Wharton School of Business and a bachelor’s of science degree in systems engineering from the University of Virginia. He is the co-author of a patent on “Tribe or Group-Based Analysis of Social Media Including Generating Intelligence from a Tribe’s Weblogs or Blogs.”

Dave Wells
Dave Wells is a consultant, mentor, teacher, and author in the field of Business Intelligence (BI). He brings to every endeavor a unique and balanced perspective about the relationships of business and technology. This perspective – refined through a career of more than thirty-five years that encompassed both business and technical roles – helps to align business and information technology in the most effective ways. Dave focuses on strategic, functional, and organizational alignment as the keys to building and sustaining valuable and high-impact BI cultures and systems.

Dave writes a regular column at B-Eye-Network (b-eye-network.com/wells) and is a contributor to other trade publications. He is a co-author of the book BI Strategy: How to Create and Document. Dave is a frequent speaker at industry events ranging from international conferences to local group meetings such as DAMA and CIPS.  His consulting practice focuses on areas such as BI program management, BI program assessment and revitalization, BI architecture, and applied business analytics

Speaker Job Title Company
Michael Berry Founder & Principal Data-Miners Inc
Richard Boire Partner Boire Filler Group
Chris Bowman Former School Superintendent Schools in the City of Baker School System
John Elder Chief Scientist Elder Research Inc
Usama Fayyad CEO Open Insights
Regina Gray Vice President of Strategic Insight Services, Experian Marketing Analytics Experian
Seth Grimes President Alta Plana
Mitch Hendricks System Analyst ConAgra Foods
Duncan Houldsworth Global Head of Analytics Hall and Partners
Jean-Paul Isson VP Business Intelligence & Predictive Analytics Monster Worldwide
Evan Sandhaus Semantic Technologist New York Times Company
Mark Smith CEO & EVP Research Ventana Research
David Stodder Editorial Director, Writer and Principle Analyst Perceptive Information Strategies
James Taylor CEO Decision Management Solutions
Bill Tuohig Senior Director, Web Intelligence J.D. Power & Associates
Dave Wells Director of Education eLearningCurve
David White Senior Research Analyst Aberdeen Group
Michael Berry
A founder and principal of Data Miners with twenty years of experience applying data mining techniques to business problems in marketing and CRM. With his colleague, Gordon Linoff, Michael has authored three of the most widely read and respected books on data mining, Data Mining Techniques, Mastering Data Mining, and Mining the Web (all published by John Wiley & Sons). These best sellers in the field have been translated into many languages.

A data mining educator as well as a consultant, Michael is in demand as a keynote speaker and seminar leader in the area of data mining generally and the application of data mining to customer relationship management in particular.

Prior to founding Data Miners, Michael spent 8 years at Thinking Machines Corporation. There he specialized in the application of massively parallel supercomputing techniques to business and marketing applications, including one of the largest database marketing systems of the time. He later ran a data mining laboratory for Naviant Technology Solutions.

Recent projects include clustering towns to create localized content editions of a major metropolitan newspaper and modeling anniversary churn for a wireless telephone company.

Richard Boire
Richard Boire’s experience in database marketing and data mining  dates back to 1983, when he received an MBA from Concordia University in Finance and Statistics. Immediately after, he joined Reader’s Digest where he developed segmentation and modeling routines for all direct marketing programs.

Richard’s progressive career path led him to American Express, where he pioneered predictive modelling technology for all direct marketing programs. He was quickly promoted to senior manager, responsible for credit risk and market behaviour segmentation. As a result of this expanded role, he pioneered the development of models which targeted the acquisition of new customers based on return on investment.

With this experience, Richard formed his own consulting company back in 1994 which is now called the Boire Filler Group , a Canadian leader in offering  analytical and database services to companies seeking solutions  to their existing data mining or database marketing challenges.

Richard is a recognized authority on data mining and is among a very few, select top five experts in this field in Canada, with expertise and knowledge that is difficult, if not impossible to replicate in Canada. He gives seminars on segmentation and data mining for such organizations as Canadian Marketing Association (CMA), Direct Marketing News,Direct Marketing Association Toronto and AARM. His articles have appeared in Direct Marketing News, Strategy Magazine, and Marketing Magazine. He has taught applied statistics and database marketing at Concordia University, Vanier College in Montreal, and  Humber College in Toronto and is currently a part-time lecturer at both George Brown College and Seneca College   in Toronto. Richard is currently  Chair at the CMA’s Database and Technology Committee  where he  has chaired  sub-committees  responsible for  both the 2000 Database and Technology Seminar as well as the  2002 Database and Technology Seminar and has currently chaired the first-ever Customer Profitability Conference  which was  held on Nov.17/2005. He has co-authored  white papers on the following topics: ‘Best Practices in Data Mining’ as well as   ‘Profitability:  The State of Evolution among Canadian Companies’

Chris Bowman
Chris Bowman has been in education since 1969 as a teacher, an assistant principal, a principal, and a supervisor of transportation, technology, federal programs, and middle schools.  He is now retired after serving as the superintendent of schools in the City of Baker School System in Baton Rouge, Louisiana.   I have a picture for you if you choose to use it.  Please correct the fact that I am not the retired superintendent of The Lafourche Parish School System.
John Elder
Dr. John Elder heads a data mining consulting team with offices in Charlottesville, Virginia and Washington DC (www.datamininglab.com).  Founded in 1995, Elder Research, Inc. focuses on Federal, commercial, investment, and security applications of advanced analytics, including text mining, stock selection, image recognition, biometrics, process optimization, cross-selling, drug efficacy, credit scoring, market timing, and fraud detection.

John obtained a BS and MEE in Electrical Engineering from Rice University, and a PhD in Systems Engineering from the University of Virginia, where he’s an adjunct professor teaching Optimization or Data Mining.  Prior to 14 years at ERI, he spent 5 years in aerospace defense consulting, 4 heading research at an investment management firm, and 2 in Rice’s Computational & Applied Mathematics department.

Dr. Elder has authored innovative data mining tools, is a frequent keynote speaker, and was co-chair of the 2009 Knowledge Discovery and Data Mining conference, in Paris.  John’s courses on analysis techniques — taught at dozens of universities, companies, and government labs — are noted for their clarity and effectiveness.  Dr. Elder was honored to serve for 5 years on a panel appointed by the President to guide technology for National Security.  His book on Practical Data Mining, with Bob Nisbet and Gary Miner, was published in May 2009. John is a follower of Christ and the proud father of 5.

Regina Gray
Regina Gray is the Vice President of the Strategic Insight Services team for Experian Marketing Services. With proven results and expertise, Regina leads her team in providing consulting and analytical services to help facilitate clients in their efforts to acquire and retain customers, increase profitability and return-on-investment from marketing campaigns, and communicate effectively.  Regina oversees the teams delivering strategy and planning, customer insights and targeting, and campaign management services.  In addition, she supervises all phases of professional services and analytics projects including providing up-front client consultation, assessing business needs, analyzing complex internal and external data, model delivery, and post-campaign assessment of model performance.

As a nationally recognized speaker at industry conferences and events, Regina informs and enlightens her audiences to the realm of possibilities in marketing strategy. Regina  has executed and supervised consulting and analytical projects for clients across all vertical markets, including, retail, catalog, publishing, financial, travel, fundraising, insurance, consumer packaging goods, telecommunications, cable and health care industries.

Graduating with honors from both, Rutgers University, where she earned her Bachelors Degree in Mathematics, and Montclair State University, where she earned her Masters Degree in Statistics, Regina brings a richly diverse analytical and mathematical view as Vice President of Strategic Insight Services.  Her previous experience teaching mathematics and statistics, both at the high school and undergraduate college level, and as a statistical consultant analyzing and reporting survey results for large university-based programs, has enabled Regina to apply innovative strategies and techniques in obtaining her goals and objectives.  Regina’s commitment and vision make her an admirable advocate with valued Experian clients in their pursuit of individual success.

Duncan Houldsworth
Duncan is the leader of brand and communications research Hall & Partner’s Analytics Practice which is a leading in unlocking the analytical promise of brand research. Previously he held leadership positions in analytics at Information Resources, Inc. (IRI), Experian and Simmons Market Research. He began his career in the UK working for Experian, working on marketing analytics for financial services and retail customers like Barclays Bank, HSBC and Discover Card.

Experian moved him to the US in 2000, where he led analytical product development and consulting, working with companies like Sears, Best Buy, American Express and Citibank. After a spell leading Simmons Marketing Research efforts to develop new analytical products based on their syndicated research data, Duncan took a senior position at IRI leading their analytical technology practice and delivering solutions built around marketing mix modeling and econometric analysis for the major names in the CPG industry such as Kraft, Campbell’s soup and Sara Lee. A graduate of the University of Warwick, UK, with a masters degree in Marketing Research from Nottingham Business School and an Executive MBA from The Kellogg School of Management at Northwestern University.

Jean-Paul Isson
Jean-Paul Isson heads up the Global Business Intelligence & Predictive Analytics group at Monster Worldwide. His team is responsible for Global Customer Scoring/Segmentation, Predictive Modeling and Sales Support Web mining Applications building. Mr. Isson built the Global Business Intelligence team from ground up .The Global BI team has been a pioneer in delivering innovative solutions to Monster and its customers throughout:The Job Posting Benchmarking tool (Optimizer tool) and the Global BI Score segmentation that are now used worldwide to prioritize sales and CRM activities across Monster. JP Isson has more than 16 years in data mining focusing on customer behavior modeling and segmentation. Prior to joining Monster more than 4 years ago, Mr. Isson was Leading the Global Customer Behavior Modeling team at Rogers Wireless (# 1 Canadian Telco Company) where he worked for over 5 years in developing and implementing actionable Customer Life Time Value Models, Churn Predictive Models, Acquisition & Propensity to buy models, Market segmentation, and marketing research solutions for Price plan optimization.Mr. Isson started his professional career as a Statistician at Credit scoring department team at Credit Lyonnais Paris; A France leading Bank Company. When moved to Canada 11 years ago Mr. Isson worked as principal statistician at Universite du Quebec at Hull. Where he worked conjointly with several Professors for various publications including Youth Smoking Behavior Study utilizing the Theory of Planned Behavior (Ajzen Fishbein).

Mr. Isson has a Master Degree in Mathematics and Applied statistics from Universite Pierre & Marie Curie Paris 6 France. He has several articles related to his successful implementation of Business Intelligence And Predictive Analytics solutions; (1to1 media, ItWorld Canada, SAS Magazine, CRM Magazine Scott directories, Direction informatique, It Business Canada): He gave several interviews on Canadian local business news magazines and TV to cover Business Intelligence and Monster Employment Index.

He has been speaker for SAS users groups and at IDC events like the most recent 2009 CIO of the year event in Denmark. He is member of conference board of Canada, and Canadian Statisticians Association interviews on Canadian local business news magazines and TV to cover Business Intelligence and Monster Employment Index. He is member of conference board of Canada, and Canadian Statisticians Association.

Bill Tuohig
Bill Tuohig draws on nearly a decade of market research experience in his role as senior director of product management for J.D. Power and Associates’ Web Intelligence division. His expertise involves mining marketing intelligence from social media, which, in turn, is used by Fortune 1000 companies to drive their marketing programs and product innovation. Bill serves as the bridge between the business and technology sides of the business, contributing to both long-term product strategy and near-term development roadmaps, and providing guidance to the development and packaging of the company’s technological advances in machine learning and natural language processing.

Bill formerly served as the vice president of product management for Umbria, a company that was acquired by J.D. Power and Associates in 2008. Prior to Umbria, he oversaw voice-of-the-customer (VoC) solutions for Informative Inc., helping customers such as Bose, Sprint and Mars with product development, marketing and branding initiatives. Bill also spent several years consulting for prominent firms such as Deloitte Consulting, Coopers & Lybrand and Sapient.

Bill received an MBA from the University of Pennsylvania’s Wharton School of Business and a bachelor’s of science degree in systems engineering from the University of Virginia. He is the co-author of a patent on “Tribe or Group-Based Analysis of Social Media Including Generating Intelligence from a Tribe’s Weblogs or Blogs.”

Dave Wells
Dave Wells is a consultant, mentor, teacher, and author in the field of Business Intelligence (BI). He brings to every endeavor a unique and balanced perspective about the relationships of business and technology. This perspective – refined through a career of more than thirty-five years that encompassed both business and technical roles – helps to align business and information technology in the most effective ways. Dave focuses on strategic, functional, and organizational alignment as the keys to building and sustaining valuable and high-impact BI cultures and systems.

Dave writes a regular column at B-Eye-Network (b-eye-network.com/wells) and is a contributor to other trade publications. He is a co-author of the book BI Strategy: How to Create and Document. Dave is a frequent speaker at industry events ranging from international conferences to local group meetings such as DAMA and CIPS.  His consulting practice focuses on areas such as BI program management, BI program assessment and revitalization, BI architecture, and applied business analytics

The Big Data Summit Agenda

Here is the agenda for the Big Data Summit in NY, which I am planning to attend. Registration is free so feel …er free to drop in.

the agenda for the evening. the event will begin promptly at 6 p.m

  • Welcome Reception: Snacks and drinks
  • A Plan for Large Scale Data Analytics: How to Utilize Aster nCluster and Hadoop in a Symbiotic Relationship to Support Processing in Excess of 100 Billion Rows Per Month
    – Michael Brown,
    EVP, Software Engineering, and Will Duckworth, Director, Software Engineering, comScore, Inc.
  • Making Sense of Hadoop – Its Fit With Data Warehouses
    – Colin White, President and Founder of BI Research
  • MapReduce Inside a Database System – When and How
    Case Studies from ShareThis, Specific Media, and Others
    – Tasso Argyros, Chief Technology Officer and Co-Founder of Aster Data
  • Large-Scale Analytics at LinkedIn
    – Jonathan Goldman, Former Principal Scientist at LinkedIn
  • Networking Mixer: Beer, wine, hot hors d’oeuvres

Location:
The Roosevelt Hotel
45 East 45th Street
New York, NY 10017

For more information click here: Big Data Summit

For Additional Info contact

Ryan
ryan.garrett@asterdata.com
(cell) 415-609-4745


Interview Thomas C. Redman Author Data Driven

Here is an interview with Tom Redman, author of Data Driven. Among the first to recognize the need for high-quality data in the information age, Dr. Redman established the AT&T Bell Laboratories Data Quality Lab in 1987 and led it until 1995. He is the author of four books, two patents and leads his own consulting group. In many respects the “Data Doc’ as his nickname is- is also the father of Data Quality Evangelism.

tom redman

Ajay- Describe your career as a science student to an author of science and strategy books.
Redman: I took the usual biology, chemistry, and physics classes in college.  And I worked closely with oceanographers in graduate school.  More importantly, I learned directly from two masters.  First, was Dr. Basu, who was at Florida State when I was.  He thought more deeply and clearly about the nature of data and what we can learn from them than anyone I’ve met since.  And second is people in the Bell Labs’ community who were passionate about making communications better. What I learned there was you don’t always need “scientific proof” to mover forward.


Ajay- What kind of bailout do you think the Government can give to the importance of science education in this country.

Redman: I don’t think the government should bail science education per se. Science departments should compete for students just like the English and anthropology departments do.  At the same time, I do think the government should support some audacious goals, such as slowing global warming or energy independence.  These could well have the effect of increasing demand for scientists and science education.

Ajay- Describe your motivations for writing your book Data Driven-Profiting from your most important business asset.

Redman: Frankly I was frustrated.  I’ve spent the last twenty years on data quality and organizations that improve gain enormous benefit.  But so few do.  I set out to figure out why that was and what to do about it.

Ajay- What can various segments of readers learn from this book-
a college student, a manager, a CTO, a financial investor and a business intelligence vendor.

Redman: I narrowed my focus to the business leader and I want him or her to take away three points.  First, data should be managed as aggressively and professionally as your other assets.  Second, they are unlike other assets in some really important ways and you’ll have to learn how to manage them.  Third, improving quality is a great place to start.

Ajay- Garbage in Garbage out- How much money and time do you believe is given to data quality in data projects.

Redman:   By this I assume you mean data warehouse, BI, and other tech projects.  And the answer is “not near enough.”  And it shows in the low success rate of those projects.

Ajay-Consider a hypothetical scenario- Instead of creating and selling fancy algorithms , a business intelligence vendor uses simple Pareto principle to focus on data quality and design during data projects. How successful do you think that would be?

Redman: I can’t speak to the market, but I do know that if organizations are loaded with problems and opportunities.  They could make great progress on most important ones if could clearly state the problem and bring high-quality data and simple techniques to bear.  But there are a few that require high-powered algorithms.  Unfortunately those require high-quality data as well.

Ajay- How and when did you first earn the nickname “Data Doc”. Who gave it to you and would you rather be known by some other names.

Redman: One of my clients started calling me that about a dozen years ago.  But I felt uncomfortable and didn’t put it on my business card until about five years ago.  I’ve grown to really like it.

Ajay- The pioneering work at AT & T Bell laboratories and at Palo Alto laboratory- who do you think are the 21st century successors of these laboratories. Do you think lab work has become too commercialized even in respected laboratories like Microsoft Research and Google’s research in mathematics.

Redman: I don’t know.  It may be that the circumstances of the 20th century were conducive to such labs and they’ll never happen again.  You have to remember two things about Bell Labs.  First, was the cross-fertilization that stemmed from having leading-edge work in dozens of areas.  Second, the goal is not just invention, but innovation, the end-to-end process which starts with invention and ends with products in the market.  AT&T, Bell Labs’ parent, was quite good at turning invention to product.  These points lead me to think that the commercial aspect of laboratory work is so much the better.

Ajay-What does ” The Data Doc” do to relax and maintain a work life balance. How important do you think is work-life balance for creative people and researchers.

Redman: I think everyone needs a balance, not just creative people.  Two things have made this easier for me.  First, I like what I do.  A lot of days it is hard to distinguish “work” from “play.”  Second is my bride of thirty-three years, Nancy.  She doesn’t let me go overboard too often.

Biography-

Dr. Thomas C. Redman is President of Navesink Consulting Group, based in Little Silver, NJ.  Known by many as “the Data Doc” (though “Tom” works too), Dr. Redman was the first to extend quality principles to data and information.  By advancing the body of knowledge, his innovations have raised the standard of data quality in today’s information-based economy.

Dr. Redman conceived the Data Quality Lab at AT&T Bell Laboratories in 1987 and led it until 1995.  There he and his team developed the first methods for improving data quality and applied them to important business problems, saving AT&T tens of millions of dollars. He started Navesink Consulting Group in 1996 to help other organizations improve their data, while simultaneously lowering operating costs, increasing revenues, and improving customer satisfaction and business relationships.

Since then – armed with proven, repeatable tools, techniques and practical advice – Dr. Redman has helped clients in fields ranging from telecommunications, financial services, and dot coms, to logistics, consumer goods, and government agencies. His work has helped organizations understand the importance of high-quality data, start their data quality programs, and also save millions of dollars per year.

Dr. Redman holds a Ph.D. in statistics from Florida State University.  He is an internationally renowned lecturer and the author of numerous papers, including “Data Quality for Competitive Advantage” (Sloan Management Review, Winter 1995) and “Data as a Resource: Properties, Implications, and Prescriptions” (Sloan Management Review, Fall 1998). He has written four books: Data Driven (Harvard Business School Press, 2008), Data Quality: The Field Guide (Butterworth-Heinemann, 2001), Data Quality for the Information Age (Artech, 1996) and Data Quality: Management and Technology (Bantam, 1992). He was also invited to contribute two chapters to Juran’s Quality Handbook, Fifth Edition (McGraw Hill, 1999). Dr. Redman holds two patents.

About Navesink Consulting Group (http://www.dataqualitysolutions.com/ )

Navesink Consulting Group was formed in 1996 and was the first company to focus on data quality.  Led by Dr. Thomas Redman, “the Data Doc” and former AT&T Bell Labs director, we have helped clients understand the importance of high-quality data, start their data quality programs, and save millions of dollars per year.

Our approach is not a cobbling together of ill-fitting ideas and assertions – it is based on rigorous scientific principles that have been field-tested in many industries, including financial services (see more under “Our clients”).  We offer no silver bullets; we don’t even offer shortcuts. Improving data quality is hard work.

But with a dedicated effort, you should expect order-of-magnitude improvements and, as a direct result, an enormous boost in your ability to manage risk, steer a course through the crisis, and get back on the growth curve.

Ultimately, Navesink Consulting brings tangible, sustainable improvement in your business performance as a result of superior quality data.

SPSS gets Directions

A link to the Predictive Analytic Conference by SPSS ( the first after the Big Blue announcement) at http://www.spss.com/spssdirections/na/index.htm

Should be interesting for existing clients and SPSS watchers.

spss