Tag: Big Data
PMML Plugin for Greenplum now available
From a press release from Zementis.
, the Universal PMML Plug-in for in-database scoring. Available now for the EMC Greenplum Database, a high-performance massively parallel processing (MPP) database, the plug-in leverages the Predictive Model Markup Language (PMML) to execute predictive models directly within EMC Greenplum, for highly optimized in-database scoring.
Developed by the Data Mining Group (DMG), PMML is supported by all major data mining vendors, e.g., IBM SPSS, SAS, Teradata, FICO, STASTICA, Microstrategy, TIBCO and Revolution Analytics as well as open source tools like R, KNIME and RapidMiner. With PMML, models built in any of these data mining tools can now instantly be deployed in the EMC Greenplum database. The net result is the ability to leverage the power of standards-based predictive analytics on a massive scale, right where the data resides.
|
|
|
Related Articles
- Creating New Capabilities With An Analytics Lab (chucksblog.emc.com)
- EMC Greenplum releases Community Edition of MPP database product, big data analysis gets cheaper still (zdnet.com)
- EMC lets go of Greenplum Community Edition (go.theregister.com)
- Greenplum, Big Data, and an Open Source Card (arnoldit.com)
- EMC launches free edition of Greenplum database (zdnet.com)
IBM and Revolution team to create new in-database R
From the Press Release at http://www.revolutionanalytics.com/news-events/news-room/2011/revolution-analytics-netezza-partnership.php
Under the terms of the agreement, the companies will work together to create a version of Revolution’s software that takes advantage of IBM Netezza’s i-class technology so that Revolution R Enterprise can run in-database in an optimal fashion.
About IBM
For information about IBM Netezza, please visit: http://www.netezza.com.
For Information on IBM Information Management, please visit: http://www.ibm.com/software/data/information-on-demand/
For information on IBM Business Analytics, please visit the online press kit: http://www.ibm.com/press/us/en/presskit/27163.wss
Follow IBM and Analytics on Twitter: http://twitter.com/ibmbizanalytics
Follow IBM analytics on Tumblr: http://smarterplanet.tumblr.com/tagged/new_intelligence
IBM YouTube Analytics Channel: http://www.youtube.com/user/ibmbusinessanalytics
For information on IBM Smarter Systems: http://www-03.ibm.com/systems/smarter/
About Revolution Analytics
Revolution Analytics is the leading commercial provider of software and services based on the open source R project for statistical computing. Led by predictive analytics pioneer Norman Nie, the company brings high performance, productivity and enterprise readiness to R, the most powerful statistics language in the world. The company’s flagship Revolution R product is designed to meet the production needs of large organizations in industries such as finance, life sciences, retail, manufacturing and media. Used by over 2 million analysts in academia and at cutting-edge companies such as Google, Bank of America and Acxiom, R has emerged as the standard of innovation in statistical analysis. Revolution Analytics is committed to fostering the continued growth of the R community through sponsorship of the Inside-R.org community site, funding worldwide R user groups and offers free licenses of Revolution R Enterprise to everyone in academia.
Netezza, an IBM Company, is the global leader in data warehouse, analytic and monitoring appliances that dramatically simplify high-performance analytics across an extended enterprise. IBM Netezza’s technology enables organizations to process enormous amounts of captured data at exceptional speed, providing a significant competitive and operational advantage in today’s data-intensive industries, including digital media, energy, financial services, government, health and life sciences, retail and telecommunications.
The IBM Netezza TwinFin® appliance is built specifically to analyze petabytes of detailed data significantly faster than existing data warehouse options, and at a much lower total cost of ownership. It stores, filters and processes terabytes of records within a single unit, analyzing only the relevant information for each query.
Using Revolution R Enterprise & Netezza Together
Revolution Analytics and IBM Netezza have announced a partnership to integrate Revolution R Enterprise and the IBM Netezza TwinFin Data Warehouse Appliance. For the first time, customers seeking to run high performance and full-scale predictive analytics from within a data warehouse platform will be able to directly leverage the power of the open source R statistics language. The companies are working together to create a version of Revolution’s software that takes advantage of IBM Netezza’s i-class technology so that Revolution R Enterprise can run in-database in an optimal fashion.
This partnership integrates Revolution R Enterprise with IBM Netezza’s high performance data warehouse and advanced analytics platform to help organizations combat the challenges that arise as complexity and the scale of data grow. By moving the analytics processing next to the data, this integration will minimize data movement – a significant bottleneck, especially when dealing with “Big Data”. It will deliver high performance on large scale data, while leveraging the latest innovations in analytics.
With Revolution R Enterprise for IBM Netezza, advanced R computations are available for rapid analysis of hundreds of terabyte-class data volumes — and can deliver 10-100x performance improvements at a fraction of the cost compared to traditional analytics vendors.
Additional Resources
- Whitepapers:
- On-Demand Webinar: Revolution R Enterprise: 100% R and More
- Free Downloads: Revolution R Community
- Product Information:
Related Articles
- IBM’s bet: Commerce can be just as big as analytics (zdnet.com)
- Revolution Analytics announces partnership with IBM Netezza (revolutionanalytics.com)
- Netezza Chief Talks About “Formative” PTC Days, IBM Deal History, and the Future of Big Data (xconomy.com)
- Gartner Ranks Data Warehousing Leaders (informationweek.com)
- IBM Acquires Netezza in $1.7 Billion Deal (dailyfinance.com)
- HP To Acquire Analytics Specialist Vertica (consultramy.wordpress.com)
- SAP, IBM Team up on In-memory Analytics (pcworld.com)
TeraData buys AsterData for 260+ million $
This just in! Big party in San Carlos this weekend.
Teradata is acquiring Aster Data‘s business, including its intellectual property and technology product line, through a merger transaction. Teradata plans to support Aster Data’s customers and integrate its employees immediately upon completion of the acquisition, which is expected to occur in the second quarter of 2011. Teradata acquired an 11 percent ownership interest in Aster Data in September 2010, and has agreed to pay an additional $263 million for the remaining ownership interest, net of debt and other expenses. In addition, through this acquisition, Teradata will obtain approximately $21 million of cash which Aster Data is expected to have on its balance sheet at closing.
http://www.asterdata.com/news/110303-Teradata-to-Acquire-Aster-Data.php
Related Articles
- Big Pay Day For Big Data. Teradata Buys Aster Data For $263 Million (techcrunch.com)
- A Story about Aster Data (robklopp.wordpress.com)
- Teradata Buys Aster Data, Boosts ‘big Data’ Wares (pcworld.com)
- Teradata to Acquire Rest of Aster Data (online.wsj.com)
Zementis partners with R Analytics Vendor- Revo
Just got a PR email from Michael Zeller,CEO , Zementis annoucing Zementis (ADAPA) and Revolution Analytics just partnered up.
Is this something substantial or just time-sharing http://bi.cbronline.com/news/sas-ceo-says-cep-open-source-and-cloud-bi-have-limited-appeal or a Barney Partnership (http://www.dbms2.com/2008/05/08/database-blades-are-not-what-they-used-to-be/)
Summary- Thats cloud computing scoring of models on EC2 (Zementis) partnering with the actual modeling software in R (Revolution Analytics RevoDeployR)
See previous interviews with both Dr Zeller at https://decisionstats.com/2009/02/03/interview-michael-zeller-ceozementis/ ,https://decisionstats.com/2009/05/07/interview-ron-ramos-zementis/ and https://decisionstats.com/2009/10/05/interview-michael-zellerceo-zementis-on-pmml/)
and Revolution guys at https://decisionstats.com/2010/08/03/q-a-with-david-smith-revolution-analytics/
and https://decisionstats.com/2009/05/29/interview-david-smith-revolution-computing/
–
strategic partnership with Revolution Analytics, the leading commercial provider of software and support for the popular open source R statistics language. With this partnership, predictive models developed on Revolution R Enterprise are now accessible for real-time scoring through the ADAPA Decisioning Engine by Zementis. ADAPA is an extremely fast and scalable predictive platform. Models deployed in ADAPA are automatically available for execution in real-time and batch-mode as Web Services. ADAPA allows Revolution R Enterprise to leverage the Predictive Model Markup Language (PMML) for better decision management. With PMML, models built in R can be used in a wide variety of real-world scenarios without requiring laborious or expensive proprietary processes to convert them into applications capable of running on an execution system.
“By partnering with Zementis, Revolution Analytics is building an end-to-end solution for moving enterprise-level predictive R models into the execution environment,” said Jeff Erhardt, Revolution Analytics Chief Operation Officer. “With Zementis, we are eliminating the need to take R applications apart and recode, retest and redeploy them in order to obtain desirable results.”
Got demo? Yes, we do! Revolution Analytics and Zementis have put together a demo which combines the building of models in R with automatic deployment and execution in ADAPA. It uses Revolution Analytics’ RevoDeployR, a new Web Services framework that allows for data analysts working in R to publish R scripts to a server-based installation of Revolution R Enterprise.
Action Items:
- Try our INTERACTIVE DEMO
- DOWNLOAD the white paper
- Try the ADAPA FREE TRIAL
RevoDeployR & ADAPA allow for real-time analysis and predictions from R to be effectively used by existing Excel spreadsheets, BI dashboards and Web-based applications, all in real-time.
Predictive analytics with RevoDeployR from Revolution Analytics and ADAPA from Zementis put model building and real-time scoring into a league of their own. Seriously!
Related Articles
- Revolution R Enterprise 4.2 now available (revolutionanalytics.com)
- Enterprise Startup Spotlight: Revolution Analytics, Taking on SAS, SPSS (readwriteweb.com)
- Gartner predicts business intelligence revolution (v3.co.uk)
WPS Version 2.5.1 Released – can still run SAS language/data and R

However this is what Phil Rack the reseller is quoting on http://www.minequest.com/Pricing.html
Windows Desktop Price: $884 on 32-bit Windows and $1,149 on 64-bit Windows.
The Bridge to R is available on the Windows platforms and is available for free to customers who
license WPS through MineQuest,LLC. Companies and organizations outside of North America
may purchase a license for the Bridge to R which starts at $199 per desktop or $599 per serverWindows Server Price: $1,903 per logical CPU for 32-bit and $2,474 for 64-bit.
Note that Linux server versions are available but do not yet support the Eclipse IDE and are
command line only
WPS sure seems going well-but their pricing is no longer fixed and on the home website, you gotta fill a form. Ditt0 for the 30 day free evaluation
http://www.teamwpc.co.uk/products/wps/modules/core
Data File Formats
The table below provides a summary of data formats presently supported by the WPS Core module.
| Data File Format | Un-Compressed Data |
Compressed Data |
||
|---|---|---|---|---|
| Read | Write | Read | Write | |
| SD2 (SAS version 6 data set) | ![]() |
![]() |
||
| SAS7BDAT (SAS version 7 data set) | ![]() |
![]() |
![]() |
|
| SAS7BDAT (SAS version 8 data set) | ![]() |
![]() |
![]() |
|
| SAS7BDAT (SAS version 9 data set) | ![]() |
![]() |
![]() |
|
| SASSEQ (SAS version 8/9 sequential file) | ![]() |
![]() |
![]() |
|
| V8SEQ (SAS version 8 sequential file) | ![]() |
![]() |
![]() |
|
| V9SEQ (SAS version 9 sequential file) | ![]() |
![]() |
![]() |
|
| WPD (WPS native data set) | ![]() |
![]() |
![]() |
![]() |
| WPDSEQ (WPS native sequential file) | ![]() |
![]() |
||
| XPORT (transport format) | ![]() |
![]() |
||
Additional access to EXCEL, SPSS and dBASE files is supported by utilising the WPS Engine for DB Filesmodule.
and they have a new product release on Valentine Day 2011 (oh these Europeans!)
From the press release at http://www.teamwpc.co.uk/press/wps2_5_1_released
WPS Version 2.5.1 Released
New language support, new data engines, larger datasets, improved scalabilityLONDON, UK – 14 February 2011 – World Programming today released version 2.5.1 of their WPS software for workstations, servers and mainframes.
WPS is a competitively priced, high performance, highly scalable data processing and analytics software product that allows users to execute programs written in the language of SAS. WPS is supported on a wide variety of hardware and operating system platforms and can connect to and work with many types of data with ease. The WPS user interface (Workbench) is frequently praised for its ease of use and flexibility, with the option to include numerous third-party extensions.
This latest version of the software has the ability to manipulate even greater volumes of data, removing the previous 2^31 (2 billion) limit on number of observations.
Complimenting extended data processing capabilities, World Programming has worked hard to boost the performance, scalability and reliability of the WPS software to give users the confidence they need to run heavy workloads whilst delivering maximum value from available computer power.
WPS version 2.5.1 offers additional flexibility with the release of two new data engines for accessing Greenplum and SAND databases. WPS now comes with eleven data engines and can access a huge range of commonly used and industry-standard file-formats and databases.
Support in WPS for the language of SAS continues to expand with more statistical procedures, data step functions, graphing controls and many other language items and options.
WPS version 2.5.1 is available as a free upgrade to all licensed users of WPS.
Summary of Main New Features:
- Supporting Even Larger Datasets
WPS is now able to process very large data sets by lifting completely the previous size limit of 2^31 observations.- Performance and Scalability Boosted
Performance and scalability improvements across the board combine to ensure even the most demanding large and concurrent workloads are processed efficiently and reliably.- More Language Support
WPS 2.5.1 continues the expansion of it’s language support with over 70 new language items, including new Procedures, Data Step functions and many other language items and options.- Statistical Analysis
The procedure support in WPS Statistics has been expanded to include PROC CLUSTER and PROC TREE.- Graphical Output
The graphical output from WPS Graphing has been expanded to accommodate more configurable graphics.- Hash Tables
Support is now provided for hash tables.- Greenplum®
A new WPS Engine for Greenplum provides dedicated support for accessing the Greenplum database.- SAND®
A new WPS Engine for SAND provides dedicated support for accessing the SAND database.- Oracle®
Bulk loading support now available in the WPS Engine for Oracle.- SQL Server®
To enhance existing SQL Server database access, a new SQLSERVR (please note spelling) facility in the ODBC engine.More Information:
Existing Users should visit www.teamwpc.co.uk/support/wps/release where you can download a readme file containing more information about all the new features and fixes in WPS 2.5.1.
New Users should visit www.teamwpc.co.uk/products/wps where you can explore in more detail all the features available in WPS or request a free evaluation.
and from http://www.teamwpc.co.uk/products/wps/data it seems they are going on the BIG DATA submarine as well-
Data Support
Extremely Large Data Size Handling
WPS is now able to handle extremely large data sets now that the previous limit of 2^31 observations has been lifted.
Access Standard Databases
Use I/O Features in WPS Core
- CLIPBOARD (Windows only)
- DDE (Windows only)
- EMAIL (via SMTP or MAPI)
- FTP
- HTTP
- PIPE (Windows and UNIX only)
- SOCKET
- STDIO
- URL
Use Standard Data File Formats
- dBase Files
- Flat Files
- Microsoft Access (via OLEDB or ODBC)
- Microsoft Excel
- SAS Data Set Files
- SAS Transport Files
- Sequential Files
- SPSS Data Files
- VSAM Files
- WPD (native WPS data set file)
Related Articles
- Revolution R Enterprise 4.2 now available (revolutionanalytics.com)
- EMC woos developers with free Greenplum Community Edition (v3.co.uk)
- How Vendors Are Lowering Big Data Barriers (nytimes.com)
OK Cupid Data Visualization- Flow Chart to your Heart
Quite appropriate on a V Day, OK Cupid remains quite innovative how they use data (in this questionnaire data)
Related Articles
- OkCupid: Finding your Valentine with R (revolutionanalytics.com)
- OkCupid Demystifies Dating with Big Data (gigaom.com)
- OkCupid’s Love Math Doesn’t Solve The Equation [They Blinded Us With Science] (jezebel.com)
- OK Cupid Finds That It’s Our Differences That Make Us Attractive (Aw) (thegloss.com)
- Match.com Buys OkCupid for $50M (appscout.com)



