Apache MADlib is a SQL-based open source library for scalable in-database analytics that supports Greenplum Database. The library offers data scientists numerous distributed implementations of mathematical, statistical and machine learning methods, including many utilities for data transformation.
New utilities have been added in the recent MADlib 1.9.1 release, including:
Pivot: data summarization tool that can do basic OLAP type operations
Sessionization: time-oriented session reconstruction on a data set comprising a sequence of events
Prediction metrics: set of metrics to evaluate the quality of predictions of a model
For more details
How to Install MADLib in Greenplum database
Greenplum DBA FAQ >