## Project SAHDE:

Scalable Adaptive Histogram Density Estimation

This programme is partly supported by:

- databricks academic partners program for distributed cloud computing
- research time for this project was party due to:
- 2009-2015 by industrial consulting revenues of Raazesh Sainudiin
- 2015, 2016 by the project CORCON: Correctness by Construction, Seventh Framework Programme of the European Union, Marie Curie Actions-People, International Research Staff Exchange Scheme with counter-part funding by The Royal Society of New Zealand

Project SAHDE is an effort to create a scalable version of the adaptive histogram density estimators implemented in:

- MRS 2.0, a C++ class library for statistical set processing and computer-aided proofs in statistics.

based on mathematical statistical notions in:

- Data-adaptive histograms through statistical regular pavings, Raazesh Sainudiin, Gloria Teng, Jennifer Harlow and Warwick Tucker, 2016 (PDF 1.8MB)

## LICENSE

The license for mrs2 is GNU General Public License (GPL) and that for SAHDE Project is Apache 2.0.

## Current Sub-Projects of SAHDE

- SparkDensityTree for scalable density estimation using optimally smoothed L2-risk minimizing penalties (
*in progress*) - SparkOnlineLearning has potential for streaming regularly paved tree arithmetic by extending from the Scala trees in SparkDensityTree.

### Blackboard discussion notes at LaMaStEx on 2016-10-08.

We will eventually lua/la/ka-tex mathematically here..