{ "cells": [ { "cell_type": "markdown", "metadata": { "deletable": false }, "source": [ "# [Introduction to Data Science: A Comp-Math-Stat Approach](https://lamastex.github.io/scalable-data-science/as/2019/)\n", "## YOIYUI001, Summer 2019 \n", "©2019 Raazesh Sainudiin. [Attribution 4.0 International (CC BY 4.0)](https://creativecommons.org/licenses/by/4.0/)" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "# 11. Non-parametric Estimation and Testing\n" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "### Topics\n", "\n", "- Nonparametric Estimation\n", "- Glivenko-Cantelli Theorem\n", "- Dvoretsky-Kiefer-Wolfowitz Inequality\n", "- Plug-in Estimator\n", "- Nonparametric Confidence Intervals via Bootstrap\n", "- Nonparametric Hypothesis Testing \n", "- Permutation Testing\n", "- Permutation Testing with Shells Data\n", "\n", " \n", "\n", "## Inference and Estimation: The Big Picture\n", "\n", "The Big Picture is about inference and estimation, and especially inference and estimation problems where computational techniques are helpful. \n", "\n", "
\n", " | Point estimation | \n", "Set estimation | \n", "Hypothesis Testing | \n", "
\n",
" Parametric \n", "\n", " | \n",
"\n",
" MLE of finitely many parameters | \n",
"\n",
" Asymptotically Normal Confidence Intervals | \n",
"\n",
" Wald Test from Confidence Interval | \n",
"
\n",
" Non-parametric | \n",
"about to see ... | \n", "about to see ... | \n", "about to see ... | \n", "\n", "
Permutation | \n", "$t$ | \n", "$\\mathbf{P}_0(T=t)$ | \n", "
\n", " | \n", " | Probability under Null | \n", "
(52, 54, 58) | \n", "5 | \n", "$\\frac{1}{6}$ | \n", "
(52, 58, 54) | \n", "1 | \n", "$\\frac{1}{6}$ | \n", "
(54, 52, 58) | \n", "5 | \n", "$\\frac{1}{6}$ | \n", "
(54, 58, 52) | \n", "4 | \n", "$\\frac{1}{6}$ | \n", "
(58, 52, 54) | \n", "1 | \n", "$\\frac{1}{6}$ | \n", "
(58, 54, 52) | \n", "4 | \n", "$\\frac{1}{6}$ | \n", "