SDS
  • SDS-3.x
  • SDS-2.x
  • SDS-2.2
  • Privacy-Decisions
  • 360-in-525
  • SDS-Research
  • LaMaStEx
  • Contact
    • SDS-2.2 Course
      • A 2017 Uppsala Big Data Meetup
      • YouTube PlayList
      • sds-2-2 gitbook
      • Basics
    • Infrastructure
      • Local Computer
      • On-Premise Cluster
    • Contents-databricks
      • Download Course Contents
      • Introduction
      • Why Spark?
      • Login to databricks
      • Scala Crash Course
      • RDDs
      • RDDs HOMEWORK
      • Word Count - SOU
      • Russian Word Count
      • SparkSQL Basics
      • SparkSQL HW-a
      • SparkSQL HW-b
      • SparkSQL HW-c
      • SparkSQL HW-d
      • SparkSQL HW-e
      • SparkSQL HW-f
      • ETL Diamonds Data
      • ETL Power Plant
      • Wiki Click streams
      • Simulation Intro
      • Machine Learning Intro
      • K-Means 1MSongs Intro
      • 1MSongs - 1 ETL
      • 1MSongs - 2 Explore
      • 1MSongs - 3 Model
      • Decision Trees for Digits
      • Linear Algebra Intro
      • Linear Regression Intro
      • DLA (Distrib. Linear Algebra)
      • DLA - Data Types Prog Guide
      • DLA - Local Vector
      • DLA - Labeled Point
      • DLA - Local Matrix
      • DLA - Distributed Matrix
      • DLA - Row Matrix
      • DLA - Indexed Row Matrix
      • DLA - Coordinate Matrix
      • DLA - Block Matrix
      • Power Plant - Model Tune Evaluate
      • Activity Detection - Random Forest
      • Graph Frames Intro
      • Ontime Flight Performance
      • Spark Streaming Intro
      • Extended Twitter Utils
      • Tweet Transmission Trees
      • REST Twitter API
      • Tweet Collector
      • Tweet Track, Follow
      • Tweet Hashtag Counter
      • Tweet Classifier
      • Power Plant - Model Tune Evaluate Deploy
      • Geospatial Analytics in Magellan
      • NY Taxi trips in Magellan
      • Old Bailey Online - ETL of XML
      • 20 Newsgroups - Latent Dirichlet Allocation
      • Cornell Movie Dialogs - Latent Dirichlet Allocation
      • Movie Recommendation - Alternating Least Squares
      • Animal Names Streaming Files
      • Normal Mixture Streaming Files
      • Structured Streaming Prog Guide
      • Graph Mixture Streaming Files
      • Structured Streaming of JSONs
      • T-Digest Normal Mixture Streaming Files
      • Sketching with T-Digest
      • Streaming with T-Digest
      • Intro to Deep Learning
      • Outline for DL
      • Neural Networks
      • Deep feed Forward NNs with Keras
      • Hello Tensorflow
      • Batch Tensorflow with Matrices
      • Convolutional Neural Nets
      • MNIST: Multi-Layer-Perceptron
      • MNIST: Convolutional Neural net
      • CIFAR-10: CNNs
      • Recurrent Neural Nets and LSTMs
      • LSTM solution
      • LSTM spoke Zarathustra
      • Generative Networks
      • Reinforcement Learning
      • DL Operations
      • 2017 Advise from Data Industry
      • Potential Projects
      • Student Project 01 on Network Anomaly Detection
      • Student Project 02 on Twitter UK Election
      • Student Project 03 on Article Topics in Retweet Networks
      • Student Project 03 on Article Topics in Retweet Networks - scalable web scraper
      • Student Project 04 on Power Forecasting Part 0
      • Student Project 04 on Power Forecasting Part 1
      • Student Project 04 on Power Forecasting Part 2
      • Student Project 04 on Power Forecasting Part 3
      • Student Project 05 on Hail Scala for Population Genomics ETL
    • Contents-CoCalc
      • 1MSongs - 3D Space
    • Contents-zeppelin
    • sbt
    • SDS-1.6 Course

    Basics

    The Basics and Some Essentials

    Learn to Work with Your Local Laptop

    • Command-line Basics
      • Linux Commnad-line Basics
      • Windows Command-line Bascis
    • How to use Git and GitHub: Version control for code

    • Scala
      • scala cheat sheet
      • scala basics in a hurry
      • Scala School! by twitter
        • Scala School! build yourself from scratch

    Distributed Computing Infrastructure

    We need infrastructure for big data. See Infrastructure.

    Other Courses for free!

    Beginner course (free)

    • Intro to Statistics
    • Intro to Computer Science (with Python)
    • Intro to Descriptive Statistics
    • Intro to Inferential Statistics
    • Intro to Python Programming

    Intermediate to Advanced Courses (free)

    • Mining Massive Datasets: Stanford Online
    • Linear Algebra Refresher Course (with Python)
    • Deploying a Hadoop Cluster
    • Intro to Algorithms
    • Machine Learning
      • Machine Learning: Supervised Learning
      • Machine Learning: Unupervised Learning
      • Machine Learning: Reinforcement Learning
    • Differential Equations in Action (Python)
    • AB testing
    • Intro to Relational Databases
    • Data Visualization and D3.js
    • Intro to Hadoop and MapReduce
    • Real-Time Analytics with Apache Storm
    • Intro to Data Analysis: Using NumPy and Pandas
    • Intro to Data Science
    • Intro to Artifical Intelligence
    • Data Analysis with R by facebook
    • Intro to Parallel Programming by Nvidia
    • Model Building and Validation by AT&T

    Updated: February 02, 2023

    Share on

    Twitter Facebook Google+ LinkedIn
    Previous Next
    • Follow:
    • Twitter
    • GitHub
    • Feed
    © 2023 Raazesh Sainudiin. Powered by Jekyll & Minimal Mistakes.