Markdown Cell Tips

To change a non-markdown cell to markdown, add %md to very start of the cell.
After updating the contents of a markdown cell, click out of the cell to update the formatted contents of a markdown cell.
To edit an existing markdown cell, doubleclick the cell.

Learn more about markdown:

https://guides.github.com/features/mastering-markdown/

Note that there are flavours or minor variants and enhancements of markdown, including those specific to databricks, github, pandoc, etc.

It will be future-proof to remain in the syntactic zone of pure markdown (at the intersection of various flavours) as much as possible and go with pandoc-compatible style if choices are necessary.

Run a Scala Cell

Run the following scala cell.
Note: There is no need for any special indicator (such as %md) necessary to create a Scala cell in a Scala notebook.
You know it is a scala notebook because of the (Scala) appended to the name of this notebook.
Make sure the cell contents updates before moving on.
Press Shift+Enter when in the cell to run it and proceed to the next cell.
- The cells contents should update.
- Alternately, press Ctrl+Enter when in a cell to run it, but not proceed to the next cell.
characters following // are comments in scala.

1+1

res0: Int = 2

println(System.currentTimeMillis) // press Ctrl+Enter to evaluate println that prints its argument as a line

1565169638781

Scala Resources

You will not be learning scala systematically and thoroughly in this course. You will learn to use Scala by doing various Spark jobs.

If you are seriously interested in learning scala properly, then there are various resources, including:

scala-lang.org is the core Scala resource.
- tour-of-scala
MOOC
- courseera: Functional Programming Principles in Scala
Books
- Programming in Scala, 1st Edition, Free Online Reading

The main sources for the following content are (you are encouraged to read them for more background):

Introduction to Scala

What is Scala?

"Scala smoothly integrates object-oriented and functional programming. It is designed to express common programming patterns in a concise, elegant, and type-safe way." by Matrin Odersky.

High-level language for the Java Virtual Machine (JVM)
Object oriented + functional programming
Statically typed
Comparable in speed to Java
Type inference saves us from having to write explicit types most of the time Interoperates with Java
Can use any Java class (inherit from, etc.)
Can be called from Java code

Why Scala?

Spark was originally written in Scala, which allows concise function syntax and interactive use
Spark APIs for other languages include:
- Java API for standalone use
- Python API added to reach a wider user community of programmes
- R API added more recently to reach a wider community of data analyststs
- Unfortunately, Python and R APIs are generally behind Spark's native Scala (for eg. GraphX is only available in Scala currently).
See Darren Wilkinson's 11 reasons for scala as a platform for statistical computing and data science. It is embedded in-place below for your convenience.

Show code Show result

Show code

val x : Int = 5 // <Ctrl+Enter> to declare a value x to be integer 5

x: Int = 5

val x = 5    // <Ctrl+Enter> to declare a value x as Int 5 (type automatically inferred)

x: Int = 5

val x = 5.0   // <Ctrl+Enter> to declare a value x as Double 5

x: Double = 5.0

val x :  Double = 5    // <Ctrl+Enter> to declare a value x as Double 5 (type automatically inferred)

x: Double = 5.0

//x = 10    //  uncomment and <Ctrl+Enter> to try to reassign val x to 10

var y = 2    // <Shift+Enter> to declare a variable y to be integer 2 and go to next cell

y: Int = 2

y = 3    // <Shift+Enter> to change the value of y to 3

y: Int = 3

val s  = "hi"    // <Ctrl+Enter> to declare val s to String "hi"

s: String = hi

//s.  // place cursor after the '.' and press Tab to see all available methods for s

s    // <Shift-Enter> recall the value of String s

res3: String = hi

s.contains("f")     // <Shift-Enter> returns Boolean false since s does not contain the string "f"

res5: Boolean = false

s.contains("")    // <Shift-Enter> returns Boolean true since s contains the empty string ""

res6: Boolean = true

s.contains("i")    // <Ctrl+Enter> returns Boolean true since s contains the string "i"

res7: Boolean = true

def square(x: Int): Int = x*x    // <Shitf+Enter> to define a function named square

square: (x: Int)Int

square(5)    // <Shitf+Enter> to call this function on argument 5

res8: Int = 25

y    // <Shitf+Enter> to recall that val y is Int 3

res9: Int = 3

SDS-2.x, Scalable Data Engineering Science

Notebooks

Notebooks can be written in Python, Scala, R, or SQL.

Creating a new Notebook

Cloning a Notebook

Introduction to Scala through Scala Notebook

Clone Or Import This Notebook

Attach the Notebook to a cluster

Cells are units that make up notebooks

Create and Edit a New Markdown Cell in this Notebook

Running a cell in your notebook.

Press Shift+Enter when in the cell to run it and proceed to the next cell.

Alternately, press Ctrl+Enter when in a cell to run it, but not proceed to the next cell.

Markdown Cell Tips

Run a Scala Cell

Scala Resources

Introduction to Scala

What is Scala?

Why Scala?

Let's get our hands dirty in Scala

Assignments

value and variable as `val` and `var`

Methods and Tab-completion

Functions

SDS-2.x, Scalable Data Engineering Science

Notebooks

Notebooks can be written in Python, Scala, R, or SQL.

Creating a new Notebook

Cloning a Notebook

Introduction to Scala through Scala Notebook

Clone Or Import This Notebook

Attach the Notebook to a cluster

Cells are units that make up notebooks

Create and Edit a New Markdown Cell in this Notebook

Running a cell in your notebook.

Press Shift+Enter when in the cell to run it and proceed to the next cell.

Alternately, press Ctrl+Enter when in a cell to run it, but not proceed to the next cell.

Markdown Cell Tips

Run a Scala Cell

Scala Resources

Introduction to Scala

What is Scala?

Why Scala?

Let's get our hands dirty in Scala

Assignments

value and variable as val and var

Methods and Tab-completion

Functions

value and variable as `val` and `var`