Notes from 4 Sep

Some FP concepts

Referential transparency
Side effects
Pure functions/languages

Algebraic reasoning about program expressions and equality. These two ’should’ be equivalent: \(3*e^2+4\) and \(3*e*e+4\). Consider if \(e\) is a C++ function like:

int f1(int y) {
  return y+1;
}

This function f1 is pure (no side effects), so the equivalence works:

   3 * e^2 + 4 where e = f1(5)
== 3 * f1(5)^2 + 4
== 3 * 6^2 + 4
== 3 * 36 + 4
== 108 + 4
== 112

   3 * e * e + 4 where e = f1(5)
== 3 * f1(5) * f1(5) + 4
== 3 * 6 * f1(5) + 4
== 3 * 6 * 6 + 4
== 108 + 4
== 112

But consider instead this function:

int f2(int y) {
  cout << "Hello!";
  return y+1;
}

   3 * e^2 + 4 where e = f2(5)
== 3 * f2(5)^2 + 4
== 3 * 6^2 + 4        // Outputs "Hello!"
== 3 * 36 + 4
== 108 + 4
== 112

   3 * e * e + 4 where e = f2(5)
== 3 * f2(5) * f2(5) + 4
== 3 * 6 * f2(5) + 4   // Outputs "Hello!"
== 3 * 6 * 6 + 4       // Outputs "Hello!" again
== 108 + 4
== 112

The function f2 does have a side effect: its output. Although both expressions produce the same final value, they can be distinguished by how many times “Hello!” appears.

We can do even worse than this. Here’s another type of side effect:

int g = 8;  // global variable
int f3(int y) {
  g++;
  return y+g;
}

   3 * e^2 + 4 where e = f3(5)
== 3 * f3(5)^2 + 4    // g becomes 9
== 3 * 14^2 + 4
== 3 * 196 + 4
== 588 + 4
== 592

// Let's assume g is reset to 8 before proceeding

   3 * e * e + 4 where e = f3(5)
== 3 * f3(5) * f3(5) + 4  // g becomes 9 in first application of f3
== 3 * 14 * f3(5) + 4     // g becomes 10 in next application
== 3 * 14 * 15 + 4
== 630 + 4
== 634

Now we get completely different results.

Another typical violation of referential transparency is any function that returns random numbers. This also amounts to modification of global state.

Haskell and GHC

GHC = Glasgow Haskell Compiler
REPL = Read, Eval, Print, Loop

For now, we can use repl.it – choose “new repl” and then select Haskell. You can create an account, or just save/remember the URL to return to the same environment later.

Type this in the editor box, and push the play button:

main = putStrLn "Hello, world!"

Below are some variations we explored, although I don’t want to get too deep into all the mechanisms here because we’ll explore them later.

You can split the main function onto multiple lines as long as you use indentation to continue them.

main =
  putStrLn "Hello, world!"

Or:

main =
  putStrLn
  "Hello, world!"

If you want to sequence multiple print statements, the >> operator will do that:

main =
  putStrLn "Hello, world!" >>
  putStrLn "Goodbye then," >>
  putStrLn "This has been fun."

In that example, the line-breaking can be about anywhere, because it’s all one big expression:

main =
  putStrLn "Hello, world!"
  >> putStrLn
  "Goodbye then," >>
  putStrLn
  "This has been fun."

Instead of the >> to sequence the distinct actions, you can use “do notation”, but it will be fussier about line breaking. (You can continue one action onto the next line, but it will need further indentation.)

main = do
  putStrLn "Hello, world!"
  putStrLn
    "Goodbye then,"
  putStrLn "This has been fun."

We also did this:

main = do
  let greeting = "Hello"
  putStrLn "Enter your name: "
  name <- getLine
  putStrLn (greeting ++ ", " ++ name)

Comment syntax is double-dash for line comments, and curly-dash for block comments.

{- Block comment
   can span multiple lines -}
-- Line comment

Operator and function syntax

Caveat about negation operator: in Haskell, the negative sign (subtraction operator) sometimes causes syntactic confusion:

λ> 3 * -2

<interactive>:7:1: error:
    Precedence parsing error
        cannot mix ‘*’ [infixl 7] and prefix `-' [infixl 6]
        in the same infix expression

The solution to this is just to use parentheses around negation.

λ> 3 * (-2)
-6

Function application does not require parentheses. Here are some simple examples of defining small functions right in the REPL.

λ> areaCircle r = pi * r**2
λ> volumeSphere r = 4/3 * pi * r**3
λ> areaCircle 5
78.53981633974483
λ> volumeSphere 5
523.5987755982989

Notice that we don’t need parentheses to invoke those functions. You don’t need to say areaCircle(5), just areaCircle 5 is sufficient. If parentheses do appear, they are used to group arguments together. For example,

areaCircle 5 + 2

would be interpreted as

(areaCircle 5) + 2
== 78.53981633974483 + 2
== 80.53981633974483

but if you want to add first, then use explicit parens:

areaCircle (5 + 2)
== areaCircle 7
== 153.93804002589985

Identifiers

Functions in Haskell can be named either with alphanumeric identifiers (examples: sqrt, div, areaCircle) or symbolic identifiers (examples: /, **, <*>, >@>>).

By default, alphanumeric identifiers are defined and invoked using prefix notation – the function name appears before any of its arguments. In contrast, symbolic identifiers are defined and invoked using infix notation – it is assumed that the operation takes two parameters, and the symbol appears between them.

For example, the integer division function (div) is alphanumeric and the floating-point division function (/) is symbolic. They each take two parameters. So the default way to call them is:

div 19 2    -- Prefix because alphanumeric
19 / 2      -- Infix because symbolic

However, you can override this. We can use alphanumeric function names with infix notation (as long as the function takes exactly two parameters) by surrounding them with “back ticks” – on most keyboards this is on the same key as the tilde (~) character:

19 `div` 2   -- Infix notation with alphanumeric

We can also use symbolic function names with prefix notation by surrounding the symbol with parentheses:

(/) 19 2     -- Prefix notation with symbolic

These notation rules also apply to the function definition. Let’s make up an operation of two parameters that adds their squares:

This defines it:

x >@@< y = x*x + y*y

and here’s a sample usage:

3 >@@< 4
== 25