University Notes - Computer Science

Just code and projects for university. All the study-related contents are just a summary of the actual material. For better explanations check other material here, study on books suggested by the teachers and learn by writing your own material 😀

You are encouraged to follow the lectures and do exercises over and over again. The process of learning requires effort

A pottery teacher split her class into two halves

To the first half she said, "You will spend the semester studying pottery, planning, designing, and creating your perfect pot. At the end of the semester, there will be a competition to see whose pot is the best".

To the other half she said, "You will spend your semester making lots of pots. Your grade will be based on the number of completed pots you finish. At the end of the semester, you'll also have the opportunity to enter your best pot into a competition."

The first half of the class threw themselves into their research, planning, and design. Then they set about creating their one, perfect pot for the competition.

The second half of the class immediately grabbed fistfulls of clay and started churning out pots. They made big ones, small ones, simple ones, and intricate ones. Their muscles ached for weeks as they gained the strength needed to throw so many pots.

At the end of class, both halves were invited to enter their most perfect pot into the competition. Once the votes were counted, all of the best pots came from the students that were tasked with quantity. The practice they gained made them significantly better potters than the planners on a quest for a single, perfect pot.

Original Post

Un'occhiata veloce a GitHub Classroom

Sarò molto sintetico, solo per dare un'idea di come funzionano gli esercizi e il feedback agli studenti

Home

GitHub Classroom first page

Creare una "Classroom"

Per creare una "Classroom" bisogna selezionare l'organizzazione nella quale si vogliono mettere i repository (si quelli con le soluzioni degli studenti, sia i template con i test)

Una possibile organizzazione per il corso potrebbe essere "Metodologie di programmazione Sapienza"

GitHub Classroom home

Passiamo direttamente alle informazioni importanti

Amministratori della "Classroom"

Si può usare un link d'invito per decidere i docenti (amministratori) del corso

Link Invito TA e admin

Assegnare esercizi

Una volta creata la "Classroom", questa è l'interfaccia di gestione dei degli esercizi

Specific Classroom home

Individuale, di gruppo, pubblico o privato (scadenza opzionale)

Per assegnare un esercizio bisogna dare alcune informazioni di base: il nome, un'eventuale scadenza (opzionale), se si tratta di un esercizio di gruppo o individuale.

Quando lo studente "accetterà l'esercizio", verrà creato un repository (pubblico o privato) con il nome dell'esercizio e dello studente nell'organizzazione che avevamo scelto nella creazione della "Classroom"

Creazione esercizio

Successivamente, si sceglie il template (il repository con i test) da usare per l'esercizio (i template con i test vengono creati una volta, e si riusano ogni anno)

Template da usare per l'esrecizio

Scelta Template

Test

Qui si può decidere di eseguire un comando unico che fa girare tutti i test (quindi l'esercizio può essere passato / non passato), oppure si può decidere di andare più a grana fine e far girare più comandi che eseguono ciascuno un sottoinsieme dei test.

Test nome

Riusare i test per gli anni futuri

Gli esercizi si possono riusare, quindi non è necessario ogni volta riscrivere i test per gli esercizi dell'anno passato. Si può prendere un esercizio dell'anno passato, cliccare il tasto "riusa" e si può riproporre lo stesso esercizio con gli stessi test in un anno successivo.

Condivisione esercizio

Una volta creato l'esercizio viene generato un link. Se gli studenti cliccano su quel link, possono accettare di fare l'esercizio, caso in cui viene creato un repository per il studente.

Link condivisione esercizio

Svolgere gli esercizi

Accettare un homework

Questa è la schermata che vede uno studente quanto clicca un esercizio (se volete provare questo è il link di un esercizio https://classroom.github.com/a/lWBDk-we)

Schermata studente

Link del repository appena creato

nome

Testare in locale

Lo studente dovrà clonare il repository in locale (lo si può fare da CLI come preferisco io, altrimenti Eclipse ha integrate le funzionalità per lavorare con git e GitHub)

In questo esempio, per far girare i test ho eseguito il comando gradle test, per chi usa Eclipse, è già tutto integrato nell'editor, e possono eseguire i test cliccando sul tasto verder per eseguire il programma.

Qui un test fallisce.

Test Fallito

Lo studente scrive il codice per far funzionare il test. Ora i 2 test passano enrambi.

Codice corretto

Pubblicare il codice

Una volta risolto l'esercizio (o anche parte di esso!) lo studente può fare il commit del codice al repository generato prima (usando git da CLI come nel mio caso, altrimenti Eclipse ha integrate le funzionalità per farlo in modo semplice)

Git commit

GitHub testa in automatico il codice

Quando lo studente fa il commit del codice, GitHub si occupa anche lui di esegure i test, per far vedere al docente quanti e quali test ha superato fino a quel momento lo studente.

GitHub actions

Esercizi dal punto di vista del doecente

Il docente può vedere l'elenco degli studenti che hanno accettato di fare l'esercizio, se hanno passato o meno i test, e possono andare a vedere il codice che hanno scritto (e lasciare un eventuale feedback manuale sul codice)

Studenti che hanno accettato

Feedback del docente

Eventualmente, il docente può vedere il codice che ha scritto lo studente.

Feedback del docente

Computer Architecture

1951 IAS Machine Architecture

This section is made to grasp a basic understand of how a computer architecture works, and it's not meant to be studied thoroughly.

The IAS Machine had a 1000 word memory, with a 40b word (40000b = 5000B ~ 5kB).

Word

Words are CA2 integers

0	000000000000000000000000000000000000000
$\pm 2^{39} \cdot bit$	$value$

Instruction words contain two instructions

00000000	000000000000	00000000	000000000000
$0..7$	$8..19$	$20..27$	$28..39$
opcode	address	opcode	address

CPU

	Name	Description
MBR	Memory Buffer Register	receives & sends data to memory and I/O
MAR	Memory Address Register	current memory address
PC	Program Counter	address of the instruction to execute
IR	Instruction Register	contains instruction to execute
IBR	Instruction Buffer Register	contains the second instruction
AC	Accumulator	for partial calculation results
MQ	Multiplier Quotient	for partial calculation results

Instructions

This isn't the full ISA of the IAS Machine, check it out here.

Transfer Instructions

	Description
LOAD	AC $\leftarrow$ AC `operation` Memory[Address]
LOAD	AC $\leftarrow$ `operation` Memory[Address]
LDMQ	MQ $\leftarrow$ Memory[Address]
ST	Memory[Address] $\leftarrow$ AC
AMODL	Memory[Address][0..11] $\leftarrow$ AC[0..11] (low)
AMODH	Memory[Address][20..31] $\leftarrow$ AC[0..11] (high)

Jumps

Like in modern assembly, jumps can be unconditional, conditional; for the IAS machine you had to specify either a low or high address.

	Description
UBL	PC $\leftarrow$ [Address]
UBH	PC $\leftarrow$ [Address] + 1
CBL	if AC $\ge$ 0 { PC $\leftarrow$ [Address] }
CBH	if AC $\ge$ 0 { PC $\leftarrow$ [Address] + 1 }

Operations

	Description
MUL	AC, MQ $\leftarrow$ AC $\cdot$ Memory[Address]
DIV	AC $\leftarrow$ AC / Memory[Address]
DIV	MQ $\leftarrow$ AC % Memory[Address]
LSHIFT	AC, MQ $\leftarrow$ AC, MQ << X
RSHIFT	AC, MQ $\leftarrow$ AC, MQ >> X
MOVE	AC $\leftarrow$ AC `operation` MQ
IO	Transfer from and to I/O devices

Example Program

LOAD 101
ADD 102
ST 103

How does it work?

Fetch
- MAR $\leftarrow$ PC
- IR, IBR $\leftarrow$ MBR $\leftarrow$ Memory[MAR]
Decode
- MAR $\leftarrow$ IR[8..19] ; address
- CU $\leftarrow$ IR[0..8] ; opcode
Exec
- AC $\leftarrow$ MBR $\leftarrow$ Memory[101]
Decode
- MAR $\leftarrow$ IBR[8..19] ; address
- CU $\leftarrow$ IBR[0..8] ; opcode
Exec
- AC $\leftarrow$ AC + MBR $\leftarrow$ Memory[102]
PC
- PC $\leftarrow$ PC + 1
Fetch
- MAR $\leftarrow$ PC
- IR, IBR $\leftarrow$ MBR $\leftarrow$ Memory[MAR]
Decode
- MAR $\leftarrow$ IR[8..19] ; address
- CU $\leftarrow$ IR[0..8] ; opcode
Exec
- Memory[103] $\leftarrow$ MBR $\leftarrow$ AC

MIPS

RISC vs CISC

Reduced Instruction Set Computer vs Complex Instruction Set Computer

RISC	CISC
fixed size instructions	variable size instructions (requires decode before fetch)
fixed format	variable format (complex decode)
operations only with registers	in-memory operands
many registers	few of registers
single access to memory	multiple accesses to memory
fixed instruction duration	variable instruction duration
simple conflicts	complex conflicts
faster pipeline	complex pipeline

Registers

name	number	use	keep
$zero ∣0∣0 co n s t an t ∣ ? ∣∣$ at	1	reserved for assembler	?
$v 0 -$ v1	2 - 3	expression evaluation and results of functions	no
$a 0 -$ a3	4 - 7	arguments	no
$t 0 -$ t7	8 - 15	temporary	no
$s 0 -$ s7	16 - 23	saved temporary	yes
$t 8 -$ t9	24 - 25	temporary	no
$k 0 -$ k1	26 - 27	reserved for OS Kernel	?
$g p ∣28∣ g l o ba lp o in t er ∣ yes ∣∣$ sp	29	stack pointer	yes
$f p ∣30∣ f r am e p o in t er ∣ yes ∣∣$ ra	31	return address	yes

Special registers

$g p * * p o in t s in t o t h e mi dd l eo f a * * 64 K b l oc k * * o f m e m ory in t h e * * h e a p * * t ha t h o l d sco n s t an t s an d * * g l o ba l v a r iab l es * * - * *$ sp points to the last location in use on the stack
$f p * * p o in t s t o t h es t a r t o f t h es t a c k f r am e an dd oes n o t m o v e f or t h e d u r a t i o n o f t h es u b ro u t in ec a ll, an d t h e p a r am e t ers t ha t a re p a sse d in t o t h es u b ro u t in ere maina t a co n s t an t s p o t re l a t i v e t o t h e f r am e p o in t er - * *$ ra is written with the return address for a call by the jal instruction

Instructions

R-type Instructions

Arithmetic Instruction Format (type to a register)

add $t0, $s1, $s2

add opcode $\to$ 000000
$t 0 in * * r d * * (\to) 01000 -$ s1 in rs, $\to$ 10001
s2 in rt, $\to$ 10010
add funct $\to$ 100000

op	rs	rt	rd	shamt	funct
000000	10001	10010	01000	00000	100000
6b	5b	5b	5b	5b	6b
opcode	first register source	second register source	register destination operand	shift amount	function code

I-type Instructions

Data Transfer Format (conditional jumps)

addi t2, <span class="katex"><span class="katex-html" aria-hidden="true"><span class="base"><span class="strut" style="height:0.8889em;vertical-align:-0.1944em;"></span><span class="mord mathnormal">s</span><span class="mord">2</span><span class="mpunct">,</span><span class="mspace" style="margin-right:0.1667em;"></span><span class="mord">4‘‘‘</span><span class="mspace" style="margin-right:0.2222em;"></span><span class="mbin">−</span><span class="mspace" style="margin-right:0.2222em;"></span></span><span class="base"><span class="strut" style="height:0.6944em;"></span><span class="mord mathnormal">a</span><span class="mord mathnormal">dd</span><span class="mord mathnormal">i</span><span class="mspace" style="margin-right:0.2222em;"></span><span class="mbin">∗</span><span class="mspace" style="margin-right:0.2222em;"></span></span><span class="base"><span class="strut" style="height:0.8889em;vertical-align:-0.1944em;"></span><span class="mord">∗</span><span class="mord mathnormal">o</span><span class="mord mathnormal">p</span><span class="mord mathnormal">co</span><span class="mord mathnormal">d</span><span class="mord mathnormal">e</span><span class="mspace" style="margin-right:0.2222em;"></span><span class="mbin">∗</span><span class="mspace" style="margin-right:0.2222em;"></span></span><span class="base"><span class="strut" style="height:0.4653em;"></span><span class="mord">∗</span></span><span class="mspace newline"></span><span class="base"><span class="strut" style="height:1em;vertical-align:-0.25em;"></span><span class="mopen">(</span><span class="mrel">→</span></span><span class="mspace newline"></span><span class="base"><span class="strut" style="height:1em;vertical-align:-0.25em;"></span><span class="mclose">)</span><span class="mord">001000</span><span class="mord">−</span></span></span></span>t2 in **rt** \\(\to\\) 01010
- s2 in **rs** \\(\to\\) 10010

| op | rs | rt | constant |
|:--:|:--:|:--:|:--:|
| 001000 | 10010 | 01010 | 0000000000000100 |
| 6b | 5b | 5b | 16b |
| opcode | first <br/> register | target <br/> register | constant value or <br/> address | 

<!-- | \\(0..5\\) | \\(6..10\\) | \\(11..15\\) | \\(16..31\\) | -->

### J-type Instructions

Unconditional Jumps

```armasm
j label

PC $\leftarrow$ label $\cdot$ 4

op	address
001000	10010010100000000000000100
6 bit	26 bit

FR-type Instructions

MIPS handles floating point instructions like regular 32b instructions. FR-type don't access the memory, and are executed by the FPU.

add.s f0, <span class="katex"><span class="katex-html" aria-hidden="true"><span class="base"><span class="strut" style="height:0.8889em;vertical-align:-0.1944em;"></span><span class="mord mathnormal" style="margin-right:0.10764em;">f</span><span class="mord">1</span><span class="mpunct">,</span></span></span></span>f2 ; single precision
div.d <span class="katex"><span class="katex-html" aria-hidden="true"><span class="base"><span class="strut" style="height:0.8889em;vertical-align:-0.1944em;"></span><span class="mord mathnormal" style="margin-right:0.10764em;">f</span><span class="mord">0</span><span class="mpunct">,</span></span></span></span>f2, f4 ; double precision

op	rs	rt	rd	shamt	funct
000000	10001	10010	01000	00000	100000
6b	5b	5b	5b	5b	6b
opcode	first register source	second register source	register destination operand	shift amount	function code

FI-type Instructions

FI-type are used for:

load / store
conditional jumps

lwc1 f1, indirizzo
bc1f cc, offset

op	rs	rt	constant
001000	10010	01010	0000000000000100
6b	5b	5b	16b
opcode	first register	target register	constant value or address

Interrupts

TODO: Interrupts

Assembly

wiki

Compilation

TODO: fill this section with instructions on how to compile assembly code

.s -> .o
.o -> .exe linking
windows
linux

Memory Layout of a Program

memory layout

Directives

.globl

The .globl directive is useful when working with multiple files, and we need parts of code to reference labels in other files. If you don't use .globl, during the linking process, it can't find the label and gives an error.

We could have a main.asm file like this:

.globl main
.data
.text
    main: 
        li $a0, 5
        jal fibonacci
        ...

And a second file math.asm with the fibonacci function:

.globl fibonacci
.data
.text
    fibonacci:
       mv $t0, $a0 
       ...

.ent

There is a .ent directive too, which is a debugger pseudo operation that marks the entry of main.

.globl main
.ent main
.text
    main:
        ...

Pseudo-Instructions

Many instructions provided by MIPS, like move, lw etc... are decomposed into multiple instructions when the code is assembled; for more details. For example:

lw

lw $s0, value is mapped to:

lui $1, 0x00001001
lw $16, 0x00000000($1)

In the second case, 0x10010000 is the address of value, to get this address, we need to lui 0x00001001 (load upper immediate, loads the immediate value in the upper 16bits of the $1 register, which is the $at register). Then we can load into $16, which is the $s0 register, whatever value is at the address 0 + $at (0x00000000($1))

move

move $t0, $s0 is mapped to addu $8, $0, $16, where addu is the "add unsigned" operation, and $8 is $t0, $0 is the $zero constant register and $16 the $s0 register.

beq

ble $s1, $t0, label is mapped to:

slt $1, $8, $17
beq $1, $0, 0x00000001

Now, slt (set less than) sets the value in rs to 1 if rt is less than rd (if you don't know what rs, rt and rd are, check R-Type Instructions)

Absolute Jump

MIPS instructions have a fixed 32 bit size... what happens when you need to jump to an address which is a 32 bit constant? Neither I-type or J-type support 32 bit constants. We need to use lui and ori.

Let's suppose we have to jump at the address 0000_0000_1111_1111_0000_1001_0000_0000, which corresponds to 0x00ff0900.

lui $t0, 0x000000ff
ori $t0, 0x00000900
jr $t0

What lui does is moving the lower 16 bits 0x00ff into the upper part of register $t0, that way we have 0x00ff0000 in $t0. Now, we use ori (which does a bitwise or, basically compares with an or for each bit in $t0 with the value 0x00000900 for the lower half of the byte) so we have the full address. Now we can just use jr to jump to the address in the register.

Statements

For each statement, I'll show the C code (I chose C over other languages as the generated assembly is very minimal and easy to understand), and the relative MIPS implementation; under details I'll leave the x_86 assembly generated by cl.exe on Windows and gcc on Linux.

Conditions

If-Else

int main() {
  int x = 0;

  if (x > 0)
    x += 5;
  else
    x += 10;
}

.text
    li $t0, 0 #; x = 0

    blez $t0, else #; if x <= 0, goto else

    if:
        addi $t0, $t0, 5 #; add 5 to x if x > 0
        j end #; don't execute else part
    else: 
        addi $t0, $s1, 10 #; add 10 to x if x <= 0
    end:

Switch

int main() {
  int x = 1;

  switch (x) {
  case 0:
    x += 16;
    break;
  case 1:
    x += 16 * 2;
    break;
  case 2:
    x += 16 * 3;
    break;
  }
}

.data
    dest: .word case0, case1, case2
.text
    #; sll $t0, $t0, 2 #; choose the case

    li $t0, 0 #; first case 
    addi $t0, $t0, 4 #; jump one case
    #; addi $t0, $t0, 8 #; jump two cases

    lw $t1, dest($t0) #; load case address to $t1
    jr $t1 #; Jump to the case address in $t1 (case0, case1 etc...)

    li $t2, 0
    case0:
        addi $t2, $zero, 0x10
        j break 
    case1:
        addi $t2, $zero, 0x20
        j break 
    case2:
        addi $t2, $zero, 0x30
        j break 
    break:

Iterations

Do-While

int main() {
  int x = 0, i = 0;

  do {
    x += 4;
    i += 1;
  } while (i < 10);
}

.text
    li $t0, 0 #; x = 0
    li $t1, 0 #; i = 0

    do:
        addi $t0, $t0, 4 #; x += 4
        addi $t1, $t1, 1 #; i += 1
    blt $t1, 10, do #; if i < 10, repeat the cicle

While

int main() {
  int x = 0, i = 0;

  while (i < 10) {
    x += 4;
    i += 1;
  }
}

.text
    li $t0, 0 #; x = 0
    li $t1, 0 #; i = 0

    while:
        bge $t1, 10, end #; if i >= 10, end the while loop
        addi $t0, $t0, 4 #; x += 4
        addi $t1, $t1, 1 #; i += 1
        j while #; repeat cicle
    end:

For

int main() {
  int x = 0;

  for (int i = 0; i <= 10; i++)
    x += i;
}

.text
    li $t0, 0 #; x = 0
    li $t1, 0 #; i = 0

    for:
        beq $t1, 10, end #; if i == 10, end loop 
        add $t0, $t0, $t1 #; x += i
        addi $t1, $t1, 1 #; i += 1 
        j for
    end:

Vectors & Matrices

Endianness

The MIPS architecture allows both big-endian and little-endian byte ordering, but the little-endian one is most commonly used. Endianness has to do on how bytes are addressed in memory, and it's related to how the access of each individual byte is made.

ASCII

TODO ASCII

Vectors

There are many types of vectors you can handle in MIPS:

.data
    byte: .byte 29, 8, 1, 29, 2, -3
    half: .half 10, -4, 20, -8, 22, 12
    word: .word 2, 29012, 29, 5, -12905, -290125

    # decimal 

    float: .float 2.5, -1.2, 21.90, -5.0
    double: .double 2.5, -1.2, 21.90, -5.0

    # strings

    string: .asciiz "Holy Moly, who ate my Canoli?"

Note that .asciiz stands for "zero terminated string", which means it has a '\0' nullchar at the end.

Vector Iterations

You can iterate vectors and matrices in two ways:

Index

useful if you need the index of each element
the increment of the index doesn't depend on the size of the elements
you have to convert the index each time, according to the size of the elements

.data
    vector: .word 10, 2, 980, 29, 1992, -2, 59, 280, 99
    size: .word 9
.text
    la $s0, vector #; s0 = vector address
    la $t1, size #; t1 = address of size
    lw $t1, ($t1) #; t1 = size

    li $t0, 0 #; t0 is the index i

    for:
        bge $t0, $t1, end #; if i == size, end loop

        sll $t2, $t0, 2 #; offset = i * 4 (shift logic left by 2)
        addi $t2, $t2, $s0 #; t2 = current address = offset + address

        lw $t7, ($t2) #; load value from current address, and maybe use it

        #; rest of the code ...

        addi $t0, $t0, 1 #; i = i + 1
        j for #; loop again
    end:

TODO: check if it works

Pointer

you work directly with the address
less calculations to do in the cycle
you don't have the index of the element
the increment depends on the size of the elements
you must calculate the index after the last element

.data
    vector: .word 10, 2, 980, 29, 1992, -2, 59, 280, 99
    size: .word 9
.text
    la $t0, vector #; t0 = vector address

    la $t1, size #; t1 = address of size
    lw $t1, ($t1) #; t1 = size
    sll $t1, $t1, 2 #; size = size * 4 (we are handling words)
    add $t1, $t0, $t1 #; t1 = end address = vector address + size * 4 (this is the address after the last one in the vector)

    for:
        bge $t0, $t1, end #; if current address == vector end address, end loop

        lw $t7, ($t0) #; load value from current address, and maybe use it

        #; rest of the code ...

        addi $t0, $t0, 4 #; current address = current address + 4 (we move by 4 bytes, because we are using words) 
        j for #; loop again
    end:

TODO: check if it works

Matrices

If you want a 7 (rows) x 13 (columns) matrix, you need enough space for 91 elements:

.data
    matrix: .word 0:91

Matrixes are stored in memory like vectors, each row is laid one after the other. To work with n-sized matrices, you just lay out one matrix after the other in memory (you can have a 3-dimensional matrix, for example, where the z coordinate dictates the layer, or the matrix, you are working with)

Syscalls & Procedures

Syscalls

Syscalls are a powerful tool, which enables interaction with I/O, files, and dynamic allocation of memory. The MARS editor supports 59 different syscalls. Here's a few of useful ones.

To use syscalls, there are some special registers:

$v0 is used for the code of the syscall
$a0 to $a3 are used for parameters
The output is usually saved in $v0

By setting these registers to the desired values, and using the syscall instruction, the OS will run the operation.

Files

service	v0	arguments
open file	13	$a 0 = a dd resso f n u ll - t er mina t e d s t r in g co n t ainin g f i l e nam e < b r / >$ a1 = flags $a 2 = m o d e ∣$ v0 contains file descriptor (negative if error)
read from file	14	$a 0 = f i l e d escr i pt or < b r / >$ a1 = address of input buffer $a 2 = ma x im u mn u mb ero f c ha r a c t ers t ore a d ∣$ v0 contains number of characters read (0 if end-of-file, negative if error)
write to file	15	$a 0 = f i l e d escr i pt or < b r / >$ a1 = address of output buffer $a 2 = n u mb ero f c ha r a c t ers t o w r i t e ∣$ v0 contains number of characters written (negative if error)
close file	16	a0 = file descriptor

Hello World!

.globl main

.data
    string: .asciiz "Hello World!"

.text
    main: 
        li v0, 4
        la a0, string
        syscall

Procedures

Procedures are pieces of code that take parameters, and return a result. They're useful to make the code cleaner and more modular.

.globl main

.text
    main:
        li a0, 5 #; first parameter
        li a1, 6 #; second parameter
        jal function #; call function

        return:
            li v0, 17
            li a0, 0
            syscall #; we have to exit, or the execution will continue

    function:
        subi sp, sp, 12 #; we need 4 bytes * 3 registers 
        sw ra, 8(sp) #; return address
        sw a0, 4(<span class="katex"><span class="katex-html" aria-hidden="true"><span class="base"><span class="strut" style="height:1em;vertical-align:-0.25em;"></span><span class="mord mathnormal">s</span><span class="mord mathnormal">p</span><span class="mclose">)</span><span class="mord mathnormal">s</span><span class="mord mathnormal" style="margin-right:0.02691em;">w</span></span></span></span>a1, 0(sp)

        #; function body...
        #; I can use jal, because we have saved in memory ra

        lw <span class="katex"><span class="katex-html" aria-hidden="true"><span class="base"><span class="strut" style="height:1em;vertical-align:-0.25em;"></span><span class="mord mathnormal">a</span><span class="mord">1</span><span class="mpunct">,</span><span class="mspace" style="margin-right:0.1667em;"></span><span class="mopen">(</span></span></span></span>sp)
        lw <span class="katex"><span class="katex-html" aria-hidden="true"><span class="base"><span class="strut" style="height:1em;vertical-align:-0.25em;"></span><span class="mord mathnormal">a</span><span class="mord">0</span><span class="mpunct">,</span><span class="mspace" style="margin-right:0.1667em;"></span><span class="mord">4</span><span class="mopen">(</span></span></span></span>sp)
        lw <span class="katex"><span class="katex-html" aria-hidden="true"><span class="base"><span class="strut" style="height:1em;vertical-align:-0.25em;"></span><span class="mord mathnormal" style="margin-right:0.02778em;">r</span><span class="mord mathnormal">a</span><span class="mpunct">,</span><span class="mspace" style="margin-right:0.1667em;"></span><span class="mord">8</span><span class="mopen">(</span></span></span></span>sp)
        addi <span class="katex"><span class="katex-html" aria-hidden="true"><span class="base"><span class="strut" style="height:0.625em;vertical-align:-0.1944em;"></span><span class="mord mathnormal">s</span><span class="mord mathnormal">p</span><span class="mpunct">,</span></span></span></span>sp, 12 #; reset stack pointer
        
        jr <span class="katex"><span class="katex-html" aria-hidden="true"><span class="base"><span class="strut" style="height:0.8889em;vertical-align:-0.1944em;"></span><span class="mord mathnormal" style="margin-right:0.02778em;">r</span><span class="mord mathnormal">a</span><span class="mord">‘‘‘</span><span class="mord mathnormal" style="margin-right:0.13889em;">T</span><span class="mord mathnormal">h</span><span class="mord mathnormal">e</span><span class="mord mathnormal">b</span><span class="mord mathnormal" style="margin-right:0.01968em;">l</span><span class="mord mathnormal">oc</span><span class="mord mathnormal" style="margin-right:0.03148em;">k</span><span class="mord mathnormal">t</span><span class="mord mathnormal">h</span><span class="mord mathnormal">e</span><span class="mord mathnormal" style="margin-right:0.10764em;">f</span><span class="mord mathnormal">u</span><span class="mord mathnormal">n</span><span class="mord mathnormal">c</span><span class="mord mathnormal">t</span><span class="mord mathnormal">i</span><span class="mord mathnormal">o</span><span class="mord mathnormal">n</span><span class="mord mathnormal">t</span><span class="mord mathnormal" style="margin-right:0.03148em;">ak</span><span class="mord mathnormal">es</span><span class="mord mathnormal" style="margin-right:0.10764em;">f</span><span class="mord mathnormal">ro</span><span class="mord mathnormal">m</span><span class="mord mathnormal">t</span><span class="mord mathnormal">h</span><span class="mord mathnormal">e</span><span class="mspace" style="margin-right:0.2222em;"></span><span class="mbin">∗</span><span class="mspace" style="margin-right:0.2222em;"></span></span><span class="base"><span class="strut" style="height:0.6944em;"></span><span class="mord">∗</span><span class="mord mathnormal">s</span><span class="mord mathnormal">t</span><span class="mord mathnormal">a</span><span class="mord mathnormal">c</span><span class="mord mathnormal" style="margin-right:0.03148em;">k</span><span class="mspace" style="margin-right:0.2222em;"></span><span class="mbin">∗</span><span class="mspace" style="margin-right:0.2222em;"></span></span><span class="base"><span class="strut" style="height:0.6944em;"></span><span class="mord">∗</span><span class="mord mathnormal">i</span><span class="mord mathnormal">sc</span><span class="mord mathnormal">a</span><span class="mord mathnormal" style="margin-right:0.01968em;">ll</span><span class="mord mathnormal">e</span><span class="mord mathnormal">d</span><span class="mspace" style="margin-right:0.2222em;"></span><span class="mbin">∗</span><span class="mspace" style="margin-right:0.2222em;"></span></span><span class="base"><span class="strut" style="height:0.8889em;vertical-align:-0.1944em;"></span><span class="mord">∗</span><span class="mord mathnormal">s</span><span class="mord mathnormal">t</span><span class="mord mathnormal">a</span><span class="mord mathnormal">c</span><span class="mord mathnormal" style="margin-right:0.03148em;">k</span><span class="mord mathnormal" style="margin-right:0.10764em;">f</span><span class="mord mathnormal" style="margin-right:0.02778em;">r</span><span class="mord mathnormal">am</span><span class="mord mathnormal">e</span><span class="mspace" style="margin-right:0.2222em;"></span><span class="mbin">∗</span><span class="mspace" style="margin-right:0.2222em;"></span></span><span class="base"><span class="strut" style="height:0.4653em;"></span><span class="mord">∗</span><span class="mord mathnormal" style="margin-right:0.02778em;">or</span><span class="mspace" style="margin-right:0.2222em;"></span><span class="mbin">∗</span><span class="mspace" style="margin-right:0.2222em;"></span></span><span class="base"><span class="strut" style="height:0.6944em;"></span><span class="mord">∗</span><span class="mord mathnormal">a</span><span class="mord mathnormal">c</span><span class="mord mathnormal">t</span><span class="mord mathnormal">i</span><span class="mord mathnormal" style="margin-right:0.03588em;">v</span><span class="mord mathnormal">a</span><span class="mord mathnormal">t</span><span class="mord mathnormal">i</span><span class="mord mathnormal">o</span><span class="mord mathnormal">n</span><span class="mord mathnormal" style="margin-right:0.02778em;">recor</span><span class="mord mathnormal">d</span><span class="mspace" style="margin-right:0.2222em;"></span><span class="mbin">∗</span><span class="mspace" style="margin-right:0.2222em;"></span></span><span class="base"><span class="strut" style="height:0.6944em;"></span><span class="mord">∗</span><span class="mord">.</span><span class="mord mathnormal" style="margin-right:0.13889em;">W</span><span class="mord mathnormal">ec</span><span class="mord mathnormal">an</span><span class="mord mathnormal">u</span><span class="mord mathnormal">se</span><span class="mord">‘</span></span></span></span>fp` to point to the start of the **activation record**, it's rendundant and rarely used.

## Recursions

Functions calling themselves!

> TODO: complete factorial

```armasm
.globl main

.text
    main:
        li $a0, 5
        jal factorial

        print:
            move $a0, $v0 #; integer = result of factorial function
            li $v0, 1 #; print integer
            syscall

        return:
            li $v0, 17
            li $a0, 0
            syscall

    factorial:
        #; jump by 1
        #; recursive step
        #; base case
        returnFactorial:
            lw $ra, ($sp)
            addi $sp, $sp
            jr $ra

Exercises

These are the implementations of the exercises presentend in course's slides and notations.

; pdf 3 slide 10

.data

.text
    li $s0, 2 #; a = 2
    li $s1, 5 #; b = 5
    li $s2, 9 #; c = 9
    li $s3, 4 #; d = 4
    li $s4, 12 #; e = 12
    
    #; a = ( b - c ) + ( d - e )

    sub $t0, $s1, $s2 #; t0 = b - c
    sub $t1, $s3, $s4 #; t1 = d - e
    add $s0, $t0, $t1 #; s0 = t0 + t1

; pdf 3 slide 11

.data
    variable: 10
    vector: .word 12, 4, 59, 9, 19, 8, 6, 18, 9, 19, 28, 12, 100

.text
    la $s6, variable #; s6 = address of variable
    la $s5, vector #; s5 = address of vector

    #; vector[12] = vector[6] + variable

    lw $t0, ($s6) #; t0 = *s6 (value at address s6, which is the value of the variable)
    lw $t1, 24($s5) #; t1 = s5[6] ; s5[6] = *(s5 + 6), but 6 rappresents words, not bytes, so 6 * 4 = 24 (which corresponds to vector[6])

    add $t0, $t0, $t1 #; t0 += t1 (which is vector[5])

    sw $t0, 48($s5) #; stores result of sum (from t0) to vector[12], but 12 is words, so 48 is bytes

; pdf 3 slide 17

.data

.text
    li $s0, 1 #; u = 1
    li $s1, 0 #; v = 0

    #; v = u * 256

    sll $s1, $s0, 8 #; s1 = s0 * 256

    #; which is the first value that breaks?

    #; a word ha 32 bits, multiplying by 256 means shifting by 8 bits
    #; this means that as soon as we have the 25 bit set to 1, the 1 is shifted out of 32
    #; 2^24 = 16777216

    li $s2, 16777216 #; 16777215 will work normally
    sll $s0, $s2, 8 #; becomes 0

; pdf 4 slide 34

.data 
    vector: .word 11, 35, 2, 7, 29, 95
    size: .word 6

.text
    #; find max value in vector

    la $t0, vector #; t0 = current address
    la $t1, vector #; t1 = end of vector

    la $t2, size #; t2 = address of vector size
    lw $t2, ($t2) #; t2 = vector size
    sll, $t2, $t2, 2 #; t2 *= 4, to accomodate words

    add $t1, $t1, $t2 #; t1 = end of vector + vector size

    lw $t2, ($t0) #; t2 = max value

    for:
        bgt $t0, $t1, endFor #; if current address > end of vector, end for 

        lw $t3, ($t0) #; load value from current address
        ble $t3, $t2, elseSmaller #; if current value <= max value, continue

        ifBigger:
            move $t2, $t3 #; max value = current value

        elseSmaller:

        addi $t0, $t0, 4 #; current address = next address
        j for #; repeat cycle

    endFor:

; pdf 4 slide 35

.data 
    vector: .word 4, -1, 5, 500, 0, 10000, -256
    size: .word 5
    sums: .word 0, 0

.text
    #; find max value in vector

    la $t0, vector #; t0 = current address
    la $t1, vector #; t1 = end of vector

    la $t2, size #; t2 = address of vector size
    lw $t2, ($t2) #; t2 = vector size
    sll, $t2, $t2, 2 #; t2 *= 4, to accomodate words

    add $t1, $t1, $t2 #; t1 = end of vector + vector size

    li $t2, 0 #; t2 = parity 
    li $t4, 0 #; t4 = even 
    li $t5, 0 #; t5 = odd 

    for:
        bgt $t0, $t1, endFor #; if current address > end of vector, end for 

        lw $t3, ($t0) #; load value from current address
        beq $t2, 1, ifIsOdd #; check if current parity is odd

        ifIsEven:
            add $t4, $t4, $t3 #; even += current value
            li $t2, 1 #; parity = odd
            j nextIteration
        ifIsOdd:
            add $t5, $t5, $t3 #; odd += current value
            li $t2, 0 #; parity = even

        nextIteration:
            addi $t0, $t0, 4 #; current address = next address
            j for #; repeat cycle

    endFor:

    la $t6, sums #; t6 = address of the result
    sw $t4, ($t6) #; t6[0] = even 
    sw $t5, 4($t6) #; t6[1] = odd; 4 is used instead of 1 because a word is 4 bytes long

; pdf 5 slide 10 

.data
    vector: .byte 1, 2, 3, 4 
.text
    #; the vector corresponds to the word 0x04030201
    #; which basically is 4, 3, 2, 1
    #; as it's rappresented in memory using little-endian

; pdf 6 slide 7 

.globl main

.data
    matrix: .word 2, -10, -10, -10, 2, -10, -10, -10, 2 
    length: .word 3 

.text
    main: 
        la $t0, matrix #; t0: matrix_address = matrix
        la $t1, length #; t1: matrix_length_address = length 
        lw $t1, ($t1) #; t1: matrix_length = *length
        move $t2, $t1 #; t2: jumps_to_do = matrix_length (to count how many jumps are needed to reach the end)

        addi $t1, $t1, 1 #; t1: jump_length = matrix_length += 1 (to jump to next diagonal cell)
        sll $t1, $t1, 2 #; t1: jump_length = jump_length * 4 (because words are 4 bytes long)

        li $t3, 0 #; t3: sum = 0 

        while:
            beq $t2, 0, end #; if jumps_to_do == 0 { end loop }

            lw $t4, ($t0) #; t4: value = *matrix_address
            add $t3, $t3, $t4 #; sum += value

            subi $t2, $t2, 1 #; jumps_to_do -= 1
            add $t0, $t0, $t1 #; matrix_address = matrix_address = jump_length
            j while
        end:

        print: 
            li $v0, 1 #; print integer
            move $a0, $t3 #; integer = sum
            syscall

        return: 
            li $v0, 17 #; exit
            li $a0, 0 #; result = 0
            syscall

; pdf 7 slide 22

.globl main

.data

.text
    main:

        li $a0, 5
        li $a1, -3
        li $a2, 9
        li $a3, 2
        jal avgOfSquareAbsSub
        
        print:
            move $a0, $v0 #; integer = formula 
            li $v0, 1 #; print integer
            syscall

        return:
            li $v0, 17 #; exit
            li $a0, 0 #; result = 0
            syscall

    avgOfSquareAbsSub:
        subi $sp, $sp, 8 #; ra, first result
        sw $ra, ($sp) 

        jal squareAbsSub #; x, y 
        sw $v0, 4($sp) #; t0: first = (|x|-|y|)^2

        move $a0, $a2
        move $a1, $a3
        jal squareAbsSub #; w, z
        move $t1, $v0 #; t1: second = (|w|-|z|)^2
        lw $t0, 4($sp) #; t0: first = (|x|-|y|)^2

        add $t0, $t0, $t1 #; first += second
        srl $v0, $t0, 1 #; numerator /= 2
        
        returnAvg:
            lw $ra, ($sp)
            addi $sp, $sp, 8
            jr $ra

    squareAbsSub:
        subi $sp, $sp, 4 #; ra
        sw $ra, ($sp)

        jal abs #; |x|
        move $t0, $v0 #; t0: abs_x = |x|

        move $a0, $a1 #; number = y
        jal abs #; |y|
        move $t1, $v0 #; t1: abs_y = |y|

        sub $t0, $t0, $t1 #; difference = |x| - |y|
        mul $v0, $t0, $t0 #; result = (|x| - |y|)^2
        
        returnSub:
            lw $ra, ($sp)
            addi $sp, $sp, 4
            jr $ra

    abs:
        bge $a0, 0, returnAbs #; if number >= 0, return 

        ca2:
            nor $a0, $a0, $zero #; number = bitwise not of number
            addi $a0, $a0, 1 #; number += 1
        
        returnAbs:
            move $v0, $a0 #; result = |number|
            jr $ra

Single Clock Cycle Architecture

MIPS Architecture

ALU Control

R-type instructions have 6 bits in the funct field to control the ALU. The first two bits are the ALUOp to indicate 1 of 3 selection codes. Based on the selection code, the next 4 bits could have a different meaning.

opcode	ALUOp	funct field	operation	ALUControl
lw / sw	00	don't care always a sum	sum	0010
beq	01	don't care always a subtraction	sub	0110
R-type	10	10_0000	ALUControl decides based on last 4 bits	0010

Based on the instruction type, we have different behaviours for the funct field and the ALUControl.

#	func	function
0	0000	AND
1	0001	OR
2	0010	add
6	0110	subtract
7	0111	slt
12	1100	NOR

Control Unit Signals

signal	on false	on true
RegDst	write register number comes from rt	write register number comes from rd
RegWrite		the data is written in in the write register
ALUSrc	data comes from register 2	data comes from sign extender (immediate part)
PCSrc	next instruction is PC + 4	next instruction is PC + 4 + immediate
MemRead		read from memory and put in read data value at address
MemWrite		data at address calculated from ALU, is overwritten by data in register 2
MemToReg	data to write in register file comes from ALU	data to write in register file comes from memory

Exercise

Based on the following instructions, write the truth table for the Control Unit, having as input 6 bits (opcode) and as output 9 bits (control signals)

op	opcode	RegDst	ALUSrc	MemtoReg	RegWrite	MemRead	MemWrite	Branch	ALUOp
R	000000	1	0	0	1	X	0	0	10
lw	100011	0	1	1	1	1	0	0	00
sw	101011	0	1	0	0	X	1	0	00
beq	000100	0	0	X	0	X	0	1	01

Note: the exercise is correct, but there are some places in which we can use don't cares instead of actual values. From here, we can create a PLA with the necessary functions.

Adding New Instructions

`j`

Let's try to add a j (jump) instruction to the current archtecture.

We must define:

it's encoding
it's behaviour
the functional units we need
the flux of information
necessary control signals
execution time (and wether it impacts the total time)

opcode	immediate value
000010	11011101001001001001100111
$31-26$	$25-0$

The immediate value is the absolute address to which we have to jump to (divided by 4). To get the full address we have to expand the immediate value:

PC + 4 first 4 bits	immediate value	multiplication by 4
0110	11011101001001001001100111	00

We have to shift the immediate value by 2, because the instructions are 4 bytes long, so we have to multiply by 4 the absolute address. Then, we get the missing 4 bits from PC + 4, so we stay within the same 256Mb block (the first 4 bits identify the block, so the size of the block is $2^{28} bit = 2^8 bit * 2^{20} \approx 2^8 * 10^6 \approx 256 Mb $).

$PC \leftarrow (PC + 4)[31..28] \ or \ (instruction[25..0] << 2)$

We also need a jump control signal, to determine wether we are jumping or not, and we have to make sure that we don't write any registers or memory. In pink, the implementation of the j instruction.

Jump Instruction Implementation

`jal`

The jal (jump and link) instruction is a J-type that does the same thing as j, with the difference that it saves in $ra (register number 31) the current value of the PC + 4.

$ $ra \leftarrow PC + 4 $

jal Instruction Implementation

`jr`

It's an I-type instruction, we just have to link whatever value we read from rs and move it into PC.

jr Instruction Implementation

`addi`

Not all instructions require modifications to the circuitery, like addi.

Exercise

Add to the CPU the R-type instruction jrr rs (jump relative to register) , which jumps to the address (relative to the PC) contained in rs. $PC \leftarrow PC + 4 + reg[rs]$

jrr Instruction Implementation

Control Signals

op	Jrr	Jump	RegDst	ALUSrc	MemtoReg	RegWrite	MemRead	MemWrite	Branch	ALUOp
jrr	1	0	X	X	X	0	X	0	X	XX

TODO: execution time and clock

Pipeline

We can divide the execution of an instruction into phases:

Fetch: load instruction from memory
Instruction decode: CU decodes instruction into signals, and values are read from registers
Execution: the ALU does the operation, or the access to memory or the branch
Memory Access: memory is read or written (lw, sw)
Write Back: the result of the ALU operation, or the Memory operation is put in the write register

In each moment in time, in a single clock cycle architecture, 80% of the CPU isn't working (only one operation is executed at a time). With a pipeline we can solve this problem by doing an instruction step by step, so in each phase there's a different instruction.

Register File and Clock

The read and write of the register file happen during the same clock cycle. It's basically a latch that writes (the previous instruction) when the clock is 1, and reads the values in the register is 0.

Pipelined Architecture

Pipelined MIPS architecture

Pipelined Architecture with Branches

Pipelined MIPS architecture

We can group the control signals we've used up until now based on the section they're used in: EXE, MEM, WB.

opcode	ALUSrc	RegDest	ALUOp1	ALUOp2	Branch	MemRead	MemWrite	MemToReg	RegWrite
R-Type	0	1	1	0	0	X	0	0	1
lw	1	0	0	0	0	1	0	1	1
sw	1	X	0	0	0	X	1	X	0
beq	0	X	0	1	1	X	0	X	0

Hazard

There are various hazards that can present on a pipelined architecture, which don't happen on a single clock cycle architecture, because we are executing multiple instructions at the same time, without waiting for the previous ones to finish.

There are three types of hazards:

Structural Hazards: hardware resources aren't enough (if the instruction memory and data memory are the same, there could be collision in the instruction fetch phase, and mem phase); these are solved during design.
Data Hazards: if the required data isn't ready yet.
Control Hazards: a jump changes the flow of the instructions' execution.

Let's look at an example:

add $s0, $t0, $t1
sub $t2, $s0, $t3

Let's see what happens if we use a pipeline!


add	IF	ID	EX	ME	WB
sub		IF	ID	EX	ME	WB

The above alignment doesn't work, because during the ID of the sub instruction, we read an old value of $s0, which hasn't been updated with the WB.


add	IF	ID	EX	ME	WB
sub		$\rightarrow$	IF	ID	EX	ME	WB

The same happens here: while the add instruction is still waiting for the memory access, we try to read the value $s0, which still hasn't been updated with the WB phase.


add	IF	ID	EX	ME	WB
sub		$\rightarrow$	$\rightarrow$	IF	ID	EX	ME	WB

This is a valid configuration! As we can see here, the WB phase happens before the ID phase, during the same clock cycle, so we can run these phases of the two instructions at the same time.

Bypassing/Forwarding

In some cases, like this one, the required value could be in the pipeline before the WB phase: in the example above, the new value in $s0 is available right after the EX phase, after the ALU does the operation, and we don't have to wait for a MEM phase. In this case, we can build a shortcut between the result of the ALU, and one of the parameters of the ALU in the next clock cycle. To do this, we have to first detect when we actually need this behaviour.

With such a shortcut, we don't need to wait for the WB.


add	IF	ID	EX	ME	WB
sub		IF	ID	EX	ME	WB

Bubble

In some cases, shortcuts can't be used, so we have to wait one or two instructions before continuing with the next instructions. To wait 1 cycle, we add a bubble, which is an empty instruction, a nop (all 0 values, doesn't affect the registers and memory; it's a valid instruction)

Instruction Rearrangement

Sometimes, by rearranging the instructions, we can solve some of these hazards. Let's look at an example.

lw $t1, 0($t0)
lw $t2, 4($t0)
add $t3, $t1, $t2
sw $t3, 12($t0)
lw $t4, 8($t0)
add $t5, $t1, $t4
sw $t5, 16($t0)

TODO: complete example

Data Hazards & Forwarding Unit

EXE

Hazards in EXE

#![allow(unused)]
fn main() {
if EX/MM.RegWrite == 1 && EX/MM.MemRead == 0 {
    ID/EX.rs == EX/MM.rd || ID/EX.rt == EX/MM.rd
}

if MM/WB.RegWrite == 1 && MM/WB.MemRead == 0 {
    ID/EX.rs == MM/WB.rd || ID/EX.rt == MM/WB.rd
}
}

MemRead has to be 0, because when MemRead is 1, it's an I-Type instruction, and the rd value doesn't have the wanted meaning (could be detected as an hazard, when it's clearly not). RegWrite has to be 1, or this means the instruction before doesn't modify the data.

Then it's just a metter of checking if one register (rs or rt) of the current instruction ID/EX matches with the destination register of the previous one, or the previous two.

Now we have to determine the precedence of the data hazards in EXE.

#![allow(unused)]
fn main() {
if 
    MM/WB.RegWrite == 1 && 
    !(EX/MM.RegWrite == 1) &&
    EX/MM.rd != ID/EX.rt && 
    MM/WB.rd == ID/EX.rt {
        forwardB = 01
}
}

If the value in MM/WB (instruction 2) is valid and needed, and I'm not forwarding the value in EX/MM (instruction 1), then I can forward instruction 2.

Here's a table describing the behaviour of the Forwarding Unit, which handles the forwarding.

control	source
forwardA = 00	ID/EX (current)
forwardA = 01	EX/MM (previous)
forwardA = 10	MM/WB (before previous)

forwardB = 00	ID/EX (current)
forwardB = 01	EX/MM (previous)
forwardB = 10	MM/WB (before previous)

MEM

It happens only in one case:

lw $t0, offset($t1)
sw $t0, offset($t2)

To detect it it's simple enough: you just have to determine if the previous instruction has MemRead and RegWrite set to 1, and the current one has MemWrite set to 1, and in both the instructions the address of rt is the same.

ID

Required only if beq is calculated in ID.

Control Hazard

beq $t0, $t1, else
    lw $s1, ($s1)
else:
    ori $s1, $s1, 10

In this case, we might need to discard the lw instruction which was loaded before jumping to the ori instruction.To make sure the program executes correctly, we can insert two nop instructions, that way, while we run the branch instruction, and we wait for the EXE to calculate the jump, we load 2 empty instruction which don't change the state of the CPU (these are called "bubbles").

beq $t0, $t1, else
    nop
    nop
    lw $s1, ($s1)
else:
    ori $s1, $s1, 10

If we can predict the jump in the ID phase, we need just 1 nop. It doesn't always work like this, as in some loops, we actually jump just at the end. In that case, the jump is never executed, and we can load the next instruction instead of a nop:

.data
    array: .word 1, 5, 8, 7, 6
    size: .word 5
.text
    xor $t1, $t1, $t1 #; i = 0
    sub $s1, $s1, $s1 #; s = 0
    lw $s7, size #; load size into register
    sll $s7, $s7, 2 #; multiply $s7 by 4 (due to word size)

    while: 
        bge $t1, $s7, whileEnd #; if i >= size { jump to end }
        lw $t2, array($t1) # load array[i]; bge is true only once!
        add $s0, $s0, $t2 # s += array[i]

        addi $t1, $t1, 4 #; i += 1
        j while
    whileEnd:

That's why the CPU tries to predict the branch, and tries to load the next instruction or the nop depending on which is executed most often.

We can move the jump decision in ID instead of EXE; in this case:

we have to put just 1 nop after (in EXE we need 2)
if we have a lw before, we need 2 nop before (instead of 1)
if we have a R-Type data hazard, we need 1 nop before (instead of none)

Anticipating `jump`

The j instruction has OPCode 000010, this means that once we fetch the instruction from memory, we can already jump to the address, just by comparing the OPCode.

Branch Prediction

We can count with a hardware solution how many jumps have been done to decide wether it's more likely the jump will be taken or not. To predict a jump, we need a simple FSM with 4 states, which changes prediction after 2 fasle positives (this way, inside loops with a prevalent choice, we always do the most efficient one).

Exercise

TOOD Full Pipelined architecture

Cache

A cache is divided into blocks

line #	valid	tag	block
multiple blocks can have the same line	to determine wether the data is valid	distinguishes a block from the others	the block of data itself
0	0	101	01001001_11010010
1	1	011	11001011_01001100
2	1	111	11010010_11001011
3	0	000	01001100_00100110
4	1	010	00100110_00100110

Direct Mapping

The following is an exmple of a cache with 4 words, with 2 lines

#	word	byte1	byte2	byte3	byte4
0	0	00000000	00000000	00000000	00000000
0	1	00000000	00000000	00000000	00000000
0	2	00000000	00000000	00000000	00000000
0	3	00000000	00000000	00000000	00000000

1	0	00000000	00000000	00000000	00000000
1	1	00000000	00000000	00000000	00000000
1	2	00000000	00000000	00000000	00000000
1	3	00000000	00000000	00000000	00000000

Now let's see how to determine where to get a word in the cache, based on its value (with 4 word blocks, in a 2 line cache).

tag	line #	word	byte
000000000000000000001000000	0	10	01

To determine a HIT in a cache it's easy (where multiple words are present in a block, there's the need for a mux to determine which word to get data from)

Cache Hit

Cache Size

To determine the size of a direct mapping cache we need some data:

$2^n$ lines
$2^m$ words block size
1 validity bit

tag size = $32 - n - m - 2$

cache size in bits = $2^n \cdot (2^m + 1 + tag\_size) $

Associativity

By making the cache more associative, we reduce the number of conflict misses.

Cache Size pt. 2

TODO: pdf 22, slide 11

Policies

Replacement Policy

LRU (least recently used), it requires a bit to determine how old a block is, to decide which one to replace (the oldest one)
LFU (least frequently used), replace the least frequently used, but it requires a more complex hardware (it requires a counter for each set, the counter is updated at every access)
RANDOM, replace a random block

Writing Policy

It's the policy used to update the RAM when the cache is written.

Write through, at each update, the block is updated in RAM (good for consistency on multi-core systems, very slow)
Write back, the blocks is updated only when replaced, (faster, but the content in the cache isn't in sync with the content in RAM)

By using a DIRTY bit, we can manage to save in RAM only blocks which have been changed.

MISS types

Cold start, when the address is requested for the first time (solved by making bigger blocks)
Conflict, when the block has been replaced due to the associativity of the cache (solved by increasing the associativity)
Capacity, the block has been replaced due to the size of the cache (solved by increasing the size of the cache)

Cache & Parallelism

In multi-processor architectures there are multiple parallel caches, which have fast communication to keep the data coherent. Multiple difference processes can access and modify the same data.

There must be a way to keep consistency and coherence of the data in multiple caches.

To solve this problem, there are two strategies:

distributed protocol which caches use to communicate
centralized manager which handles the interactions

Cache Controller FSA

Finite State Automaton

stateDiagram-v2
    IDLE --> Tag
    Tag --> IDLE
    Allocation --> Tag
    Tag --> Allocation
    Tag --> WriteBack
    WriteBack --> WriteBack
    WriteBack --> Allocation
    Allocation --> Allocation

The writes of different processors must be read in order.

Cache Invalidation Protocol

Coherence is when the value I read is the last one written, consistency means that all data is consistent (calendar - message example)

Virtual Memory

In a multi process system, it's hard to manage the memory for all the processes; there could be a problem with memory not being sufficient for all the processes. The solution is to make the addresses of the processes "virtual", and map them to physical ones when needed.

The memory is divided in pages, which are stored on a slower memory when not needed.

Each process has a page table which takes a virtual page and maps it to a physical page.

valid	dirty	used	physical page address
1	0	0	address in mass memory
1	1	0	address in mass memory
0	0	0	address in mass memory
1	1	1	address in mass memory

When valid is 1, the page is in RAM, but you still need 2 accesses to get the address. When valid is 0, the page is in a mass storage, a page fault exception is launched, and it requires millions of clock cycles to get the data from memory.

Policies

Replacement Policy

LRU
LFU
RANDOM

Writing Policy

Write back (because write through requires too much time)

TLB

A Translation Lookaside Buffer is a special buffer used to access virtual memory addresses faster.

Algorithms & Data Structures I

Implementations and exercises for Algo I 2022/2023 course at Sapienza Universita' di Roma (Computer Science Bachelor's degree) in Rust 🦀, cos it's more fun!

The content in the checked boxes was summarized / implemented / completed. If you need explanations on some content, just open an issue, and I'll be happy to help 😄.

(Completed in Rust 56 out of 149 ~ 38%)

Introduction
Big O notation
1. Big O
2. Omega
3. Teta
4. Formulas
5. Ex 1
6. Ex 2
Cost
1. Formulas
2. Ex 1
3. Ex 2
4. Ex 3
Searching Algorithms
1. Linear Search
2. Binary Search (iterative)
3. Ex 1 (TODO: test again)
Recursion
1. Ex 1
2. Ex 2
3. Ex 3
4. Ex 4
5. Ex 5
6. Ex 6
7. Ex 7 (Hanoi)
8. Linear Search
9. Binary Search
10. Factorial
11. Fibonacci
12. Binomial
13. GCD
Let's just not... pt.1
1. Iterative
2. Substitution
3. Tree
4. Main
Let's just not... pt.2
Naive Sorting
1. Insertion Sort
2. Selection Sort
3. Bubble Sort
4. Ex 1
5. Ex 2
Merge Sort
1. Merge
2. Merge Sort
3. Ex 1
4. Ex 2
5. Ex 3
6. Ex 4
7. Ex 5
Quick Sort
1. Quick Sort
2. Ex 1
3. Ex 2
Heap Sort
1. Heap
2. Ex 1
3. Ex 2
4. Ex 3
Linear Sorting
1. Counting Sort
2. Stable Counting Sort
3. Bucket Sort
4. Ex 1
5. Ex 2
6. Ex 3
Linked List
1. Array Operations
2. Linked List
3. Double Linked List
4. Ex 1
5. Ex 2
6. Ex 3
Queue & Stack
1. Queue
2. Stack
3. Queue on LinkedList
4. Stack on LinkedList
5. Priority Queue
6. Ex 1
7. Ex 2
Linked List exercises
1. Ex 1
2. Ex 2
3. Ex 3
4. Ex 4
5. Ex 5
6. Ex 6
7. Ex 7
8. Ex 8
9. Ex 9
10. Ex 10
Tree
1. Graph Theory
2. TreeNode (Tree built with Records)
3. Positional Binary Tree (Basically a Heap)
4. ParentTree (Tree built with Two Arrays)
5. Operations
6. Ex 1
7. Ex 2
DFS/BFS
1. Preorder
2. Inorder
3. Postorder
4. Ex 1
5. Ex 2
6. Ex 3
Dictionary
1. Insert
2. Search
3. Delete
4. Direct Address Table (GeeksForGeeks)
5. Hash Table
Binary Search Tree
1. Binary Search Tree
2. Min
3. Max
4. Operations
5. Ex 1
6. Ex 2
7. Ex 3
8. Ex 4
Black-Red Tree
1. Rotate
2. Insert
3. Delete
4. Ex 1
Exercises pt.1
1. Ex 1
2. Ex 2
3. Ex 3
4. Ex 4
5. Ex 5
6. Ex 6
7. Ex 7
8. Ex 8
Exercises pt.2
1. Ex 1
2. Ex 2
3. Ex 3
4. Ex 4
5. Ex 5
6. Ex 6
Exercises pt.3
Other
1. Tim Sort
2. Has Duplicates in Merge Sort
3. Merge Sort on Linked List (iterative)
4. Python List (TODO: look info about it's implementation)
[Pytohn Utils](https://twiki.di.uniroma1.it/pub/Intro_algo/AD/Dispense/METODI_UTILI_IN_PYTHON.pdf

Computational Cost

Mathematical Series

Sums

$\sum_{i = 0}^{n} 1 = θ (n)$
$\sum_{i = 0}^{n} i = θ (n^{2})$

$\sum_{i = 0}^{n} 2^{i} = θ (2^{n})$
$\sum_{i = 0}^{n} c^{i} = \frac{c ^{n + 1} - 1}{c - 1}, c > 1$
$\sum_{i = 0}^{n} c^{i} = θ (1), c < 1$
$\sum_{i = 0}^{n} i c^{i} = θ (n c^{n}), c > 1$

$\sum_{i = 0}^{n} lo g i = θ (n lo g n)$
$\sum_{i = 0}^{n} lo g^{c} i = θ (n lo g^{c} n)$

$\sum_{i = 0}^{n} \frac{1}{i} = θ (lo g n)$
$\sum_{i = 0}^{n} \frac{1}{c ^{i}} = θ (1)$

Recurrence Equations

We'll analyzer the computational cost of the following recursive search algorithm.

def search(list, value, index=0):
    if list[index] == value:
        return index;

    if index == len(list) - 1:
        return None

    return search(list, value, index + 1)

The first step, requires writing out a system of the equation and the base case of the algorithm.

${T (n) = T (n - 1) + θ (1) T (1) = θ (1)$

Now let's solve the equation, in four different methods.

Iterative

Idea: - develop the equation and express it as sum of terms depending on $n$ and the base case.

Difficulty: - many algebric calculations to do.

$T (n) = T (n - 1) + θ (1) ⟹ T (n) = [T (n - 2) + θ (1)] + θ (1) ⟹ T (n) = {[T (n - 3) + θ (1)] + θ (1)} + θ (1) ⟹ T (n) = T (n - k) + k θ (1)$

Then we calculate the equation when $n - k \to 1$ , the base case.

$n - k = 1 ⟹ k = n - 1 ⟹ T (n) = T (1) + (n - 1) θ (1) ⟹ T (n) = θ (1) + n θ (1) - θ (1) ⟹ T (n) = θ (n)$

Tree

TODO: draw trees in markdown!

Substitution

Idea: - ipothize a solution for the given recurrence equation - verify (by induction) wether it works

Difficulty: - it's hard to find a solution as close as possible to the real solution - it's used mainly in demonstrations

Let's suppose $T (n) = c n$ , and $T (1) = d$ , where $c$ and $d$ are fixed constants.

$T (n) = c n = c (n - 1) + θ (1) ⟹ c n = c n - c + θ (1) ⟹ c = θ (1)$

This doesn't mean that T(1), which is a $θ (1)$ is the same as $c$ , so we need two constants.

${T (n) = T (n - 1) + c T (1) = d$

Now we have to prove that $kn$ is a $O (n)$ and a $ω (n)$ using induction.

$O (n)$

$T (n) = O (n) ⟹ T (n) \leq kn$ where k is to be determined.

Base Case

First, check for which values the base case is verified.

$T (1) \leq k \cdot 1 ⟹ d \leq k ⟹ k \geq d$

Induction

Then check if the general case is covered by the base case.

$T (n) \leq k (n - 1) + c = kn - k + c \leq kn ⟹ k > c$

We get that $k \geq d \land k \geq c$ , we can always find constants $c$ and $d$ so that $\exists k$ greater than both, so the induction is verified.

$ω (n)$

$T (n) = ω (n) ⟹ T (n) \geq kn$ where k is to be determined.

Base Case

First, check for which values the base case is verified.

$T (1) \geq k \cdot 1 ⟹ d \geq k ⟹ k \leq d$

Induction

Then check if the general case is covered by the base case.

$T (n) = k (n - 1) + c = kn - k + c \geq kn ⟹ k \leq c$

We get that $k \leq d \land k \leq c$ , we can always find constants $c$ and $d$ so that $\exists k$ smaller than both, so the induction is verified.

Main

Idea: - It's a set of formulas to solve a recurrence equation

Difficulty: - works only when the equation is in the form $T (n) = a T (\frac{n}{b}) + f (n)$ with $T (1) = θ (1)$

Theorem

Given

$a \geq 1, b \geq 1$
$f : R \to R$
$n \to + \infty lim f (n) \geq 0$

The equation:

${T (n) = a T (\frac{n}{b}) + f (n) T (1) = θ (1)$

There are three cases that can generate by comparing $f (n)$ with $n^{l o g_{b} a}$ :

$f (n) = O (n^{l o g_{b} a - ϵ}), ϵ > 0 ⟹ T (n) = θ (n^{l o g_{b} a})$
$f (n) = θ (n^{l o g_{b} a}), ⟹ T (n) = θ (n^{l o g_{b} a} lo g n)$
$f (n) = ω (n^{l o g_{b} a + ϵ}), ϵ > 0 \land a \cdot f (\frac{n}{b}) \leq c f (n), c < 1, n >> 1 ⟹ T (n) = θ (f (n))$

The comparison must be polynomial, by an order of $n^{ϵ}$ .

Where not to apply it?

In the following examples, the main method cannot be applied.

Ex 1

${T (n) = 2 T (\frac{n}{2}) + θ (n lo g n) T (1) = θ (1)$

$a = 2, b = 2$
$f (n) = θ (n lo g n)$
$n^{l o g_{b} a} = n^{l o g_{2} 2} = n$

In this case, $f (n)$ is asintotically bigger than n, but not plynomially bigger. In fact $lo g n < n^{ϵ}, ϵ > 0$

Ex 2

${T (n) = 2 T (\frac{n}{2}) + θ (\frac{n}{l o g n}) T (1) = θ (1)$

$a = 2, b = 2$
$f (n) = θ (\frac{n}{l o g n})$
$n^{l o g_{b} a} = n^{l o g_{2} 2} = n$

In this case, $f (n)$ is asintotically smaller than n, but not plynomially smaller. In fact $lo g n < n^{ϵ}, ϵ > 0$

Searching Algorithms

A bunch of searching algorithms on arrays 🔎.

Linear Search

Rust

#![allow(unused)]
fn main() {
pub fn linear_search<T: Eq>(array: &[T], value: T) -> Option<usize> {
    array
        .iter()
        .enumerate()
        .find_map(|(i, v)| if *v == value { Some(i) } else { None })
}
}

Java

    public static <T> Optional<Integer> search(T[] array, T toFind) {
        for (var index = 0; index < array.length; index++)
            if (array[index] == toFind)
                return Optional.of(index);

        return Optional.empty();
    }

Binary Search

In the course, you will study the recursive implementation of the binary search, in my code, I've written an iterative one (I don't "like" recurion)

Rust

#![allow(unused)]
fn main() {
pub fn binary_search<T: Ord>(array: &[T], value: T) -> Option<usize> {
    let mut step = array.len();
    let mut index = 0;

    while step > 0 {
        let next = index + step;

        while next < array.len() {
            let cmp = match array.get(next) {
                Some(v) => v.cmp(&value),
                None => break,
            };

            match cmp {
                Equal => return Some(next),
                Less => index = next,
                Greater => break,
            }
        }

        step /= 2;
    }

    None
}
}

Java

    public static <T extends Comparable<? super T>> Optional<Integer> binarySearch(T[] array, T toFind) {
        int jump = array.length - 1, index = 0;

        while (jump > 0) {
            while (index + jump < array.length) {
                var comparison = array[index + jump].compareTo(toFind);

                if (comparison > 0)
                    break;

                index += jump;

                if (comparison == 0)
                    return Optional.of(index);
            }

            jump /= 2;
        }

        return Optional.empty();
    }

Exercise

Given $A$ , an array of integers, and two values $a$ and $b$ , with $a \leq b$ , count how many elements of $A$ are included in the range $[a, b]$

The simplest way to solve the problem is to implement two functions: lower_bound and upper_bound, which are basically binary searches that don't stop once they find the value in the array. In the case of lower_bound, it finds the index of "the smallest value bigger or equal to $x$ ", and the upper_bound is "the biggest value smaller or equal to $x$ " with $x$ being the value to find.

This way, we can find the upper_bound of $b$ , and the lower_bound of $a$ , and do a subtraction of the two to find the number of elements inbetween. There are a few corner cases to consider both for upper_bound, lower_bound and count_in_range (the function that solves the exercise) for which we return 0 (open an ISSUE if you want me to discuss them).

Rust

#![allow(unused)]
fn main() {
pub fn upper_bound<T: Ord>(array: &Vec<T>, value: T) -> Option<usize> {
    let mut step = array.len();
    let mut index = 0;

    if array.first().unwrap() > &value {
        return None;
    }

    while step > 0 {
        let next = index + step;

        while next < array.len() {
            let cmp = match array.get(next) {
                Some(v) => v.cmp(&value),
                None => break,
            };

            match cmp {
                Greater => break,
                _ => index = next,
            }
        }

        step /= 2;
    }

    Some(index)
}
}

#![allow(unused)]
fn main() {
pub fn lower_bound<T: Ord>(vector: &Vec<T>, value: T) -> Option<usize> {
    let mut step = vector.len();
    let mut index = vector.len() - 1;

    if vector.last().unwrap() < &value {
        return None;
    }

    while step > 0 {
        while step <= index {
            let cmp = match vector.get(index - step) {
                Some(v) => v.cmp(&value),
                None => break,
            };

            match cmp {
                Less => break,
                _ => index -= step,
            }
        }

        step /= 2;
    }

    Some(index)
}
}

#![allow(unused)]
fn main() {
    pub fn count_in_range<T: Ord>(vector: Vec<T>, lower: T, upper: T) -> usize {
        let lower = lower_bound(&vector, lower);
        let upper = upper_bound(&vector, upper);

        if let (Some(l), Some(u)) = (lower, upper) {
            return match l.cmp(&u) {
                Greater => 0,
                _ => u.abs_diff(l) + 1,
            };
        }

        0
    }
}

Java

TODO: make bound functions return Optional, handle corner cases

    public static <T extends Comparable<? super T>> Integer upperBound(List<T> list, T toFind) {
        int jump = list.size() - 1, index = 0;

        while (jump > 0) {
            while (index + jump < list.size() && list.get(index + jump).compareTo(toFind) <= 0)
                index += jump;

            jump /= 2;
        }

        return index;
    }

    public static <T extends Comparable<? super T>> Integer lowerBound(List<T> list, T toFind) {
        int jump = list.size() - 1, index = list.size() - 1;

        while (jump > 0) {
            while (index - jump >= 0 && list.get(index - jump).compareTo(toFind) >= 0)
                index -= jump;

            jump /= 2;
        }

        return index;
    }

TODO: write a countInRange method

Recursion

These are just a bunch of recursive functions and exercises, nothing too special. There should be a faster way to write a recursive fibonacci with doubling, I'll work on it.

#![allow(unused)]
fn main() {
pub fn linear_search<T: Eq>(array: &[T], value: T, index: usize) -> Option<usize> {
    if index == array.len() {
        return None;
    }

    if let Some(v) = array.get(index) {
        if *v == value {
            return Some(index);
        }
    }

    linear_search(array, value, index + 1)
}

pub fn binary_search<T: Ord>(array: &[T], value: T, start: usize, end: usize) -> Option<usize> {
    if start == end {
        return None;
    }

    let mid = (end - start) / 2;

    if let Some(v) = array.get(mid) {
        match value.cmp(v) {
            Equal => Some(mid),
            Greater => binary_search(array, value, mid + 1, end),
            _ => binary_search(array, value, start, mid),
        };
    }

    None
}

pub fn factorial(number: usize) -> usize {
    if number == 0 {
        return 1;
    }

    number * factorial(number - 1)
}

pub fn fibonacci(nth: usize) -> usize {
    if nth == 0 || nth == 1 {
        return 1;
    }

    fibonacci(nth - 1) + fibonacci(nth - 2)
}
}

TODO: fast fibonacci

Guided Exercises

#![allow(unused)]
fn main() {
// Ex 1, kth power of n
pub fn pow(base: usize, exponent: usize) -> usize {
    if exponent == 0 {
        return 1;
    }

    base * pow(base, exponent - 1)
}

// Ex 2, sum of elements
pub fn sum(array: &[usize], index: usize) -> usize {
    if let Some(x) = array.get(index) {
        return x + sum(array, index + 1);
    }

    0
}

// Ex 3, find min
pub fn min<T: Ord>(array: &[T], index: usize) -> Option<&T> {
    if index == array.len() {
        return None;
    }

    std::cmp::min(array.get(index), min(array, index + 1))
}

// Ex 4, palindrome
pub fn is_palindrome<T: Eq>(_array: &[T], _index: usize) -> bool {
    false
}

// Ex 5, reverse print
pub fn reverse_print<T: Debug>(array: &[T], index: usize) {
    if let Some(t) = array.get(index) {
        print!("{:?}", t);
        reverse_print(array, index - 1)
    }
}

// Ex 6, print in order
pub fn print<T: Debug>(array: &[T], index: usize) {
    if let Some(t) = array.get(index) {
        print!("{:?}", t);
        print(array, index + 1)
    }
}

// Ex 7, hanoi
}

TODO: Hanoi

Exercises

TODO: Binomial

#![allow(unused)]
fn main() {
    // Ex 1, binomial

    // Ex 2, GCD
    pub fn gcd(x: usize, y: usize) -> usize {
        if y == 0 {
            return x;
        }

        gcd(y, x % y)
    }
}

Naive Sorting

Insertion Sort

Rust

#![allow(unused)]
fn main() {
pub fn insertion_sort<T: Ord>(array: &mut [T]) {
    for i in 1..array.len() {
        for j in (1..=i).rev() {
            if array[j - 1] < array[j] {
                break;
            }

            array.swap(j - 1, j);
        }
    }
}
}

Java

    static <T extends Comparable<? super T>> void insertionSort(List<T> list, Integer start, Integer end) {
        for (var index = start + 1; index < end; index++) {
            var left = index;
            while (left > start && list.get(left).compareTo(list.get(left - 1)) < 0) {
                // swap
                var temp = list.get(left);
                list.set(left, list.get(left - 1));
                list.set(left - 1, temp);

                left--;
            }
        }
    }

Selection Sort

Rust

#![allow(unused)]
fn main() {
pub fn selection_sort<T: Ord>(vector: &mut [T]) {
    for i in 0..vector.len() - 1 {
        let (j, _) = (&vector[i..])
            .iter()
            .enumerate()
            .min_by(|&(_, x), &(_, y)| x.cmp(y))
            .unwrap();

        vector.swap(i, j + i);
    }
}
}

Java

    public static <T extends Comparable<? super T>> void selectionSort(List<T> list) {
        for (int index = 0; index < list.size(); index++) {
            var minIndex = min(list, index);

            var temp = list.get(index);
            list.set(index, list.get(minIndex));
            list.set(minIndex, temp);
        }
    }

Bubble Sort

Rust

#![allow(unused)]
fn main() {
pub fn bubble_sort<T: Ord>(vector: &mut [T]) {
    for i in 0..vector.len() {
        for j in (i + 1..vector.len()).rev() {
            if vector[j] < vector[j - 1] {
                vector.swap(j, j - 1)
            }
        }
    }
}
}

Java

    public static <T extends Comparable<? super T>> void bubbleSort(List<T> list) {
        for (var left = 0; left < list.size(); left++)
            for (var right = left; right < list.size(); right++)
                if (list.get(left).compareTo(list.get(right)) > 0) {
                    var temp = list.get(left);
                    list.set(left, list.get(right));
                    list.set(right, temp);
                }
    }

Exercises

Just a bunch of exercises related to sorting.

Rust

#![allow(unused)]
fn main() {
// Pdf 8, Slide 35
pub mod exercises {
    use std::ops::Range;

    // Ex 1, pt. 1
    pub fn reversed_bubble_sort<T: Ord>(vector: &mut [T]) {
        for i in (0..vector.len() - 1).rev() {
            for j in 0..=i {
                if vector[j] < vector[j + 1] {
                    vector.swap(j, j + 1)
                }
            }
        }
    }

    // Ex 1, pt. 2, Which are stable?
    // Insertion Sort - stable
    // Selection Sort - stable
    // Bubble Sort - unstable

    // Ex 1, pt. 3, Cost if sorted? Cost if all equal?
    // Insertion Sort - sorted O(n) - equal O(n)
    // Selection Sort - sorted O(n^2) - equal O(n^2)
    // Bubble Sort - sorted O(n^2) - equal O(n^2)

    // Ex 2, pt. 1, Write an insertion_sort using a separate function for min
    pub fn min_in_range<T: Ord>(vector: &[T], r: Range<usize>) -> usize {
        let (index, _) = (&vector[r])
            .iter()
            .enumerate()
            .min_by(|&(_, x), &(_, y)| x.cmp(y))
            .unwrap();

        index
    }

    pub fn min_selection_sort<T: Ord>(vector: &mut [T]) {
        for i in 0..vector.len() - 1 {
            let j = min_in_range(vector, i..vector.len());
            vector.swap(i, i + j);
        }
    }

    // Ex 2, pt. 2, Check if array has_duplicates, based on naive sorting algorithms
    pub fn has_duplicates<T: Eq>(vector: &[T]) -> bool {
        for (index, value) in vector.iter().enumerate() {
            if (vector[index + 1..]).iter().filter(|&x| x == value).count() > 0 {
                return true;
            }
        }

        false
    }
}
}

Merge Sort

Quick Sort

Heap Sort

Heap

To code a heap_sort function, we need to implement the heap data structure on an array.

#![allow(unused)]
fn main() {
pub struct Heap<T> {
    buffer: Box<[T]>,
    size: usize,
}
}

Looks simple enough... now we need a way to create a heap (ideally from a boxed slice) or just create an empty one in which to insert values later.

#![allow(unused)]
fn main() {
impl<T: Copy + Ord> From<Box<[T]>> for Heap<T> {
    fn from(value: Box<[T]>) -> Self {
        let mut heap = Self {
            size: value.len(),
            buffer: value,
        };

        heap.build();
        heap
    }
}
}

We'll look into the specification of the Heap::build method later, to see what does it do, and why it requires Copy and Ord traits.

#![allow(unused)]
fn main() {
impl<T: Default + Copy> Heap<T> {
    pub fn new<const SIZE: usize>() -> Self {
        Self {
            buffer: Box::new([Default::default(); SIZE]),
            size: 0,
        }
    }
}
}

Heap Methods

heapify is the most important method to make a heap work: it basically rearranges in $O(\log{n})$ a Heap in which only the root is out of order.

#![allow(unused)]
fn main() {
impl<T: Ord + Copy> Heap<T> {
    fn heapify(&mut self, node: usize) {
        use std::cmp::max;

        if let Some((v, i)) = max(self.child(node, 1), self.child(node, 2)) {
            if self.buffer.get(node) < v {
                self.buffer.swap(node, i);
                self.heapify(i)
            }
        }
    }
}
}

Now that we have the heapify method, to build the Heap, we just need to run heapify on the left side of the array.

#![allow(unused)]
fn main() {
impl<T: Ord + Copy> Heap<T> {
    fn build(&mut self) {
        (0..self.size / 2).rev().for_each(|n| self.heapify(n));
    }
}
}

Indexing a Heap

In a Heap, to get the children of a node at an index i, we just need a formula:

i * 2 + 1 for the left child
i * 2 + 2 for the right child

Knowing this, we can write a child method to get the children of a node in a Heap, and reuse it in the heapify method.

#![allow(unused)]
fn main() {
impl<T: Ord + Copy> Heap<T> {
    fn child(&self, node: usize, child: usize) -> Option<(Option<&T>, usize)> {
        let node = 2 * node + child;

        if node >= self.size {
            return None;
        }

        Some((self.buffer.get(node), node))
    }
}
}

Iterating a Heap

Now that we have the Heap setup, we just need to implement the Iterator trait to consume the Heap. The next method is very simple: we just swap the last element with the root (in position 0), reduce the size of the Heap, and run heapify again.

#![allow(unused)]
fn main() {
impl<T: Ord + Copy> Iterator for Heap<T> {
    type Item = T;

    fn next(&mut self) -> Option<Self::Item> {
        if self.size == 0 {
            return None;
        }

        self.buffer.swap(0, self.size - 1);
        self.size -= 1;
        self.heapify(0);

        Some(self.buffer[self.size])
    }
}
}

Heap Sort

Now sorting a Heap becomes a very easy task! We just have to run heapify until the there are no more elements, and the unerlying buffer will be sorted.

#![allow(unused)]
fn main() {
pub fn heap_sort<T: Ord + Copy + Default>(buffer: Box<[T]>) {
    Heap::from(buffer).into_iter().for_each(drop);
}
}

Exercises

#![allow(unused)]
fn main() {
pub mod exercises {
    use std::cmp::Reverse;

    use super::*;

    // Ex 1, O(n) for MaxHeap, O(1) for MinHeap

    pub enum BinaryHeap<T> {
        MaxHeap(Heap<T>),
        MinHeap(MinHeap<T>),
    }

    pub fn min<T: Ord + Copy + Default>(heap: &mut BinaryHeap<T>) -> Option<T> {
        match heap {
            BinaryHeap::MaxHeap(heap) => heap.last(),
            BinaryHeap::MinHeap(heap) => heap.next(),
        }
    }

    // Ex 2, Build a MinHeap struct

    pub struct MinHeap<T> {
        heap: Heap<T>,
    }

    impl<T: Ord + Default + Copy> MinHeap<Reverse<T>> {
        pub fn from(buffer: Box<[T]>) -> Self {
            Self {
                heap: Heap::from(
                    buffer
                        .iter()
                        .map(|v| Reverse(*v))
                        .collect::<Vec<Reverse<T>>>()
                        .into_boxed_slice(),
                ),
            }
        }
    }

    impl<T: Ord + Default + Copy> Iterator for MinHeap<T> {
        type Item = T;

        fn next(&mut self) -> Option<Self::Item> {
            self.heap.next()
        }
    }

    // Ex 3, insert in Heap with available space

    impl<T: Ord + Default + Copy> Heap<T> {
        pub fn insert(&mut self, value: T) -> Result<(), &'static str> {
            if self.size >= self.buffer.len() {
                return Err("Array is full");
            }

            self.buffer[self.size] = value;
            self.size += 1;
            self.build();

            Ok(())
        }
    }
}
}

$O(n)$ Sorting Algorithms

Counting Sort

#![allow(unused)]
fn main() {
pub fn counting_sort<'a>(array: &'a mut [usize]) {
    let mut counter: Vec<usize> = vec![0; *array.iter().max().unwrap_or(&mut 0) + 1];

    for n in array.iter() {
        counter[*n] += 1;
    }

    let mut index = 0;
    for (number, &count) in counter.iter().enumerate() {
        for _ in 0..count {
            array[index] = number;
            index += 1;
        }
    }
}
}

Stable Counting Sort

It works on generics too!

#![allow(unused)]
fn main() {
pub trait IntoIndex {
    fn into_index(&self) -> usize;
}

pub fn stable_counting_sort<'a, T: Clone + Copy + IntoIndex + Default>(array: &'a mut [T]) {
    let mut counter: Vec<usize> = vec![0; array.iter().map(T::into_index).max().unwrap_or(0) + 1];
    for n in array.iter().map(T::into_index) {
        counter[n] += 1;
    }

    let mut positions = counter;
    for i in 1..positions.len() {
        positions[i] += positions[i - 1];
    }

    let mut tmp: Vec<T> = vec![T::default(); array.len()];
    for k in array.iter().rev() {
        tmp[positions[k.into_index()] - 1] = *k;
        positions[k.into_index()] -= 1;
    }

    for (i, k) in tmp.iter().enumerate() {
        array[i] = *k;
    }
}
}

Bucket Sort

TODO: bucket sort with LinkedList and Insertion Sort / Counting sort / Olog(n)

Exercises

#![allow(unused)]
fn main() {
mod exercises {
    // Ex 1, Is stable_counting_sort stable? Yes

    // Ex 2, Worst case for bucket_sort? O(n^2) if insertion_sort is used for buckets, O(n) if
    // counting sort is used

    // Ex 3, bucket_sort using counting_sort for buckets, hypotesis on k?
}
}

Linked List

#![allow(unused)]
fn main() {
pub struct LinkedList<T> {
    pub value: T,
    pub next: Option<Rc<LinkedList<T>>>,
}
}

Queue

Linked List Implementation

Rust

Java

import java.util.Optional;

public class Queue<T> {
    Node<T> head, tail;

    public void enqueue(T value) {
        var node = new Node<T>(value);

        if (tail == null) {
            head = node;
            tail = node;
        } else {
            tail.next = node;
            tail = tail.next;
        }
    }

    public Optional<T> dequeue() {
        if (head == null)
            return Optional.empty();

        var result = head.value;
        head = head.next;
        if (head == null)
            tail = null; // Queue is empty

        return Optional.of(result);
    }
}

Array Implementation

Rust

#![allow(unused)]
fn main() {
pub struct Queue<T> {
    buffer: Box<[T]>,
    start: usize,
    end: usize,
    size: usize,
}

impl<T> Queue<T> {
    pub fn from(buffer: Box<[T]>) -> Self {
        Self {
            buffer,
            start: 0,
            end: 0,
            size: 0,
        }
    }

    pub fn len(&self) -> usize {
        self.size
    }

    pub fn enqueue(&mut self, value: T) -> Result<(), &'static str> {
        if self.size == self.buffer.len() {
            return Err("The Queue is full");
        }

        self.buffer[self.end] = value;
        self.size += 1;
        self.end = (self.end + 1) % self.buffer.len();

        Ok(())
    }

    pub fn dequeue(&mut self) -> Option<&T> {
        if self.size == 0 {
            return None;
        }

        let result = self.buffer.get(self.start);

        self.start = (self.start + 1) % self.buffer.len();
        self.size -= 1;

        result
    }
}

impl<T: Copy> Iterator for Queue<T> {
    type Item = T;

    fn next(&mut self) -> Option<Self::Item> {
        self.dequeue().and_then(|&v| Some(v))
    }
}
}

Java

import java.util.Optional;

public class ArrayQueue<T> {
    Object[] queue;
    Integer head = 0, tail = 0, size = 0;

    public ArrayQueue(Integer size) {
        queue = new Object[size];
    }

    public void enqueue(T value) throws IndexOutOfBoundsException {
        if (size == queue.length)
            throw new IndexOutOfBoundsException();

        queue[tail] = value;
        tail++;
        if (tail >= queue.length)
            tail = 0;
    }

    public Optional<Object> pop() {
        if (size == 0)
            return Optional.empty();

        var result = queue[head];
        head++;
        if (head >= queue.length)
            head = 0;
        return Optional.of(result);
    }
}

Stack

Linked List Implementation

Rust

Java

import java.util.Optional;

public class Stack<T> {
    Node<T> top;

    public void push(T value) {
        top = new Node<T>(value, top);
    }

    public Optional<T> pop() {
        if (top == null)
            return Optional.empty();

        var result = top.value;
        top = top.next;
        return Optional.of(result);
    }
}

Array Implementation

Rust

#![allow(unused)]
fn main() {
pub struct Stack<T> {
    buf: Box<[T]>,
    len: usize,
}

impl<T> From<Vec<T>> for Stack<T> {
    fn from(value: Vec<T>) -> Self {
        Self {
            buf: value.into_boxed_slice(),
            len: 0,
        }
    }
}

impl<T> Stack<T> {
    pub fn len(&self) -> usize {
        self.len
    }

    pub fn push(&mut self, value: T) -> Result<(), &'static str> {
        if self.len == self.buf.len() {
            return Err("Pirla, non hai piu' spazio per i piatti!");
        }

        self.buf[self.len] = value;
        self.len += 1;

        Ok(())
    }

    pub fn pop(&mut self) -> Option<&T> {
        if self.len == 0 {
            return None;
        }

        self.len -= 1;
        self.buf.get(self.len)
    }
}

impl<T: Copy> Iterator for Stack<T> {
    type Item = T;

    fn next(&mut self) -> Option<Self::Item> {
        self.pop().and_then(|&v| Some(v))
    }
}
}

Java

import java.util.Optional;

public class ArrayStack<T> {
    Object[] stack;
    Integer top;

    public ArrayStack(Integer size) {
        stack = new Object[size];
        top = 0;
    }

    public Boolean isFull() {
        return top == stack.length - 1;
    }

    public void push(T value) throws IndexOutOfBoundsException {
        if (isFull())
            throw new IndexOutOfBoundsException();

        stack[top] = value;
        top++;
    }

    public Optional<Object> pop() {
        if (top == 0)
            return Optional.empty();

        var result = stack[top];
        top--;
        return Optional.of(result);
    }

Red Black Tree

#include <stdio.h>
#include <stdlib.h>

typedef enum { Red, Black } Color;
typedef struct RBTree RBTree;

struct RBTree {
  int value;
  Color color;
  struct RBTree *left, *right, *parent;
};

RBTree *FAKE_LEAF = &(RBTree){0, Black, NULL, NULL, NULL};

void rotate_left(RBTree *x) {
  RBTree *parent = x->parent, *y = x->right;
  RBTree *alpha = x->left, *beta = y->left, *gamma = x->right;

  if (parent != NULL) {
    if (parent->left == x)
      parent->left = y;
    else
      parent->right = y;
  }
  y->parent = parent;

  y->left = x;
  x->parent = y;
  x->left = alpha;
  x->right = beta;
  alpha->parent = x;
  beta->parent = x;
}

void rotate_right(RBTree *x) {
  RBTree *parent = x->parent, *y = x->left;
  RBTree *alpha = y->left, *beta = y->right, *gamma = x->right;

  if (parent != NULL) {
    if (parent->left == x)
      parent->left = y;
    else
      parent->right = y;
  }
  y->parent = parent;

  y->left = alpha;
  y->right = x;
  alpha->parent = y;
  x->parent = y;
  x->left = beta;
  x->left->parent = y;
}

void fix(RBTree *tree) {
  if (tree->parent == NULL) {
    tree->color = Black;
    return;
  }

  if (tree->parent->parent == NULL || tree->parent->color == Black)
    return;

  RBTree *grandpa = tree->parent->parent, *father = tree->parent;
  RBTree *uncle = grandpa->left == father ? grandpa->right : grandpa->left;

  if (uncle->color == Red) {
    father->color = Black;
    uncle->color = Black;
    grandpa->color = Red;
    fix(grandpa);
  } else if (father->right == tree) {
    rotate_left(father);
    fix(father);
  } else if (father->left == tree) {
    if (grandpa->left == father) {
      rotate_right(grandpa);
      grandpa->color = Red;
      father->color = Black;
    } else {
      rotate_left(grandpa);
      grandpa->color = Red;
      father->color = Black;
      fix(tree);
    }
  }
}

void insert_with_parent(RBTree **tree, RBTree *parent, int value) {
  if (*tree == FAKE_LEAF) {
    *tree = malloc(sizeof(RBTree));
    **tree = (RBTree){value, Red, FAKE_LEAF, FAKE_LEAF, parent};
    fix(*tree);
  } else if (value < (*tree)->value)
    insert_with_parent(&(*tree)->left, *tree, value);
  else
    insert_with_parent(&(*tree)->right, *tree, value);
}

void insert(RBTree **tree, int value) {
  insert_with_parent(tree, NULL, value);
  if ((*tree)->parent != NULL)
    *tree = (*tree)->parent;
}

void visit(RBTree *tree, int layer) {
  for (int _ = 0; _ < layer; _++)
    printf(" ");

  if (tree == FAKE_LEAF) {
    printf("\x1b[1;30m\x1b[1;47m-\x1b[0m\n");
    return;
  }

  if (tree->color == Red)
    printf("\x1b[1;31m");
  else
    printf("\x1b[1;30m\x1b[1;47m");
  printf("%d\x1b[0m\n", tree->value);
  visit(tree->left, layer + 1);
  visit(tree->right, layer + 1);
}

int main() {
  RBTree *tree = FAKE_LEAF;

  int values[] = {11, 14, 15, 2, 1, 7, 5, 8, 4};
  int SIZE = 9;

  for (int *value = values; value < values + SIZE; value++)
    insert(&tree, *value);

  visit(tree, 0);
}

Assimoatica della probabilità

La probabilità si basa sugli assiomi introdotti da Andrey Kolmogorov nel 1933. Uno schema probabilistico, o modello probabilistico, è composto da 3 oggetti $(Ω, A, P)$ .

$Ω = {possibili risultati di un esperimento}$
$A = {E ∣ E \subseteq Ω} = P (Ω)$
- $∣Ω∣ = n ⟹ ∣ A ∣ = 2^{n}$
- $(A, \cap, \cup, ∁)$
$P : A \to [0, 1]$
- $P (\emptyset) = 0$
- $P (Ω) = 1$
- $A, B \in A \land A \cap B = \emptyset ⟹ P (A \cup B) = P (A) + P (B)$

In alternativa al punto 3.

$P : A \to [0, 1]$
- $\forall E \in A, P (E) \in R \land P (E) \geq 0$
- $P (Ω) = 1$
- $E_{1}, E_{2}, ..., E_{n} \in A$
  - $i \neq = j ⟹ E_{i} \cap E_{j} = \emptyset$
  - $P (i = 1 ⋃ n E_{i}) = i = 1 \sum n P (E_{i})$

$Ω$ è chiamato spazio degli eventi elementari (o spazio campionario) $E \subseteq Ω$ è un evento, ovvero, una domanda binaria sull'esito dell'esperimento $A$ è l'algebra degli eventi $P$ è la probabilità

Conseguenze degli assiomi

TODO: dimostrare le proprietà sotto

Monotonicità $A, B \in A, A \subseteq B ⟹ P (A) \leq P (B)$

Probabilità dell'insieme vuoto $P (\emptyset) = 0$

Regola del complemento $E \in A, P (E^{∁}) = 1 - P (E))$

Limite numerico $\forall E \in A, 0 \leq P (E) \leq 1$

Altre conseguenze $A, B \in A, P (A \cup B) = P (A) + P (B) - P (A \cap B)$

Costruzione di $P$

Sia $p : Ω \to [0, 1]$ ottenuta restringendo $P$ agli eventi elementari $\forall ω \in Ω, p (ω) := P ({ω}) :$

$\forall ω \in Ω, 0 \leq p (ω) \leq 1$
$ω \in Ω \sum p (ω) = 1$

Per il terzo assioma $ω \in Ω \sum p (ω) = ω \in Ω \sum P ({ω}) = P (ω \in Ω ⋃ ω) = P (Ω) = 1$

Probabilità uniforme

Si ha probabilità uniforme quando $p (ω) = \frac{1}{∣Ω∣} ⟹ P (A = i = 1 ⋃ k {ω_{i}}, k \leq ∣Ω∣) = \frac{∣ A ∣}{∣Ω∣}$

Indipendenza

Spazio di probabilità prodotto

Siano $(Ω_{1}, P_{1})$ e $(Ω_{2}, P_{2})$ due schemi probabilistici (Es. lancio di una moneta e misura della temperatura a Rio). Ora si considera l'esperimento congiunto

$Ω = Ω_{1} \times Ω_{2} = {(ω_{1}, ω_{2}) ∣ ω_{1} \in Ω_{1}, ω_{2} \in Ω_{2}}$

La scelta naturale per la probabilità $P$ è la probabilità prodotto

$P ({ω_{1}, ω_{2}}) = P_{1} ({ω_{1}}) \cdot P_{2} ({ω_{2}})$

Si può osservare che:

Se $P_{1}, P_{2}$ sono le probabilità uniformi su $Ω_{1}, Ω_{2}$ allora $P$ , probabilità prodotto, è la probabilità uniforme su $Ω$ .

Infatti

$P ({ω_{1}, ω_{2}}) = \frac{1}{∣ Ω _{1} ∣} \cdot \frac{1}{∣ Ω _{2} ∣} = \frac{1}{∣ Ω _{1} \times Ω _{2} ∣} = \frac{1}{∣Ω∣}$

$P$ soddisfa le seguenti condizioni di compatibilità:

Sia $A \subseteq Ω$ un evento che posso decidere osservando l'esperimento descritto solo da $(Ω_{1}, P_{1})$ , osservo che $A = A_{1} \times Ω_{2}$ con $A_{1} \subseteq Ω_{1}$ , allora

$P (A) = (ω_{1}, ω_{2}) \in A_{1} \times Ω_{2} \sum P_{1} ({ω_{1}}) \cdot P_{2} ({ω_{2}}) = = ω_{1} \in A_{1} \sum P ({ω_{1}}) \cdot ω_{2} \in Ω_{2} \sum P ({ω_{2}}) = = ω_{1} \in A_{1} \sum P ({ω_{1}}) \cdot 1 = P_{1} (A_{1})$

Allo stesso modo, se $B$ è un evento deciso osservando solo $Ω_{2}$ , ovvero $B = Ω_{1} \times B_{2}$ con $B_{2} \subseteq Ω_{2}$ , si ha $P (B) = P_{2} (B_{2})$

Nello schema di probabilità prodotto, se $A$ è deciso solo da $Ω_{1}$ e $B$ è deciso solo da $Ω_{2}$ , allora $P (A \cap B) = P (A) \cdot P (B)$

$A_{1} \subseteq Ω_{1}, A = A_{1} \times Ω_{2} B_{2} \subseteq Ω_{2}, B = Ω_{1} \times B_{2} P (A \cap B) = P ((A_{1} \times Ω_{2}) \cap (Ω_{1} \times B_{2})) = = P (A_{1} \times B_{2}) = P_{1} (A_{1}) \cdot P_{2} (B_{2}) = = P (A) \cdot P (B)$

In generale, dato lo schema probabilistico $(Ω, P)$ , due eventi $A, B \subseteq Ω$ sono indipendenti quando $P (A \cap B) = P (A) \cdot P (B)$

Indipendenza di 3 eventi

Dato lo schema probabilistico $(Ω, P)$ , siano $A, B, C \subseteq Ω$ ci sono almeno due modi di definire l'indipendenza di 3 eventi:

Indipendenza a coppie, per cui
- $P (A \cap B) = P (A) \cdot P (B)$
- $P (A \cap C) = P (A) \cdot P (C)$
- $P (B \cap C) = P (B) \cdot P (C)$
Indipendenza a terna, per cui
- $P (A \cap B \cap C) = P (A) \cdot P (B) \cdot P (C)$

Si dimostra che l'indipendenza di tipo $1$ non implica $2$ e l'indipendenza di tipo $2$ non implica $1$ , per questo, quando si darà per assunta l'ipotesi di indipendenza si considerano sia l'indipendenza a coppie sia l'indipendenza a terna come vere.

Indipendenza di $n$ eventi

Sia $(Ω, P)$ uno schema probabilistico, $A_{1}, A_{2}, ..., A_{n} \subseteq Ω$ sono indipendenti quando presa una qualunque sottofamiglia di $A_{1}, ..., A_{n}$ la probabilità $P (A_{1} \cap ... \cap A_{n}) = P (A_{1}) \cdot ... \cdot P (A_{n})$

$\forall k, \forall (1 \leq i_{1} < i_{2} < ... < i_{k} \leq n) P (A_{i_{1}} \cap A_{i_{2}} \cap ... \cap A_{i_{k}}) = P (A_{i_{1}}) \cdot P (A_{i_{2}}) \cdot ... \cdot P (A_{i_{k}})$

Schema di Bernoulli

Lo schema di Bernoulli rappresenta un esperimento binario ripetutto $n$ volte:

tradizionalmente $n$ lanci di moneta non necessariamente equa
la trasmissione di $n$ bit su un cavo con disturbi (1 mi indica che il bit è arrivato corretamente, 0 altrimenti), schema in cui la probabilità di errore (di 0) deve essere bassa

Si codificano i risultati con ${0, 1}$ (per comodità dei calcoli successivamente)

$Ω = {(ω_{1}, ..., ω_{2}) ∣ ω_{i} \in {0, 1}} = = {0, 1} \times {0, 1} \times ... \times {0, 1} = {0, 1}^{n}$

La scelta naturale è la probabilità prodotto su singolo lancio, dove $p$ indica la "truccatura" della moneta

$Ω_{1} = {0, 1} P ({0}) = 1 - p P ({1}) = p$

Lo schema si indica con $(n, p)$

n	$Ω$
1	${0, 1}$
2	${00, 01, 10, 11}$
3	${000, 001, 010, 011, 100, 101, 110, 111}$

$n = 1$

$P ({0}) = 1 - p P ({1}) = p$

$n = 2$

$P ({00}) = (1 - p)^{2} P ({01}) = (1 - p) \cdot p P ({10}) = p \cdot (1 - p) P ({11}) = p^{2}$

$n = 3$

$P ({000}) = (1 - p)^{3} P ({001}) = (1 - p)^{2} \cdot p P ({010}) = (1 - p)^{2} \cdot p P ({011}) = (1 - p) \cdot p^{2} P ({100}) = (1 - p)^{2} \cdot p P ({101}) = (1 - p) \cdot p^{2} P ({110}) = (1 - p) \cdot p^{2} P ({111}) = p^{3}$

In generale

$P ({ω_{1}, ..., ω_{n}}) = p^{i = 1 \sum n ω_{i}} \cdot (p - 1)^{1 - i = 1 \sum n ω_{i}}$

Probabilità condizionata

Sia $(Ω, P)$ uno schema probabilistico e $A, B \subseteq Ω$ eventi, si definisce la probabilità condizionata come:

$P (B ∣ A) := \frac{P ( A \cap B )}{P ( A )}$

Sia $P$ la probabilità uniforme, allora

$P (B ∣ A) = \frac{P ( A \cap B )}{P ( A )} = \frac{\frac{∣ A \cap B ∣}{∣Ω∣}}{\frac{∣ A ∣}{∣Ω∣}} = \frac{∣ A \cap B ∣}{∣ A ∣}$

Probabilità composte

Sia $(Ω, P)$ uno schema probabilistico, e siano $A_{1}, A_{2}, ..., A_{k} \subseteq Ω$ eventi

$P (i = 1 ⋂ k A_{i}) = = P (A_{1}) P (A_{2} ∣ A_{1}) P (A_{3} ∣ A_{2} \cap A_{1}) \dots P (A_{k} ∣ i = 1 ⋂ k - 1 A_{i}) = = P (A_{1}) \frac{P ( A _{2} \cap A _{1} )}{P ( A _{1} )} \frac{P ( A _{3} \cap A _{2} \cap A _{1} )}{P ( A _{2} \cap A _{1} )} \dots \frac{i = 1 ⋂ k A _{i}}{P ( i = 1 ⋂ k - 1 A _{i} )} = = P (i = 1 ⋂ k A_{i})$

Probabilità totali

Sia $(Ω, P)$ uno schema probabilistico, e siano $D_{1}, D_{2}, ..., D_{n} \subseteq Ω ∣ i = 1 ⋃ n D_{i} = Ω \land i \neq = j ⟹ D_{i} \neq = D_{j}$

Formula di Bayes

Variabili aleatorie

Sia $(Ω, P)$ uno schema probabilistico, $X : Ω \to R$ è una variabile aleatoria. Per distribuzione di una variabile aleatoria s'intende l'istogramma che rappresenta le probabilità dei valori in $ℑ (X)$

La probabilità $P$ su $Ω$ induce una probabilità su $ℑ (X)$ tramite $μ_{x}$ , per cui $μ_{x} ({x}) = P ({ω \in Ω ∣ X (ω) = x}), x \in ℑ (X)$ , equivalentemente $μ_{x} ({x}) = P (X^{- 1} ({x})), x \in ℑ (X) ⟹ μ_{x} = P \circ X^{- 1}$ , per cui $μ_{x}$ è la distrubuzione di $X$

Valore di attesa

Linearità

Varianza

Covarianza

Variabile aleatoria geometrica

Usata tipicamente in teoria dell'affidabilità. Si ipotizza un macchinario che opera a cicli, e la trovi anche qui.

Siano

$p$ la probabilità di rottura ad ogni ciclo
$X$ il tempo di rottura (dopo quanti cicli si è rotto)

Per cui $Ω = {0, 1}^{N}$ , quindi la v.a. geometrica è uno schema di Bernoulli con infiniti lanci e $P$ probabilità prodotto, $X \sim Geom (p), p \in (0, 1)$

Distribuzione

$ℑ (X) = {1, 2, 3, ..., n, ...} = N ∖ {0} = N^{*} P (X = k) = P ({(ω_{1}, ω_{2}, ..., ω_{k}) ∣ ω_{1} = ω_{2} = \dots = ω_{k - 1} = 0 \land ω_{k} = 1}) = (1 - p)^{k - 1} p$

Valore di attesa

$E (X) = \frac{1}{p}$

Dimostrazione

$E (X) = k = 1 \sum \infty E (X_{i} = k) = k = 0 \sum \infty E (X_{i} = k) = k = 0 \sum \infty k (1 - p)^{k - 1} p = p k = 0 \sum \infty k (1 - p)^{k - 1} = p k = 0 \sum \infty \frac{d}{d ( 1 - p )} (1 - p)^{k} = = p \frac{d}{d ( 1 - p )} k = 0 \sum \infty (1 - p)^{k} = p \frac{d}{d ( 1 - p )} k = 0 \sum \infty (1 - p)^{k} = = p \frac{d}{d ( 1 - p )} (\frac{1}{1 - ( 1 - p )}) = p (\frac{1}{p})^{2} = \frac{1}{p}$

Varianza

$V (X) = \frac{( 1 - p )}{p ^{2}}$

Perdita di memoria

Sia $G (n) = (1 - p)^{n}$ la funzione di sopravvivenza tale che

$G (n) = P (la macchina sopravvive i primi n cicli) = = P (X > n) = k = n + 1 \sum \infty P (X = k) = k = n + 1 \sum \infty (1 - p)^{k - 1} p ⟹ ⟹ h = k - n - 1 ⟹ h = 0 \sum \infty (1 - p)^{h + n} p = = p (1 - p)^{n} h = 0 \sum \infty p (1 - p)^{h} = (1 - p)^{n}$

Ora possiamo determinare che $P (X = n + l ∣ X > n) = P (X = l)$ , dato che

$P (X = n + l ∣ X > n) = = \frac{P ( X = n + l , X > n )}{P ( X > n )} = \frac{P ( X = n + l )}{P ( X > n )} = \frac{( 1 - p ) ^{n + l - 1} p}{( 1 - p ) ^{n}} = = (1 - p)^{l - 1} p = P (X = l)$

Variabile aleatoria binomiale negativa

Variabile aleatoria di Poisson

Usata spesso in teoria delle code, e la trovi anche qui.

Sia $Δ t$ un intervallo piccolo di tempo, si ha che la probabilità

$P (cliente arrivi nell’intervallo Δ t) = λ Δ t + o (Δ t), λ > 0$

Arrivi in intervalli di tempo disgiunti sono indipendenti. Ora si consideri l'intervallo di tempo unitario $[0, 1] = i = 1 ⋃ n [\frac{i - 1}{n}, \frac{i}{n}]$ , quindi suddiviso in $n$ intervalli

X_i = \text{# di arrivi nell'intervallo } \biggl(\frac{i-1}{n}, \frac{i}{n}\biggl) = \begin{cases} 1 & \text{con probabilita' } & \frac{\lambda}{n} \ 0 & \text{con probabilita' } & 1 - \frac{\lambda}{n} \end{cases} \implies \ \implies X_i \sim \text{Bernoulli}\biggl(\frac{\lambda}{n}\biggl)

Per cui

X^{(n)} = \text{# di arrivi nell'intervallo } [0, 1] = \sum\limits_{i = 1}^n X_i \implies X^{(n)} \sim \text{Bin}\biggl(n, \frac{\lambda}{n}\biggl)

Teorema di Poisson

Siano $λ \in (0, 1), X^{(n)} \sim B in (n, \frac{λ}{n})$ , con $n$ grande abbastanza. Fissando $k = 0, 1, ..., n$ , allora $n \to \infty lim P (X^{(n)} = k) = \frac{e ^{- λ} λ ^{k}}{k !}$

TODO: Dimostrazione

Distribuzione

$P (X^{(n)} = k) = \frac{e ^{- λ} λ ^{k}}{k !}$

TODO: Dimostrazione distribuzione

Valore di attesa

$E (X) = λ$

Dimostrazione

$E (X) = k = 0 \sum \infty k P (X = k) = k = 1 \sum \infty k P (X = k) = k = 1 \sum \infty k \frac{e ^{- λ} λ ^{k}}{k !} = e^{- λ} k = 1 \sum \infty \frac{λ ^{k}}{( k - 1 )!} h = k - 1 ⟹ e^{- λ} h = 0 \sum \infty \frac{λ ^{h + 1}}{h !} = λ e^{- λ} h = 0 \sum \infty \frac{λ ^{h}}{h !} = λ e^{- λ} e^{λ} = λ$

Varianza

$V (X) = λ$

Dimostrazione

$V (X) = n \to \infty lim V (X^{(n)}) = n \to \infty lim n \frac{λ}{n} (1 - \frac{λ}{n}) = n \to \infty lim λ - \frac{λ ^{2}}{n} = λ$

Complementare

TODO

Variabile aleatoria multinomiale

Distribuzione marginale con $k = 3$

Distribuzione di $X_{1}$ condizionata a $X_{2} = n_{2}$

Valore di attesa di $X_{1}$

Varianza di $X_{1}$

Covarianza di $X_{1}$

Dovrebbe venire < 0

Problema degli accoppiamenti

Dato $(Ω, P)$ uno schema probabilistico. Sia $f : {1, 2, ..., n} \to {1, 2, ..., n}$ biettiva, allora $i \in {1, 2, ..., n}$ è un punto fisso se $f (i) = i$ , calcolare la probabilità che una permutazione casuale non abbia punti fissi.

$Ω = {(ω_{1}, ω_{2}, ..., ω_{n}) ∣ ω_{i} \in {1, 2, ..., n} \land i \neq = j ⟹ ω_{i} \neq = ω_{j}} ⟹ ∣Ω∣ = n!$
$P$ è la probabilità uniforme su $Ω$

Codifichiamo l'evento di cui si vuole calcolare la probabilità

$E = {(ω_{1}, ω_{2}, ..., ω_{n}) ∣ ω_{i} \in {1, 2, ..., n} \land i \neq = j ⟹ ω_{i} \neq = ω_{j} \land ω_{i} \neq = i}$

Questo è un problema per cui il passaggio a complemento si pone come una buona soluzione, per cui

$A = E^{∁} = {(ω_{1}, ω_{2}, ..., ω_{n}) ∣ ω_{i} \in {1, 2, ..., n} \land \exists i \in {1, 2, ..., n} : ω_{i} = i}$

Ora è possibile scrivere

$A = k = 1 ⋃ n A_{k} ∣ A_{k} = {(ω_{1}, ω_{2}, ..., ω_{n}) ∣ ω_{i} \in {1, 2, ..., n} \land i \neq = j ⟹ ω_{i} \neq = ω_{j} \land ω_{k} = k}$

Dato che, per simmetria, gli eventi $A_{k}$ hanno la stessa cardinalità, basta calcolare $∣ A_{1} ∣$

$A_{1} = {(ω_{1}, ω_{2}, ..., ω_{n}) ∣ ω_{i} \in {1, 2, ..., n} \land i \neq = j ⟹ ω_{i} \neq = ω_{j} \land ω_{1} = 1} ⟹ ⟹ ∣ A_{1} ∣ = (n - 1)! ⟹ P (A_{1}) = \frac{∣ A _{1} ∣}{∣Ω∣} = \frac{( n - 1 )!}{n !} = \frac{1}{n}$

Il problema è che gli eventi $A_{1}, A_{2}, ..., A_{n}$ non sono disgiunti, quindi bisogna usare il PIE per calcolare la cardinalità dell'unione, per cui servono le intersezioni

$A_{1} \cap A_{2} = = {(ω_{1}, ω_{2}, ..., ω_{n}) ∣ ω_{i} \in {1, 2, ..., n} \land i \neq = j ⟹ ω_{i} \neq = ω_{j} \land ω_{1} = 1 \land ω_{2} = 2} ⟹ ⟹ ∣ A_{1} \cap A_{2} ∣ = (n - 2)! ⟹ ⟹ P (A_{1} \cap A_{2}) = \frac{∣ A _{1} \cap A _{2} ∣}{∣Ω∣} = \frac{( n - 2 )!}{n !} = \frac{1}{n ( n - 1 )}$

È possibile generalizzare per l'intersezione di $A_{1}, A_{2}, ..., A_{μ}$ eventi, con $μ \leq n$

$A_{μ} = 1 \leq l_{1} < l_{2} < \dots < l_{μ} \leq n ⋂ A_{l_{i}} = {(ω_{1}, ω_{2}, ..., ω_{n}) ∣ ω_{i} \in {1, 2, ..., n} \land i \neq = j ⟹ ω_{i} \neq = ω_{j} \land ω_{l_{i}} = l_{i}} ⟹ ⟹ ∣ A_{μ} ∣ = (n - μ)! ⟹ ⟹ P (A_{μ}) = \frac{∣ A _{μ} ∣}{∣Ω∣} = \frac{( n - μ )!}{n !} = \frac{1}{n ( n - 1 ) \dots ( n - μ + 1 )}$

Ora è possibile usare il PIE per ricavare la sommatoria completa

$P (A) =$

TODO: completare

Legge dei grandi numeri

Disuguaglianza di Markov

Disuguaglianza di Čebyšëv

Legge dei grandi numeri

Metodo Montecarlo

Databases

Download the introduction presentation

View the presentation on proofs or download it. The links refer to the slides of prof. Perelli and don't work on the website: you have to download the pdf in the same folder with the course's slides for the links to work.

Schema

An attribute is a $(name, domain)$ pair; we can define the $d o m ()$ function on a set of names, which associates to each name a specific domain (different attributes can have the same domain)

$d o m : {name_{1}, ..., name_{n}} name_{i} \to {domain_{1}, ..., domain_{k}} \mapsto domain_{j}$

PDF 7 slide 2

A relation schema $R = {A_{1}, A_{2}, ..., A_{n}}$ is a set of attributes

Tuples & instances

PDF 7 slide 3 Given a relation schema $R = A_{1} A_{2} ... A_{n}$ , a tuple $t$ on $R$ is a function such that

$t : R A_{i} \to i = 1 ⋃ n d o m (A_{i}) \mapsto a \in d o m (A_{i})$

Given a relation schema $R$ , a subset $X \subseteq R$ and $t$ a tuple on $R$ , the reduction of $t$ on $X$ is defined as

$t [X] = {(A, t [A]) ∣ A \in X}$

PDF 7 slide 4 Given a relation schema $R$ , a subset $X \subseteq R$ and $t_{1}, t_{2}$ tuples on $R$

$t_{1} [X] = t_{2} [X] ⟺ t_{1} [A] = t_{2} [A] \forall A \in X$

PDF 7 slide 5 Given a relation schema $R$ and $t_{1}, t_{2}, ..., t_{k}$ tuples on $R$ , a set $r = {t_{1}, t_{2}, ..., t_{k}}$ is an instance of $R$

Functional dependencies

PDF 7 slide 6

Given a relation schema $R$ and $X, Y \in P (R) ∖ {\emptyset}$ we have that $(X, Y)$ is a functional dependency on $R$ (noted as $X \to Y$ )

PDF 7 slide 7

Given a relation schema $R$ and a functional dependency $X \to Y$ defined on $R$ we say that an instance $r$ of $R$ satisfies the functional dependency $X \to Y$ if

$\forall t_{1}, t_{2} \in r t_{1} [X] = t_{2} [X] ⟹ t_{1} [Y] = t_{2} [Y]$

Instance legality & closure

PDF 7 slide 14

Given a relation schema $R$ and a set $F$ of functional dependencies defined on $R$ , an instance $r$ of $R$ is legal if it satisfies every dependency in $F$

$\forall X \to Y \in F \forall t_{1}, t_{2} \in r t_{1} [X] = t_{2} [X] ⟹ t_{1} [Y] = t_{2} [Y]$

PDF 7 slide 20

Given a relation schema $R$ and a set $F$ of functional dependencies defined on $R$ , the closure of $F$ is the set of functional dependencies that are satisfied by every legal instance of $R$

$F^{+} = {V \to W ∣ \forall legal r of R, r satisfies V \to W}$

$V \to W$ doesn't necessarily have to be in $F$

$F \subseteq F^{+}$

PDF 7 slide 21

$F \subseteq F^{+}$

Proof

$F^{+} = {V \to W ∣ \forall legal r of R, r satisfies V \to W}$

By definition $r$ is legal if it satisfies every dependency $X \to Y \in F ⟹$ given $X \to Y \in F$ , every legal instance of $R$ satisfies $X \to Y ⟹ X \to Y \in F^{+}$

Keys

PDF 7 slide 22

Given a relation schema $R$ and a set $F$ of functional dependencies on $R$ , $K \subseteq R$ is a key of $R$ if

$K \to R \in F^{+}$
$\forall K^{'} \subset K, K^{'} \to R \in / F^{+}$

$" \subset "$ means proper subset, which implies that $K \neq = K^{'}$

Trivial dependencies

PDF 7 slide 26

Given a schema $R$ and $X, Y \in P (R) ∖ {\emptyset} : Y \subseteq X$ , we have that every instance $r$ of $R$ satisfies the dependency $X \to Y$

Proof

Given an instance $r$ of $R, \forall t_{1}, t_{2} \in r$ we have that

$t_{1} [X] = t_{2} [X] ⟹$ by definition $t_{1} [A] = t_{2} [A] \forall A \in X ⟹$ as $Y \subseteq X$ we have that
$t_{1} [A^{'}] = t_{2} [A^{'}] \forall A^{'} \in Y ⟹$ by definition $t_{1} [Y] = t_{2} [Y]$

As $t_{1} [X] = t_{2} [X] ⟹ t_{1} [Y] = t_{2} [Y]$ we have that $r$ satisfies $X \to Y$

Decomposition

PDF 7 slide 27

Given a schema $R$ and a set of functional dependencies $F$ on $R$ , we have that

$X \to Y \in F^{+} ⟺ X \to A \in F^{+} \forall A \in Y$

Proof

$X \to Y \in F^{+} ⟹ \forall legal r of R \forall t_{1}, t_{2} \in r t_{1} [X] = t_{2} [X] ⟹ t_{1} [Y] = t_{2} [Y] ⟹ t_{1} [A] = t_{2} [A] \forall A \in Y ⟹ X \to A \in F^{+} \forall A \in Y$

$X \to A \in F^{+} \forall A \in Y ⟹ \forall legal r of R \forall t_{1}, t_{2} \in R t_{1} [X] = t_{2} [X] ⟹ t_{1} [A] = t_{2} [A] \forall A \in Y ⟹ t_{1} [Y] = t_{2} [Y] ⟹ X \to Y \in F^{+}$

$F^{A}$

PDF 8 slide 3 $F^{A}$ is a set of functional dependencies on $R$ such that

$X \to Y \in F ⟹ X \to Y \in F^{A}$
$Y \subseteq X \in R ⟹ X \to Y \in F^{A}$ (refelxivity)
$\forall Z \in R, X \to Y \in F^{A} ⟹ ZX \to Z Y \in F^{A}$ (augmentation)
$X \to Y, Y \to Z \in F^{A} ⟹ X \to Z \in F^{A}$ (transitivity)

PDF 8 slide 6 derivates

$X \to Y, X \to Z \in F^{A} ⟹ X \to Y Z \in F^{A}$ (union)
$X \to Y \in F^{A} \land Z \subseteq Y ⟹ X \to Z \in F^{A}$ (decomposition)
$X \to Y, WY \to Z \in F^{A} ⟹ W X \to Z \in F^{A}$ (pseudotransitivity)

PDF 8 slide 8 $X \to A_{1} A_{2} ... A_{n} \in F^{A} ⟺ \forall i = 1, ..., n X \to A_{i} \in F^{A}$

Derivates (Proofs)

Union

$(X)_{F}^{+}$ LaTeX

$X \to Y, X \to Z \in F^{A} ⟹$ by augmentation $X \to X Y, X Y \to Y Z \in F^{A} ⟹$ by transitivity $X \to Y Z \in F^{A}$

Decomposition

$X \to Y \in F^{A} \land Z \subseteq Y ⟹ Y \to Z \in F^{A} ⟹$ by transitivity $X \to Z \in F^{A}$

Pseudotransitivity

$X \to Y, WY \to Z \in F^{A} ⟹$ by augmentation $W X \to WY \in F^{A} ⟹$ by transitivity $W X \to Z \in F^{A}$

$(X)_{F}^{+}$

$([P D F 8 s l i d e 9] (08 G i v e na re l a t i o n sc h e ma$ R $, a se t$ F $o fd e p e n d e n c i eso n$ R $an d$ X \subseteq R $. T h e * * c l os u re * * o f$ X $w i t h res p ec tt o$ F $, d e n o t e d$ (X)^+_F $i s d e f in e d a s$ $(X)_{F}^{+} = {A \in R ∣ X \to A \in F^{A}}$ $W e ha v e t ha t$ X \subseteq (X)^+_F

Proof

\forall A \in X $b yre f l e x i v i t y$ X \to A \in F^A \implies $b y d e f ini t i o n$ A \in (X)^+_F \implies X \subseteq (X)^+_F $> W ec an u se A r m s t ro n g^{'} s a x i o m s a s$ (X)^+_F $i s d e f in e d o f$ F^A $> NOTE :$ (X)^+_F $i s * * NOT * * d e f in e d o n$ F^+

Lemma of closure

PDF 8 slide 10

Let R $b e a sc h e maan d$ F $a se t o ff u n c t i o na l d e p e n d e n c i eso n$ R $X \to Y \in F^{A} ⟺ Y \subseteq (X)_{F}^{+}$

Proof

X \to Y \in F^A \implies $b y d eco m p os i t i o n$ X \to A \in F^A ; \forall A \in Y \implies $b y d e f ini t i o n$ A \in (X)^+_F ; \forall A \in Y \implies Y \subseteq (X)^+_F $>$ Y \subseteq (X)^+_F \implies A \in (X)^+_F ; \forall A \in Y \implies $b y d e f ini t i o n$ X \to A \in F^A ; \forall A \in Y \implies $b y u ni o n$ X \to Y \in F^A

F^+ = F^A $[P D F 8 s l i d e 11] (08 L e t$ R $b e a re l a t i o n sc h e maan d$ F $a se t o ff u n c t i o na l d e p e n d e n c i eso n$ R $t h e n$ F^+ = F^A

Proof

Let F_i $b e t h e v a l u eo f$ F $a f t er t h e$ i $- t ha ppl i c a t i o n o f an A r m s t ro n g^{'} s a x i o m, w i t h$ F_0 = F

F^A \subseteq F^+

Base case

F_0 = F \subseteq F^+ \implies F_0 \subseteq F^+

Inductive step

F_i \subseteq F^+ \implies F_{i + 1} \subseteq F^+ $L e t$ X \to Y \in F_{i + 1} $, e i t h er -$ X \to Y \in F_i \implies $b yH P$ X \to Y \in F^+ $-$ X \to Y \in F_{i + 1} \setminus F_i $, w hi c hm e an s t ha t$ X \to Y has been optained through one of the axioms

F^A \subseteq F^+

Reflexivity

Y \subseteq X \implies $g i v e n t ha t$ X \to Y $i ss a t i s f i e d b ye v ery in s t an ce$ X \to Y \in F^+

Augmentation

Z \subseteq R, X = ZV, Y = ZW \land V \to W \in F_i $g i v e n$ t_1, t_2 \in r $l e g a l in s t an ceo f$ R $w e ha v e t ha t$ t_1[X] = t_2[X] \implies (t_1[V] = t_2[V] \implies $b yH P$ t_1[W] = t_2[W]) \land t_1[Z] = t_2[Z] \implies t_1[Y] = t_2[Y]

Transitivity

X \to Z, Z \to Y \in F_i \implies $b yH P$ \forall \text{ legal } r \text{ of } R, \forall t_1, t_2 \in r, t_1[X] = t_2[X] \implies t_1[Z] = t_2[Z] \implies t_1[Y] = t_2[Y] $w e ha v e t ha t$ t_1[X] = t_2[X] \implies t_1[Y] = t_2[Y] \implies X \to Y \in F^+

F^+ \subseteq F^A $_{(} l e g a l in s t an ce)_{G} i v e n$ X \subseteq R $w ec anb u i l d anin s t an ce$ r = \set{t_1, t_2} $o n$ R $s u c h t ha t < t ab l e >< t h e a d >< t r >< t h >$ r $< / t h >< t h co l s p an = "5" >$ (X)^+_F $< / t h >< t h co l s p an = "5" >$ R \setminus (X)^+_F $< / t h >< / t r >< / t h e a d >< t b o d y >< t r >< t d >$ t_1 $< / t d >< t d > 1 < / t d >< t d > 1 < / t d >< t d > 1 < / t d >< t d > ... < / t d >< t d > 1 < / t d >< t d > 1 < / t d >< t d > 1 < / t d >< t d > 1 < / t d >< t d > ... < / t d >< t d > 1 < / t d >< / t r >< t r >< t d >$ t_2 $< / t d >< t d > 1 < / t d >< t d > 1 < / t d >< t d > 1 < / t d >< t d > ... < / t d >< t d > 1 < / t d >< t d > 0 < / t d >< t d > 0 < / t d >< t d > 0 < / t d >< t d > ... < / t d >< t d > 0 < / t d >< / t r >< / t b o d y >< / t ab l e > L e t^{'} s v er i f y t ha t$ r $i s a l e g a l in s t an ce . G i v e n$ V \to W \in F $, a s$ V, W \neq \varnothing $b y d e f ini t i o n, w eco u l d ha v e -$ V \nsubseteq (X)^+_F \implies \exists A \in V : A \in R \setminus (X)^+_F \implies t_1[V] \neq t_2[V] \implies r $s a t i s f i es$ V \to W $-$ V \subseteq (X)^+_F $, w eco u l d ha v e t ha t -$ W \subseteq (X)^+_F \implies t_1[V] = t_2[V] \land t_1[W] = t_2[W] \implies r $s a t i s f i es$ V \to W $-$ W \nsubseteq (X)^+_F \implies \exists A \in W : A \in R \setminus (X)^+_F \implies t_1[V] = t_2[V] \land t_1[W] \neq t_2[W]

F^+ \subseteq F^A $_{(} l e g a l in s t an ce)_{I} n t h e l a s t c a se$ r $d oes n^{'} t s a t i s f y$ V \to W $, so w e ha v e t os h o wt ha t i t c a n^{'} t ha pp e n . L e t^{'} ss u pp ose t ha t$ \exists V \to W \in F $s u c h t ha t$ r $d oes n^{'} t s a t i s f y$ V \to W $; b yco n s t r u c t i o n w e ha v e t ha t$ $V \subseteq (X)_{F}^{+} \land \exists A \in W : A \in R ∖ (X)_{F}^{+} ⟹ A \in / (X)_{F}^{+}$ $W e ha v e t ha t -$ V \subseteq (X)^+_F \implies $b y t h e l e mma o f c l os u re$ X \to V \in F^A $-$ A \in W \implies $b y d eco m p os i t i o n$ V \to A \in F^A $B y t r an s i t i v i t y$ X \to A \in F^A \implies $b y t h e l e mma o f c l os u re$ A \in (X)^+_F which is a contraddiction

Legality

In the first 2 cases r $s a t i s f i es$ V \to W \in F $, c a se 3 c a n^{'} t ha pp e n$ \implies r $i s a l e g a l in s t an ceo f$ R

F^+ \subseteq F^A $L e t^{'} sco n s i d er$ X \to Y \in F^+ $B y d e f ini t i o n w e ha v e t ha t$ X \subseteq (X)^+_F \implies $b yco n s t r u c t i o n$ t_1[X] = t_2[X] \implies $b y h y p o t es i s an d g i v e n t ha t$ r $i s a l e g a l in s t an ce$ t_1[Y] = t_2[Y] \implies $b y t h e l e mma$ Y \subseteq (X)^+_F \implies X \to Y \in F^A

F^+ = F^A $G i v e n t ha t$ F_i \subseteq F^+ : \forall i \in \mathbb{N} $an d$ F^+ \subseteq F^A $w e ha v e t ha t$ F^+ = F^A

3NF

PDF 9 slide 14

Given a relation schema R $an d a se t o ff u n c t i o na l d e p e n d e n c i es$ F $o n$ R $.$ R $i s in * * 3 NF * * i f$ \forall X \to A \in F^+ : A \notin X $e i t h er -$ A $i s p r im e_{(} b e l o n g s t o ak ey)_{-}$ X is superkey

3NF pt.2

PDF 10 slide 4

Let R $b e a re l a t i o n sc h e maan d$ F $a se t o ff u n c t i o na l d e p e n d e n c i eso n$ R $A na tt r ib u t e$ A \in R $* * p a r t ia ll y * * d e p e n d so nak ey$ K $i f <! - - TO D O c h ec ki f s u b se t e q ? NOT n ee d e d b ec a u se X i s a p ro p ers u b se t o f K - - > -$ \exists X \subset R : A \notin X \land X \to A \in F \land X \subset K $-$ A $i s n^{'} tp r im e A na tt r ib u t e$ A \in R $* * t r an s i t i v e l y * * d e p e n d so nak ey$ K $i f -$ \exists X \subset R : A \notin X \land X \to A \in F \land K \to X \in F $-$ X $i s n^{'} t ak ey -$ A $i s n^{'} tp r im e >$ X \subset R $m e an s t ha t$ X \neq R $, o t h er w i se Xw o u l d b e a s u p er k ey, a s$ R \to R \in F^A = F^+

3NF pt.3

PDF 10 slide 5

Given a schema R $an d a se t o ff u n c t i o na l d e p e n d e n c i es$ F $o n$ R $, * * TF A E * * -$ R $i s in 3 NF - t h ere a re * * n o a tt r ib u t es t ha tp a r t ia ll yor t r an s i t i v e l y d e p e n d o nak ey * * -$ \forall X \to A \in F^+ : A \notin X $e i t h er : -$ A $i s p r im e_{(} b e l o n g s t o ak ey)_{-}$ X is superkey

Proof

TODO: I have it, I just have to write it out in \LaTeX $- - - <! - - P D F 10 s l i d e 6 - - ><! - - L e t$ R $b e a re l a t i o n sc h e maan d$ F $a se t o ff u n c t i o na l d e p e n d e n c i eso n$ R $. A sc h e ma$ R $i s in 3 NF i f an d o n l y i f n e i t h er p a r t ia l d e p e n d e n c i es n or t r an s i t i v e d e p e n d e n c i ese x i s t in$ F -->

BCNF (Boyce-Codd)

PDF 10 slide 20

A relation schema R $i s in * * B oyce - C o dd N or ma lF or m * * (BCNF) w h e n e v ery d e t er minan t in$ F is a superkey. A relation that respects Boyce-Codd Normal Form is also in 3NF, but the opposite is not true.

(X)^+_F

PDF 11 slie 5

def clousure(R, F, X):
	Z = X
	S = { A ∈ R | Y → V ∈ F ∧ Y ⊆ Z ∧ A ∈ V }

	if S ⊆ Z:
		return Z

	return closure(R, F, Z ∪ S)

(X)^+_F $[P D F 11 s l i d e 8] (11 T h e a l g or i t hm ‘ c l os u re () ‘ correc tl yco m p u t es t h ec l os u reo f a se t o f a tt r ib u t es$ X $res p ec t i v e l y t o a se t$ F $o ff u n c t i o na l d e p e n d e n c i eso n$ R

Proof

Let's consider Z_i, S_i $t h e v a l u eso f$ Z $an d$ S $a tt h e$ i $- t h c a ll o f t h e f u n c t i o nan d$ Z_f, S_f \mid S_f \subseteq Z_f $t h e v a l u eso f$ Z, S $a tt h e l a s t c a ll o f t h e f u n c t i o n . L e t^{'} s p ro v e b y in d u c t i o n t ha t$ Z_i \subseteq (X)^+_F

Z_i \subseteq (X)^+_F

Base case

Inductive step Z_i \subseteq (X)^+F \implies Z{i + 1} \subseteq (X)^+F $<! - - W e ha v e t o p ro v e t ha t$ Z_i \subseteq (X)^+F \implies Z{i + 1} \subseteq (X)^+F $- - ><! - - an d$ S_i = \set{A \in R \mid Y \to V \in F \land Y \subseteq Z_i \land A \in V} $- - > G i v e n t ha t$ Z{i + 1} = Z_i \cup S_i $t h e ni f$ A \in Z{i + 1} $e i t h er -$ A \in Z_i \implies $b yH P$ A \in (X)^+_F $-$ A \in S_i \implies $b yco n s t r u c t i o n$ \exists Y \to V \in F \mid Y \subseteq Z_i \land A \in V \implies $b yH P$ Y \subseteq (X)^+_F \implies $b y t h e l e mma o f c l os u re$ X \to Y \in F^A $an d b y d eco m p os i t i o n$ Y \to A \in F^A \implies $b y t r an s i t i v i t y$ X \to A \in F^A \implies $b y d e f ini t i o n$ A \in (X)^+_F

(X)^+_F \subseteq Z_f $_{(} l e g a l in s t an ce)_{G} i v e n$ Z_f $w ec anb u i l d anin s t an ce$ r = \set{t_1, t_2} $o n$ R $s u c h t ha t < t ab l e >< t h e a d >< t r >< t h >$ r $< / t h >< t h co l s p an = "5" >$ Z_f $< / t h >< t h co l s p an = "5" >$ R \setminus Z_f $< / t h >< / t r >< / t h e a d >< t b o d y >< t r >< t d >$ t_1 $< / t d >< t d > 1 < / t d >< t d > 1 < / t d >< t d > 1 < / t d >< t d > ... < / t d >< t d > 1 < / t d >< t d > 1 < / t d >< t d > 1 < / t d >< t d > 1 < / t d >< t d > ... < / t d >< t d > 1 < / t d >< / t r >< t r >< t d >$ t_2 $< / t d >< t d > 1 < / t d >< t d > 1 < / t d >< t d > 1 < / t d >< t d > ... < / t d >< t d > 1 < / t d >< t d > 0 < / t d >< t d > 0 < / t d >< t d > 0 < / t d >< t d > ... < / t d >< t d > 0 < / t d >< / t r >< / t b o d y >< / t ab l e > L e t^{'} s v er i f y t ha t$ r $i s a l e g a l in s t an ce . G i v e n$ V \to W \in F $a s$ V, W \neq \varnothing $w eco u l d ha v ee i t h er -$ V \nsubseteq Z_f \implies \exists A \in V : A \in R \setminus Z_f \implies t_1[V] \neq t_2[V] \implies r $s a t i s f i es$ V \to W $-$ V \subseteq Z_f $-$ W \subseteq Z_f \implies $b yco n s t r u c t i o n$ t_1[V] = t_2[V] \land t_1[W] = t_1[W] \impliesr $s a t i s f i es$ V \to W $-$ W \nsubseteq Z_f \implies $b yco n s t r u c t i o n$ t_1[V] = t_2[V] \land t_1[W] \neq t_2[W]

(X)^+_F \subseteq Z_f $_{(} l e g a l in s t an ce)_{L} e t^{'} ss u pp ose t ha t$ \exists V \to W \in F : r $d oes n^{'} t s a t i s f y$ V \to W \implies $b yco n s t r u c t i o n$ $V \subseteq Z_{f} \land \exists A \in W : A \in R ∖ Z_{f} ⟹ A \in / Z_{f}$ $G i v e n t ha t$ V \subseteq Z_f \land V \to W \in F \land A \in W \implies $b yco n s t r u c t i o n o f$ S_f, : A \in Z_f which is a contraddiction

Legality

In the first 2 cases r $s a t i s f i es$ V \to W \in F $c a se 3 c a n^{'} t ha pp e n$ \implies r $i s a l e g a l in s t an ceo f$ R

(X)^+_F \subseteq Z_f $L e t^{'} sco n s i d er$ A \in (X)^+_F $G i v e n t ha t$ X \to A \in F^A = F^+ $an d$ r $i s a l e g a l in s t an ce$ \implies r $s a t i s f i es$ X \to Y $, an d g i v e n t ha t b yco n s t r u c t i o n$ X \subseteq Z_f \implies t_1[X] = t_2[X] \implies $b y d e f ini t i o n$ t_1[A] = t_2[A] \implies $b yco n s t r u c t i o n$ A \in Z_f

Z_f = (X)^+_F $G i v e n t ha t$ Z_i \subseteq (X)^+_F ; \forall i \in \mathbb{N} $an d$ (X)^+_F \subseteq Z_f $, w e ha v e t ha t$ Z_f = (X)^+_F

Intersection Rule

PDF 12 slide 19

Given a relation scheme R $an d a se t o ff u n c t i o na l d e p e n d e n c i es$ F $o n$ R $X := V \to W \in F ⋂ R - (W - V)$ $<! - - I f t h e in t ersec t i o n o f t h esese t s d e t er min es$ R $, t h e n t h e in t ersec t i o ni s t h eo n l y k ey t o$ R $e l se t h ere a re m u lt i pl e k eys, an d A LL o f t h e mm u s t b e i d e n t i f i e df orc h ec kin g 3 NF - - > I f$ X \to R \in F^+ $t h e n t h e in t ersec t i o ni s t h eo n l y k ey t o$ R $o t h er w i se t h ere a re m u lt i pl e k eys, an d * * A LL * * o f t h e mm u s t b e i d e n t i f i e d t oc h ec ki f$ R is in 3NF

Decomposition

PDF 13 slide 8

Let R $b e a re l a t i o n sc h e m e, a d eco m p os i t i o n$ \rho $o f$ R $i ss u c h t ha t$ $ρ = {R_{1}, R_{2}, ..., R_{k}} \subseteq P (R) : i = 1 ⋃ k R_{i} = R$

Equivalence

PDF 13 slide 12

Let F $an d$ G $b e tw ose t so ff u n c t i o na l d e p e n d e n c i es, w ec an d e f in e an e q u i v a l e n cere l a t i o n$ $F \equiv G ⟺ F^{+} = G^{+}$ $- re f l e x i v i t y$ F \implies F^+ = F^+ \implies F \equiv F $- s imm e t ry$ F \equiv G \implies F^+ = G^+ \implies G^+ = F^+ \implies G \equiv F $- t r an s i t i v i t y$ F \equiv G \land G \equiv H \implies F^+ = G^+ \land G^+ = H^+ \implies F^+ = H^+ \implies F \equiv H $[P D F 13 s l i d e 14] (13 L e t$ F $an d$ G $b e tw ose t so ff u n c t i o na l d e p e n d e n c i es$ $F \subseteq G^{+} ⟹ F^{+} \subseteq G^{+}$

F \subseteq G^+ \implies F^+ \subseteq G^+

Base case

F_0 = F \subseteq G^+ \implies F_0 \subseteq G^+

Inductive Step

F_i \subseteq G^+ \implies F_{i + 1} \subseteq G^+X \to Y \in F_{i + 1} \implies X \to Y $ha s b ee n o pt ain e d t h ro ug h - re f l e x i v i t y$ Y \subseteq X \implies $g i v e n t ha t$ X \to Y $i ss a t i s f i e d b ye v ery in s t an ce$ X \to Y \in G^+ $- a ug m e n t a t i o n$ \exists Z \subseteq R, V \to W \in F_i \mid X = ZV, Y = ZW

transitivity

TODO

Preserving F

PDF 13 slide 15

Let R $b e a re l a t i o n sc h e m e,$ F $a se t o ff u n c t i o na l d e p e n d e n c i eso n$ R $an d$ \rho = \set{R_1, R_2, ..., R_k} $a d eco m p os i t i o n o f$ R $, w es a y t ha t$ \rho $p rese v es$ F $i f$ $F \equiv G = i = 1 ⋃ k π_{R_{i}} (F)$ $Wh ere$ $π_{R_{i}} (F) = {X \to Y \in F^{+} ∣ X Y \subseteq R_{i}}$ $[P D F 13 s l i d e 16] (13 G i v e n t h e d e f ini t i o n o f$ G $, i tw i ll a lw a ys b e t ha t$ G \subseteq F^+ \implies G^+ \subseteq F^+ $so i t i se n o ug h t o v er i f y t ha t$ F \subseteq G^+

Dependency preservation

PDF 13 slide 17

def preserves_dependencies(R, F, ρ):
	for X → Y ∈ F:
		if Y ⊈ closure_G(R, F, ρ, X):
			return false

	return true

This algorithm is enough as we just have to check wether F \subseteq G^+ $G i v e n$ X \to Y \in F $w e ha v e t ha t$ X \to Y \in G^+ = G^A \iff $b y t h e l e mma o f c l os u re$ Y \subseteq (X)^+_G

(X)^+_G $‘‘‘ p y t h o n d e f c l o u s u r e_{G} (R, F, X, ρ) : Z = XS = \emptyset f or P \in ρ : S = S \cup (c l os u re (R, F, Z \cap P) \cap P) i f S \subseteq Z re t u r n Z re t u r n c l os u r e_{G} (R, F, Z \cup S) ‘‘‘ <! - - re t u r n X i f S \subseteq X e l sec l os u r e_{G} (R, F, X \cup S) - - > [P D F 13 s l i d e 23] (13$ R $b e a re l a t i o n sc h e ma,$ F $a se t o ff u n c t i o na l d e p e n d e n c i eso n$ R $an d$ \rho = \set{R_1, R_2, ..., R_k} $a d eco m p os i t i o n o f$ R $an d$ X \subseteq R $t h e a l g or i t hm ‘ c l os u r e_{G} () ‘ correc tl yco m p u t es$ (X)^+_G

Z_f \subseteq (X)^+_G $L e t$ Z_i, S_i $t h e v a l u eso f$ Z $an d$ S $a tt h e$ i $- t h c a ll o f t h e f u n c t i o n, w i t h$ Z_0 = X $, an d$ S_f \subseteq Z_f $<! - - t h e f ina l v a l u eso f$ Z $an d$ S -->

Base case

Z_0 = X \subseteq (X)^+_G \implies Z_0 \subseteq (X)^+_G by HP

Inductive step

Z_i \subseteq (X)^+G \implies Z{i+1} \subseteq (X)^+G $, g i v e n t ha t$ S_i = \bigcup\limits{j = 1}^k (Z_i \cap R_j)^+F \cap R_j $L e t$ A \in Z{i + 1} = Z_i \cup S_i \implies \exists j : A \in (Z_i \cap R_j) \cap R_j \implies Z_i \cap R_j \to A \in G^A $B yH Pw e ha v e t ha t$ Z_i \subseteq (X)^+_G \implies X \to Z_i \in G^A $, l e t$ Z_i = (Z_i \cap R_j) \cup V $b y d eco m p os i t i o n w e ha v e t ha t$ X \to Z_i \cap R_j \in G^A \implies $b y t r an s i t i v i t y$ X \to A \in G^A

X \subseteq Y \implies (X)^+_F \subseteq (Y)^+_FX \subseteq Y \implies Y \to X \in F^A $b yre f l e x i v i t y G i v e n$ A \in (X)^+_F \implies $b y t h e l e mma o f c l os u re$ X \to A \in F^A \implies $b y t r an s i t i v i t y$ Y \to A \in F^A \ \implies $b y t h e l e mma o f c l os u re$ A \in (Y)^+_F

(X)^+_G \subseteq Z_fX \subseteq Z_f \implies (X)^+_G \subseteq (Z_f)^+_G $, w e ha v e t o p ro v e t ha t$ Z_f = (Z_f)^+_G $L e t^{'} sco n s i d er$ A \in S' = \set{A \in R \mid V \to W \in G \land V \subseteq Z_f \land A \in W} \implies \exists V \to W \in G : V \subseteq Z_f \land A \in W \implies \exists R_j \in \rho : VW \subseteq R_j \implies V \subseteq Z_f \cap R_j \land A \in R_j \implies A \in (Z_f \cap R_j)^+_F \cap R_j \implies A \in S_f \implies A \in Z_f

Loseless join

r \subseteq m_{\rho}(r)t \in r \implies t[R_i] \in \pi_{R_i}(r) ; \forall R_i \in \rho $b y d e f ini t i o n$ \mathop{\bowtie}\limits_{i = 1}^k \pi_{R_i}(r) = \set {\bigcup\limits_{i = 1}^k p_i[R_i] \mid p_i[R_i] \in \pi_{R_i}(r) \land \bigcup\limits_{i = 1}^k p_i[R_i] \text{ is a function}}\forall t \in r, ; t = \bigcup\limits_{i = 1}^k t[R_i] $a s b y d e f ini t i o n o f$ \rho $w e ha v e t ha t$ R = \bigcup\limits_{i = 1}^k R_it \in r \implies t $i s a f u n c t i o nb y d e f ini t i o n$ t = \bigcup\limits_{i = 1}^k t[R_i] \in \mathop{\bowtie}\limits_{i = 1}^k \pi_{R_i}(r) = m_{\rho}(r) \implies t \in m_{\rho}(r) $<! - -$ \forall t \in r, ; t = \bigcup\limits_{i = 1}^k t[R_i] = \mathop{\bowtie}\limits_{i = 1}^k \set{ t[R_i] } $- - ><! - -$ t \in r \implies $b y d e f ini t i o n$ t[R_i] \in \pi_{R_i}(r) ; \forall R_i \in \rho \implies \set{ t[R_i] } \subseteq \pi_{R_i}(r) ; \forall R_i \in \rho $- - ><! - -$ t = \mathop{\bowtie}\limits_{i = 1}^k \set{ t[R_i] } \subseteq \mathop{\bowtie}\limits_{i = 1}^k \pi_{R_i}(r) = m_{\rho}(r) \implies t \in m_{\rho}(r) $- - ><! - -$ \forall t \in r $w eco n s i d er$ t[R_i], : R_i \in \rho $, w e ha v e t ha t$ t \in \set{ t[R_1] } \bowtie ... \bowtie \set{ t[R_k] } \subseteq \pi_{R_1}(r) \bowtie ... \bowtie \pi_{R_k}(r) = m_{\rho}(r) -->

\pi_{R_i}(m_{\rho}(r)) = \pi_{R_i}(r) $<! - - L e t^{'} sco n s i d er$ t_{R_i} \in \pi_{R_i}(m_{\rho}(r)) $, w e ha v e t o p ro v e t ha t$ t_{R_i} \in \pi_{R_i}(r) $- - >$ t \in r \implies $b y d e f ini t i o n$ t[R_i] \in \pi_{R_i}(r) ; \forall R_i \in \rho\pi_{R_i}(m_{\rho}(r)) = \set{q[R_i] \mid q \in \mathop{\bowtie}\limits_{i = 1}^k \pi_{R_i}(r)}

\pi_{R_i}(r) \subseteq \pi_{R_i}(m_{\rho}(r))t \in r \implies t \in m_{\rho}(r) \implies t[R_i] \in \pi_{R_i}(m_{\rho}(r))

\pi_{R_i}(m_{\rho}(r)) \subseteq \pi_{R_i}(r)q \in \mathop{\bowtie}\limits_{i = 1}^k \pi_{R_i}(r) \implies $b y d e f ini t i o n o f j o in$ q = \mathop{\bowtie}\limits_{i = 1}^k \set{ p_i[R_i] } \mid p_i \in r \implies $g i v e n t ha t$ q $i s a f u n c t i o n$ q[R_i] = p_i[R_i] $an d$ p_i \in r \implies p_i[R_i] \in \pi_{R_i}(r) $w e ha v e t ha t$ q[R_i] \in \pi_{R_i}(r) $<! - -$ \exists p_1, p_2, ..., p_k \in r \mid p_i[R_i] \in \pi_{R_i}(r) ; \forall i = 1,..., k $- - ><! - -$ t_{R_i} \in \pi_{R_i}(m_{\rho}(r)) \iff \exists t' \in m_{\rho}(r) : t'[R_i] = t_{R_i} \iff\iff \exists t_1, ..., t_k \in r : t'[R_j] = t_j[R_j] \quad \forall R_j \in \rho $b u t$ t_{R_i} = t[R_i] \in \pi_{R_i}(r) -->

m_{\rho}(m_{\rho}(r)) = m_{\rho}(r)m_{\rho}(m_{\rho}(r)) = \mathop{\bowtie}\limits_{i = 1}^k \pi_{R_i}(m_{\rho}(r)) = \mathop{\bowtie}\limits_{i = 1}^k \pi_{R_i}(r) = m_{\rho}(r) $<! - -$ m_{\rho}(m_{\rho}(r)) = \pi_{R_1}(m_{\rho}(r)) \bowtie ... \bowtie \pi_{R_k}(m_{\rho}(r)) = \pi_{R_1}(r) \bowtie ... \bowtie \pi_{R_k}(r) = m_{\rho}(r) $- - ><! - -$ m_{\rho}(m_{\rho}(r)) = \pi_{R_1}(m_{\rho}(r)) \bowtie ... \bowtie \pi_{R_k}(m_{\rho}(r)) = \pi_{R_1}(r) \bowtie ... \bowtie \pi_{R_k}(r) = m_{\rho}(r) $- - ><! - - L e t^{'} sco n s i d er$ t_{R_i} \in \pi_{R_i}(r) $, w e ha v e t o p ro v e t ha t$ t_{R_i} \in \pi_{R_i}(m_{\rho}(r)) $- - ><! - - L e t^{'} sco n s i d er$ t_{R_i} \in \pi_{R_i}(r) \land t' \in r $w i t h$ t'[R_i] $, w e ha v e t ha t$ t'[R_i] \in t_{R_i} -->

Loseless join pt.2

PDF 15 slide 15 Given \rho = \set{R_1, R_2, ..., R_k} $, b u i l d a t ab l e$ r $w i t h$ |R| $co l u mn s an d$ k $ro w s . A tt h e$ i $- t h ro w an d$ j $- t h co l u mn p u t$ a_j $i f$ A \in R_{i} $e l se$ b_{ij}

def has_looseless_join(R, F, ρ):
	while !(∃ t ∈ r | ∀ A ∈ R, t[A] = a) and r changed:
		for X → Y ∈ F:
			for t1 ∈ r:
				for t2 ∈ r:
					if t1[X] = t2[X] and t1[Y] != t2[Y]:
						for A ∈ Y:
							if t1[A] = a:
								t2[A] = t1[A]
							else:
								t1[A] = t2[A]

	return ∃ t ∈ r | ∀ A ∈ R, t[A] = a

Correctness

PDF 15 slide 19

Let R $b e a re l a t i o n sc h e m e,$ F $a se t o ff u n c t i o na l d e p e n d e n c i eso n$ R $an d l e t$ \rho = \set{R_1, R_2, ..., R_k} $b e a d eco m p os i t i o n o f$ R $; t h e a l g or i t hm correc tl y d ec i d es w h e t h er$ \rho $ha s a l oss l ess j o in$ r = m_{\rho}(r) \iff r $ha s a t u pl e w i t ha ll$ a $w h e n t h e a l g or i t hm t er min t es > TO D O : I c an p ro v e$ r = m_{\rho}(r) \implies r $ha s a t u pl e w i t ha ll$ a $w h e n t h e a l g or i t hm t er mina t es, I j u s t ha v e t o w r i t e i t in$ \LaTeX

Minimal cover

PDF 17 slide 7

Let R $b e a sc h e maan d$ F $b e a se t o ff u n c t i o na l d e p e n d e n c i eso n$ R $. A * * minima l co v er * * o f$ F $i s a se t o ff u n c t i o na l d e p e n d e n c i es$ G \equiv F $s u c h t ha t : -$ \forall X \to Y \in G, |Y| = 1 $-$ \forall X \to A \in G, \nexists X' \subset X \mid G \equiv (G - \set{X \to A}) \cup \set{X' \to A} $-$ \nexists X \to A \in G \mid G \equiv G - \set{X \to A}

Minimal cover (step 1)

F_1 = \set{X \to A \mid X \to Y \in F \land A \in Y}F \overset{A}{\to} F_1 $b y d eco m p os i t i o n$ F_1 \overset{A}{\to} F_1^A \implies F \subseteq F_1^AF_1 \overset{A}{\to} F $b y u ni o n$ F \overset{A}{\to} F^A \implies F_1 \subseteq F^AF \equiv F_1

Minimal cover (step 2)

Given X \to A \in F_1, X' \subset X \land X' \to A \in F_1^+ \implies F_2 = (F_1 \setminus \set{X \to A}) \cup \set{X' \to A}X' \subseteq X \implies X \to X' \in F_1^+ \land X \to X' \in F_2^+ $b yre f l e x i v i t y$ X \to A \in F_1 $-$ X \to A \in F_2 \implies X \to A \in F_2^+ $-$ X \to A \notin F_2 \implies X \to X' \in F_2^+ \land X' \to A \in F_2^+ \implies X \to A \in F_2^+ $b y t r an s i t i v i t y$ X \to A \in F_2 $-$ X \to A \in F_1 \implies X \to A \in F_1^+ $-$ X \to A \notin F_1 \implies X \to A \in F_1^+ $b yH P$ F_2 \equiv F_1 \implies F \equiv F_2 $b y t r an s i t i v i t yo f t h e$ \equiv relationship

Minimal cover (step 3)

X \to A \in F_2, ; A \in (X)^+{F_2 \setminus \set{X \to A}} \implies F_3 = F_2 \setminus \set{X \to A}X \to A \in F_2 $-$ X \to A \in F_3 \implies X \to A \in F_3^+ $-$ X \to A \notin F_3 \implies X \to A \in F_3^+ $b yH P a s$ A \in (X)^+{F_3}X \to A \in F_3 $-$ X \to A \in F_2 \implies X \to A \in F_2^+ $-$ X \to A \notin F_2 $i s a co n t r a dd i c t i o na s$ F_3 = F_2 \setminus \set{X \to A} $b y d e f ini t i o n$ F_2 \equiv F_3 \implies F \equiv F_3

Decomposition

def decomposition(R, F: minimal cover):
	S = ∅
	ρ = ∅

	for A ∈ R | ∄ X → Y ∈ F : A ∈ XY:
		S = S ∪ {A}

	if S != ∅:
		R = R - S
		ρ = ρ ∪ {S}

	if ∃ X → Y ∈ F | XY = R:
		ρ = ρ ∪ {R}
	else:
		for X → A ∈ F:
			ρ = ρ ∪ {XA}

Decomposition pt.2

PDF 19 slide 5

Let R $b e a re l a t i o na l sc h e maan d$ F $a se t o ff u n c t i o na l d e p e n d e n c i eso n$ R $, w hi c hi s aminima l co v er; t h e a l g or i t hm ‘ d eco m p os i t i o n () ‘ co m p u t e s_{(} in p o l y n o mia lt im e)_{a} d eco m p os i t i o n$ \rho $o f$ R $s u c h t ha t : - e a c h re l a t i o na l sc h e main$ \rho $i s in 3 NF -$ \rho $p reser v es$ F

Azienda 1

Si vuole sviluppare un sistema informativo per la gestione dei dati sul personale di una certa azienda costituita da diversi dipartimenti. Durante la fase di raccolta dei requisiti è stata prodotta la specifica dei requisiti mostrata di seguito. Si chiede di iniziare la fase di Analisi dei requisiti ed in particolare di:

raffinare la specifica dei requisiti eliminando inconsistenze, omissioni o ridondanze e produrre un elenco numerato di requisiti il meno ambiguo possibile
produrre un diagramma UML delle classi concettuale che modelli i dati di interesse, utilizzando solo i costrutti di classe, associazione, attributo

Requisiti

I dati di interesse per il sistema sono impiegati, dipartimenti, direttori dei dipartimenti e progetti aziendali.

Di ogni impiegato interessa conoscere il nome, il cognome, la data di nascita e lo stipendio attuale, il dipartimento al quale afferisce (esattamente uno, con la rispettiva data di afferenza).
Di ogni dipartimento interessa conoscere il nome, il numero di telefono del centralino

Di ogni dipartimento interessa conoscere inoltre il direttore, che è uno degli impiegati dell’azienda.

Il sistema deve permettere di rappresentare i progetti aziendali nei quali sono coinvolti i diversi impiegati.
Di ogni progetto interessa il nome ed il budget.
Ogni impiegato può partecipare ad un numero qualsiasi di progetti.

UML (opzione 1)

Osservazioni

Questa è l'opzione più "complessa"

Un dipartimento non deve avere per forza un dirigente assegnato (quindi 0..1)
Un impiegato può dirigere più dipartimenti (non afferisce al dipartimento che dirige, ci potrebbe essere un dipartimento apposta per i dirigenti, quindi 0..*)
La classe Afferenza ci serve per tenere uno storico dei dipartimenti in cui ha lavorato un impiegato
Usiamo la classe Partecipazione per i progetti perché un impiegato può lavorare ad un progetto più volte in periodi diversi di tempo (N.B. è fondamentale avere come attributi data_inizio e data_fine, altrimenti l'utilizzo della classe non regge in questo contesto)
Usare il tipo Stringa per l'attributo telefono non va bene, ma la soluzione originale lo includeva e preferisco attenermi a quella

classDiagram
    class Impiegato {
        nome: Stringa
        cognome: Stringa
        data_di_nascita: Data
        stipendio: Razionale >= 0
    }
    class Impiegato["fa:fa-users Impiegato"]

    class Dipartimento {
        nome: Stringa
        telefono: Stringa
    }
    class Dipartimento["fa:fa-building Dipartimento"]

    class Afferenza {
        data: Data
    }

    class Progetto {
        nome: Stringa
        budget: Razionale >= 0
    }
    class Progetto["fa:fa-wrench Progetto"]

    class Partecipazione {
        data_inizio: Data
        data_fine: Data
    }


	Impiegato "1..1" -- "0..*" Partecipazione : impiegato_partecipante 
    Partecipazione "0..*" -- "1..1" Progetto : progetto_partecipazione
	Impiegato "0..1" -- "0..*" Dipartimento : dirige
	Afferenza "1..*" -- "1..1" Impiegato : impiegato_afferente
	Afferenza "0..*" -- "1..1" Dipartimento : dipartimento_afferenza

UML (opzione 2)

Osservazioni

Questa è l'opzione più "minimale"

Un dipartimento non deve avere per forza un dirigente assegnato (quindi 0..1)
Un impiegato può dirigere al più un dipartimento
L'impiegato afferisce ad un solo dipartimento, quindi possiamo mettere la data_di_afferenza come attributo dell'impiegato
Un dipendente o partecipa ad un determinato progetto o non vi partecipa, la relazione è binaria e non serve una classe
Usare il tipo Stringa per l'attributo telefono non va bene, ma la soluzione originale lo includeva e preferisco attenermi a quella

classDiagram
    class Impiegato {
        nome: Stringa
        cognome: Stringa
        data_di_nascita: Data
        stipendio: Razionale >= 0
        data_di_afferenza: Data
    }
    class Impiegato["fa:fa-users Impiegato"]

    class Dipartimento {
        nome: Stringa
        telefono: Stringa
    }
    class Dipartimento["fa:fa-building Dipartimento"]

    class Progetto {
        nome: Stringa
        budget: Razionale >= 0
    }
    class Progetto["fa:fa-wrench Progetto"]


	Impiegato "0..*" -- "0..*" Progetto : partecipa a 
	Impiegato "0..1" -- "0..1" Dipartimento : dirige
    Impiegato "0..*" -- "1..1" Dipartimento : afferisce a

Voli Aerei 1

Requisiti

I dati di interesse per il sistema sono voli, compagnie aeree ed aeroporti.

Dei voli interessa rappresentare codice, durata, compagnia aerea ed aeroporti di partenza e arrivo.
Degli aeroporti interessa rappresentare codice, nome, città (con nome e numero di abitanti) e nazione.
Delle compagnie aeree interessa rappresentare nome, anno di fondazione, e la città in cui ha sede la direzione.

TODO: durata minima di un volo durata: [(ore: intero >= 1, minuti: intero >= 0), (ore: intero >= 0, minuti: intero >= 1)] compagnia aerea più vecchia? 1912 tecnicamente, ma mettiamo 1906, se gli aerei non esistono, non possono esistere le compagnie aere

Volo

codice, composto dalle 2 lettere della compagnia, dal numero di volo (questo numero può variare da 1 a 4 cifre e serve a distinguere ogni volo gestito dalla compagnia aerea)
duarata (vedere tabella sotto)
compagnia aerea
aeroporto di partenza
aeroporto di arrivo
non interessano i posti disponibili, dato che non è una delle metriche di cui la IATA tiene traccia
potrebbe interessare il ritardo

Una classificazione arbitraria della durata di un volo secondo Wikipedia

tipo di volo	durata
corto raggio	meno di 3 ore (internazionale)
medio raggio	da 3 a 6 - 8 ore
lungo raggio	da 6 a 12 ore
ultra-lungo raggio	più di 12 ore

Il volo più lungo è stato di circa 20 ore, ma non è importante dare un limite massimo alla durata di un volo

Aeroporto

nome ("Paperopoli" pare lecito qui)
codice, qui useremo il Il codice aeroportuale IATA ("Paperopoli" qui non va decisamente bene), una stringa lunga 3 di lettere maiuscole
città, perché sì
nazione, perché sì

Compagnia aerea

nome ("Paperopoli" pare lecito qui)
codice IATA di 2 lettere!
anno di fondazione dopo il 1903, se vogliamo essere pedanti 1909
sede (tecnicamente può cambiare, e vogliamo tenere lo storico)

Città

nome ("Paperopoli" pare lecito qui)
numero di abitanti (si spera più di 0? Immagina avere -10 abitanti, LOL)
possibilmente una nazione?

Nazione

nome
codice ISO 3166-1

Osservazioni

Non sono pedante sulla durata minima del volo, perché si suppone che la imposti un utente autorizzato (e autenticato!)
Una città può avere 0 aeroporti, perché magari è la sede di una qualche compagnia
Ho messo lo storico delle sedi legali delle compagnie per essere mettere più funzionalità possibili

UML

classDiagram
    class Volo {
        codice: (codice_IATA_compagnia: Stringa [A-Z]2, numero_di_volo: 1..9999) 
        durata: (ore: intero >= 0, minuti: intero 0..60)
        ritardo: (ore: intero >= 0, minuti: intero 0..60)
    }

    class Aeroporto {
        codice: (codice_IATA_aerporto: Stringa [A-Z]3)
        nome: Stringa
    } 

    class Compagnia {
        codice: (codice_IATA_compagnia: Stringa [A-Z]2)
        nome: Stringa
        anno_di_fondazione: Intero >= 1903
    }
    class Compagnia["Compagnia aerea"]

    class Citta {
        nome: Stringa
        numero_di_abitanti: Intero >= 0
    }
    class Citta["Città"]

    class Nazione {
        codice: (codice_ISO_3166-1: Stringa [A-Z]2)
        nome: Stringa
    }

    class Sede {
        data_inizio: Data
        data_fine: Data
    }

    Volo "0..*" -- "1..1" Compagnia
    Volo "0..*" -- "1..1" Aeroporto : parte_da
    Volo "0..*" -- "1..1" Aeroporto : arriva_a 
    Aeroporto "0..*" -- "1..1" Citta : sta_in
    Citta "0..*" -- "1..1" Nazione : sta_in
    Compagnia "1..1" -- "1..*" Sede : compagnia_sede 
    Sede "1..1" -- "1..*" Citta : città_sede

Università 1

Requisiti

I dati di interesse per il sistema sono studenti, facoltà, professori e corsi.

Di ogni studente interessa conoscere il nome, il codice fiscale, il numero di matricola, la data di nascita, il luogo di nascita (città e regione), il corso di laurea a cui è iscritto (con l’anno di iscrizione), e gli insegnamenti di cui ha superato l’esame.
Dei professori interessa il nome, la data di nascita, il codice fiscale, il luogo di nascita e gli insegnamenti erogati.
Dei corsi di laurea interessa il nome e la o le facoltà di appartenenza. Di queste ultime interessa il nome.
Di ogni insegnamento interessa il codice, il nome, il numero di ore di lezione, e i corsi di laurea a cui appartiene.

Studente

nome ("Paperopoli" pare lecito qui)
codice fiscale (nome [A-Z]{3}, cognome [A-Z]{3}, anno [0-9]{2}, mese [A-Z]{1}, giorno [0-9]{2}, comune [A-Z][0-9]{3}, codice_controllo [A-Z]) per pedanti
matricola (Stringa [0-9]{7})
data di nascita (all my fellas be born after 1400?)
luogo di nascita
corso di laurea a cui è iscritto con anno di iscrizione (all my fellas be iscritti a più di un corso di laurea, ovvero 2 lo dice il Gov)
insegnamenti superati con voto

(prima account su infostud, e solo dopo iscrizione ad un corso)

Insegnamento

codice 100000..999999
nome
corsi di appartenenza
ore di lezione intero > 0

Corso

nome
facoltà di appartenenza

Facoltà

nome

Professore

nome ("X Æ A-12" pare lecito qui)
codice fiscale (stessa pappardella di sopra)
data di nascita
luogo di nascita
insegnamenti erogati

UML

classDiagram
    class Studente {
        nome: Stringa
        codice_fiscale: [A-Z]3[A-Z]3[0-9]2[A-Z][0-9]2[A-Z][0-9]3[A-Z]
        matricola: Stringa [0-9]7
        data_di_nascita: Data
    }

    class Citta {
        nome: Stringa
    }
    class Citta["Città"]

    class Regione {
        nome: Stringa
        codice: [A-Z]2 
    }

    class Corso {
        nome: Stringa
    }

    class Insegnamento {
        codice: (codice_insegnamento: 100000..999999)
        nome: Stringa
        ore_di_lezione: intero >= 1
    }

    class Facolta {
        nome: Stringa
    }
    class Facolta["Facoltà"]

    class Professore {
        nome: Stringa
        data_di_nascita: Data
        codice_fiscale: [A-Z]3[A-Z]3[0-9]2[A-Z][0-9]2[A-Z][0-9]3[A-Z]
    }

    Citta "0..*" -- "1..1" Regione : sta in
    Studente "0..*" -- "1..1" Citta : nato a
    Studente "0..*" -- "0..2" Corso : iscritto a
    Studente "" -- "" Insegnamento : superato (voto 18..30, e la lode?)
    Insegnamento "0..*" -- "0..*" Corso : in (potrei considerare 21..* a sinistra? Nah, la magistrale ne avra' di meno)
    Corso "0..*" -- "1..1" Facolta : appartiene a
    Professore "1..1" -- "0..*" Insegnamento : insegna

Voli Aerei 2

Requisiti

I dati di interesse per il sistema sono voli, compagnie aeree ed aeroporti.

Dei voli interessa rappresentare codice, durata, compagnia aerea ed aeroporti di partenza e arrivo.

Degli aeroporti interessa rappresentare codice, nome, città (con nome e numero di abitanti) e nazione.

Delle compagnie aeree interessa rappresentare nome, anno di fondazione, e la città in cui ha sede la direzione.

Un tipo particolare di voli sono voli charter.

Questi possono prevedere tappe intermedie in aeroporti.

Delle tappe intermedie di un volo charter interessa mantenere l’ordine con cui esse si susseguono (ad esempio, un certo volo che parte da “Milano Linate” e arriva a “Palermo Punta Raisi”, prevede tappe intermedie prima nell’aeroporto di Bologna e poi in quello di Napoli).

Dei voli charter interessa rappresentare anche il modello di velivolo usato.

Volo

codice, composto dalle 2 lettere della compagnia, dal numero di volo (questo numero può variare da 1 a 4 cifre e serve a distinguere ogni volo gestito dalla compagnia aerea) {id}
duarata
compagnia aerea
aeroporto di partenza
aeroporto di arrivo

Aeroporto

nome ("Paperopoli" pare lecito qui)
codice, qui useremo il Il codice aeroportuale IATA [A-Z]{3} {id}
città
nazione

Compagnia aerea

nome ("Paperopoli" pare lecito qui)
codice IATA di 2 lettere {id}
anno di fondazione dopo il 1903, se vogliamo essere pedanti 1909
sede (città, non teniamo lo storico)

Voli

Città

nome
numero di abitanti >= 0
nazione

Nazione

nome
codice ISO 3166-1, stringa lunga 2

Voli Charter

tappe intermedie in aeroporti
modello di velivolo

Tappe

mantenere l'ordine con cui si susseguono
“Milano Linate” e arriva a “Palermo Punta Raisi”, prevede tappe intermedie prima nell’aeroporto di Bologna e poi in quello di Napoli

UML

classDiagram
    note "TIPI\nCodiceIATACompagnia = Stringa [A-Z]{2}\nCodiceIATAAeroporto = Stringa [A-Z]{3}\nCodiceIATAVolo = (codice_IATA: CodiceIATACompagnia, numero_di_volo: 1..9999)\nCodiceISO_3166-1_alpha-2: Stringa [A-Z]{2}"

    class Volo {
        durata: Durata 
    }
    Volo : codice&#58 CodiceIATAVolo {id}

    class Charter {
        modello: Stringa
    }

    class Aeroporto {
        nome: Stringa
    } 
    Aeroporto : codice&#58 CodiceIATAAeroporto {id}

    class Compagnia {
        nome: Stringa
        anno_di_fondazione: Intero
    }
    Compagnia : codice&#58 CodiceIATACompagnia 
    class Compagnia["Compagnia aerea"]


    class Citta {
        nome: Stringa
        numero_di_abitanti: Intero >= 0
    }
    class Citta["Città"]

    class Nazione {
        nome: Stringa
    }
    Nazione: codice&#58 CodiceISO_3166-1_alpha-2 {id}

    Volo "0..*" -- "1..1" Compagnia : gestito_da
    Volo "0..*" -- "1..1" Aeroporto : parte_da
    Volo "0..*" -- "1..1" Aeroporto : arriva_a 
    Aeroporto "0..*" -- "1..1" Citta : situato_a
    Citta "0..*" -- "1..1" Nazione : si_trova_in
    Compagnia "0..*" -- "1..1" Citta : ha_sede_a 
    Charter --|> Volo
    Charter "0..*" -- "0..*" Aeroporto : tappa (numero > 0, univoco per ogni tappa del volo charter?)

Accademia 1

Requisiti

I dati di interesse per il sistema sono i docenti universitari, i progetti di ricerca e le attività dei docenti.

Di ogni docente interessa conoscere il nome, il cognome, la data di nascita, la matricola, la posizione universitaria (ricercatore, professore associato, professore ordinario) e i progetti ai quali partecipa.

Dei progetti interessa il nome, un acronimo, la data di inizio, la data di fine e i docenti che vi partecipano.

Un progetto è composto da molti Work Package (WP). Oltre al progetto a cui fa riferimento, del WP interessa sapere il nome, la data di inizio e la data di fine.

Il sistema deve permettere ai docenti di registrare impegni di diverso tipo.

Degli impegni interessa sapere il giorno in cui avvengono, la durata in ore e la tipologia di impegno con relativa motivazione.

Devo mettere gli ID!

Docenti

nome
cognome
data di nascita
matricola {id}
posizione universitaria
- ricercatore
- professore associato
- professore ordinario
progetti a cui partecipa (più di 1)
impegni (più di 1?)

Progetti di ricerca

nome
acronimo [A-Z]{10}? {id?}
data inizio
data fine (opzionale? [0..1])
i docenti che vi partecipano (più di 1)

Work Package

nome
progetto di cui fanno parte
data inizio
data fine (opzionale? [0..1])

Attività

impegni di diverso tipo?
giorno
durata in ore
motivazione

Tipologia

Mi aspetto che il professore scelga la tipologia da un elenco prefissato dall'università, quindi lo tengo in una classe separata

UML

classDiagram
    class Docente {
        nome: Stringa
        cognome: Stringa
        data_di_nascita: Data
        posizione: ["ricercatore", "professore associato",  "professore ordinario"]
    }
    Docente : matricola&#58 [A-Z0-9]{10} {id}
    class Docente["Docente 👨‍🏫"]

    class Progetto {
        acronimo: [A-Z]+
    }
    class Progetto["Progetto di ricerca 🏗️"]

    class WP {
        nome: Stringa
        data_inizio: Data
        data_fine: Data [0..1]
    }
    class WP["Work Package 📦"]

    class Attivita {
        giorno: Data,
        durata_ore: intero > 0,
    }
    class Attivita["Attività ⌚"]

    class Tipologia {
        nome: Stringa
    }

    Docente "0..*" -- "0..*" Progetto : partecipa_a
    Docente "1..1" -- "0..*" Attivita : ha_un
    WP "0..*" -- "1..1" Progetto : fa_parte_di
    WP <|-- Progetto
    Attivita "0..*" -- "1..1" Tipologia : di_tipo `(motivazione Stringa)`

Accademia 2

I dati di interesse per il sistema sono i docenti universitari, i progetti di ricerca e le attività dei docenti. Di ogni docente interessa conoscere il nome, il cognome, il luogo e la data di nasci- ta, la matricola e la posizione universitaria (una tra: ricercatore, professore associato, professore ordinario). Dei progetti interessa il nome, un acronimo, la data di inizio, la data di fine e i docenti che vi partecipano. Un progetto è composto da molti Work Package (WP). Oltre al progetto a cui fa riferimento, del WP interessa sapere il nome, la data di inizio e la data di fine. Il sistema deve permettere ai docenti di registrare impegni di diverso tipo tra: as- senza (per chiusura universitaria, malattia, gravidanza, etc.), attività legate a impegni istituzionali (didattica, ricerca, missione, consiglio di dipartimento, consiglio di area di- dattica, etc.), attività legate a impegni progettuali (Ricerca e Sviluppo, Dimostrazione, Management, etc.). Degli impegni interessa sapere il giorno in cui avvengono, la durata e la tipologia di impegno con relativa motivazione. Si noti che alcuni impegni (ad es., impegno per didattica oppure alcuni tipi di assenza) possono occupare solo parte di una giornata lavorativa, pertanto la loro durata va misurata in giorni; altri impegni ed altre tipologie di assenza (ad es., assenza per malattia) occupano intere giornate, e la loro durata, dunque, va misurata in giorni.

Officine 1

I dati di interesse per il sistema sono quelli relativi alle officine della catena, i relativi dipendenti e direttori, e quelli relativi alle riparazioni dei veicoli.

Di ogni officina della catena interessano il nome, l’indirizzo, il numero di dipendenti, i dipendenti con il relativo numero di anni di servizio ed il direttore.

Dei dipendenti e dei direttori interessano il nome, il codice fiscale, l’indirizzo e il numero di telefono; inoltre dei direttori interessa anche la data di nascita.

Per quanto riguarda le riparazioni dei veicoli, sono dati di interesse il codice, il vei- colo (modello, tipo, targa, anno di immatricolazione e proprietario), la data ed ora di accettazione e quella di riconsegna (per le riparazioni terminate).

Infine, dei proprietari dei veicoli interessano nome, codice fiscale, indirizzo e telefono.

Travel to the Moon

Requisiti

Requisiti sulle crociere: 1.1. codice 1.2. data di inizio 1.3. data di fine 1.4. nave utilizzata (v. req. 2) 1.5. itinerario (v. req. 4) 1.6. il tipo, uno tra: 1.6.1. luna di miele, di cui interessa un sottotipo tra: 1.6.1.1. tradizionali (# dest. romantiche >= # dest. divertenti) (v. req. 3) 1.6.1.2. alternative (altrimenti) 1.6.2. per famiglie, di cui interessa: 1.6.3. se adatte ai bambini (booleano)
Requisiti sulle navi 2.1. nome 2.2. comfort (3..5) 2.3. capienza (max)
Requisiti sui porti: 3.1. nome 3.2. continente ({"Africa", "Asia", "Europa", etc...}) 3.3. posti da vedere (v. req. 5) 3.4. tipo, almeno uno tra: 3.4.1. romantico 3.4.2. divertente 3.5 un insieme di posti da vedere (v. req. 5)
Requisiti sugli itinerari: di ogni itinerario interessa 4.1. nome 4.2. sequenza ordinata di elementi (tappe), di cui interessa: 4.2.1. porto (v. req. 3) 4.2.2. arrivo: 4.2.2.1. il numero d'ordine del giorno (rispetto alla data di inizio della crociera) 4.2.2.2. ora 4.2.3. ripartenza
4.2.3.1. il numero d'ordine del giorno (rispetto alla data di inizio della crociera) 4.2.3.2. ora
Requisiti sui posti da vedere: 5.1. nome 5.2. descrizione 5.3. fascia consigliata 5.3.1. giorno 5.3.2. ora inizio 5.3.3. ora fine

Requisiti clienti 6.1. nome 6.2. cognome 6.3. età 6.3. indirizzo: (nome: Stringa, civico: numero, Città: Stringa, Regione: Stringa, Paese: Stringa) 6.4. può prenotare crociere (v. req. 7)
Requisiti prenotazioni 6.1. istante di prenotazione (DataOra?) 6.2. crociera prenotatata (v. req. 1) 6.3. posti prenotati (Intero > 0)

UML

classDiagram
    class Nave {
        nome: Stringa
        conformt: 3..5
        capienza: Intero > 0
    }

    class Destinazione {
        nome: Stringa
        tipo: [Romantico, Divertente] [1..2]
    }

    class Continente {
        nome: Stringa
    }

    class PostoDaVedere {
        nome: Stringa
    }

    Destinazione "0..*" -- "1..1" Continente : "porto_cont"
    Destinazione "0..*" -- "0..*" PostoDaVedere : "porto_post"

Requisiti 1 e 4 (crociere e itinerari)

Algoritmi II

Un grafo $G$ è una coppia $(V, E)$ dove $V$ è un insieme di vertici ed $E$ un insieme di nodi; un grafo si dice

semplice se ogni coppia di nodi è collegata con al massimo un arco, e non ci sono cappi
diretto se gli archi sono orientati
connesso se ogni coppia di nodi è collegata da una passeggiata
fortemente connesso se per ogni coppia di vertici $x, y$ esiste un cammino da $x$ a $y$ e viceversa

Due nodi $x, y$ si dicono adiacenti ( $x \sim y$ ) se sono collegati da un arco, e l'arco si dice incidente rispetto ai due nodi

Rappresentazione

Un grafo $G = (V, E)$ si può rappresentare tramite matrice di adiacenza o lista di adiacenza

Matrice di adiacenza

Siano $V_{1}, V_{2}, ..., V_{n}$ i vertici del grafo, possiamo rappresentare il grafo come una matrice $v$ tale che

$(v)_{ij} = {10 se V_{i} \overset{e}{ˋ} adiacente a V_{j} altrimenti$


	$V_{1}$	$V_{2}$	...	$V_{n}$
$V_{1}$	0	1	...	0
$V_{2}$	1	0	...	1
...	...	...	...	...
$V_{n}$	0	1	...	0

Per verificare se $V_{i} \sim V_{j}$ il costo è $O (1)$ la dimensione della matrice è $O (n^{2})$ con

$n = ∣ V (G) ∣$

Lista di adiacenza

Siano $V_{1}, V_{2}, ..., V_{n}$ i vertici del grafo, possiamo indicare per ogni nodo l'insieme dei suoi vicini


$V_{1}$	${V_{3}, V_{4}, ...}$
$V_{2}$	${V_{5}, V_{n}, ...}$
...	${...}$
$V_{n}$	${V_{2}, V_{5}, ...}$

Per verificare se $V_{i} \sim V_{j}$ il costo è $O (n)$ la dimensione della matrice è $O (n + m)$ , con

$n = ∣ V (G) ∣$
$m = v \in V (G) \sum de g (v) = 2∣ E (G) ∣$

Teoremi

Definizione

Una passeggiata in un grafo $(V, E)$ è definita come una sequenza $V_{0} e_{1} V_{1} e_{2} V_{2} e_{3} V_{3}$ dove $e_{i}$ collega $V_{i - 1}$ a $V_{i}$

Definizione

Una passeggiata si dice euleriana se attraversa ogni arco del grafo esattamente una volta

Teorema di Eulero

$\exists$ una passeggiata euleriana $⟺$ il grafo è connesso ed esistono al massimo 2 vertici di grado dispari

Definizione

Un cammino è una passeggiata che non ripete vertici (quindi neanche archi)

Definizione

Un ciclo in un grafo è un sottografo connesso con ogni vertice di grado 2

Osservazione

Se $\exists$ una passeggiata da $x$ a $y ⟹ \exists$ un cammino da $x$ a $y$

si dimostra con l'algoritmo

DFS (depth first search)

#![allow(unused)]
fn main() {
            dfs(graph, y, visited)
        }
    }
}

fn dfs_iterative(graph: &[Vec<usize>], x: usize) -> Vec<bool> {
    let mut stack = Vec::from([x]);
    let mut visited = vec![false; graph.len()];
    let mut adjacent = vec![0; graph.len()];

    while let Some(&x) = stack.last() {
        if let Some(&y) = graph[x].get(adjacent[x]) {
            if !visited[y] {
                stack.push(y);
                visited[y] = true;
            }

            adjacent[x] += 1;
        } else {
            stack.pop();
}

Dimostrazione correttezza: bisogna dimostrare che

se $\exists$ un cammino da $x$ a $y ⟹ y \in visited$

per assurdo, supponiamo che $\exists y ∣ x \to y \land y \in / visited$


$V_{0}$	$V_{1}$	$V_{2}$	...	$V_{n}$
$x$	...	...	...	$y$

Sia $i$ un indice $V_{i} \in visited, V_{i + 1} \in / visited$

$V_{i} \in visited ⟹ V_{i}$ è stato inserito nello stack $⟹$ ma se $V_{i} \in$ stack e $V_{i + 1} \in /$ stack $⟹$ l'algoritmo non è stato eseguito correttamente (contraddizione!)

TODO: complessità $O (n + m)$

Definizione

L'albero di visita è un sottografo composto dagli archi che usiamo per raggiungere i vertici nuovi non ancora visitati

è connesso

è aciclico

Definizione

L'albero di visita di un grafo diretto è detto arborescenza

diretto

ogni arco orientato dalla radice alle foglie

Se $G$ è un graffo connesso $⟹ visited$ contiene tutti i nodi del grafo

Se $G$ non è connesso $⟹ visited$ è il componennte che contiene $X$

Ordine topologico

Consideriamo un progetto diviso in $X_{1}, X_{2}, ..., X_{n}$ task, con dipendenze fra i vari task (Es. $X_{1}$ va eseguito dopo $X_{2}$ e $X_{3}$ , e $X_{3}$ dopo $X_{2}$ ; in questo caso l'ordine sarebbe $X_{2}, X_{3}, X_{1}$ )

Indicando i vertici con $X_{1}, X_{2}, ..., X_{n}$ e gli archi con $(X_{i}, X_{j})$ se $X_{i}$ dipende da $X_{j}$ , una programmazione dei task corrisponde ad un ordine dei vertici con tutti gli archi da destra verso sinistra

Se il grafo ha un ciclo (diretto), non è possibile dare un ordine topologico

Dimostrazione

Suppponiamo per assurdo che esiste un tale ordine, allora uno dei vertici deve essere per forza l'ultimo nell'ordine; ma essendo ciclico esiste un arco che va da sinistra verso destra da uno dei vertici "centrali" della sequenza all'ultimo vertice, quindi tale ordine non esiste.

Definizione

Un grafo diretto ha un ordine topologico se $\exists$ un ordine deivertici con tutti gli ordini degli archi da destra verso sinistra

Proposizione

Se un grafo diretto ha la proprietà che ogni vertice ha almeno un arco uscente $⟹ \exists$ un ciclo

stessa dimostrazione dell'algoritmo del primo giorno: se ogni vertice ha un arco uscente e i vertici sono finiti, alora prima o poi dovrà tornare... (creare un ciclo)

L'implicazione $⟸$ non è vera!

Corollario

Se \notexists un ciclo $⟹ \exists$ un nodo senza archi uscenti

Soluzione naive

TODO: algoritmo $O (n (n + m))$ per trovare l'ordine topologico!

Soluzione con DFS

Esercizi

Ciclo in un grafo

input un grafo $G ∣ \forall v \in V (G), de g (v) \geq 2$
output un ciclo un grafo (come elenco di vicini)
complessità $O (n + m)$

#![allow(unused)]
fn main() {
use std::collections::VecDeque;

pub mod practice;
use practice::path;

fn find_cycle(graph: &[Vec<usize>], mut x: usize) -> Vec<usize> {
    let mut cycle = vec![];
    let mut visited = vec![false; graph.len()];

    let mut z = x;
    while !visited[x] {
        cycle.push(x);
        visited[x] = true;

        let next = if graph[x][0] == z { 1 } else { 0 };
        z = x;
        x = graph[x][next];
    }

    cycle.into_iter().skip_while(|&y| y != x).collect()
}

fn does_path_exist(graph: &[Vec<usize>], x: usize, y: usize) -> bool {
}

Cammino da $x$ a $y$

input un grafo $G$ e $x, y \in V (G)$
output $x \sim y ?$
complessità $O (n + m)$

#![allow(unused)]
fn main() {
    dfs(graph, x, &mut visited);

    visited[y]
}

fn dfs(graph: &[Vec<usize>], x: usize, visited: &mut Vec<bool>) {
    visited[x] = true;

    for &y in &graph[x] {
}

#![allow(unused)]
fn main() {
            dfs(graph, y, visited)
        }
    }
}

fn dfs_iterative(graph: &[Vec<usize>], x: usize) -> Vec<bool> {
    let mut stack = Vec::from([x]);
    let mut visited = vec![false; graph.len()];
    let mut adjacent = vec![0; graph.len()];

    while let Some(&x) = stack.last() {
        if let Some(&y) = graph[x].get(adjacent[x]) {
            if !visited[y] {
                stack.push(y);
                visited[y] = true;
            }

            adjacent[x] += 1;
        } else {
            stack.pop();
}

#![allow(unused)]
fn main() {
fn path(graph: &Vec<Vec<usize>>, x: usize, y: usize) -> bool {
    dfs_iterative(graph, x)[y]
}
}

Componenti di $G$

input un grafo $G$
output l'elenco di archi in avanti, indietro e di attraversamento
complessità $O (n + m)$

#![allow(unused)]
fn main() {
    }

    visited
}

pub fn find_components(graph: &[Vec<usize>]) -> Vec<usize> {
    let mut components = vec![0; graph.len()];
    let mut component = 1;

    for x in 0..graph.len() {
        if components[x] == 0 {
            dfs_components(graph, x, &mut components, component);
            component += 1;
        }
    }

    components
}

fn dfs_components(graph: &[Vec<usize>], x: usize, components: &mut Vec<usize>, component: usize) {
    components[x] = component;

    for &y in &graph[x] {
        if components[y] == 0 {
            dfs_components(graph, y, components, component)
        }
    }
}
}

Classificazione archi

input un grafo $G$
output le componenti di $G$
complessità $O (n + m)$


add	IF	ID	EX	ME	WB
sub		\(\rightarrow\)	\(\rightarrow\)	IF	ID	EX	ME	WB

Computer Science @ Sapienza