Databases

View the presentation on proofs or download it. The links refer to the slides of prof. Perelli and don't work on the website: you have to download the pdf in the same folder with the course's slides for the links to work.

Schema

An attribute is a $(name, domain)$ pair; we can define the $d o m ()$ function on a set of names, which associates to each name a specific domain (different attributes can have the same domain)

$d o m : {name_{1}, ..., name_{n}} name_{i} \to {domain_{1}, ..., domain_{k}} \mapsto domain_{j}$

PDF 7 slide 2

A relation schema $R = {A_{1}, A_{2}, ..., A_{n}}$ is a set of attributes

Tuples & instances

PDF 7 slide 3 Given a relation schema $R = A_{1} A_{2} ... A_{n}$ , a tuple $t$ on $R$ is a function such that

$t : R A_{i} \to i = 1 ⋃ n d o m (A_{i}) \mapsto a \in d o m (A_{i})$

Given a relation schema $R$ , a subset $X \subseteq R$ and $t$ a tuple on $R$ , the reduction of $t$ on $X$ is defined as

$t [X] = {(A, t [A]) ∣ A \in X}$

PDF 7 slide 4 Given a relation schema $R$ , a subset $X \subseteq R$ and $t_{1}, t_{2}$ tuples on $R$

$t_{1} [X] = t_{2} [X] ⟺ t_{1} [A] = t_{2} [A] \forall A \in X$

PDF 7 slide 5 Given a relation schema $R$ and $t_{1}, t_{2}, ..., t_{k}$ tuples on $R$ , a set $r = {t_{1}, t_{2}, ..., t_{k}}$ is an instance of $R$

Functional dependencies

PDF 7 slide 6

Given a relation schema $R$ and $X, Y \in P (R) ∖ {\emptyset}$ we have that $(X, Y)$ is a functional dependency on $R$ (noted as $X \to Y$ )

PDF 7 slide 7

Given a relation schema $R$ and a functional dependency $X \to Y$ defined on $R$ we say that an instance $r$ of $R$ satisfies the functional dependency $X \to Y$ if

$\forall t_{1}, t_{2} \in r t_{1} [X] = t_{2} [X] ⟹ t_{1} [Y] = t_{2} [Y]$

Instance legality & closure

PDF 7 slide 14

Given a relation schema $R$ and a set $F$ of functional dependencies defined on $R$ , an instance $r$ of $R$ is legal if it satisfies every dependency in $F$

$\forall X \to Y \in F \forall t_{1}, t_{2} \in r t_{1} [X] = t_{2} [X] ⟹ t_{1} [Y] = t_{2} [Y]$

PDF 7 slide 20

Given a relation schema $R$ and a set $F$ of functional dependencies defined on $R$ , the closure of $F$ is the set of functional dependencies that are satisfied by every legal instance of $R$

$F^{+} = {V \to W ∣ \forall legal r of R, r satisfies V \to W}$

$V \to W$ doesn't necessarily have to be in $F$

$F \subseteq F^{+}$

PDF 7 slide 21

$F \subseteq F^{+}$

Proof

$F^{+} = {V \to W ∣ \forall legal r of R, r satisfies V \to W}$

By definition $r$ is legal if it satisfies every dependency $X \to Y \in F ⟹$ given $X \to Y \in F$ , every legal instance of $R$ satisfies $X \to Y ⟹ X \to Y \in F^{+}$

Keys

PDF 7 slide 22

Given a relation schema $R$ and a set $F$ of functional dependencies on $R$ , $K \subseteq R$ is a key of $R$ if

$K \to R \in F^{+}$
$\forall K^{'} \subset K, K^{'} \to R \in / F^{+}$

$" \subset "$ means proper subset, which implies that $K \neq = K^{'}$

Trivial dependencies

PDF 7 slide 26

Given a schema $R$ and $X, Y \in P (R) ∖ {\emptyset} : Y \subseteq X$ , we have that every instance $r$ of $R$ satisfies the dependency $X \to Y$

Proof

Given an instance $r$ of $R, \forall t_{1}, t_{2} \in r$ we have that

$t_{1} [X] = t_{2} [X] ⟹$ by definition $t_{1} [A] = t_{2} [A] \forall A \in X ⟹$ as $Y \subseteq X$ we have that
$t_{1} [A^{'}] = t_{2} [A^{'}] \forall A^{'} \in Y ⟹$ by definition $t_{1} [Y] = t_{2} [Y]$

As $t_{1} [X] = t_{2} [X] ⟹ t_{1} [Y] = t_{2} [Y]$ we have that $r$ satisfies $X \to Y$

Decomposition

PDF 7 slide 27

Given a schema $R$ and a set of functional dependencies $F$ on $R$ , we have that

$X \to Y \in F^{+} ⟺ X \to A \in F^{+} \forall A \in Y$

Proof

$X \to Y \in F^{+} ⟹ \forall legal r of R \forall t_{1}, t_{2} \in r t_{1} [X] = t_{2} [X] ⟹ t_{1} [Y] = t_{2} [Y] ⟹ t_{1} [A] = t_{2} [A] \forall A \in Y ⟹ X \to A \in F^{+} \forall A \in Y$

$X \to A \in F^{+} \forall A \in Y ⟹ \forall legal r of R \forall t_{1}, t_{2} \in R t_{1} [X] = t_{2} [X] ⟹ t_{1} [A] = t_{2} [A] \forall A \in Y ⟹ t_{1} [Y] = t_{2} [Y] ⟹ X \to Y \in F^{+}$

$F^{A}$

PDF 8 slide 3 $F^{A}$ is a set of functional dependencies on $R$ such that

$X \to Y \in F ⟹ X \to Y \in F^{A}$
$Y \subseteq X \in R ⟹ X \to Y \in F^{A}$ (refelxivity)
$\forall Z \in R, X \to Y \in F^{A} ⟹ ZX \to Z Y \in F^{A}$ (augmentation)
$X \to Y, Y \to Z \in F^{A} ⟹ X \to Z \in F^{A}$ (transitivity)

PDF 8 slide 6 derivates

$X \to Y, X \to Z \in F^{A} ⟹ X \to Y Z \in F^{A}$ (union)
$X \to Y \in F^{A} \land Z \subseteq Y ⟹ X \to Z \in F^{A}$ (decomposition)
$X \to Y, WY \to Z \in F^{A} ⟹ W X \to Z \in F^{A}$ (pseudotransitivity)

PDF 8 slide 8 $X \to A_{1} A_{2} ... A_{n} \in F^{A} ⟺ \forall i = 1, ..., n X \to A_{i} \in F^{A}$

Derivates (Proofs)

Union

$(X)_{F}^{+}$ LaTeX

$X \to Y, X \to Z \in F^{A} ⟹$ by augmentation $X \to X Y, X Y \to Y Z \in F^{A} ⟹$ by transitivity $X \to Y Z \in F^{A}$

Decomposition

$X \to Y \in F^{A} \land Z \subseteq Y ⟹ Y \to Z \in F^{A} ⟹$ by transitivity $X \to Z \in F^{A}$

Pseudotransitivity

$X \to Y, WY \to Z \in F^{A} ⟹$ by augmentation $W X \to WY \in F^{A} ⟹$ by transitivity $W X \to Z \in F^{A}$

$(X)_{F}^{+}$

$([P D F 8 s l i d e 9] (08 G i v e na re l a t i o n sc h e ma$ R $, a se t$ F $o fd e p e n d e n c i eso n$ R $an d$ X \subseteq R $. T h e * * c l os u re * * o f$ X $w i t h res p ec tt o$ F $, d e n o t e d$ (X)^+_F $i s d e f in e d a s$ $(X)_{F}^{+} = {A \in R ∣ X \to A \in F^{A}}$ $W e ha v e t ha t$ X \subseteq (X)^+_F

Proof

\forall A \in X $b yre f l e x i v i t y$ X \to A \in F^A \implies $b y d e f ini t i o n$ A \in (X)^+_F \implies X \subseteq (X)^+_F $> W ec an u se A r m s t ro n g^{'} s a x i o m s a s$ (X)^+_F $i s d e f in e d o f$ F^A $> NOTE :$ (X)^+_F $i s * * NOT * * d e f in e d o n$ F^+

Lemma of closure

PDF 8 slide 10

Let R $b e a sc h e maan d$ F $a se t o ff u n c t i o na l d e p e n d e n c i eso n$ R $X \to Y \in F^{A} ⟺ Y \subseteq (X)_{F}^{+}$

Proof

X \to Y \in F^A \implies $b y d eco m p os i t i o n$ X \to A \in F^A ; \forall A \in Y \implies $b y d e f ini t i o n$ A \in (X)^+_F ; \forall A \in Y \implies Y \subseteq (X)^+_F $>$ Y \subseteq (X)^+_F \implies A \in (X)^+_F ; \forall A \in Y \implies $b y d e f ini t i o n$ X \to A \in F^A ; \forall A \in Y \implies $b y u ni o n$ X \to Y \in F^A

F^+ = F^A $[P D F 8 s l i d e 11] (08 L e t$ R $b e a re l a t i o n sc h e maan d$ F $a se t o ff u n c t i o na l d e p e n d e n c i eso n$ R $t h e n$ F^+ = F^A

Proof

Let F_i $b e t h e v a l u eo f$ F $a f t er t h e$ i $- t ha ppl i c a t i o n o f an A r m s t ro n g^{'} s a x i o m, w i t h$ F_0 = F

F^A \subseteq F^+

Base case

F_0 = F \subseteq F^+ \implies F_0 \subseteq F^+

Inductive step

F_i \subseteq F^+ \implies F_{i + 1} \subseteq F^+ $L e t$ X \to Y \in F_{i + 1} $, e i t h er -$ X \to Y \in F_i \implies $b yH P$ X \to Y \in F^+ $-$ X \to Y \in F_{i + 1} \setminus F_i $, w hi c hm e an s t ha t$ X \to Y has been optained through one of the axioms

F^A \subseteq F^+

Reflexivity

Y \subseteq X \implies $g i v e n t ha t$ X \to Y $i ss a t i s f i e d b ye v ery in s t an ce$ X \to Y \in F^+

Augmentation

Z \subseteq R, X = ZV, Y = ZW \land V \to W \in F_i $g i v e n$ t_1, t_2 \in r $l e g a l in s t an ceo f$ R $w e ha v e t ha t$ t_1[X] = t_2[X] \implies (t_1[V] = t_2[V] \implies $b yH P$ t_1[W] = t_2[W]) \land t_1[Z] = t_2[Z] \implies t_1[Y] = t_2[Y]

Transitivity

X \to Z, Z \to Y \in F_i \implies $b yH P$ \forall \text{ legal } r \text{ of } R, \forall t_1, t_2 \in r, t_1[X] = t_2[X] \implies t_1[Z] = t_2[Z] \implies t_1[Y] = t_2[Y] $w e ha v e t ha t$ t_1[X] = t_2[X] \implies t_1[Y] = t_2[Y] \implies X \to Y \in F^+

F^+ \subseteq F^A $_{(} l e g a l in s t an ce)_{G} i v e n$ X \subseteq R $w ec anb u i l d anin s t an ce$ r = \set{t_1, t_2} $o n$ R $s u c h t ha t < t ab l e >< t h e a d >< t r >< t h >$ r $< / t h >< t h co l s p an = "5" >$ (X)^+_F $< / t h >< t h co l s p an = "5" >$ R \setminus (X)^+_F $< / t h >< / t r >< / t h e a d >< t b o d y >< t r >< t d >$ t_1 $< / t d >< t d > 1 < / t d >< t d > 1 < / t d >< t d > 1 < / t d >< t d > ... < / t d >< t d > 1 < / t d >< t d > 1 < / t d >< t d > 1 < / t d >< t d > 1 < / t d >< t d > ... < / t d >< t d > 1 < / t d >< / t r >< t r >< t d >$ t_2 $< / t d >< t d > 1 < / t d >< t d > 1 < / t d >< t d > 1 < / t d >< t d > ... < / t d >< t d > 1 < / t d >< t d > 0 < / t d >< t d > 0 < / t d >< t d > 0 < / t d >< t d > ... < / t d >< t d > 0 < / t d >< / t r >< / t b o d y >< / t ab l e > L e t^{'} s v er i f y t ha t$ r $i s a l e g a l in s t an ce . G i v e n$ V \to W \in F $, a s$ V, W \neq \varnothing $b y d e f ini t i o n, w eco u l d ha v e -$ V \nsubseteq (X)^+_F \implies \exists A \in V : A \in R \setminus (X)^+_F \implies t_1[V] \neq t_2[V] \implies r $s a t i s f i es$ V \to W $-$ V \subseteq (X)^+_F $, w eco u l d ha v e t ha t -$ W \subseteq (X)^+_F \implies t_1[V] = t_2[V] \land t_1[W] = t_2[W] \implies r $s a t i s f i es$ V \to W $-$ W \nsubseteq (X)^+_F \implies \exists A \in W : A \in R \setminus (X)^+_F \implies t_1[V] = t_2[V] \land t_1[W] \neq t_2[W]

F^+ \subseteq F^A $_{(} l e g a l in s t an ce)_{I} n t h e l a s t c a se$ r $d oes n^{'} t s a t i s f y$ V \to W $, so w e ha v e t os h o wt ha t i t c a n^{'} t ha pp e n . L e t^{'} ss u pp ose t ha t$ \exists V \to W \in F $s u c h t ha t$ r $d oes n^{'} t s a t i s f y$ V \to W $; b yco n s t r u c t i o n w e ha v e t ha t$ $V \subseteq (X)_{F}^{+} \land \exists A \in W : A \in R ∖ (X)_{F}^{+} ⟹ A \in / (X)_{F}^{+}$ $W e ha v e t ha t -$ V \subseteq (X)^+_F \implies $b y t h e l e mma o f c l os u re$ X \to V \in F^A $-$ A \in W \implies $b y d eco m p os i t i o n$ V \to A \in F^A $B y t r an s i t i v i t y$ X \to A \in F^A \implies $b y t h e l e mma o f c l os u re$ A \in (X)^+_F which is a contraddiction

Legality

In the first 2 cases r $s a t i s f i es$ V \to W \in F $, c a se 3 c a n^{'} t ha pp e n$ \implies r $i s a l e g a l in s t an ceo f$ R

F^+ \subseteq F^A $L e t^{'} sco n s i d er$ X \to Y \in F^+ $B y d e f ini t i o n w e ha v e t ha t$ X \subseteq (X)^+_F \implies $b yco n s t r u c t i o n$ t_1[X] = t_2[X] \implies $b y h y p o t es i s an d g i v e n t ha t$ r $i s a l e g a l in s t an ce$ t_1[Y] = t_2[Y] \implies $b y t h e l e mma$ Y \subseteq (X)^+_F \implies X \to Y \in F^A

F^+ = F^A $G i v e n t ha t$ F_i \subseteq F^+ : \forall i \in \mathbb{N} $an d$ F^+ \subseteq F^A $w e ha v e t ha t$ F^+ = F^A

3NF

PDF 9 slide 14

Given a relation schema R $an d a se t o ff u n c t i o na l d e p e n d e n c i es$ F $o n$ R $.$ R $i s in * * 3 NF * * i f$ \forall X \to A \in F^+ : A \notin X $e i t h er -$ A $i s p r im e_{(} b e l o n g s t o ak ey)_{-}$ X is superkey

3NF pt.2

PDF 10 slide 4

Let R $b e a re l a t i o n sc h e maan d$ F $a se t o ff u n c t i o na l d e p e n d e n c i eso n$ R $A na tt r ib u t e$ A \in R $* * p a r t ia ll y * * d e p e n d so nak ey$ K $i f <! - - TO D O c h ec ki f s u b se t e q ? NOT n ee d e d b ec a u se X i s a p ro p ers u b se t o f K - - > -$ \exists X \subset R : A \notin X \land X \to A \in F \land X \subset K $-$ A $i s n^{'} tp r im e A na tt r ib u t e$ A \in R $* * t r an s i t i v e l y * * d e p e n d so nak ey$ K $i f -$ \exists X \subset R : A \notin X \land X \to A \in F \land K \to X \in F $-$ X $i s n^{'} t ak ey -$ A $i s n^{'} tp r im e >$ X \subset R $m e an s t ha t$ X \neq R $, o t h er w i se Xw o u l d b e a s u p er k ey, a s$ R \to R \in F^A = F^+

3NF pt.3

PDF 10 slide 5

Given a schema R $an d a se t o ff u n c t i o na l d e p e n d e n c i es$ F $o n$ R $, * * TF A E * * -$ R $i s in 3 NF - t h ere a re * * n o a tt r ib u t es t ha tp a r t ia ll yor t r an s i t i v e l y d e p e n d o nak ey * * -$ \forall X \to A \in F^+ : A \notin X $e i t h er : -$ A $i s p r im e_{(} b e l o n g s t o ak ey)_{-}$ X is superkey

Proof

TODO: I have it, I just have to write it out in \LaTeX $- - - <! - - P D F 10 s l i d e 6 - - ><! - - L e t$ R $b e a re l a t i o n sc h e maan d$ F $a se t o ff u n c t i o na l d e p e n d e n c i eso n$ R $. A sc h e ma$ R $i s in 3 NF i f an d o n l y i f n e i t h er p a r t ia l d e p e n d e n c i es n or t r an s i t i v e d e p e n d e n c i ese x i s t in$ F -->

BCNF (Boyce-Codd)

PDF 10 slide 20

A relation schema R $i s in * * B oyce - C o dd N or ma lF or m * * (BCNF) w h e n e v ery d e t er minan t in$ F is a superkey. A relation that respects Boyce-Codd Normal Form is also in 3NF, but the opposite is not true.

(X)^+_F

PDF 11 slie 5

def clousure(R, F, X):
	Z = X
	S = { A ∈ R | Y → V ∈ F ∧ Y ⊆ Z ∧ A ∈ V }

	if S ⊆ Z:
		return Z

	return closure(R, F, Z ∪ S)

(X)^+_F $[P D F 11 s l i d e 8] (11 T h e a l g or i t hm ‘ c l os u re () ‘ correc tl yco m p u t es t h ec l os u reo f a se t o f a tt r ib u t es$ X $res p ec t i v e l y t o a se t$ F $o ff u n c t i o na l d e p e n d e n c i eso n$ R

Proof

Let's consider Z_i, S_i $t h e v a l u eso f$ Z $an d$ S $a tt h e$ i $- t h c a ll o f t h e f u n c t i o nan d$ Z_f, S_f \mid S_f \subseteq Z_f $t h e v a l u eso f$ Z, S $a tt h e l a s t c a ll o f t h e f u n c t i o n . L e t^{'} s p ro v e b y in d u c t i o n t ha t$ Z_i \subseteq (X)^+_F

Z_i \subseteq (X)^+_F

Base case

Inductive step Z_i \subseteq (X)^+F \implies Z{i + 1} \subseteq (X)^+F $<! - - W e ha v e t o p ro v e t ha t$ Z_i \subseteq (X)^+F \implies Z{i + 1} \subseteq (X)^+F $- - ><! - - an d$ S_i = \set{A \in R \mid Y \to V \in F \land Y \subseteq Z_i \land A \in V} $- - > G i v e n t ha t$ Z{i + 1} = Z_i \cup S_i $t h e ni f$ A \in Z{i + 1} $e i t h er -$ A \in Z_i \implies $b yH P$ A \in (X)^+_F $-$ A \in S_i \implies $b yco n s t r u c t i o n$ \exists Y \to V \in F \mid Y \subseteq Z_i \land A \in V \implies $b yH P$ Y \subseteq (X)^+_F \implies $b y t h e l e mma o f c l os u re$ X \to Y \in F^A $an d b y d eco m p os i t i o n$ Y \to A \in F^A \implies $b y t r an s i t i v i t y$ X \to A \in F^A \implies $b y d e f ini t i o n$ A \in (X)^+_F

(X)^+_F \subseteq Z_f $_{(} l e g a l in s t an ce)_{G} i v e n$ Z_f $w ec anb u i l d anin s t an ce$ r = \set{t_1, t_2} $o n$ R $s u c h t ha t < t ab l e >< t h e a d >< t r >< t h >$ r $< / t h >< t h co l s p an = "5" >$ Z_f $< / t h >< t h co l s p an = "5" >$ R \setminus Z_f $< / t h >< / t r >< / t h e a d >< t b o d y >< t r >< t d >$ t_1 $< / t d >< t d > 1 < / t d >< t d > 1 < / t d >< t d > 1 < / t d >< t d > ... < / t d >< t d > 1 < / t d >< t d > 1 < / t d >< t d > 1 < / t d >< t d > 1 < / t d >< t d > ... < / t d >< t d > 1 < / t d >< / t r >< t r >< t d >$ t_2 $< / t d >< t d > 1 < / t d >< t d > 1 < / t d >< t d > 1 < / t d >< t d > ... < / t d >< t d > 1 < / t d >< t d > 0 < / t d >< t d > 0 < / t d >< t d > 0 < / t d >< t d > ... < / t d >< t d > 0 < / t d >< / t r >< / t b o d y >< / t ab l e > L e t^{'} s v er i f y t ha t$ r $i s a l e g a l in s t an ce . G i v e n$ V \to W \in F $a s$ V, W \neq \varnothing $w eco u l d ha v ee i t h er -$ V \nsubseteq Z_f \implies \exists A \in V : A \in R \setminus Z_f \implies t_1[V] \neq t_2[V] \implies r $s a t i s f i es$ V \to W $-$ V \subseteq Z_f $-$ W \subseteq Z_f \implies $b yco n s t r u c t i o n$ t_1[V] = t_2[V] \land t_1[W] = t_1[W] \impliesr $s a t i s f i es$ V \to W $-$ W \nsubseteq Z_f \implies $b yco n s t r u c t i o n$ t_1[V] = t_2[V] \land t_1[W] \neq t_2[W]

(X)^+_F \subseteq Z_f $_{(} l e g a l in s t an ce)_{L} e t^{'} ss u pp ose t ha t$ \exists V \to W \in F : r $d oes n^{'} t s a t i s f y$ V \to W \implies $b yco n s t r u c t i o n$ $V \subseteq Z_{f} \land \exists A \in W : A \in R ∖ Z_{f} ⟹ A \in / Z_{f}$ $G i v e n t ha t$ V \subseteq Z_f \land V \to W \in F \land A \in W \implies $b yco n s t r u c t i o n o f$ S_f, : A \in Z_f which is a contraddiction

Legality

In the first 2 cases r $s a t i s f i es$ V \to W \in F $c a se 3 c a n^{'} t ha pp e n$ \implies r $i s a l e g a l in s t an ceo f$ R

(X)^+_F \subseteq Z_f $L e t^{'} sco n s i d er$ A \in (X)^+_F $G i v e n t ha t$ X \to A \in F^A = F^+ $an d$ r $i s a l e g a l in s t an ce$ \implies r $s a t i s f i es$ X \to Y $, an d g i v e n t ha t b yco n s t r u c t i o n$ X \subseteq Z_f \implies t_1[X] = t_2[X] \implies $b y d e f ini t i o n$ t_1[A] = t_2[A] \implies $b yco n s t r u c t i o n$ A \in Z_f

Z_f = (X)^+_F $G i v e n t ha t$ Z_i \subseteq (X)^+_F ; \forall i \in \mathbb{N} $an d$ (X)^+_F \subseteq Z_f $, w e ha v e t ha t$ Z_f = (X)^+_F

Intersection Rule

PDF 12 slide 19

Given a relation scheme R $an d a se t o ff u n c t i o na l d e p e n d e n c i es$ F $o n$ R $X := V \to W \in F ⋂ R - (W - V)$ $<! - - I f t h e in t ersec t i o n o f t h esese t s d e t er min es$ R $, t h e n t h e in t ersec t i o ni s t h eo n l y k ey t o$ R $e l se t h ere a re m u lt i pl e k eys, an d A LL o f t h e mm u s t b e i d e n t i f i e df orc h ec kin g 3 NF - - > I f$ X \to R \in F^+ $t h e n t h e in t ersec t i o ni s t h eo n l y k ey t o$ R $o t h er w i se t h ere a re m u lt i pl e k eys, an d * * A LL * * o f t h e mm u s t b e i d e n t i f i e d t oc h ec ki f$ R is in 3NF

Decomposition

PDF 13 slide 8

Let R $b e a re l a t i o n sc h e m e, a d eco m p os i t i o n$ \rho $o f$ R $i ss u c h t ha t$ $ρ = {R_{1}, R_{2}, ..., R_{k}} \subseteq P (R) : i = 1 ⋃ k R_{i} = R$

Equivalence

PDF 13 slide 12

Let F $an d$ G $b e tw ose t so ff u n c t i o na l d e p e n d e n c i es, w ec an d e f in e an e q u i v a l e n cere l a t i o n$ $F \equiv G ⟺ F^{+} = G^{+}$ $- re f l e x i v i t y$ F \implies F^+ = F^+ \implies F \equiv F $- s imm e t ry$ F \equiv G \implies F^+ = G^+ \implies G^+ = F^+ \implies G \equiv F $- t r an s i t i v i t y$ F \equiv G \land G \equiv H \implies F^+ = G^+ \land G^+ = H^+ \implies F^+ = H^+ \implies F \equiv H $[P D F 13 s l i d e 14] (13 L e t$ F $an d$ G $b e tw ose t so ff u n c t i o na l d e p e n d e n c i es$ $F \subseteq G^{+} ⟹ F^{+} \subseteq G^{+}$

F \subseteq G^+ \implies F^+ \subseteq G^+

Base case

F_0 = F \subseteq G^+ \implies F_0 \subseteq G^+

Inductive Step

F_i \subseteq G^+ \implies F_{i + 1} \subseteq G^+X \to Y \in F_{i + 1} \implies X \to Y $ha s b ee n o pt ain e d t h ro ug h - re f l e x i v i t y$ Y \subseteq X \implies $g i v e n t ha t$ X \to Y $i ss a t i s f i e d b ye v ery in s t an ce$ X \to Y \in G^+ $- a ug m e n t a t i o n$ \exists Z \subseteq R, V \to W \in F_i \mid X = ZV, Y = ZW

transitivity

TODO

Preserving F

PDF 13 slide 15

Let R $b e a re l a t i o n sc h e m e,$ F $a se t o ff u n c t i o na l d e p e n d e n c i eso n$ R $an d$ \rho = \set{R_1, R_2, ..., R_k} $a d eco m p os i t i o n o f$ R $, w es a y t ha t$ \rho $p rese v es$ F $i f$ $F \equiv G = i = 1 ⋃ k π_{R_{i}} (F)$ $Wh ere$ $π_{R_{i}} (F) = {X \to Y \in F^{+} ∣ X Y \subseteq R_{i}}$ $[P D F 13 s l i d e 16] (13 G i v e n t h e d e f ini t i o n o f$ G $, i tw i ll a lw a ys b e t ha t$ G \subseteq F^+ \implies G^+ \subseteq F^+ $so i t i se n o ug h t o v er i f y t ha t$ F \subseteq G^+

Dependency preservation

PDF 13 slide 17

def preserves_dependencies(R, F, ρ):
	for X → Y ∈ F:
		if Y ⊈ closure_G(R, F, ρ, X):
			return false

	return true

This algorithm is enough as we just have to check wether F \subseteq G^+ $G i v e n$ X \to Y \in F $w e ha v e t ha t$ X \to Y \in G^+ = G^A \iff $b y t h e l e mma o f c l os u re$ Y \subseteq (X)^+_G

(X)^+_G $‘‘‘ p y t h o n d e f c l o u s u r e_{G} (R, F, X, ρ) : Z = XS = \emptyset f or P \in ρ : S = S \cup (c l os u re (R, F, Z \cap P) \cap P) i f S \subseteq Z re t u r n Z re t u r n c l os u r e_{G} (R, F, Z \cup S) ‘‘‘ <! - - re t u r n X i f S \subseteq X e l sec l os u r e_{G} (R, F, X \cup S) - - > [P D F 13 s l i d e 23] (13$ R $b e a re l a t i o n sc h e ma,$ F $a se t o ff u n c t i o na l d e p e n d e n c i eso n$ R $an d$ \rho = \set{R_1, R_2, ..., R_k} $a d eco m p os i t i o n o f$ R $an d$ X \subseteq R $t h e a l g or i t hm ‘ c l os u r e_{G} () ‘ correc tl yco m p u t es$ (X)^+_G

Z_f \subseteq (X)^+_G $L e t$ Z_i, S_i $t h e v a l u eso f$ Z $an d$ S $a tt h e$ i $- t h c a ll o f t h e f u n c t i o n, w i t h$ Z_0 = X $, an d$ S_f \subseteq Z_f $<! - - t h e f ina l v a l u eso f$ Z $an d$ S -->

Base case

Z_0 = X \subseteq (X)^+_G \implies Z_0 \subseteq (X)^+_G by HP

Inductive step

Z_i \subseteq (X)^+G \implies Z{i+1} \subseteq (X)^+G $, g i v e n t ha t$ S_i = \bigcup\limits{j = 1}^k (Z_i \cap R_j)^+F \cap R_j $L e t$ A \in Z{i + 1} = Z_i \cup S_i \implies \exists j : A \in (Z_i \cap R_j) \cap R_j \implies Z_i \cap R_j \to A \in G^A $B yH Pw e ha v e t ha t$ Z_i \subseteq (X)^+_G \implies X \to Z_i \in G^A $, l e t$ Z_i = (Z_i \cap R_j) \cup V $b y d eco m p os i t i o n w e ha v e t ha t$ X \to Z_i \cap R_j \in G^A \implies $b y t r an s i t i v i t y$ X \to A \in G^A

X \subseteq Y \implies (X)^+_F \subseteq (Y)^+_FX \subseteq Y \implies Y \to X \in F^A $b yre f l e x i v i t y G i v e n$ A \in (X)^+_F \implies $b y t h e l e mma o f c l os u re$ X \to A \in F^A \implies $b y t r an s i t i v i t y$ Y \to A \in F^A \ \implies $b y t h e l e mma o f c l os u re$ A \in (Y)^+_F

(X)^+_G \subseteq Z_fX \subseteq Z_f \implies (X)^+_G \subseteq (Z_f)^+_G $, w e ha v e t o p ro v e t ha t$ Z_f = (Z_f)^+_G $L e t^{'} sco n s i d er$ A \in S' = \set{A \in R \mid V \to W \in G \land V \subseteq Z_f \land A \in W} \implies \exists V \to W \in G : V \subseteq Z_f \land A \in W \implies \exists R_j \in \rho : VW \subseteq R_j \implies V \subseteq Z_f \cap R_j \land A \in R_j \implies A \in (Z_f \cap R_j)^+_F \cap R_j \implies A \in S_f \implies A \in Z_f

Loseless join

r \subseteq m_{\rho}(r)t \in r \implies t[R_i] \in \pi_{R_i}(r) ; \forall R_i \in \rho $b y d e f ini t i o n$ \mathop{\bowtie}\limits_{i = 1}^k \pi_{R_i}(r) = \set {\bigcup\limits_{i = 1}^k p_i[R_i] \mid p_i[R_i] \in \pi_{R_i}(r) \land \bigcup\limits_{i = 1}^k p_i[R_i] \text{ is a function}}\forall t \in r, ; t = \bigcup\limits_{i = 1}^k t[R_i] $a s b y d e f ini t i o n o f$ \rho $w e ha v e t ha t$ R = \bigcup\limits_{i = 1}^k R_it \in r \implies t $i s a f u n c t i o nb y d e f ini t i o n$ t = \bigcup\limits_{i = 1}^k t[R_i] \in \mathop{\bowtie}\limits_{i = 1}^k \pi_{R_i}(r) = m_{\rho}(r) \implies t \in m_{\rho}(r) $<! - -$ \forall t \in r, ; t = \bigcup\limits_{i = 1}^k t[R_i] = \mathop{\bowtie}\limits_{i = 1}^k \set{ t[R_i] } $- - ><! - -$ t \in r \implies $b y d e f ini t i o n$ t[R_i] \in \pi_{R_i}(r) ; \forall R_i \in \rho \implies \set{ t[R_i] } \subseteq \pi_{R_i}(r) ; \forall R_i \in \rho $- - ><! - -$ t = \mathop{\bowtie}\limits_{i = 1}^k \set{ t[R_i] } \subseteq \mathop{\bowtie}\limits_{i = 1}^k \pi_{R_i}(r) = m_{\rho}(r) \implies t \in m_{\rho}(r) $- - ><! - -$ \forall t \in r $w eco n s i d er$ t[R_i], : R_i \in \rho $, w e ha v e t ha t$ t \in \set{ t[R_1] } \bowtie ... \bowtie \set{ t[R_k] } \subseteq \pi_{R_1}(r) \bowtie ... \bowtie \pi_{R_k}(r) = m_{\rho}(r) -->

\pi_{R_i}(m_{\rho}(r)) = \pi_{R_i}(r) $<! - - L e t^{'} sco n s i d er$ t_{R_i} \in \pi_{R_i}(m_{\rho}(r)) $, w e ha v e t o p ro v e t ha t$ t_{R_i} \in \pi_{R_i}(r) $- - >$ t \in r \implies $b y d e f ini t i o n$ t[R_i] \in \pi_{R_i}(r) ; \forall R_i \in \rho\pi_{R_i}(m_{\rho}(r)) = \set{q[R_i] \mid q \in \mathop{\bowtie}\limits_{i = 1}^k \pi_{R_i}(r)}

\pi_{R_i}(r) \subseteq \pi_{R_i}(m_{\rho}(r))t \in r \implies t \in m_{\rho}(r) \implies t[R_i] \in \pi_{R_i}(m_{\rho}(r))

\pi_{R_i}(m_{\rho}(r)) \subseteq \pi_{R_i}(r)q \in \mathop{\bowtie}\limits_{i = 1}^k \pi_{R_i}(r) \implies $b y d e f ini t i o n o f j o in$ q = \mathop{\bowtie}\limits_{i = 1}^k \set{ p_i[R_i] } \mid p_i \in r \implies $g i v e n t ha t$ q $i s a f u n c t i o n$ q[R_i] = p_i[R_i] $an d$ p_i \in r \implies p_i[R_i] \in \pi_{R_i}(r) $w e ha v e t ha t$ q[R_i] \in \pi_{R_i}(r) $<! - -$ \exists p_1, p_2, ..., p_k \in r \mid p_i[R_i] \in \pi_{R_i}(r) ; \forall i = 1,..., k $- - ><! - -$ t_{R_i} \in \pi_{R_i}(m_{\rho}(r)) \iff \exists t' \in m_{\rho}(r) : t'[R_i] = t_{R_i} \iff\iff \exists t_1, ..., t_k \in r : t'[R_j] = t_j[R_j] \quad \forall R_j \in \rho $b u t$ t_{R_i} = t[R_i] \in \pi_{R_i}(r) -->

m_{\rho}(m_{\rho}(r)) = m_{\rho}(r)m_{\rho}(m_{\rho}(r)) = \mathop{\bowtie}\limits_{i = 1}^k \pi_{R_i}(m_{\rho}(r)) = \mathop{\bowtie}\limits_{i = 1}^k \pi_{R_i}(r) = m_{\rho}(r) $<! - -$ m_{\rho}(m_{\rho}(r)) = \pi_{R_1}(m_{\rho}(r)) \bowtie ... \bowtie \pi_{R_k}(m_{\rho}(r)) = \pi_{R_1}(r) \bowtie ... \bowtie \pi_{R_k}(r) = m_{\rho}(r) $- - ><! - -$ m_{\rho}(m_{\rho}(r)) = \pi_{R_1}(m_{\rho}(r)) \bowtie ... \bowtie \pi_{R_k}(m_{\rho}(r)) = \pi_{R_1}(r) \bowtie ... \bowtie \pi_{R_k}(r) = m_{\rho}(r) $- - ><! - - L e t^{'} sco n s i d er$ t_{R_i} \in \pi_{R_i}(r) $, w e ha v e t o p ro v e t ha t$ t_{R_i} \in \pi_{R_i}(m_{\rho}(r)) $- - ><! - - L e t^{'} sco n s i d er$ t_{R_i} \in \pi_{R_i}(r) \land t' \in r $w i t h$ t'[R_i] $, w e ha v e t ha t$ t'[R_i] \in t_{R_i} -->

Loseless join pt.2

PDF 15 slide 15 Given \rho = \set{R_1, R_2, ..., R_k} $, b u i l d a t ab l e$ r $w i t h$ |R| $co l u mn s an d$ k $ro w s . A tt h e$ i $- t h ro w an d$ j $- t h co l u mn p u t$ a_j $i f$ A \in R_{i} $e l se$ b_{ij}

def has_looseless_join(R, F, ρ):
	while !(∃ t ∈ r | ∀ A ∈ R, t[A] = a) and r changed:
		for X → Y ∈ F:
			for t1 ∈ r:
				for t2 ∈ r:
					if t1[X] = t2[X] and t1[Y] != t2[Y]:
						for A ∈ Y:
							if t1[A] = a:
								t2[A] = t1[A]
							else:
								t1[A] = t2[A]

	return ∃ t ∈ r | ∀ A ∈ R, t[A] = a

Correctness

PDF 15 slide 19

Let R $b e a re l a t i o n sc h e m e,$ F $a se t o ff u n c t i o na l d e p e n d e n c i eso n$ R $an d l e t$ \rho = \set{R_1, R_2, ..., R_k} $b e a d eco m p os i t i o n o f$ R $; t h e a l g or i t hm correc tl y d ec i d es w h e t h er$ \rho $ha s a l oss l ess j o in$ r = m_{\rho}(r) \iff r $ha s a t u pl e w i t ha ll$ a $w h e n t h e a l g or i t hm t er min t es > TO D O : I c an p ro v e$ r = m_{\rho}(r) \implies r $ha s a t u pl e w i t ha ll$ a $w h e n t h e a l g or i t hm t er mina t es, I j u s t ha v e t o w r i t e i t in$ \LaTeX

Minimal cover

PDF 17 slide 7

Let R $b e a sc h e maan d$ F $b e a se t o ff u n c t i o na l d e p e n d e n c i eso n$ R $. A * * minima l co v er * * o f$ F $i s a se t o ff u n c t i o na l d e p e n d e n c i es$ G \equiv F $s u c h t ha t : -$ \forall X \to Y \in G, |Y| = 1 $-$ \forall X \to A \in G, \nexists X' \subset X \mid G \equiv (G - \set{X \to A}) \cup \set{X' \to A} $-$ \nexists X \to A \in G \mid G \equiv G - \set{X \to A}

Minimal cover (step 1)

F_1 = \set{X \to A \mid X \to Y \in F \land A \in Y}F \overset{A}{\to} F_1 $b y d eco m p os i t i o n$ F_1 \overset{A}{\to} F_1^A \implies F \subseteq F_1^AF_1 \overset{A}{\to} F $b y u ni o n$ F \overset{A}{\to} F^A \implies F_1 \subseteq F^AF \equiv F_1

Minimal cover (step 2)

Given X \to A \in F_1, X' \subset X \land X' \to A \in F_1^+ \implies F_2 = (F_1 \setminus \set{X \to A}) \cup \set{X' \to A}X' \subseteq X \implies X \to X' \in F_1^+ \land X \to X' \in F_2^+ $b yre f l e x i v i t y$ X \to A \in F_1 $-$ X \to A \in F_2 \implies X \to A \in F_2^+ $-$ X \to A \notin F_2 \implies X \to X' \in F_2^+ \land X' \to A \in F_2^+ \implies X \to A \in F_2^+ $b y t r an s i t i v i t y$ X \to A \in F_2 $-$ X \to A \in F_1 \implies X \to A \in F_1^+ $-$ X \to A \notin F_1 \implies X \to A \in F_1^+ $b yH P$ F_2 \equiv F_1 \implies F \equiv F_2 $b y t r an s i t i v i t yo f t h e$ \equiv relationship

Minimal cover (step 3)

X \to A \in F_2, ; A \in (X)^+{F_2 \setminus \set{X \to A}} \implies F_3 = F_2 \setminus \set{X \to A}X \to A \in F_2 $-$ X \to A \in F_3 \implies X \to A \in F_3^+ $-$ X \to A \notin F_3 \implies X \to A \in F_3^+ $b yH P a s$ A \in (X)^+{F_3}X \to A \in F_3 $-$ X \to A \in F_2 \implies X \to A \in F_2^+ $-$ X \to A \notin F_2 $i s a co n t r a dd i c t i o na s$ F_3 = F_2 \setminus \set{X \to A} $b y d e f ini t i o n$ F_2 \equiv F_3 \implies F \equiv F_3

Decomposition

def decomposition(R, F: minimal cover):
	S = ∅
	ρ = ∅

	for A ∈ R | ∄ X → Y ∈ F : A ∈ XY:
		S = S ∪ {A}

	if S != ∅:
		R = R - S
		ρ = ρ ∪ {S}

	if ∃ X → Y ∈ F | XY = R:
		ρ = ρ ∪ {R}
	else:
		for X → A ∈ F:
			ρ = ρ ∪ {XA}

Decomposition pt.2

PDF 19 slide 5

Let R $b e a re l a t i o na l sc h e maan d$ F $a se t o ff u n c t i o na l d e p e n d e n c i eso n$ R $, w hi c hi s aminima l co v er; t h e a l g or i t hm ‘ d eco m p os i t i o n () ‘ co m p u t e s_{(} in p o l y n o mia lt im e)_{a} d eco m p os i t i o n$ \rho $o f$ R $s u c h t ha t : - e a c h re l a t i o na l sc h e main$ \rho $i s in 3 NF -$ \rho $p reser v es$ F

Computer Science @ Sapienza