diff --git a/cards/communication/prep/index.html b/cards/communication/prep/index.html index ff12f25db..fc79e8526 100644 --- a/cards/communication/prep/index.html +++ b/cards/communication/prep/index.html @@ -25,8 +25,9 @@ #Soft Skills

PREP

PREP is a framework for making your point.

PREP: Point + Reason + Example + Point
 
  1. Point: Make a point; PREP is a good method.
  2. Reason: Give the reason; Because it has a clear logic.
  3. Example: Show examples; The famous XYZ did ABC then everyone was convinced.
  4. Point: State the point for a conclusion.

Planted: -by ;

Dynamic Backlinks:
Links to:

SCQ-A

SCQ-A: Situation + Conflict + Question + Answer
 

SCA-A is a framework for problem-solving.

  1. Situation: background knowledge, set the stage
  2. Complications: what is happening
  3. Question: propose your hypothesis
  4. Answer: accept or reject the hypothesis

Planted: -by ;

Dynamic Backlinks:
Links to:
PREP +by ;

L Ma (2021). 'SCQ-A', Datumorphism, 01 April. Available at: https://datumorphism.leima.is/cards/communication/scq-a/.

WWH

WWH: What (happened) + Why (this happened) + How (to improve)
 

Planted: -by ;

Links to:

Planted: by ;

References:
Similar Articles:
Dynamic Backlinks to cards/forecasting/gnn-graph-structure-learning:

L Ma (2022). 'Graph Structure Learning in GNN', Datumorphism, 11 April. Available at: https://datumorphism.leima.is/cards/forecasting/gnn-graph-structure-learning/.

Planted: by ;

References:
  1. Wu2020 -Wu Z, Pan S, Long G, Jiang J, Chang X, Zhang C. Connecting the Dots: Multivariate Time Series Forecasting with Graph Neural Networks. arXiv [cs.LG]. 2020. Available: http://arxiv.org/abs/2005.11650
Similar Articles:
Dynamic Backlinks to cards/forecasting/gnn-mix-hop-propagation:

L Ma (2022). 'Mix-hop Propagation in GNN', Datumorphism, 11 April. Available at: https://datumorphism.leima.is/cards/forecasting/gnn-mix-hop-propagation/.

Planted: by ;

References:
  1. Gneiting2014 -Gneiting T, Katzfuss M. Probabilistic Forecasting. Annu Rev Stat Appl. 2014;1: 125–151. doi:10.1146/annurev-statistics-062713-085831
Similar Articles:

L Ma (2022). 'Prediction Space in Forecasting', Datumorphism, 04 April. Available at: https://datumorphism.leima.is/cards/forecasting/prediction-space/.

References:
  1. Wu2020 Wu Z, Pan S, Long G, Jiang J, Chang X, Zhang C. Connecting the Dots: Multivariate Time Series Forecasting with Graph Neural Networks. arXiv [cs.LG]. 2020. Available: http://arxiv.org/abs/2005.11650
  2. Lässig2021 -Lässig F. Temporal Convolutional Networks and Forecasting. In: Unit8 [Internet]. 6 Jul 2021 [cited 28 Nov 2022]. Available: https://unit8.com/resources/temporal-convolutional-networks-and-forecasting/
Similar Articles:
Additional Double Backet Links:
Additional Double Backet Links:
Links to:

Planted: by ;

Links to:

L Ma (2021). 'Betweenness Centrality of a Graph', Datumorphism, 09 April. Available at: https://datumorphism.leima.is/cards/graph/graph-betweenness-centrality/.

Planted: -by ;

Links to:
What is Graph +by ;

Links to:

Planted: by ;

Links to:

L Ma (2021). 'Eigenvector Centrality of a Graph', Datumorphism, 09 April. Available at: https://datumorphism.leima.is/cards/graph/graph-eigenvector-centrality/.

Links to:

Links to:
Dynamic Backlinks to cards/graph/graph-global-overlap-leicht-holme-newman-similarity:

L Ma diff --git a/cards/graph/graph-global-overlap-random-walk-similarity/index.html b/cards/graph/graph-global-overlap-random-walk-similarity/index.html index 20e8cf2e5..cb791bffb 100644 --- a/cards/graph/graph-global-overlap-random-walk-similarity/index.html +++ b/cards/graph/graph-global-overlap-random-walk-similarity/index.html @@ -56,8 +56,8 @@  ↩︎ ↩︎

Planted: by ;

Links to:

L Ma diff --git a/cards/graph/graph-laplacians/index.html b/cards/graph/graph-laplacians/index.html index e0a6ab806..1901fc87a 100644 --- a/cards/graph/graph-laplacians/index.html +++ b/cards/graph/graph-laplacians/index.html @@ -55,9 +55,11 @@ Planted: by ;

Links to:
Links to:
Graph Convolution Operator diff --git a/cards/graph/graph-local-clustering-coefficient/index.html b/cards/graph/graph-local-clustering-coefficient/index.html index e6da855de..cc57ba7cc 100644 --- a/cards/graph/graph-local-clustering-coefficient/index.html +++ b/cards/graph/graph-local-clustering-coefficient/index.html @@ -35,8 +35,8 @@ $$

where $\color{red}{d_n \choose 2}$ means all the possible combinations of neighbor nodes, and $\mathcal N(u)$ is the set of nodes that are neighbor to $u$.

Closed Triangles

Ego Graph

Counting the closed triangles of the ego graph of a node and normalize it by the total possible number of triangles is also a measure of clustering coefficients.

  • If the ego graph of $u$ is fully connected, we have $c_u=1$;
  • If the ego graph of $u$ is a star, we have $c_u=0$.

This concept can be generalized from triangles to any type of motifs.

Planted: by ;

Links to:

L Ma (2021). 'Graph Clustering Coefficient', Datumorphism, 09 April. Available at: https://datumorphism.leima.is/cards/graph/graph-local-clustering-coefficient/.

L Ma (2021). 'Graph Local Overlap Measure: Adamic Adar Index', Datumorphism, 09 April. Available at: https://datumorphism.leima.is/cards/graph/graph-local-overlap-adamic-adar-index/.

Planted: by ;

L Ma (2021). 'Graph Local Overlap Measure: Resource Allocation Index', Datumorphism, 09 April. Available at: https://datumorphism.leima.is/cards/graph/graph-local-overlap-resource-allocation-index/.

Planted: by ;

L Ma (2021). 'Graph Local Overlap Measure: Salton Index', Datumorphism, 09 April. Available at: https://datumorphism.leima.is/cards/graph/graph-local-overlap-salton-index/.

Planted: by ;

L Ma (2021). 'Graph Local Overlap Measure: Sorensen Index', Datumorphism, 09 April. Available at: https://datumorphism.leima.is/cards/graph/graph-local-overlap-sorensen-index/.

Dynamic Backlinks:
Additional Double Backet Links:
Multiset, mset or bag

Heterophily is the tendency to differ from others. Heterophily on a graph is the tendency to connect to nodes that are different from itself, e.g., nodes with different attributes have higher probability of edge.

Planted: by ;

Planted: by ;

Planted: by ;

Links to:

L Ma (2021). 'Node Degree', Datumorphism, 09 April. Available at: https://datumorphism.leima.is/cards/graph/node-degree/.

Structural Equivalence means that nodes with similar neighborhood structures will share similar attributes.

Planted: by ;

L Ma (2021). 'Structural Equivalence on Graph', Datumorphism, 09 April. Available at: https://datumorphism.leima.is/cards/graph/structural-equivalence/.

Planted: -by ;

Links to:

Dynamic Backlinks:

Lei Ma (2021). 'f-Divergence', Datumorphism, 09 April. Available at: https://datumorphism.leima.is/cards/information/f-divergence/.

Planted: -by ;

Links to:
Fraser Information +by ;

Links to:
Links to:
Fisher Information +by ;

Lei Ma (2021). 'Fraser Information', Datumorphism, 05 April. Available at: https://datumorphism.leima.is/cards/information/fraser-information/.

Dynamic Backlinks:

Lei Ma (2021). 'Shannon Entropy', Datumorphism, 09 April. Available at: https://datumorphism.leima.is/cards/information/shannon-entropy/.

Data storage is diverse. For data on smaller scales, we are mostly dealing with some data files.

work_with_data_files

Efficiencies and Compressions

Parquet

Parquet is fast. But

  1. Don’t use json or list of json as columns. Convert them to strings or binary objects if it is really needed.

Planted: -by ;

Links to:
Data Types +by ;

Links to:

LM (2021). 'Data File Formats', Datumorphism, 02 April. Available at: https://datumorphism.leima.is/cards/machine-learning/datatypes/data-file-formats/.

Planted: -by ;

Dynamic Backlinks:

L Ma +by ;

L Ma (2020). 'Data Types', Datumorphism, 01 April. Available at: https://datumorphism.leima.is/cards/machine-learning/datatypes/data-types/.

Here we encode all words presented in the corpus to demostrate the idea of CBOW. In the real world, we might want to remove some certain words such as the.

We use the following quote by Ford in Westworld as an example.

I read a theory once that the human intellect is like peacock feathers. Just an extravagant display intended to attract a mate, just an elaborate mating ritual. But, of course, the peacock can barely fly. It lives in the dirt, pecking insects out of the muck, consoling itself with its great beauty.

The word intended is surrunded by extravagant display in the front and to attract after it. The task is to predict the middle word intended using the ‘history words’ extravagant display and to attract.

  • Input: extravagant, display, to, attract
  • Output: intended

In the bag-of-words model, the order of the words extravagant, display, to, attract doesn’t matter hence bag-of-words. [mikolov2013]

This makes it easier to represent the dataset:

Input (Context)Output (Center Word)
extravagantintended
displayintended
tointended
attractintended

To create a real dataset, we “slide” over all the words.

Input (Context)Output (Center Word)
readI
aI
Iread
aread
theoryread
Ia
reada
theorya
oncea
readtheory
atheory
oncetheory

It is not required to choose two words as history words and two words as future words. The number of words to choose is the window size.

Planted: by ;

L Ma (2020). 'CBOW: Continuous Bag of Words', Datumorphism, 01 April. Available at: https://datumorphism.leima.is/cards/machine-learning/embedding/continuous-bag-of-words/.

We use the following quote by Ford in Westworld as an example.

I read a theory once that the human intellect is like peacock feathers. Just an extravagant display intended to attract a mate, just an elaborate mating ritual. But, of course, the peacock can barely fly. It lives in the dirt, pecking insects out of the muck, consoling itself with its great beauty.

The word intended is surrunded by extravagant display in the front and to attract after it. The task is to predict the probability of words around the middle word intended, which are the ‘history words’ extravagant, display and ‘future words’ to, attract in our case. [mikolov2013]

For this center word intended, we generate the following data.

Input (Center Word)Output (Context)
intendedextravagant
intendeddisplay
intendedto
intendedattract

We will build the following dataset using the sentence.

Input (Center Word)Output (Context)
Iread
Ia
readI
reada
readtheory
aI
aread
atheory
aonce
theoryread
theorya
theoryonce

It is not required to use two in the front and two words after the middle word. The number of words to choose is a hyperparameter to be decided.

Planted: by ;

L Ma (2020). 'skipgram: Continuous skip-gram', Datumorphism, 01 April. Available at: https://datumorphism.leima.is/cards/machine-learning/embedding/continuous-skip-gram/.

Links to:
Dynamic Backlinks to cards/machine-learning/embedding/negative-sampling:

L Ma (2020). 'Negative Sampling', Datumorphism, 01 April. Available at: https://datumorphism.leima.is/cards/machine-learning/embedding/negative-sampling/.

Planted: by ;

Links to:

L Ma (2021). 'Cross Validation', Datumorphism, 05 April. Available at: https://datumorphism.leima.is/cards/machine-learning/learning-theories/cross-validation/.

Planted: -by ;

Links to:
The Learning Problem +by ;

L Ma diff --git a/cards/machine-learning/learning-theories/induction-deduction-transduction/index.html b/cards/machine-learning/learning-theories/induction-deduction-transduction/index.html index ee5bd9fbc..baeb0df97 100644 --- a/cards/machine-learning/learning-theories/induction-deduction-transduction/index.html +++ b/cards/machine-learning/learning-theories/induction-deduction-transduction/index.html @@ -26,13 +26,8 @@ #Deduction #Transduction

Planted: -by ;

L Ma +by ;

Dynamic Backlinks to cards/machine-learning/learning-theories/induction-deduction-transduction:

L Ma (2022). 'Induction, Deduction, and Transduction', Datumorphism, 04 April. Available at: https://datumorphism.leima.is/cards/machine-learning/learning-theories/induction-deduction-transduction/.

Planted: by ;

Links to:
Links to:
Dynamic Backlinks:

L Ma (2021). 'Noise Contrastive Estimation: NCE', Datumorphism, 08 April. Available at: https://datumorphism.leima.is/cards/machine-learning/learning-theories/noise-contrastive-estimation/.

Planted: by ;

Links to:

L Ma (2021). 'Shatter', Datumorphism, 10 April. Available at: https://datumorphism.leima.is/cards/machine-learning/learning-theories/set-shatter/.

Planted: -by ;

Links to:
The Learning Problem +by ;

L Ma diff --git a/cards/machine-learning/measurement/empirical-loss/index.html b/cards/machine-learning/measurement/empirical-loss/index.html index d160ba12d..ff93de1cf 100644 --- a/cards/machine-learning/measurement/empirical-loss/index.html +++ b/cards/machine-learning/measurement/empirical-loss/index.html @@ -28,8 +28,10 @@ \end{align} $$

where $d(y_i, f(x_i))$ is the distance defined between $y_i$ and $f(x_i)$.

Planted: -by ;

Links to:
Measures of Generalizability +by ;

L Ma diff --git a/cards/machine-learning/measurement/gini-impurity/index.html b/cards/machine-learning/measurement/gini-impurity/index.html index 2fa6d37c5..b2b173688 100644 --- a/cards/machine-learning/measurement/gini-impurity/index.html +++ b/cards/machine-learning/measurement/gini-impurity/index.html @@ -34,8 +34,11 @@ G = p(0)(1-p(0)) + p(1)(1-p(1)) = 0.5 * 0.5+ 0.5*0.5 = 0.5. $$

For data with two possible values $\{0,1\}$, the maximum Gini impurity is 0.25. The following chart shows all the possible values of the Gini impurity for two-value dataset.

For data with three possible values, the Gini impurity is also visualized using the same chart given the condition that $p_3 = 1 - p_1 - p_2$.

Planted: -by ;

Supplementary:
Links to:
Decision Tree +by ;

Supplementary:

L Ma diff --git a/cards/machine-learning/measurement/hilbert-schmidt-independence-criterion/index.html b/cards/machine-learning/measurement/hilbert-schmidt-independence-criterion/index.html index 31a865c1e..11747aa4b 100644 --- a/cards/machine-learning/measurement/hilbert-schmidt-independence-criterion/index.html +++ b/cards/machine-learning/measurement/hilbert-schmidt-independence-criterion/index.html @@ -39,9 +39,8 @@ Useful when centering a vector around its mean .

We can choose different kernel functions $k$ and $l$. For example, if $k$ and $l$ are linear kernels, we have $k(x, y) = l(x, y) = x \cdot y$. In this linear case, HSIC is simply $\parallel\operatorname{cov}(x^T,y^T) \parallel^2_{\text{Frobenius}}$.


  1. Gretton A, Bousquet O, Smola A, Schölkopf B. Measuring Statistical Dependence with Hilbert-Schmidt Norms. Algorithmic Learning Theory. Springer Berlin Heidelberg; 2005. pp. 63–77. doi:10.1007/11564089_7 ↩︎

  2. Kornblith S, Norouzi M, Lee H, Hinton G. Similarity of Neural Network Representations Revisited. arXiv [cs.LG]. 2019. Available: http://arxiv.org/abs/1905.00414 ↩︎

Planted: -by ;

Links to:

Planted: -by ;

Supplementary:
Links to:
Decision Tree +by ;

Supplementary:
Links to:
Coding Theory Concepts diff --git a/cards/machine-learning/measurement/population-loss/index.html b/cards/machine-learning/measurement/population-loss/index.html index c6d59ce32..9769fa3b6 100644 --- a/cards/machine-learning/measurement/population-loss/index.html +++ b/cards/machine-learning/measurement/population-loss/index.html @@ -28,8 +28,10 @@ \end{align} $$

where $d(y, f(x))$ is the distance defined between $y$ and $f(x)$.

Planted: -by ;

Links to:
Measures of Generalizability +by ;

L Ma diff --git a/cards/machine-learning/neural-networks/activation-bi-polar-sigmoid/index.html b/cards/machine-learning/neural-networks/activation-bi-polar-sigmoid/index.html index ec3ace8df..d2c7743a4 100644 --- a/cards/machine-learning/neural-networks/activation-bi-polar-sigmoid/index.html +++ b/cards/machine-learning/neural-networks/activation-bi-polar-sigmoid/index.html @@ -28,11 +28,8 @@ \sigma(x) = \frac{1-e^{-x}}{1+e^{-x}}. $$

Visualization

Bipolar Sigmoid

Planted: -by ;

Links to:

Planted: by ;

Links to:

Planted: -by ;

Links to:

Planted: -by ;

Links to:

Planted: -by ;

Links to:
Two unnormalized Gaussian radial basis functions in one input dimension. The basis function centers are located at x1=0.75 and x2=3.25. Source Unnormalized Radial Basis Functions
Hyperbolic tangent

Two unnormalized Gaussian radial basis functions in one input dimension. The basis function centers are located at x1=0.75 and x2=3.25. Source Unnormalized Radial Basis Functions

Planted: -by ;

Links to:

Planted: -by ;

Links to:

Planted: -by ;

Links to:

Planted: -by ;

Links to:

Planted: -by ;

Links to:
Artificial Neural Networks +by ;

Dynamic Backlinks to cards/machine-learning/neural-networks/mcculloch-pitts-model:

LM (2021). 'McCulloch-Pitts Model', Datumorphism, 02 April. Available at: https://datumorphism.leima.is/cards/machine-learning/neural-networks/mcculloch-pitts-model/.

Links to:

L Ma (2021). 'Initialize Artificial Neural Networks', Datumorphism, 09 April. Available at: https://datumorphism.leima.is/cards/machine-learning/neural-networks/neural-networks-initialization/.

L Ma (2022). 'CUDA Memory', Datumorphism, 10 April. Available at: https://datumorphism.leima.is/cards/machine-learning/practice/cuda-memory/.

Dynamic Backlinks to cards/machine-learning/practice/cuda-memory:

L Ma (2022). 'CUDA Memory', Datumorphism, 10 April. Available at: https://datumorphism.leima.is/cards/machine-learning/practice/cuda-memory/.

Links to:

I find this slide from Christoph Freudenthaler very useful.

Canonical decomposition visualized by Christoph Freudenthaler

Canonical decomposition visualized by Christoph Freudenthaler

Planted: -by ;

L Ma +by ;

Dynamic Backlinks to cards/math/canonical-decomposition:

L Ma (2019). 'Canonical Decomposition', Datumorphism, 06 April. Available at: https://datumorphism.leima.is/cards/math/canonical-decomposition/.

Planted: -by ;

L Ma +by ;

L Ma (2019). 'Cholesky Decomposition', Datumorphism, 06 April. Available at: https://datumorphism.leima.is/cards/math/cholesky-decomposition/.

Planted: -by ;

Planted: by ;

Lei Ma (2022). 'Dilated Convolution', Datumorphism, 08 April. Available at: https://datumorphism.leima.is/cards/math/convolution-dilated/.

Lei Ma (2022). 'Dilated Convolution', Datumorphism, 08 April. Available at: https://datumorphism.leima.is/cards/math/convolution-dilated/.

  1. Find the eigenvectors $\mathbf x_i$ of the matrix $\mathbf A$; If we find degerations, the matrix is not diagonalizable.
  2. Construct a matrix $\mathbf S = \begin{pmatrix} \mathbf x_1 & \mathbf x_2 & \cdots & \mathbf x_n \end{pmatrix}$;
  3. The matrix $\mathbf A$ is diagonalize using $\mathbf S^{-1} \mathbf A \mathbf S = \mathbf {A_D}$

Planted: -by ;

Dynamic Backlinks:
Links to:
Eigenvalues and Eigenvectors +by ;

L Ma (2020). 'Diagnolize Matrices', Datumorphism, 03 April. Available at: https://datumorphism.leima.is/cards/math/diagonalize-matrix/.

Planted: -by ;

References:
Dynamic Backlinks:
Links to:

Planted: -by ;

References:
  1. Weisstein. Frobenius Norm. [cited 8 Nov 2021]. Available: https://mathworld.wolfram.com/FrobeniusNorm.html
Dynamic Backlinks:

L Ma +by ;

L Ma (2019). 'Frobenius distance', Datumorphism, 06 April. Available at: https://datumorphism.leima.is/cards/math/frobenius-distance/.

Planted: by ;

References:
  1. Hubbard1959 -Hubbard J. Calculation of Partition Functions. Physical Review Letters. 1959. pp. 77–78. doi:10.1103/physrevlett.3.77
Dynamic Backlinks:

L Ma (2021). 'The Hubbard-Stratonovich Identity', Datumorphism, 06 April. Available at: https://datumorphism.leima.is/cards/math/hubbard-stratonovich-identity/.

Planted: -by ;

References:
  1. Jaccard index
Dynamic Backlinks:

L Ma +by ;

References:

L Ma (2019). 'Jaccard Similarity', Datumorphism, 05 April. Available at: https://datumorphism.leima.is/cards/math/jaccard-similarity/.

Planted: -by ;

References:
  1. Jensen's Inequality @ Wikipedia
Dynamic Backlinks:

Planted: by ;

References:
  1. wiki-ls -Contributors to Wikimedia projects. Level set. In: Wikipedia [Internet]. 4 Nov 2022 [cited 12 Nov 2022]. Available: https://en.wikipedia.org/wiki/Level_set
Similar Articles:
Dynamic Backlinks to cards/math/level-set:

L Ma (2022). 'Level Set', Datumorphism, 11 April. Available at: https://datumorphism.leima.is/cards/math/level-set/.

#Tensor
Modes of a tensor
Slices of a tensor

Planted: -by ;

References:
  1. Tensor Decompositions and Applications by Tamara G. Kolda and Brett W. Bader
Dynamic Backlinks:

L Ma +by ;

Dynamic Backlinks to cards/math/modes-and-slices-of-tensor:

L Ma (2019). 'Modes and Slices of Tensors', Datumorphism, 06 April. Available at: https://datumorphism.leima.is/cards/math/modes-and-slices-of-tensor/.

A bag is a set in which duplicate elements are allowed.

An ordered bag is a list that we use in programming.

Planted: -by ;

References:
  1. Multiset @ Wikipedia
Similar Articles:

LM (2020). 'Multiset, mset or bag', Datumorphism, 12 April. Available at: https://datumorphism.leima.is/cards/math/multiset-mset-bag/.

Awesome Stuff

Summarizations, workflows, experiences, fails, etc
Active

I create workflows/checklists, and write about experiences here.

Active

I create workflows/checklists, and write about experiences here.