Improve intermediate layer extraction explanation by palonso · Pull Request #1338 · MTG/essentia

palonso · 2023-05-26T08:51:29Z

TensorToVectorReal converts tensors to 2D arrays by flattening all axis but the last one into the first dimension.
model-specific prediction algorithms (e.g., TensorflowPredictVGGish) use this algorithm to return 2D arrays since they are primarily intended for time-wise predictions or embeddings. However, it is possible to use these algorithms to extract intermediate layers of the models that may have more than two dimensions. In this case, all dimensions but the last one will be flattened. To address this:

TensorToVectorReal throws a warning in case it flattens a dimension.
We added notes explaining this behavior to the algorithms potentially affected.

Note that it is also possible to retrieve intermediate layers with their original shape using TensorflowPredict as discussed here.

dbogdanov

This looks good! I've left a proposal to improve the description of the algorithms' output in the DOC string.

dbogdanov · 2023-05-29T13:34:32Z

+  "Note: The output of this algorithm is 2D, which is suitable for extracting embeddings or "
+  "class activations (the output shape is, e.g., [time, number of classes]). If the output "
+  "parameter is set to an intermediate layer with more dimensions, the output will be "
+  "flattened to 2D.\n"


Rephrased version (trying to simplify):

Note: The algorithm outputs a time series of class activations or embedding vectors, with a 2D shape [time, feature vector]. Feature vector values will be flattened if the output parameter is set to extract an intermediate layer with multiple dimensions.

dbogdanov · 2023-05-29T13:36:07Z

+  "class activations (the output shape is, e.g., [time, number of classes]). If the output "
+  "parameter is set to an intermediate layer with more dimensions, the output will be "
+  "flattened to 2D.\n"
+  "\n"


Same comments as for TensorflowPredictEffnetDiscogs

dbogdanov · 2023-05-29T13:36:21Z

+  "Note: The output of this algorithm is 2D, which is suitable for extracting embeddings or "
+  "class activations (the output shape is, e.g., [time, number of classes]). If the output "
+  "parameter is set to an intermediate layer with more dimensions, the output will be "
+  "flattened to 2D.\n"


Same comment as for TensorflowPredictEffnetDiscogs

dbogdanov · 2023-05-29T13:36:31Z

+  "Note: The output of this algorithm is 2D, which is suitable for extracting embeddings or "
+  "class activations (the output shape is, e.g., [time, number of classes]). If the output "
+  "parameter is set to an intermediate layer with more dimensions, the output will be "
+  "flattened to 2D.\n"


Same comment as for TensorflowPredictEffnetDiscogs

dbogdanov · 2023-05-29T13:37:22Z

    _featsSize = tensor.dimension(3);

+    if (_channels != 1 && !_warned) {
+        E_WARNING("TensorToVectorReal: The channel axis (dimension 1) of the input tensor has size larger than 1, but the output of this algorithm is 2D. The batch, channel, and time axes (dimensions 0, 1, 2) will be flattened to the first dimension of the output matrix.");


We output a vector of vector of reals, so the "matrix" terminology may be misleading.

palonso added 3 commits May 26, 2023 10:23

Warn if channels>1 when converting tensor to frame

0500f9a

Add note explaining intermediate layer extraction

4af184c

Fix references

3d5cf82

palonso requested a review from dbogdanov May 26, 2023 08:51

dbogdanov requested changes May 29, 2023

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve intermediate layer extraction explanation#1338

Improve intermediate layer extraction explanation#1338
palonso wants to merge 3 commits intoMTG:masterfrom
palonso:intermediate-layer-extraction-doc

palonso commented May 26, 2023

Uh oh!

dbogdanov left a comment

Uh oh!

dbogdanov May 29, 2023

Uh oh!

dbogdanov May 29, 2023

Uh oh!

dbogdanov May 29, 2023

Uh oh!

dbogdanov May 29, 2023

Uh oh!

dbogdanov May 29, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

palonso commented May 26, 2023

Uh oh!

dbogdanov left a comment

Choose a reason for hiding this comment

Uh oh!

dbogdanov May 29, 2023

Choose a reason for hiding this comment

Uh oh!

dbogdanov May 29, 2023

Choose a reason for hiding this comment

Uh oh!

dbogdanov May 29, 2023

Choose a reason for hiding this comment

Uh oh!

dbogdanov May 29, 2023

Choose a reason for hiding this comment

Uh oh!

dbogdanov May 29, 2023

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants