项目作者: keunwoochoi

项目描述 :
FMA convnet features
高级语言:
项目地址: git://github.com/keunwoochoi/FMA_convnet_features.git
创建时间: 2017-05-24T15:38:03Z
项目社区:https://github.com/keunwoochoi/FMA_convnet_features

开源协议:

下载


FMA_convnet_features

Keunwoo Choi, May 2017

A repo to host convnet features for FMA. Wanted to do more experiment than baseline method, but don’t think I have time to do so, so just releasing it.

What is convnet feature?

A set of features that is computed from transfer learning repo

What is FMA?

Free Music Archieve dataset: A large collection of audio and its genre annotation released in 2017.

How good is the feature?

  • In general, it achieved better performance than some popular audio features as in this paper
  • For FMA dataset, it’s bit better than the provided audio features — showed 63.94% of accuracy. +1% improvement over baseline. Although I think this convnet feature also should be considered as one of baseline features.

How to use?

  1. # Load the features
  2. import numpy as np
  3. feat1 = np.load('fma_large_layer1.npy')
  4. feat2 = np.load('fma_large_layer2.npy')
  5. feat3 = np.load('fma_large_layer3.npy')
  6. feat4 = np.load('fma_large_layer4.npy')
  7. feat5 = np.load('fma_large_layer5.npy')
  8. # concatenate the features
  9. features = np.concatenate((feat1, feat2, feat3, feat4, feat5), axis=1)
  10. features.shape
  11. # (106574, 160)
  12. # This is matched to the order of metadata.csv provided in FMA.
  13. # Now use it for your task!

Classification results (Compare these with the results from provided baselines)

result

T-SNE

on FMA-‘small’

tsne on small

on FMA-‘medium’

tsne on medium

Details

  • This is computed from FMA large, which is 30-second previews clips