项目作者: jerrygood0703

项目描述 :
speech enhancement GAN on waveform/log-power-spectrum data using Improved WGAN
高级语言: Python
项目地址: git://github.com/jerrygood0703/speech-enhancement-WGAN.git
创建时间: 2017-08-01T09:41:42Z
项目社区:https://github.com/jerrygood0703/speech-enhancement-WGAN

开源协议:

下载


SEGAN with improved wgan

Under construction!! Don’t fork!!

Tensorflow 1.2rc

Using imporved wgan

Enhancement on both waveform data and LPS data

Stage1 training only L1/L2 loss, without adversarial loss

Stage2 joint training

Usage

Preparing data(data_utils.py)

  1. import tensorflow as tf
  2. from data_utils import *
  3. reader = dataPreprocessor(path_to_record_name, path_to_noisy, path_to_clean, use_waveform=True)
  4. reader.write_tfrecord()

Training phase

  1. python main.py stage1

Testing phase

In main.py

change test_path and test_list

  1. python main.py test