Search

3 results

23 - Short-Time Fourier Transform and Introduction to Wavelet Analysis
Chunyan Li, Louisiana State University
Book:

Time Series Data Analysis in Oceanography

Published online:

21 April 2022

Print publication:

05 May 2022, pp 415-426
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

This chapter discusses the drawback of Fourier analysis and the methods that can overcome its limitations. In general, Fourier analysis does not include information about time, particularly events. A slight modification of Fourier analysis can allow the addition of a dimension in time: by dividing the time series into smaller segments and doing the Fourier Transform for each segment, a method called short-time Fourier Transform (STFT) is introduced. Wavelet analysis is then discussed as a much better alternative to or replacement for STFT. It involves scaled and translated convolution with a short base function (short in the sense that it is essentially non-zero only in a finite interval). Wavelet analysis uses different base functions than the Fourier Transform. They are limited in time (unlike the infinitely long sinusoidal functions) and can be stretched or compressed to represent different scales (equivalent to frequencies). This method will allow the resolution of events at different times and different scales.

Neutral-to-emotional voice conversion with cross-wavelet transform F0 using generative adversarial networks
Part of
- Affect, Emotion and Behavior Processing in Human-Machine Interaction
Zhaojie Luo, Jinhui Chen, Tetsuya Takiguchi, Yasuo Ariki
Journal:

APSIPA Transactions on Signal and Information Processing / Volume 8 / 2019

Published online by Cambridge University Press:

04 March 2019, e10

Print publication:

2019
- Article
- - You have access
  - Open access
- PDF
- HTML
- Export citation
In this paper, we propose a novel neutral-to-emotional voice conversion (VC) model that can effectively learn a mapping from neutral to emotional speech with limited emotional voice data. Although conventional VC techniques have achieved tremendous success in spectral conversion, the lack of representations in fundamental frequency (F0), which explicitly represents prosody information, is still a major limiting factor for emotional VC. To overcome this limitation, in our proposed model, we outline the practical elements of the cross-wavelet transform (XWT) method, highlighting how such a method is applied in synthesizing diverse representations of F0 features in emotional VC. The idea is (1) to decompose F0 into different temporal level representations using continuous wavelet transform (CWT); (2) to use XWT to combine different CWT-F0 features to synthesize interaction XWT-F0 features; (3) and then use both the CWT-F0 and corresponding XWT-F0 features to train the emotional VC model. Moreover, to better measure similarities between the converted and real F0 features, we applied a VA-GAN training model, which combines a variational autoencoder (VAE) with a generative adversarial network (GAN). In the VA-GAN model, VAE learns the latent representations of high-dimensional features (CWT-F0, XWT-F0), while the discriminator of the GAN can use the learned feature representations as a basis for a VAE reconstruction objective.

A scale-space approach with wavelets to singularity estimation
Jérémie Bigot
Journal:

ESAIM: Probability and Statistics / Volume 9 / February 2005

Published online by Cambridge University Press:

15 November 2005, pp. 143-164

Print publication:

February 2005
- Article
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
This paper is concerned with the problem of determining the typical features of a curve when it is observed with noise. It has been shown that one can characterize the Lipschitz singularities of a signal by following the propagation across scales of the modulus maxima of its continuous wavelet transform. A nonparametric approach, based on appropriate thresholding of the empirical wavelet coefficients, is proposed to estimate the wavelet maxima of a signal observed with noise at various scales. In order to identify the singularities of the unknown signal, we introduce a new tool, “the structural intensity”, that computes the “density” of the location of the modulus maxima of a wavelet representation along various scales. This approach is shown to be an effective technique for detecting the significant singularities of a signal corrupted by noise and for removing spurious estimates. The asymptotic properties of the resulting estimators are studied and illustrated by simulations. An application to a real data set is also proposed.

Search Results

Refine search

Refine search

Actions for selected content:

3 results

23 - Short-Time Fourier Transform and Introduction to Wavelet Analysis

Summary

Neutral-to-emotional voice conversion with cross-wavelet transform F0 using generative adversarial networks

A scale-space approach with wavelets to singularity estimation

Search Results

Refine search

Refine search

Actions for selected content:

Save Search

3 results

23 - Short-Time Fourier Transform and Introduction to Wavelet Analysis

Summary

Neutral-to-emotional voice conversion with cross-wavelet transform F0 using generative adversarial networks

A scale-space approach with wavelets to singularity estimation