Search IEEE.tv

Showing 769 - 792 of 23807

Real-Time Speech Enhancement Using Equilibriated Rnn

We propose a speech enhancement method using a causal deep neural network (DNN) for real-time applications. DNN has been widely used for estimating a time-frequency (T-F) mask which enhances a speech…

Submodular Rank Aggregation On Score-Based Permutations For Distributed Automatic Speech Recognition

Distributed automatic speech recognition (ASR) requires to aggregate outputs of distributed deep neural network (DNN)-based models. This work studies the use of submodular functions to design a rank…

Low-Complexity Fixed-Point Convolutional Neural Networks For Automatic Target Recognition

There has been growing interest in developing neural network based automatic target recognition systems for synthetic aperture radar applications. However, these networks are typically complex in…

Truth-To-Estimate Ratio Mask: A Post-Processing Method For Speech Enhancement Direct At Low Signal-To-Noise Ratios

This study proposes a bi-directional recurrent neural network (Bi-RNN) post-processing method for speech enhancement (SE) at low signal-to noise ratios (SNR). Current speech enhancement solutions…

Donald Wunsch

Channel Attention Based Generative Network For Robust Visual Tracking

In recent years, Siamese trackers have achieved great success in visual tracking. Siamese networks can achieve competitive performance in both accuracy and speed. However, they may suffer from the…

Matching Pursuit Based Dynamic Phase-Amplitude Coupling Measure

Long-distance neuronal communication in the brain is enabled by the interactions across various oscillatory frequencies. One interaction that is gaining importance during cognitive brain functions is…

1 views

End-To-End Generation Of Talking Faces From Noisy Speech

Acoustic cues are not the only component in speech communication; if the visual counterpart is present, it is shown to benefit speech comprehension. In this work, we propose an end-to-end (no pre- or…

End-To-End Non-Negative Autoencoders For Sound Source Separation

Discriminative models for source separation have recently been shown to produce impressive results. However, when operating on sources outside of the training set, these models can not perform as…

Vocal Tract Articulatory Contour Detection In Real-Time Magnetic Resonance Images Using Spatio-Temporal Context

Due to its ability to visualize and measure the dynamics of vocal tract shaping during speech production, real-time magnetic resonance imaging (rtMRI) has emerged as one of the prominent research…

An Improved Selective Active Noise Control Algorithm Based On Empirical Wavelet Transform

The gradual adaptation and possibility of divergence have been the two main obstacles in the efficient implementation of conventional adaptive active noise control (ANC) to a wider range of…

Achieving Fully-Digital Performance By Hybrid Analog/Digital Beamforming In Wide-Band Massive-Mimo Systems

In this paper, we study the realization of any given fully-digital precoder (FDP) by hybrid analog/digital precoding (HADP) in wide-band mmWave systems. We first formulate the massive-MIMO OFDM-based…

Clustering 101

…

August 11, 2013

71 views

Improving Efficiency In Large-Scale Decentralized Distributed Training

Decentralized Parallel SGD (D-PSGD) and its asynchronous variant Asynchronous Parallel SGD (AD-PSGD) is a family of distributed learning algorithms that have been demonstrated to perform well for…

Chirping Up The Right Tree: Incorporating Biological Taxonomies Into Deep Bioacoustic Classifiers

Class imbalance in the training data hinders the generalization ability of machine listening systems. In the context of bioacoustics, this issue may be circumvented by aggregating species labels into…

Coupled Training Of Sequence-To-Sequence Models For Accented Speech Recognition

Accented speech poses significant challenges for state-of-the-art automatic speech recognition (ASR) systems. Accent is a property of speech that lasts throughout an utterance in varying degrees of…

Waveffjord: Ffjord-Based Vocoder For Statistical Parametric Speech Synthesis

Free-form Jacobian of Reversible Dynamics(FFJORD) is a flow-based invertible generative model defined by ordinary differential equations (ODE). Inspired by WaveGlow, in this paper, we propose…

Speaker Diarization With Session-Level Speaker Embedding Refinement Using Graph Neural Networks

Deep speaker embedding models have been commonly used as a building block for speaker diarization systems; however, the speaker embedding model is usually trained according to a global loss defined…

Korean Singing Voice Synthesis Based On Auto-Regressive Boundary Equilibrium Gan

Singing voice synthesis is a generative task that involves not only multidimensional controls of a singer model such as phonetic modulation by lyrics and pitch control by music score but also…

Tensorflow Audio Models In Essentia

Essentia is a reference open-source C++/Python library for audio and music analysis. In this work, we present a set of algorithms that employ TensorFlow in Essentia, allow predictions with pre-…

2 views

Tree Of Shapes Cut For Material Segmentation Guided By A Design

In manufacturing, the monitoring of the fabrication process is crucial in order to be sure that objects are compliant. For nano-objects, most of this monitoring is done manually. In this paper, we…

Evolving Fuzzy Systems: A Granular Computing Design Framework

…

June 11, 2013

94 views

A Time-Frequency Network With Channel Attention And Non-Local Modules For Artificial Bandwidth Extension

Convolution neural networks (CNNs) have been achieving increasing attention for the artificial bandwidth extension (ABE) task recently. However, these methods use the flipped low-frequency phase to…

Computational Intelligence: What Can We Learn From the Brain?

IEEE SSCI 2013 Day 3 Prof Wlodzislaw Duch

June 11, 2013

91 views

All Channels page: Communities submenu block

Communities

IEEE.tv Specials

IEEE Awards

IEEE Women in Engineering

IEEE TechEthics™

IEEE Students

All Channels page: Societies submenu block

Societies

IEEE Computer Society

IEEE Society on Social Implications of Technology

IEEE Communications Society

IEEE Nuclear and Plasma Sciences Society

IEEE Signal Processing Society

Events Showcase: ES submenu block

Event showcases

Recently Added Speakers

Events Hub Submenu block

Education: Education submenu block

Education Activity

Artificial Neural Networks, Intro

WIRELESS TRANSCEIVER SYSTEM DESIGN FOR MODERN COMMUNICATION STANDARDS

Beyond the Cellular Paradigm: Cell-Free Architectures with Radio Stripes - IEEE Future Networks Webinar

SOC DESIGN METHODOLOGY FOR IMPROVED ROBUSTNESS

The Upcoming Era of Specialization and the Research Needed to Make It Work for our Country - ICRC 2018 Plenary, William Chappell

2020 EAB AWARDS

2020 EAB AWARDS

About IEEE