IEEE ICASSP 2020 Virtual Conference May 2020

Thu, 16 July, 2020

Showing 1601 - 1650 of 1951

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Processing Convolutional Neural Networks On Cache

00:13:00

0 views

With the advent of Big Data application domains, several Machine Learning (ML) signal-processing algorithms such as Convolutional Neural Networks (CNNs) are required to process progressively larger datasets at a great cost in terms of both compute power a

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Vimo: Vital Sign Monitoring Using Commodity Millimeter Wave Radio

00:14:37

0 views

Accurate monitoring of human vital signs (e.g. breathing and heart rates) is crucial in detecting medical problems. In this paper, we propose ViMo, a calibration-free remote Vital sign Monitoring system that can simultaneously monitor multiple users by le

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

A Recursive Bayesian Solution For The Excess Over Threshold Distribution With Stochastic Parameters

00:16:28

0 views

In this paper, we propose a new approach for analyzing extreme values that are witnessed in financial markets. Our goal is to compute the predictive distribution of extreme events that are clustered in time and, as opposed to modeling just the maximum of

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Computing Hilbert Transform And Spectral Factorization For Signal Spaces Of Smooth Functions

00:14:51

0 views

Although the Hilbert transform and the spectral factorization are of central importance in signal processing, both operations can generally not be calculated in closed form. Therefore, algorithmic solutions are prevalent which provide an approximation of

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Content Based Singing Voice Extraction From A Musical Mixture

00:12:39

0 views

We present a deep learning based methodology for extracting the singing voice signal from a musical mixture based on the underlying linguistic content. Our model follows an encoder decoder architecture and takes as input the magnitude component of the spe

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Line Spectral Estimation With Palindromic Kernels

00:10:59

0 views

Estimation of line spectra is a classical problem in signal processing and arises in many applications. The problem is to estimate the frequencies and corresponding amplitudes of a sum of (possibly complex-valued) sinusoidal components from noisy measurem

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Confidence Estimation For Black Box Automatic Speech Recognition Systems Using Lattice Recurrent Neural Networks

00:14:47

0 views

Recently, there has been growth in providers of speech transcription services enabling others to leverage technology they would not normally be able to use. As a result, speech-enabled solutions have become commonplace. Their success critically relies on

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Clutter Identification Based On Sparse Recovery And L1-Type Probabilistic Distance Measures

00:18:01

0 views

Cognitive radar framework has recently been proposed in radar signal processing to develope algorithms for target detection, tracking, and waveform design in the presence of nonstationary environmental (clutter) characteristics. In this framework, there a

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Spoken Document Retrieval Leveraging Bert-Based Modeling And Query Reformulation

00:14:06

0 views

Spoken document retrieval (SDR) has long been deemed a fundamental and important step towards efficient organization of, and access to multimedia associated with spoken content. In this paper, we present a novel study of SDR leveraging the Bidirectional E

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Efficient Image Super Resolution Via Channel Discriminative Deep Neural Network Pruning

00:06:48

0 views

Deep convolutional neural networks (CNN) have demonstrated superior performance in image super-resolution (SR) problem.However, CNNs are known to be heavily over-parameterized, and suffer from abundant redundancy. The growing size ofCNNs may be incompatib

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Attention Driven Fusion For Multi-Modal Emotion Recognition

00:14:40

0 views

Deep learning has emerged as a powerful alternative to hand-crafted methods for emotion recognition on combined acoustic and text modalities. Baseline systems model emotion information in text and acoustic modes independently using Deep Convolutional Neur

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Emet: Embeddings From Multilingual-Encoder Transformer For Fake News Detection

00:13:20

0 views

In the last few years, social media networks have changed human life experience and behavior as it has broken down communication barriers, allowing ordinary people to actively produce multimedia content on a massive scale. On this wise, the information di

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Statistics Pooling Time Delay Neural Network Based On X-Vector For Speaker Verification

00:12:50

0 views

This paper aims to improve speaker embedding representation based on x-vector for extracting more detailed information for speaker verification. We propose a statistics pooling time delay neural network (TDNN), in which the TDNN structure integrates stati

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Large-Scale Fading Precoding For Maximizing The Product Of Sinrs

00:15:01

0 views

This paper considers the large-scale fading precoding design for mitigating the pilot contamination in the downlink of multi-cell massive MIMO (multiple-input multiple-output) systems. Rician fading with spatially correlated channels are considered where

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Adrn: Attention-Based Deep Residual Network For Hyperspectral Image Denoising

00:12:31

0 views

Hyperspectral image (HSI) denoising is of crucial importance for many subsequent applications, such as HSI classification and interpretation. In this paper, we propose an attention-based deep residual network to directly learn a mapping from noisy HSI to

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Vapar Synth - A Variational Parametric Model For Audio Synthesis

00:15:14

0 views

With the advent of data-driven statistical modeling and abundant computing power, researchers are turning increasingly to deep learning for audio synthesis. These methods try to model audio signals directly in the time or frequency domain. In the interest

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Sequential Joint Detection And Estimation With An Application To Joint Symbol Decoding And Noise Power Estimation

00:13:32

0 views

Jointly testing multiple hypotheses and estimating a random parameter of the underlying model is investigated in a sequential setup. The optimal scheme is designed such that it minimizes the expected number of used samples while keeping the probabilities

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Automatic Epileptic Seizure Onset-Offset Detection Based On Cnn In Scalp Eeg

00:14:26

1 view

We establish a deep learning-based method to automatically detect the epileptic seizure onsets and offsets in multi-channel electroencephalography (EEG) signals. A convolutional neural network (CNN) is designed to identify occurrences of seizures in EEG e

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Robust Fundamental Frequency Estimation In Coloured Noise

00:14:23

0 views

Most parametric fundamental frequency estimators make the implicit assumption that any corrupting noise is additive, white Gaussian. Under this assumption, the maximum likelihood (ML) and the least squares estimators are the same, and statistically effici

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Saliency-Based Image Contrast Enhancement With Reversible Data Hiding

00:13:57

0 views

Reversible data hiding (RDH) has become a hot research area in the recent years due to its wide applications such as authentication. Among all the RDH methods proposed, contrast enhancement based reversible data hiding is one that was recently proposed. H

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Spectrum Allocation In Wireless Networks For Crowd Labelling

00:11:57

0 views

The massive sensing data generated by Internet-of-Things will provide fuel for ubiquitous artificial intelligence (AI), while tremendous labels are required for AI model training via supervised learning. To tackle this challenge, a novel framework of wire

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Staged Training Strategy And Multi-Activation For Audio Tagging With Noisy And Sparse Multi-Label Data

00:11:18

0 views

Audio tagging aims to predict whether certain acoustic events occur in the audio clips. Due to the difficulty and huge cost of obtaining manually labeled data with high confidence, researchers begin to focus on audio tagging using a small set of manually-

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Encoding And Decoding Mixed Bandlimited Signals Using Spiking Integrate-And-Fire Neurons

00:12:17

0 views

Conventional sampling focuses on encoding and decoding bandlimited signals by recording signal amplitudes at known time points. Alternately, sampling can be approached using biologically-inspired schemes. Among these are integrate-and-fire time encoding m

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Asr Is All You Need: Cross-Modal Distillation For Lip Reading

00:12:22

0 views

The goal of this work is to train strong models for visual speech recognition without requiring human annotated ground truth data. We achieve this by distilling from an Automatic Speech Recognition (ASR) model that has been trained on a large-scale audio-

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Learning To Rank Music Tracks Using Triplet Loss

00:13:09

0 views

Most music streaming services rely on automatic recommendation algorithms to exploit their large music catalogs. These algorithms aim at retrieving a ranked list of music tracks based on their similarity with a target music track. In this work, we propose

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Probabilistic Filter And Smoother For Variational Inference Of Bayesian Linear Dynamical Systems

00:14:16

0 views

Variational inference of a Bayesian linear dynamical system is a powerful method for estimating latent variable sequences and learning sparse dynamic models in domains ranging from neuroscience to audio processing. The hardest part of the method is inferr

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Deliberation Model Based Two-Pass End-To-End Speech Recognition

00:15:44

0 views

End-to-end (E2E) models have made rapid progress in automatic speech recognition (ASR) and perform competitively relative to conventional models. To further improve the quality, a two-pass model has been proposed to rescore streamed hypotheses using the n

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Regularized Fast Multichannel Nonnegative Matrix Factorization With Ilrma-Based Prior Distribution Of Joint-Diagonalization Process

00:13:00

0 views

In this paper, we address a convolutive blind source separation (BSS) problem and propose a new extended framework of FastMNMF by introducing prior information for joint diagonalization of the spatial covariance matrix model. Recently, FastMNMF has been p

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Epi-Neighborhood Distribution Based Light Field Depth Estimation

00:13:59

0 views

In this paper, a novel depth estimation algorithm tackling foreground occlusion is proposed based on the neighborhood distribution in the sheared epipolar images (EPIs). First, the EPI is sheared to perform refocusing. Next a series of sheared EPI?s neigh

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Multi Image Depth From Defocus Network With Boundary Cue For Dual Aperture Camera

00:12:32

0 views

In this paper, we estimate depth information using two defocused images from dual aperture camera. Recent advances in deep learning techniques have increased the accuracy of depth estimation. Besides, methods of using a defocused image in which an object

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Defense Against Adversarial Attacks On Spoofing Countermeasures Of Asv

00:12:47

0 views

Various forefront countermeasure methods for automatic speaker verification (ASV) with considerable performance in anti-spoofing are proposed in the ASVspoof 2019 challenge. However, previous work has shown that countermeasure models are vulnerable to adv

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Improving Device Directedness Classification Of Utterances With Semantic Lexical Features

00:14:58

0 views

User interactions with personal assistants like Alexa, Google Home and Siri are typically initiated by a wake term or wakeword. Several personal assistants feature "follow-up" modes that allow users to make additional interactions without the need of a wa

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Comparison Of Glottal Closure Instants Detection Algorithms For Emotional Speech

00:16:51

0 views

In production of voiced speech, epochs or glottal closure instants (GCIs) refer to the instants of significant excitation of the vocal tract. Extraction of GCIs is used as a pre-processing stage in many areas of speech technology, such as in prosody modif

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

An Ontology-Aware Framework For Audio Event Classification

00:13:50

0 views

Recent advancements in audio event classification often ignore the structure and relation between the label classes available as prior information. This structure can be defined by ontology and augmented in the classifier as a form of domain knowledge. To

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

A Model-Free Approach To Distributed Transmit Beamforming

00:14:00

0 views

This paper presents a model-free solution to distributed transmit beamforming using mobile agents. Each agent is equipped with an antenna and the agents represent the individual elements in an antenna array. The agents are tasked to coordinate their relat

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Gaussian Lpcnet For Multisample Speech Synthesis

00:13:45

0 views

LPCNet vocoder has recently been presented to TTS community and is now gaining increasing popularity due to its effectiveness and high quality of the speech synthesized with it. In this work, we present a modification of LPCNet that is 1.5x faster, has tw

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Source Domain Data Selection For Improved Transfer Learning Targeting Dysarthric Speech Recognition

00:13:09

0 views

This paper presents an improved transfer learning framework applied to robust personalised speech recognition models for speakers with dysarthria. As the baseline of transfer learning, a state-of-the-art CNN-TDNN-F ASR acoustic model trained solely on sou

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Semi-Implicit Stochastic Recurrent Neural Networks

00:15:12

0 views

Stochastic recurrent neural networks with latent random variables of complex dependency structures have shown to be more successful in modeling sequential data than deterministic deep models. However, the majority of existing methods have limited expressi

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Decentralized Stochastic Non-Convex Optimization Over Weakly Connected Time-Varying Digraphs

00:14:55

0 views

In this paper, we consider decentralized stochastic non-convex optimization over a class of weakly connected digraphs. First, we quantify the convergence behaviors of the weight matrices of this type of digraphs. By leveraging the perturbed push sum proto

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Time-Frequency Loss For Cnn Based Speech Super-Resolution

00:15:36

0 views

Speech super-resolution (SR), also called speech bandwidth extension (BWE), aims to increase the sampling rate of a given lower resolution speech signal. Recent years have witnessed the successful application of deep neural networks in time or frequency d

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Dynamic Resource Allocation For Wireless Edge Machine Learning With Latency And Accuracy Guarantees

00:15:17

1 view

In this paper, we address the problem of dynamic allocation of communication and computation resources for Edge Machine Learning (EML) exploiting Multi-Access Edge Computing (MEC). In particular, we consider an IoT scenario, where sensor devices collect d

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Sampling Classes Of Non-Bandlimited Signals Using Integrate-And-Fire Devices: Average Case Analysis

00:14:12

0 views

We investigate the use of integrate-and-fire systems to efficiently sample classes of non-bandlimited signals such as bursts of spikes. The sampling in this case is based on storing some timing information about the signal, and no information about its am

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Improving Robustness Of Deep Learning Based Monaural Speech Enhancement Against Processing Artifacts

00:15:25

0 views

In voice telecommunication, the intelligibility and quality of speech signals can be severely degraded by background noise if the speaker at the transmitting end talks in a noisy environment. Therefore, a speech enhancement system is typically integrated

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Lookahead Converges To Stationary Points Of Smooth Non-Convex Functions

00:13:47

0 views

The Lookahead optimizer [Zhang et al., 2019] was recently proposed and demonstrated to improve performance of stochastic first-order methods for training deep neural networks. Lookahead can be viewed as a two time-scale algorithm, where the fast dynamics

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Constant-Envelope Precoding For Satellite Systems

00:13:01

0 views

In this paper, Constant-Envelope Precoding techniques are presented for satellite-based communication systems. In the developed transmission technique the signals of the antennas are designed to be of constant amplitude, improving the robustness of the la

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Cost Aware Adversarial Learning

00:14:50

0 views

The problem of making the classifier design resilient to test data falsification is considered. In the literature, a few countermeasures have been proposed to defend machine learning algorithms against test data falsification, but a common assumption empl

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Universal Phone Recognition With A Multilingual Allophone System

00:12:51

0 views

Recently, multilingual speech recognition has achieved tremendous progress by sharing parameters across languages. Multilingual acoustic models, however, generally ignore the difference between phonemes (sounds that can support lexical contrasts in a emp

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Learning Partial Differential Equations From Data Using Neural Networks

00:14:34

0 views

We develop a framework for estimating unknown partial differential equations (PDEs) from noisy data, using a deep learning approach. Given noisy samples of a solution to an unknown PDE, our method interpolates the samples using a neural network, and extra

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Local-Global Feature For Video-Based One-Shot Person Re-Identification

00:12:29

0 views

One-shot video-based re-identification, which uses only one labeled tracklet for each identity, is challenging since the framework usually suffers misalignment and inefficient utilizing of unlabeled data. In this paper we propose a novel local-global prog

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Toward Better Speaker Embeddings: Automated Collection Of Speech Samples From Unknown Distinct Speakers

00:10:56

0 views

The accuracy of speaker verification and diarization models depends on the quality of the speaker embeddings used to separate audio samples from different speakers. With the goal of training better embedding models, we devise an au- tomatic pipeline for l

All Channels page: Communities submenu block

Communities

All Channels page: Societies submenu block

Societies

Events Showcase: ES submenu block

Event showcases

Recently Added Speakers

Events Hub Submenu block

Education: Education submenu block

Education Activity

2020 EAB AWARDS

2020 EAB AWARDS

IEEE ICASSP 2020 Virtual Conference May 2020