IEEE ICASSP 2020 Virtual Conference May 2020

Thu, 16 July, 2020

Showing 501 - 550 of 1951

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Densely Connected Neural Network With Dilated Convolutions For Real-Time Speech Enhancement In The Time Domain

00:14:19

0 views

In this work, we propose a fully convolutional neural network for real-time speech enhancement in the time domain. The proposed network is an encoder-decoder based architecture with skip connections. The layers in the encoder and the decoder are followed

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Person Identification Using Deep Convolutional Neural Networks On Short-Term Signals From Wearable Sensors

00:11:47

2 views

In this work, we explore the discriminating ability of short-term signal patterns (e.g. few minutes long) with respect to the person identification task. We focus on signals recorded by simple wearable devices, such as smartwatches, which can measure move

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Multimodal Learning For Classroom Activity Detection

00:10:24

0 views

Classroom activity detection (CAD) focuses on accurately classifying whether the teacher or student is speaking and recording both the length of individual utterances during a class. A CAD solution helps teachers get instant feedback on their pedagogical

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Attention Guided Region Division For Crowd Counting

00:12:05

0 views

Crowd counting has drawn more and more attention in computer vision. There are two mainstream approaches to deal with crowd counting tasks, regression and detection. Regression-based methods usually overestimate the count in sparse areas, while detection-

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Speaker Independence Of Neural Vocoders And Their Effect On Parametric Resynthesis Speech Enhancement

00:14:05

0 views

Traditional speech enhancement systems produce speech with compromised quality. Here we propose to use the high quality speech generation capability of neural vocoders for better quality speech enhancement. We term this parametric resynthesis (PR). In pre

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Unsupervised Content-Preserved Adaptation Network For Classification Of Pulmonary Textures From Different Ct Scanners

00:12:16

0 views

Deep network based methods have been proposed for accurate classification of pulmonary textures on CT images. However, such methods well-trained on CT data from one scanner cannot perform well when they are directly applied to the data from other scanners

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Improving The Scalability Of Deep Reinforcement Learning-Based Routing With Control On Partial Nodes

[2 Videos ]

Machine Learning (ML)-based routing optimization has been proposed to optimize the performance of flow routing for future networks, such as Software-Defined Networks (SDNs). However, existing studies are either hard to converge for large networks or vulne

Show videos in this product

Improving The Scalability Of Deep Reinforcement Learning-Based Routing With Control On Partial Nodes

00:14:26

0 views

Machine Learning (ML)-based routing optimization has been proposed to optimize the performance of flow routing for future networks, such as Software-Defined Networks (SDNs). However, existing studies are either hard to converge for large networks or vulne
Improving The Scalability Of Deep Reinforcement Learning-Based Routing With Control On Partial Nodes

00:00:00

0 views

Machine Learning (ML)-based routing optimization has been proposed to optimize the performance of flow routing for future networks, such as Software-Defined Networks (SDNs). However, existing studies are either hard to converge for large networks or vulne

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Q-Gadmm: Quantized Group Admm For Communication Efficient Decentralized Machine Learning

00:15:39

0 views

In this paper, we propose a communication-efficient decentralized machine learning (ML) algorithm, coined quantized group ADMM (Q-GADMM). Every worker in Q-GADMM communicates only with two neighbors, and updates its model via the group alternating direct

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Speaker Diarization Using Latent Space Clustering In Generative Adversarial Network

00:14:40

0 views

In this work, we propose deep latent space clustering for speaker diarization using generative adversarial network (GAN) back-projection with the help of an encoder network. The proposed diarization system is trained jointly with GAN loss, latent variable

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Adaptive Distributed Stochastic Gradient Descent For Minimizing Delay In The Presence Of Stragglers

00:16:24

0 views

We consider the setting where a master wants to run a distributed stochastic gradient descent (SGD) algorithm on $n$ workers each having a subset of the data. Distributed SGD may suffer from the effect of stragglers, i.e., slow or unresponsive workers who

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Gpu-Accelerated Viterbi Exact Lattice Decoder For Batched Online And Offline Speech Recognition

00:15:38

0 views

We present an optimized weighted finite-state transducer (WFST) decoder capable of online streaming and offline batch processing of audio using Graphics Processing Units (GPUs). The decoder is efficient in memory utilization, input/output (I/O) bandwidth,

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Tree Of Shapes Cut For Material Segmentation Guided By A Design

00:09:18

0 views

In manufacturing, the monitoring of the fabrication process is crucial in order to be sure that objects are compliant. For nano-objects, most of this monitoring is done manually. In this paper, we propose a method to segment different materials in a manuf

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Variational Student: Learning Compact And Sparser Networks In Knowledge Distillation Framework

00:14:59

0 views

The holy grail in deep neural network research is porting the memory- and computation-intensive network models on embedded platforms with a minimal compromise in model accuracy. To this end, we propose Variational Student where we reap the benefits of com

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Rde-Moga: Automatic Selection Of Rate-Distortion-Energy Control Points For Video Encoders Using Muti-Objetive Genetic Algorithm

00:14:24

0 views

Controlling energy consumption of video encoders is acomplex multi-objective optimization problem of great im-portance. In this work we propose the RDE-MOGA, an multi-objective genetic algorithm capable of finding energeticallyefficient configurations for

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Emotional Voice Conversion Using Multitask Learning With Text-To-Speech

00:14:14

1 view

Voice conversion (VC) is a task that alters the voice of a person to suit different styles while conserving the linguistic content. Previous state-of-the-art technology used in VC was based on the sequence-to-sequence (seq2seq) model, which could lose lin

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

A Novel Approach For Intelligibility Assessment In Dysarthric Subjects

00:14:38

0 views

Dysarthria is a motor speech impairment caused by muscle weakness. Individuals, with this condition, are unable to control rapid movement of the velum leading to reduction in intelligibility, audibility, naturalness and efficiency of vocal communication.

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Sensor Selection For Model-Free Source Localization: Where Less Is More

00:13:12

0 views

The ability for a wireless network to precisely localize the radio nodes composing it is a great tool towards system optimization and is increasingly seen as a basic service requirement. In the past, model-free algorithms such as weighted centroid localiz

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Epigraphical Reformulation For Non-Proximable Mixed Norms

00:14:16

0 views

This paper proposes an epigraphical reformulation (ER) technique for non-proximable mixed norm regularization. Various regularization methods using "mixed norms" have been proposed, where their optimization relies on efficient computation of the proximity

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

A Study Of Child Speech Extraction Using Joint Speech Enhancement And Separation In Realistic Conditions

00:14:05

0 views

In this paper, we design a novel joint framework of speech enhancement and speech separation for child speech extraction in realistic conditions, targeting the problem of extracting child speech from daily conversations in BabyTrain mega corpus. To the be

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Back-And-Forth Prediction For Deep Tensor Compression

00:14:46

0 views

Recent AI applications such as Collaborative Intelligence with neural networks involve transferring deep feature tensors between various computing devices. This necessitates tensor compression in order to optimize the usage of bandwidth-constrained channe

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Transfer Learning From Youtube Soundtracks To Tag Arctic Ecoacoustic Recordings

00:10:32

0 views

Sound provides a valuable tool for long-term monitoring of sensitive animal habitats at a spatial scale larger than camera traps or field observations, while also providing more details than satellite imagery. Currently, the ability to collect such record

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Preconditioned Ghost Imaging Via Sparsity Constraint

00:13:30

0 views

Ghost imaging via sparsity constraint (GISC) can recover objects from the intensity fluctuation of light fields at a sampling rate far below the Nyquist rate. However, its imaging quality may degrade severely when the coherence of sampling matrices is lar

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Learning-Based Content Caching And User Clustering: A Deep Deterministic Policy Gradient Approach

00:12:02

0 views

The joint design of content caching and user clustering (JCC) in cache-enabled heterogeneous networks is challenging, due to various unknown, possibly time-varying, system parameters which potentially give rise to various design tradeoffs in practice. Thi

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Upgrading Crfs To Jrfs And Its Benefits To Sequence Modeling And Labeling

00:12:30

0 views

Two important sequence tasks are sequence modeling and labeling. Sequence modeling involves determining the probabilities of sequences, e.g. language modeling. It is still difficult to improve language modeling with additional relevant tags, e.g. part-of-

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Accent Estimation Of Japanese Words From Their Surfaces And Romanizations For Building Large Vocabulary Accent Dictionaries

00:14:36

0 views

In Japanese text-to-speech (TTS), it is necessary to add accent information to the input sentence. However, there are a limited number of publicly available accent dictionaries, and those dictionaries e.g. UniDic, do not contain many compound words, prope

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Communication Constrained Learning With Uncertain Models

00:13:18

0 views

We consider the problem of distributed inference of a group of agents in a social network, where the agents construct, share, and update beliefs in a non-Bayesian framework to identify the underlying true state of the world. We build upon the concept of u

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Leveraging Ordinal Regression With Soft Labels For 3D Head Pose Estimation From Point Sets

00:13:31

0 views

Head pose estimation from depth image is a challenging problem, considering its large pose variations, severer occlusions, and low quality of depth data. In contrast to existing approaches that take 2D depth image as input, we propose a novel deep regress

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Optimal Window Design For W-Ofdm

00:13:19

1 view

Windowing is an effective approach to reduce out-of-band radiation (OBR) in multicarrier systems in order to avoid adjacent channel interference. However, commonly used window functions are chosen in an ad hoc manner and fixed. We present an optimal windo

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Multi-Motifgan (Mmgan): Motif-Targeted Graph Generation And Prediction

00:12:22

0 views

Generative graph models create instances of graphs that mimic the properties of real-world networks. Generative models are successful at retaining pairwise associations in the underlying networks but often fail to capture higher-order connectivity pattern

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

A Study Of Generalization Of Stochastic Mirror Descent Algorithms On Overparameterized Nonlinear Models

00:14:46

0 views

We study the convergence, the implicit regularization and the generalization of stochastic mirror descent (SMD) algorithms in overparameterized nonlinear models, where the number of model parameters exceeds the number of training data points. Due to overp

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Nonlinear Spatial Filtering For Multichannel Speech Enhancement In Inhomogeneous Noise Fields

00:13:53

0 views

A common processing pipeline for multichannel speech enhancement is to combine a linear spatial filter with a single-channel postfilter. In fact, it can be shown that such a combination is optimal in the minimum mean square error (MMSE) sense if the noise

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

A New Application Of Ultrasound Signal Processing For Archaeological Ceramic Classification

00:12:41

0 views

Identifying archaeological ceramic pieces is a challenging problem for archaeologists, since fragments of archaeological pottery from the same site might have been made in different distant locations from the site. The pieces look very similar and context

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

One-Bit Doa Estimation Via Sparse Linear Arrays

00:13:48

1041 views

Parameter estimation from noisy and quantized received signals has become an important topic in signal processing, as it offers low cost and low complexity in the implementation. Techniques to achieve high estimation performance in spite of the coarse qua

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Fast Optical System Identification By Numerical Interferometry

00:14:56

0 views

We propose a numerical interferometry method for identification of optical multiply-scattering systems when only intensity can be measured. Our method simplifies the calibration of optical transmission matrices from a quadratic to a linear inverse problem

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

A Deep Learning Architecture For Epileptic Seizure Classification Based On Object And Action Recognition

00:09:00

0 views

Epilepsy affects approximately 1% of the world’s population. Semiology of epileptic seizures contain major clinical signs to classify epilepsy syndromes currently evaluated by epileptologists by simple visual inspection of video. There is a necessity to c

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Hybrid Neural-Parametric F0 Model For Singing Synthesis

00:13:47

475 views

We propose a novel hybrid neural-parametric fundamental frequency generation model for singing voice synthesis. A recurrent neural network predicts the parameters of a flexible parametric F0 model, conditioned on a given input score. Rather than trying to

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Automatic Classification Of Volumes Of Water Using Swallow Sounds From Cervical Auscultation

00:14:09

0 views

The signatures of swallowing vary depending on the volume of bolus swallowed. Among existing instrumental methods, cervical auscultation (CA) captures the acoustic signatures of the swallow sound. Although many features present in the literature can chara

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Anomalous Sound Detection Based On Interpolation Deep Neural Network

00:12:03

2 views

As the labor force decreases, the demand for labor-saving automatic anomalous sound detection technology that conducts maintenance of industrial equipment has grown. Conventional approaches detect anomalies based on the reconstruction errors of an autoenc

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Localized Linear Regression In Networked Data

00:12:23

0 views

The network Lasso (nLasso) has been proposed recently as an efficient learning algorithm for massive networked data sets (big data over networks). It extends the well-known least absolute shrinkage and selection operator (Lasso) from learning sparse (gene

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Connections Between Spectral Properties Of Asymptotic Mappings And Solutions To Wireless Network Problems

00:17:42

0 views

We establish connections between asymptotic functions and properties of solutions to problems in wireless networks. We start by introducing self-mappings (called asymptotic mappings) constructed with asymptotic functions, and we show that their spectral p

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Super-Resolution Via Image-Adapted Denoising Cnns: Incorporating External And Internal Learning

00:12:08

634 views

While deep neural networks exhibit state-of-the-art results in the task of image super-resolution (SR) with a fixed known acquisition process (e.g., a bicubic downscaling kernel), they experience a huge performance loss when the real observation model mis

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Learning Based Reconfigurable Wideband Non-Contiguous Spectrum Characterization For 5G Applications

00:16:20

603 views

Introduction of spectrum sharing in 3GPP Release 17 demands base-stations (BS) with the capability to characterize the wideband spectrum spanned over licensed, shared and unlicensed non-contiguous frequency bands. Since, multiple-antenna and beam-forming

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Convolutional Beamspace For Array Signal Processing

00:13:37

48 views

A new type of beamspace for array processing is introduced called convolutional beamspace. It enjoys the advantages of traditional beamspace such as lower computational complexity, increased parallelism of subband processing, and improved resolution thres

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Automatic Lyrics Alignment And Transcription In Polyphonic Music: Does Background Music Help?

00:13:52

0 views

Automatic lyrics alignment and transcription in polyphonic music are challenging tasks because the singing vocals are corrupted by the background music. In this work, we propose to learn music genre-specific characteristics to train polyphonic acoustic mo

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Data-Driven Model Set Design For Model Averaged Particle Filter

00:16:18

0 views

This paper is concerned with sequential state filtering in the presence of nonlinearity, non-Gaussianity and model uncertainty. For this problem, the Bayesian model averaged particle filter (BMAPF) is perhaps one of the most efficient solutions. Major adv

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Robust Multi-Channel Speech Recognition Using Frequency Aligned Network

00:18:33

0 views

Conventional speech enhancement technique such as beamforming has known benefits for far-field speech recognition. Our own work in frequency-domain multi-channel acoustic modeling has shown additional improvements by training a spatial filtering layer joi

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

C3Dvqa: Full-Reference Video Quality Assessment With 3D Convolutional Neural Network

00:13:20

0 views

Traditional video quality assessment (VQA) methods evaluate localized picture quality and video score is predicted by temporally aggregating frame scores. However, video quality exhibits different characteristics from static image quality due to the exist

All Channels page: Communities submenu block

Communities

All Channels page: Societies submenu block

Societies

Events Showcase: ES submenu block

Event showcases

Recently Added Speakers

Events Hub Submenu block

Education: Education submenu block

Education Activity

2020 EAB AWARDS

2020 EAB AWARDS

IEEE ICASSP 2020 Virtual Conference May 2020