IEEE ICASSP 2020 Virtual Conference May 2020

Thu, 16 July, 2020

Showing 1651 - 1700 of 1951

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Exploiting Two-Dimensional Symmetry And Unimodality For Model-Free Source Localization In Harsh Environment

00:14:59

0 views

Knowing the location of a transceiver may enable advanced radio resource management strategies in sensing and communication networks. However, there are many scenarios where users operate in a non-cooperative mode with no localization-dedicated signaling

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Multi-Head Attention For Speech Emotion Recognition With Auxiliary Learning Of Gender Recognition

00:15:22

0 views

The paper presents a Multi-Head Attention deep learning network for Speech Emotion Recognition (SER) using Log mel-Filter Bank Energies (LFBE) spectral features as the input. The multi-head attention along with the position embedding jointly attends to in

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Generalized Graph Spectral Sampling With Stochastic Priors

00:13:03

1 view

We consider generalized sampling for stochastic graph signals. The generalized graph sampling framework allows recovery of graph signals beyond the bandlimited setting by placing a correction filter between the sampling and reconstruction operators and as

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

View-Angle Invariant Object Monitoring Without Image Registration

00:14:09

0 views

Object monitoring can be performed by change detection algorithms. However, for the image pair with a large perspective difference, the change detection performance is usually impacted by inaccurate image registration. To address the above difficulties, a

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Spoken Language Acquisition Based On Reinforcement Learning And Word Unit Segmentation

00:14:59

0 views

The process of spoken language acquisition has been one of the topics which attract the greatest interesting from linguists for decades. By utilizing modern machine learning techniques, we simulated this process on computers, which helps to understand the

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Investigation Of Methods To Improve The Recognition Performance Of Tamil-English Code-Switched Data In Transformer Framework

00:11:00

0 views

Code-switching (CS) refers to (inter/intra-word) switching between multiple languages in a single conversation. In multilingual countries like India, CS occurs very often in everyday speech, resulting in a new breed of languages in urban regions like Hing

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Video Deblurring Via 3D Cnn And Fourier Accumulation Learning

00:11:56

0 views

Camera shake and target movement often leads to undesirable image blurring in videos. How to exploit spatial-temporal information of adjacent frames and reduce the processing time of deblurring are two major issues in video deblurring. In this paper, we p

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Hydranet: A Real-Time Waveform Separation Network

00:10:41

0 views

Real-time source separation has become increasingly important, as more and more applications, such as voice recognition and voice commands, require clean audio input in noisy environments. Recent developments in deep learning have allowed models to direct

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Computability Of The Peak Value Of Bandlimited Signals

00:12:19

0 views

In this paper we study the peak value problem, i.e., the task of computing the peak value of a bandlimited signal from its samples. The peak value problem is important, for example, in communications, where the peak value of the transmit signal has to be

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Meta-Learning Extractors For Music Source Separation

00:12:07

0 views

We propose a hierarchical meta-learning-inspired model for music source separation (Meta-TasNet) in which a generator model is used to predict the weights of individual extractor models. This enables efficient parameter-sharing, while still allowing for i

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Unsupervised Variational Bayesian Kalman Filtering For Large-Dimensional Gaussian Systems

00:13:50

0 views

This paper considers the unsupervised filtering problem for large-dimensional linear and Gaussian systems, a setup in which the optimal Kalman filter (KF) might not be usable due to the exorbitant computational cost and storage requirements. For this prob

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Cross-Stained Segmentation From Renal Biopsy Images Using Multi-Level Adversarial Learning

00:12:19

0 views

Segmentation from renal pathological images is a key step in automatic analyzing the renal histological characteristics. However, the performance of models varies significantly in different types of stained datasets due to the appearance variations. In th

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Multi-Conditioning And Data Augmentation Using Generative Noise Model For Speech Emotion Recognition In Noisy Conditions

00:14:41

0 views

Degradation due to additive noise is a significant road block in the real-life deployment of Speech Emotion Recognition (SER) systems. Most of the previous work in this field dealt with the noise degradation either at the signal or at the feature level. I

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

3D Unknown View Tomography Via Rotation Invariants

00:15:12

492 views

In this paper, we study the problem of reconstructing a 3D point source model from a set of 2D projections at unknown view angles. Our method obviates the need to recover the projection angles by extracting a set of rotation-invariant features from the no

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Hierarchical Sequence Representation With Graph Network

00:11:49

0 views

Video classification problem is a challenging task in computer vision. The performance of this task is highly relied on the scale of training data and the effectiveness of video embedding via a robust embedding network. Unsupervised solutions such as feat

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

High-Resolution Attention Network With Acoustic Segment Model For Acoustic Scene Classification

[2 Videos ]

The spectral information of acoustic scenes is diverse and complex, which poses challenges for acoustic scene tasks. To improve the classification performance, a variety of convolutional neural networks (CNNs) are proposed to extract richer semantic infor

Show videos in this product

High-Resolution Attention Network With Acoustic Segment Model For Acoustic Scene Classification

00:12:38

0 views

The spectral information of acoustic scenes is diverse and complex, which poses challenges for acoustic scene tasks. To improve the classification performance, a variety of convolutional neural networks (CNNs) are proposed to extract richer semantic infor
High-Resolution Attention Network With Acoustic Segment Model For Acoustic Scene Classification

00:00:00

703 views

The spectral information of acoustic scenes is diverse and complex, which poses challenges for acoustic scene tasks. To improve the classification performance, a variety of convolutional neural networks (CNNs) are proposed to extract richer semantic infor

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

The Fifthnet Chroma Extractor

00:12:22

583 views

Deep Learning (DL) is now commonly used in music processing such as Automatic Chord Recognition (ACR), with Convolutional Neural Networks (CNN) being popular in such tasks. Compression of CNNs has become a research topic of interest, focussed on post-prun

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Projected Weight Regularization To Improve Neural Network Generalization

00:09:39

0 views

Generalization of a deep neural network (DNN) is one major concern when employing the deep learning approach for solving practical problems. In this paper we propose a new technique, named projected weight regularization (PWR), to improve the generalizati

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

An Acoustic Modelling Based Remote Error Sensing Approach For Quiet Zone Generation In A Noisy Environment

00:11:43

907 views

Remote error sensing is required in active noise control systems when they are used to create a quiet zone in a noisy environment with the constraint that the error microphones cannot be inside the zone. The challenge in remote error sensing is to estimat

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Performance Analysis And Constellation Optimization Of Star-Qam-Aided Differential Faster-Than-Nyquist Signaling

00:14:34

0 views

In this letter, motivated by the recent differential faster-than-Nyquist (DFTN) signaling concept, we propose an improved 16-point double-ring star quadrature amplitude modulation (QAM)-aided DFTN signaling transmission, which allows us to attain a higher

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Direction Of Arrival Estimation For Reverberant Speech Based On Enhanced Decomposition Of The Direct Sound

00:14:58

0 views

Direction of arrival (DOA) estimation for speech sources is an important task in audio signal processing. This task becomes a challenge in reverberant environments, which are typical to real scenarios. Several DOA estimation methods for speech sources hav

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Realistic Real-Time Voice Swapping From Single Unpaired Sentences

00:04:35

610 views

We demonstrate a system that allows two speakers to swap their voices from any two unpaired sentences such that the result is indistinguishable from real voices and performed in real-time on a laptop. Each of the two speakers takes turns pronouncing any u

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Machine Learning-Based Adaptive Receive Filtering: Proof-Of-Concept On An Sdr Platform

00:11:15

2 views

The constant demand for low latency and high data rates in a modern mobile communications network creates new scientific challenges in each new generation. An accurate reconstruction of transmission data of as many users as possible at the base station is

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

An Empirical Study On Acoustic Feedback Path Across Hearing Aid Users

00:12:03

0 views

Acoustic feedback is one of the major problems in hearing aid applications. During a fitting session of a modern hearing aid, typically a feedback path prediction or an in situ measurement of feedback path is used as part of the gain and earpiece prescrip

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Fast Block-Sparse Estimation For Vector Networks

00:12:32

0 views

While there is now a significant literature on sparse inverse covariance estimation, all that literature, with only a couple of exceptions, has dealt only with univariate (or scalar) networks where each node carries a univariate signal. However in many, p

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

On Modeling Asr Word Confidence

00:16:58

0 views

We present a new method for computing ASR word confidences that effectively mitigates the effect of ASR errors for diverse downstream applications, improves the word error rate of the 1-best result, and allows better comparison of scores across different

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Improving Reverberant Speech Training Using Diffuse Acoustic Simulation

00:09:34

1 view

We present an efficient and realistic geometric acoustic simulation approach for generating and augmenting training data in speech-related machine learning tasks. Our physically-based acoustic simulation method is capable of modeling occlusion, specular a

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Single-Shot Real-Time Multiple-Path Time-Of-Flight Depth Imaging For Multi-Aperture And Macro-Pixel Sensors

00:12:47

0 views

Multiple-Path Interference (MPI) is a major drawback of Time-of-Flight (ToF) sensors. MPI occurs when a ToF pixel receives more than a single light bounce from the scene. Current methods resolving more than a single return per pixel rely on the sequential

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Design-Gan: Cross-Category Fashion Translation Driven By Landmark Attention

00:10:34

0 views

The rise of generative adversarial networks has boosted a vast interest in the field of fashion image-to-image translation. However, previous methods do not perform well in cross-category translation tasks, e.g., translating jeans to skirts in fashion ima

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Near-Optimal Interference Exploitation 1-Bit Massive Mimo Precoding Via Partial Branch-And-Bound

00:16:51

0 views

In this paper, we focus on 1-bit precoding for large-scale antenna systems in the downlink based on the concept of constructive interference (CI). By formulating the optimization problem that aims to maximize the CI effect subject to the 1-bit constraint

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Theoretical Analysis Of Multi-Carrier Agile Phased Array Radar

00:12:18

0 views

Modern radar systems are expected to operate reliably in congested environments under cost and power constraints. A recent technology for realizing such systems is frequency agile radar (FAR), which transmits narrowband pulses in a frequency hopping manne

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

A Bin Encoding Training Of A Spiking Neural Network Based Voice Activity Detection

00:14:55

1 view

Advances of deep learning for Artificial Neural Networks(ANNs) have led to significant improvements in the performance of digital signal processing systems implemented on digital chips. Although recent progress in low-power chips is remarkable, neuromorph

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Adaptive Knowledge Distillation Based On Entropy

00:14:43

0 views

Knowledge distillation (KD) approach is widely used in the deep learning field mainly for model size reduction. KD utilizes soft labels of teacher model, which contain the dark- knowledge that one-hot ground-truth does not have. This knowledge can improve

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Audio-Based Auto-Tagging With Contextual Tags For Music

00:14:17

0 views

Music listening context such as location or activity has been shown to greatly influence the users' musical tastes. In this work, we study the relationship between user context and audio content in order to enable context-aware music recommendation agnost

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Discrete Wasserstein Autoencoders For Document Retrieval

00:13:17

0 views

Learning to hash via generative models has become a promising paradigm for fast similarity search in document retrieval. The binary hash codes are treated as Bernoulli latent variables when training a variational autoencoder (VAE). However, the prior of d

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

On Harmonic Approximations Of Inharmonic Signals

00:13:35

0 views

In this work, we present the misspecified Gaussian Cram'er-Rao lower bound for the parameters of a harmonic signal, or pitch, when signal measurements are collected from an almost, but not quite, harmonic model. For the asymptotic case of large sample si

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Denoising Of Event-Based Sensors With Spatial-Temporal Correlation

00:12:17

0 views

As a novel asynchronous-driven cameras, event-based sensors are with high sensitivity, fast speed and low data volume, but with abundant noise. Since the output of event-based sensors is in the form of address-event-representation (AER), the traditional f

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Dense Residual Network For Retinal Vessel Segmentation

00:13:28

0 views

Retinal vessel segmentation plays an imaportant role in the field of retinal image analysis because changes in retinal vascular structure can aid in the diagnosis of diseases such as hypertension and diabetes. In recent research, numerous successful segme

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Video Frame Interpolation Via Residue Refinement

00:12:06

0 views

Video frame interpolation achieves temporal super-resolution by generating smooth transitions between frames. Although great success has been achieved by deep neural networks, the synthesized images stills suffer from poor visual appearance and unsatisfie

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Single-Channel Speech Separation Integrating Pitch Information Based On A Multi Task Learning Framework

00:14:58

0 views

Pitch is a critical cue for speech separation in humans? auditory perception. Although the technology of tracking pitch in single-talker speech succeeds in many applications, it?s still a challenging problem to extract pitch information from speech mixtur

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Low Mutual And Average Coherence Dictionary Learning Using Convex Approximation

00:14:05

0 views

In dictionary learning, a desirable property for the dictionary is to be of low mutual and average coherences. Mutual coherence is defined as the maximum absolute correlation between distinct atoms of the dictionary, whereas the average coherence is a mea

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Robust Online Mirror Saddle-Point Method For Constrained Resource Allocation

00:13:34

0 views

Online-learning literature has focused on designing algorithms that ensure sub-linear growth of the cumulative long-term constraint violations. The drawback of this guarantee is that strictly feasible actions may cancel out constraint violations on other

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Adaptive Subspace Detectors For Off-Grid Mismatched Targets

00:15:19

1 view

Abstract In classical detection framework, the parameter space is usually discretized, so that in reality received parameter dependent signals are never perfectly aligned with the signal model under test: it leads to the off-grid signal mismatch. In a Gau

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Corrgan: Sampling Realistic Financial Correlation Matrices Using Generative Adversarial Networks

00:15:08

0 views

We propose a novel approach for sampling realistic financial correlation matrices. This approach is based on generative adversarial networks. Experiments demonstrate that generative adversarial networks are able to recover most of the known stylized facts

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Triggerless Random Interleaved Sampling

00:15:00

0 views

A single short sequence of samples taken at sub-Nyquist rate rarely allows for periodic signal recovery. If there is more than one such sequence and time offsets between these sequences are given, the signal approximation is possible and is known as equiv

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Ava Active Speaker: An Audio-Visual Dataset For Active Speaker Detection

00:14:36

0 views

Active speaker detection is an important component in video analysis algorithms for applications such as speaker diarization, video re-targeting for meetings, speech enhancement, and human-robot interaction. The absence of a large, carefully labeled audio

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Self-Supervised Deep Learning For Fisheye Image Rectification

00:13:31

0 views

To rectify fisheye distortion from a single image, we advance self-supervised learning strategies and propose a unique deep learning model of Fisheye GAN (FE-GAN). Our FEGAN learns pixel-level distortion flow from sets of fisheye distorted images and dist

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Efficient Techniques For In-Band System Information Broadcast In Multi-Cell Massive Mimo

00:14:46

0 views

In this paper we consider joint beamforming of data to scheduled terminals (STs) and broadcast of system information (SI) to idle terminals (ITs) on the same time-frequency resource in multi-cell multi-user massive MIMO systems. We propose two different m

All Channels page: Communities submenu block

Communities

All Channels page: Societies submenu block

Societies

Events Showcase: ES submenu block

Event showcases

Recently Added Speakers

Events Hub Submenu block

Education: Education submenu block

Education Activity

2020 EAB AWARDS

2020 EAB AWARDS

IEEE ICASSP 2020 Virtual Conference May 2020