IEEE ICASSP 2020 Virtual Conference May 2020

Thu, 16 July, 2020

Showing 601 - 650 of 1951

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Using Personalized Speech Synthesis And Neural Language Generator For Rapid Speaker Adaptation

00:23:49

0 views

We propose to use the personalized speech synthesis and the neural language generator to synthesize content relevant personalized speech for rapid speaker adaptation. It has two distinct aspects: First, it relieves the general data sparsity issue in rapid

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Signal Sensing And Reconstruction Paradigms For A Novel Multi-Source Static Computed Tomography System

00:15:52

0 views

Conventional Computed Tomography (CT) systems use a single X-ray source and an arc of detectors mounted on a rotating gantry to acquire a set of projection data. Novel CT systems are now being pioneered in which a complete ring of distributed X-ray source

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Computation Of "Best" Interpolants In The Lp Sense

00:14:40

0 views

We study a variant of the interpolation problem where the continuously defined solution is regularized by minimizing the Lp-norm of its second-order derivative. For this continuous-domain problem, we propose an exact discretization scheme that restricts t

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Fast Intent Classification For Spoken Language Understanding Systems

00:14:11

0 views

Spoken Language Understanding (SLU) systems consist of several machine learning components operating together (e.g. intent classification, named entity resolution and recognition). Deep learning models have obtained state of the art results on several of

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Exploiting Channel Locality For Adaptive Massive Mimo Signal Detection

00:12:03

0 views

We propose MMNet, a deep learning MIMO detection scheme that significantly outperforms existing approaches on realistic channels with the same or lower computational complexity. MMNet?s design builds on the theory of iterative soft-thresholding algorithms

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

A Hybrid Model For Bipolar Disorder Classification From Visual Information

00:12:35

1 view

Bipolar Disorder (BD) is one of the most prevalent mental illnesses in the world. It has a negative impact on people?s social and personal functions. The principal indicator of BD is the extreme swing in the mood ranging from manic to depressive states. T

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Densely Connected Neural Network With Dilated Convolutions For Real-Time Speech Enhancement In The Time Domain

00:14:19

0 views

In this work, we propose a fully convolutional neural network for real-time speech enhancement in the time domain. The proposed network is an encoder-decoder based architecture with skip connections. The layers in the encoder and the decoder are followed

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Person Identification Using Deep Convolutional Neural Networks On Short-Term Signals From Wearable Sensors

00:11:47

2 views

In this work, we explore the discriminating ability of short-term signal patterns (e.g. few minutes long) with respect to the person identification task. We focus on signals recorded by simple wearable devices, such as smartwatches, which can measure move

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Multimodal Learning For Classroom Activity Detection

00:10:24

0 views

Classroom activity detection (CAD) focuses on accurately classifying whether the teacher or student is speaking and recording both the length of individual utterances during a class. A CAD solution helps teachers get instant feedback on their pedagogical

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Disentangled Speech Embeddings Using Cross-Modal Self-Supervision

00:12:25

0 views

The objective of this paper is to learn representations of speaker identity without access to manually annotated data. To do so, we develop a self-supervised learning objective that exploits the natural cross-modal synchrony between faces and audio in vid

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

An Unsupervised Retinal Vessel Extraction And Segmentation Method Based On A Tube Marked Point Process Model

00:14:41

0 views

Retinal vessel extraction and segmentation is essential for supporting diagnosis of eye-related diseases. In recent years, deep learning has been applied to vessel segmentation and achieved excellent performance. However, these supervised methods require

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Super-Resolution Of 3D Color Point Clouds Via Fast Graph Total Variation

00:14:01

0 views

3D point clouds acquired by low-cost sensors are often in lower spatial resolutions than desired for rendering images on high-resolution displays. In this paper, we propose a fast super-resolution (SR) algorithm for color 3D point clouds. We first populat

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Concentration-Based Polynomial Calculations On Nicked Dna

00:15:22

0 views

In this paper, we introduce a novel scheme for computing polynomial functions on a substrate of nicked DNA. We first discuss a fractional encoding of data, based on the concentration of nicked double DNA strands. Then we show how to perform multiplication

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Soft-Output Finite Alphabet Equalization For Mmwave Massive Mimo

00:14:48

0 views

Next-generation wireless systems are expected to combine millimeter-wave (mmWave) and massive multi-user multiple-input multiple-output (MU-MIMO) technologies to deliver high data-rates. These technologies require the basestations (BSs) to process high-di

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Age-Based Scheduling Policy For Federated Learning In Mobile Edge Networks

00:28:50

0 views

Federated learning (FL) is a machine learning model that preserves data privacy in the training process. Specifically, FL brings the model directly to the user equipments (UEs) for local training, where an edge server periodically collects the trained par

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Prototypical Networks For Small Footprint Text-Independent Speaker Verification

00:12:49

0 views

Speaker verification aims to recognize target speakers with very few enrollment utterances. Conventional approaches learn a representation model to extract the speaker embeddings for verification. Recently, there are several new approaches in meta-learnin

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

2D-To-2D Mask Estimation For Speech Enhancement Based On Fully Convolutional Neural Network

00:12:55

0 views

In recent years, the deep learning-based approaches are popular in the field of singe-channel speech enhancement. Convolutional neural networks (CNNs) are a standard component of many current speech enhancement system. In this study, we design a new Fully

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Gated Mechanism For Attention Based Multimodal Sentiment Analysis

00:09:10

0 views

Multimodal sentiment analysis has recently gained popularity because of its relevance to social media posts, customer service calls and video blogs. In this paper, we address three aspects of multimodal sentiment analysis; 1. Cross modal interaction learn

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

An Empirical Study Of Transformer-Based Neural Language Model Adaptation

00:14:41

0 views

We explore two adaptation approaches of deep Transformer based neural language models (LMs) for automatic speech recognition. The first approach is a pretrain-finetune framework, where we first pretrain a Transformer LM on a large-scale text corpus from s

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

One-Shot Parametric Audio Production Style Transfer With Application To Frequency Equalization

00:13:30

0 views

Audio production is a difficult process for many people, and properly manipulating sound to achieve a certain effect is non-trivial. In this paper, we present a method that facilitates this process by inferring appropriate audio effect parameters in order

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Transformer Transducer: A Streamable Speech Recognition Model With Transformer Encoders And Rnn-T Loss

00:14:09

3 views

In this paper we present an end-to-end speech recognition model with Transformer encoders that can be used in a streaming speech recognition system. Transformer computation blocks based on self-attention are used to encode both audio and label sequences i

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Decentralized Min-Max Optimization: Formulations, Algorithms And Applications In Network Poisoning Attack

00:13:50

0 views

This paper discusses formulations and algorithms which allow a number of agents to collectively solve problems involving both (non-convex) minimization and (concave) maximization operations. These problems have a number of interesting applications in info

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Cross-Vae: Towards Disentangling Expression From Identity For Human Faces

00:12:01

0 views

Facial expression and identity are two independent yet intertwined components for representing a face. For facial expression recognition, identity can contaminate the training procedure by providing tangled but irrelevant information. In this paper, we pr

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Toso: Student's-T Distribution Aided One-Stage Orientation Target Detection In Remote Sensing Images

00:12:19

0 views

In this paper, a robust Student?s-T distribution aided One-Stage Orientation detector, namely TOSO, is proposed to address orientation target detection in remote sensing images. A one-stage keypoint based network architecture is used to avoid the complica

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Adversarial Example Detection By Classification For Deep Speech Recognition

00:14:08

0 views

Machine Learning systems are vulnerable to adversarial attacks and will highly likely produce incorrect outputs under these attacks. There are white-box and black-box attacks regarding to adversary?s access level to the victim learning algorithm. To defen

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Aligntts: Efficient Feed-Forward Text-To-Speech System Without Explicit Alignment

00:12:03

0 views

Targeting at both high efficiency and performance, we propose AlignTTS to predict the mel-spectrum in parallel. AlignTTS is based on a Feed-Forward Transformer which generates mel-spectrum from a sequence of characters, and the duration of each character

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Weakly Supervised Segmentation Guided Hand Pose Estimation During Interaction With Unknown Objects

00:11:36

0 views

Hand pose estimation is important for human computer interaction, but the performance is not satisfying when the hand is interacting with objects. To alleviate the influence of unknown objects, we propose a novel weakly supervised segmentation guided sche

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Deep Geometric Knowledge Distillation With Graphs

00:14:59

0 views

In most cases deep learning architectures are trained disregarding the amount of operations and energy consumption. However, some applications, like embedded systems, can be resource-constrained during inference. A popular approach to reduce the size of a

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Audio-Assisted Image Inpainting For Talking Faces

00:13:42

0 views

The goal of our work is to complete missing areas of images of talking faces, exploiting information from both the visual and audio modalities. Existing image inpainting methods rely solely on visual content that doesn?t always provide sufficient informat

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Improving Spoken Question Answering Using Contextualized Word Representation

00:15:44

0 views

While question answering (QA) systems have witnessed great breakthroughs in reading comprehension (RC) tasks, spoken question answering (SQA) is still a much less investigated area. Previous work shows that existing SQA systems are limited by catastrophic

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Unsupervised Neural Mask Estimator For Generalized Eigen-Value Beamforming Based Asr

00:13:10

0 views

The state-of-art methods for acoustic beamforming in multi-channel ASR is based on a neural mask estimator that attempts to learn the prediction of speech and noise using a paired corpus of clean and noisy recordings (teacher model). In this paper, we att

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Learning Noise Invariant Features Through Transfer Learning For Robust End-To-End Speech Recognition

00:13:56

0 views

End-to-end models yield impressive speech recognition results on clean datasets while having inferior performance on noisy datasets. To address this, we propose transfer learning from a clean dataset (WSJ) to a noisy dataset (CHiME-4) for connectionist te

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Prediction Of Vessel Trajectories From Ais Data Via Sequence-To-Sequence Recurrent Neural Networks

00:13:30

0 views

In this paper, we address the problem of predicting vessel trajectories based on Automatic Identification System (AIS) data. The goal is to learn the predictive distribution of maritime traffic patterns using historical data during the training phase, in

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Confirmnet: Convolutional Firmnet And Application To Image Denoising And Inpainting

00:13:57

0 views

We address the problem of efficient convolutional sparse coding (CSC) and develop a non-convex-penalty-regularized CSC formulation, namely, minimax-concave CSC (MC2SC). MC2SC leads to an optimal sparse representation than the standard ell_1-penalty based

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

On The Stability Of Polynomial Spectral Graph Filters

00:13:21

0 views

Spectral graph filters are a key component in state-of-the-art machine learning models used for graph-based learning, such as graph neural networks. For certain tasks stability of the spectral graph filters is important for learning suitable representatio

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

A Method For Millimeter-Wave Imaging Of Concealed Objects Via De-Aliasing

00:12:43

1055 views

We consider the problem of millimeter-wave (MMW) imaging for concealed objects using a transceiver antenna array. In practical implementations, larger array element spacing leads to aliasing in the spectrum of the received echo signals. In this paper, we

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Key Action And Joint Ctc-Attention Based Sign Language Recognition

00:13:19

0 views

Sign Language Recognition (SLR) translates sign language video into natural language. In practice, sign language video, owning a large number of redundant frames, is necessary to be selected the essential. However, unlike common video that describes actio

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Stochastic Ml Estimation For Hyperspectral Unmixing Under Endmember Variability And Nonlinear Models

00:14:57

0 views

Hyperspectral unmixing (HU) is a problem of blindly identifying the underlying materials, in form of spectral signatures, in the captured hyperspectral image. HU has received tremendous interest in remote sensing, and fundamentally the problem can be rega

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Uncertainty Quantification For Remaining Useful Lifetime Prediction With Multi-Channel Sensory Data

00:12:32

457 views

For remaining useful lifetime (RUL) prediction with multi-channel sensory data, long-term prediction has more uncertainty than short-term prediction. In this paper, the ratio of mean to variance was considered to measure the uncertainty propagation rate (

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Limitations Of Weak Labels For Embedding And Tagging

00:14:29

0 views

Many datasets and approaches in ambient sound analysis use weakly labeled data. Weak labels are employed because annotating every data sample with a strong label is too expensive. Yet, their impact on the performance in comparison to strong labels remains

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Graphem: Em Algorithm For Blind Kalman Filtering Under Graphical Sparsity Constraints

00:09:46

2 views

Modeling and inference with multivariate sequences is central in a number of signal processing applications such as acoustics, social network analysis, biomedical, and finance, to name a few. The linear-Gaussian state-space model is a common way to descri

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

A Deep Neural Network-Driven Feature Learning Method For Polyphonic Acoustic Event Detection From Real-Life Recordings

00:13:00

0 views

In this paper, a Deep Neural Network (DNN)-driven feature learning method for polyphonic Acoustic Event Detection (AED) is proposed. The proposed DNN is a combination of different layers used to characterize multiple overlapped acoustic events in the mixt

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Swift-Link: A Compressive Beam Alignment Algorithm For Practical Mmwave Radios

00:14:46

0 views

Millimeter wave (mmWave) bands offer a large amount of spectrum that can support many high data rate applications. To efficiently use the spectrum at mmWave, the wireless link between the transmitting and receiving radios must be configured properly. Comp

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Sylnet: An Adaptable End-To-End Syllable Count Estimator For Speech

00:13:36

773 views

Automatic syllable count estimation (SCE) is used in a variety of applications ranging from speaking rate estimation to detecting social activity from wearable microphones or developmental research concerned with quantifying speech heard by language-learn

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Curriculum Learning For Speech Emotion Recognition From Crowdsourced Labels

00:14:45

0 views

This study introduces a method to design a curriculum for machine-learning to maximize the efficiency during the training process of deep neural networks (DNNs) for speech emotion recognition. Previous studies in other machine-learning problems have shown

All Channels page: Communities submenu block

Communities

All Channels page: Societies submenu block

Societies

Events Showcase: ES submenu block

Event showcases

Recently Added Speakers

Events Hub Submenu block

Education: Education submenu block

Education Activity

2020 EAB AWARDS

2020 EAB AWARDS

IEEE ICASSP 2020 Virtual Conference May 2020