Search IEEE.tv

Showing 769 - 792 of 23823

Self-Supervised Learning For Ecg-Based Emotion Recognition

We present an electrocardiogram (ECG) -based emotion recognition system using self-supervised learning. Our proposed architecture consists of two main networks, a signal transformation recognition…

A Differential Approach For Rain Field Tomographic Reconstruction Using Microwave Signals From Leo Satellites

A differential approach is proposed for tomographic rain field reconstruction using the estimated signal-to-noise ratio of microwave signals from low earth orbit satellites at the ground receivers,…

Spectrum Analysis: RF Boot Camp

What is Spectrum and Signal Analysis? Hear which measurements are available, the theory of operation, modern designs, capabilities and more. An expert from Keysight Technologies defines the 3 main…

August 3, 2015

884 views

Manet: Multi-Scale Aggregated Network For Light Field Depth Estimation

We present a novel end-to-end network, MANet, for light field depth estimation. MANet is a parameter-effective and efficient multi-scale aggregated network, which is about 3 times smaller and 3 times…

Roimix: Proposal-Fusion Among Multiple Images For Underwater Object Detection

Generic object detection algorithms have proven their excellent performance in recent years. However, object detection on underwater datasets is still less explored. In contrast to generic datasets,…

Audio-Based Auto-Tagging With Contextual Tags For Music

Music listening context such as location or activity has been shown to greatly influence the users' musical tastes. In this work, we study the relationship between user context and audio content in…

Snorer Diarisation Based On Deep Neural Network Embeddings

Acoustic analysis of sleep breathing sounds using a smartphone at home provides a much less obtrusive means of screening for sleep-disordered breathing (SDB) than assessment in a sleep clinic.…

Full Reference Video Quality Measures Improvement Using Neural Networks

The accuracy of video quality metrics (VQMs) is an important issue for several applications. In this work, first we observe that the accuracy of several video quality metrics (VQMs) is strongly…

Deep Learning For Robust Power Control For Wireless Networks

Robust optimization is an important task in wireless communications, because due to fading and feedback delay there is inherent uncertainty in channel state information in a wireless environment.…

Mixup-Breakdown: A Consistency Training Method For Improving Generalization Of Speech Separation Models

Deep-learning based speech separation models confront poor generalization problem that even the state-of-the-art models could abruptly fail when evaluating them in mismatch conditions. To address…

An Attention Enhanced Multi-Task Model For Objective Speech Assessment In Real-World Environments

Computational objective metrics that use reference signals have been shown to be effective forms of speech assessment in simulated environments, since they are correlated with subjective listening…

Maximum Likelihood Estimation Of The Interference-Plus-Noise Cross Power Spectral Density Matrix For Own Voice Retrieval

In headset and hearing aid applications, it is of interest to retrieve the user's own voice in a noisy environment, e.g. for telephony applications. To do so, the cross power spectral density (CPSD)…

1 views

A Whiteness Test Based On The Spectral Measure Of Large Non-Hermitian Random Matrices

In the context of multivariate time series, a whiteness test against an MA(1) correlation model is proposed. This test is built on the eigenvalue distribution (spectral measure) of the non-Hermitian…

Neural Coding Strategies For Event-Based Vision Data

Neural coding schemes are powerful tools used within neuroscience. This paper introduces three different neural coding scheme formations for event-based vision data which are designed to emulate the…

Self-Paced Probabilistic Principal Component Analysis For Data With Outliers

Principal Component Analysis (PCA) is a popular tool for dimension reduction and feature extraction in data analysis. Probabilistic PCA (PPCA) extends the standard PCA by using a probabilistic model…

Kazuhiro Hono

Visually Guided Self Supervised Learning Of Speech Representations

Self supervised representation learning has recently attracted a lot of research interest for both the audio and visual modalities. However, most works typically focus on a particular modality or…

Coded Illumination And Multiplexing For Lensless Imaging

Mask-based lensless cameras offer an alternative option to conventional cameras. Compared to conventional cameras, lensless cameras can be extremely thin, flexible, and light-weight. Despite these…

Translation Of A Higher Order Ambisonics Sound Scene Based On Parametric Decomposition

This paper presents a novel 3DoF+ system that allows to navigate, i.e., change position, in scene-based spatial audio content beyond the sweet spot of a Higher Order Ambisonics recording. It is one…

Enhancing The Labelling Of Audio Samples For Automatic Instrument Classification Based On Neural Networks

The polyphonic OpenMIC-2018 dataset is based on weak and incomplete labels. The automatic classification of sound events, based on the VGGish bottleneck layer as proposed before by the AudioSet,…

Triplet Loss Feature Aggregation For Scalable Hash

The increasing demands of high resolution and quality aggravate the status of heavy burden of cluster storage side and restricted bandwidth resources. Hence, video de-duplication in storage and…

Towards A New Understanding Of The Training Of Neural Networks With Mislabeled Training Data

We investigate the problem of machine learning with mislabeled training data. We try to make the effects of mislabeled training better understood through analysis of the basic model and equations…

Tensor Decomposition-Based Beamspace Esprit Algorithm For Multidimensional Harmonic Retrieval

Beamspace processing is an efficient and commonly used approach in harmonic retrieval (HR). In the beamspace, measurements are obtained by linearly transforming the sensing data, thereby achieving a…

Embedded Large–Scale Handwritten Chinese Character Recognition

As handwriting input becomes more prevalent, the large symbol inventory required to support Chinese handwriting recognition poses unique challenges. This paper describes how the Apple deep learning…

All Channels page: Communities submenu block

Communities

IEEE.tv Specials

IEEE Women in Engineering

IEEE Awards

IEEE TechEthics™

IEEE Students

All Channels page: Societies submenu block

Societies

IEEE Future Directions

IEEE Computer Society

IEEE Society on Social Implications of Technology

IEEE Communications Society

IEEE Nuclear and Plasma Sciences Society

Events Showcase: ES submenu block

Event showcases

Recently Added Speakers

Events Hub Submenu block

Education: Education submenu block

Education Activity

Quantum Technologies in Europe: The Quantum Flagship Initiative - Applied Superconductivity Conference 2018

Recent Research Activities of Applied Superconductivity in China

NSF's Platforms for Advanced Wireless Research (PAWR) - IEEE Future Networks Webinar

Algorithmic Decision Making: Impacts and Implications - IEEE Internet Initiative Webinar

Technologies Advancing Humanity - What are We Most Passionate About: 2017 Brain Fuel President's Chat

2020 EAB AWARDS

2020 EAB AWARDS

About IEEE