cs.AI - 人工智能

    cs.CL - 计算与语言 cs.CV - 机器视觉与模式识别 cs.DB - 数据库 cs.DC - 分布式、并行与集群计算 cs.DS - 数据结构与算法 cs.GR - 计算机图形学 cs.GT - 计算机科学与博弈论 cs.HC - 人机接口 cs.IR - 信息检索 cs.IT - 信息论 cs.LG - 自动学习 cs.LO - 计算逻辑 cs.MA - 多代理系统 cs.MM - 多媒体 cs.NE - 神经与进化计算 cs.NI - 网络和互联网体系结构 cs.PF - 计算性能 cs.PL - 编程语言 cs.RO - 机器人学 cs.SI - 社交网络与信息网络 econ.TH - 理论经济学 eess.IV - 图像与视频处理 eess.SP - 信号处理 math.AC - 交换代数 math.CO - 组合数学 math.DS - 动力系统 math.GR - 群论 math.OC - 优化与控制 math.ST - 统计理论 physics.flu-dyn - 流体动力学 physics.med-ph - 医学物理学 physics.soc-ph - 物理学与社会 q-bio.NC - 神经元与认知 q-bio.QM - 定量方法 q-fin.PM - 投资组合管理 quant-ph - 量子物理 stat.AP - 应用统计 stat.ME - 统计方法论 stat.ML - (统计)机器学习

    • [cs.AI]A Double Q-Learning Approach for Navigation of Aerial Vehicles with Connectivity Constraint
    • [cs.AI]AMP Chain Graphs: Minimal Separators and Structure Learning Algorithms
    • [cs.AI]Counterfactual fairness: removing direct effects through regularization
    • [cs.AI]Declarative Memory-based Structure for the Representation of Text Data
    • [cs.AI]Efficient exploration of zero-sum stochastic games
    • [cs.AI]FairRec: Two-Sided Fairness for Personalized Recommendations in Two-Sided Platforms
    • [cs.AI]Forming Diverse Teams from Sequentially Arriving People
    • [cs.AI]Injecting Domain Knowledge in Neural Networks: a Controlled Experiment on a Constrained Problem
    • [cs.AI]Problems with Shapley-value-based explanations as feature importance measures
    • [cs.AI]Turning 30: New Ideas in Inductive Logic Programming
    • [cs.CL]A more abstractive summarization model
    • [cs.CL]BERT Can See Out of the Box: On the Cross-modal Transferability of Text Representations
    • [cs.CL]Detecting Asks in SE attacks: Impact of Linguistic and Structural Knowledge
    • [cs.CL]Differentiable Reasoning over a Virtual Knowledge Base
    • [cs.CL]End-to-end Emotion-Cause Pair Extraction via Learning to Link
    • [cs.CL]Event Detection with Relation-Aware Graph Convolutional Neural Networks
    • [cs.CL]Exploring BERT Parameter Efficiency on the Stanford Question Answering Dataset v2.0
    • [cs.CL]KEML: A Knowledge-Enriched Meta-Learning Framework for Lexical Relation Classification
    • [cs.CL]Label-guided Learning for Text Classification
    • [cs.CL]Language-Independent Tokenisation Rivals Language-Specific Tokenisation for Word Similarity Prediction
    • [cs.CL]MiniLM: Deep Self-Attention Distillation for Task-Agnostic Compression of Pre-Trained Transformers
    • [cs.CL]MuST-Cinema: a Speech-to-Subtitles corpus
    • [cs.CL]Multimodal Transformer with Pointer Network for the DSTC8 AVSD Challenge
    • [cs.CL]Parsing Early Modern English for Linguistic Search
    • [cs.CL]Semantic Relatedness for Keyword Disambiguation: Exploiting Different Embeddings
    • [cs.CL]Small-Footprint Open-Vocabulary Keyword Spotting with Quantized LSTM Networks
    • [cs.CV]3D-MiniNet: Learning a 2D Representation from Point Clouds for Fast and Efficient 3D LIDAR Semantic Segmentation
    • [cs.CV]A Deep Learning Framework for Simulation and Defect Prediction Applied in Microelectronics
    • [cs.CV]ABCNet: Real-time Scene Text Spotting with Adaptive Bezier-Curve Network
    • [cs.CV]Anatomy-aware 3D Human Pose Estimation in Videos
    • [cs.CV]Circle Loss: A Unified Perspective of Pair Similarity Optimization
    • [cs.CV]Copy and Paste GAN: Face Hallucination from Shaded Thumbnails
    • [cs.CV]Cross-layer Feature Pyramid Network for Salient Object Detection
    • [cs.CV]DDet: Dual-path Dynamic Enhancement Network for Real-World Image Super-Resolution
    • [cs.CV]Deep Representation Learning on Long-tailed Data: A Learnable Embedding Augmentation Perspective
    • [cs.CV]Evaluating Registration Without Ground Truth
    • [cs.CV]Exploring Learning Dynamics of DNNs via Layerwise Conditioning Analysis
    • [cs.CV]FPConv: Learning Local Flattening for Point Convolution
    • [cs.CV]Fast Loop Closure Detection via Binary Content
    • [cs.CV]Fault Diagnosis in Microelectronics Attachment via Deep Learning Analysis of 3D Laser Scans
    • [cs.CV]Freeze Discriminator: A Simple Baseline for Fine-tuning GANs
    • [cs.CV]Globally Optimal Contrast Maximisation for Event-based Motion Estimation
    • [cs.CV]Ground Texture Based Localization Using Compact Binary Descriptors
    • [cs.CV]Hierarchical Conditional Relation Networks for Video Question Answering
    • [cs.CV]MPM: Joint Representation of Motion and Position Map for Cell Tracking
    • [cs.CV]MagnifierNet: Towards Semantic Regularization and Fusion for Person Re-identification
    • [cs.CV]On Pruning Adversarially Robust Neural Networks
    • [cs.CV]Optimal least-squares solution to the hand-eye calibration problem
    • [cs.CV]PointAugment: an Auto-Augmentation Framework for Point Cloud Classification
    • [cs.CV]Random Bundle: Brain Metastases Segmentation Ensembling through Annotation Randomization
    • [cs.CV]Real-time Fusion Network for RGB-D Semantic Segmentation Incorporating Unexpected Obstacle Detection for Road-driving Images
    • [cs.CV]Revisiting Saliency Metrics: Farthest-Neighbor Area Under Curve
    • [cs.CV]ScopeFlow: Dynamic Scene Scoping for Optical Flow
    • [cs.CV]See, Attend and Brake: An Attention-based Saliency Map Prediction Model for End-to-End Driving
    • [cs.CV]Toward fast and accurate human pose estimation via soft-gated skip connections
    • [cs.CV]Towards Better Surgical Instrument Segmentation in Endoscopic Vision: Multi-Angle Feature Aggregation and Contour Supervision
    • [cs.CV]Towards Learning a Generic Agent for Vision-and-Language Navigation via Pre-training
    • [cs.CV]Triple Wins: Boosting Accuracy, Robustness and Efficiency Together by Enabling Input-Adaptive Inference
    • [cs.CV]Triplet Online Instance Matching Loss for Person Re-identification
    • [cs.CV]When Relation Networks meet GANs: Relation GANs with Triplet Loss
    • [cs.DB]BAD to the Bone: Big Active Data at its Core
    • [cs.DC]A$^3$: Accelerating Attention Mechanisms in Neural Networks with Approximation
    • [cs.DC]Analysis of Amnesiac Flooding
    • [cs.DC]Combining Learning and Optimization for Transprecision Computing
    • [cs.DC]Distributed Edge Coloring in Time Quasi-Polylogarithmic in Delta
    • [cs.DC]Distributed Training of Deep Neural Network Acoustic Models for Automatic Speech Recognition
    • [cs.DC]Graph Computing based Distributed State Estimation with PMUs
    • [cs.DC]Near Optimal Task Graph Scheduling with Priced Timed Automata and Priced Timed Markov Decision Processes
    • [cs.DC]Quantized Push-sum for Gossip and Decentralized Optimization over Directed Graphs
    • [cs.DS]Efficient and Simple Algorithms for Fault Tolerant Spanners
    • [cs.GR]Image Stylization: From Predefined to Personalized
    • [cs.GR]PolyGen: An Autoregressive Generative Model of 3D Meshes
    • [cs.GT]Inducing Equilibria in Networked Public Goods Games through Network Structure Modification
    • [cs.HC]ORCSolver: An Efficient Solver for Adaptive GUI Layout with OR-Constraints
    • [cs.IR]Abstractive Snippet Generation
    • [cs.IR]Data Augmentation for Personal Knowledge Graph Population
    • [cs.IR]Leveraging Code Generation to Improve Code Retrieval and Summarization via Dual Learning
    • [cs.IT]Amplitude and Phase Estimation for Absolute Calibration of Massive MIMO Front-Ends
    • [cs.IT]Deep Reinforcement Learning for Intelligent Reflecting Surfaces: Towards Standalone Operation
    • [cs.IT]Enhancing Physical Layer Security of Random Caching in Large-Scale Multi-Antenna Heterogeneous Wireless Networks
    • [cs.IT]Learning Beam Codebooks with Neural Networks: Towards Environment-Aware mmWave MIMO
    • [cs.IT]LoRa beyond ALOHA: An Investigation of Alternative Random Access Protocols
    • [cs.IT]Millimeter Wave Communications with an Intelligent Reflector: Performance Optimization and Distributional Reinforcement Learning
    • [cs.IT]Study of Coarse Quantization-Aware Block Diagonalization Algorithms for MIMO Systems with Low Resolution
    cs.LGRandomized Smoothing for Certifiable Defense against Patch Attacks
    • [cs.LG]A Theory of Usable Information Under Computational Constraints
    • [cs.LG]Adaptive Distributed Stochastic Gradient Descent for Minimizing Delay in the Presence of Stragglers
    • [cs.LG]Batch norm with entropic regularization turns deterministic autoencoders into generative models
    • [cs.LG]Breaking Batch Normalization for better explainability of Deep Neural Networks through Layer-wise Relevance Propagation
    • [cs.LG]Coherent Gradients: An Approach to Understanding Generalization in Gradient Descent-based Optimization
    • [cs.LG]Diversity-Based Generalization for Neural Unsupervised Text Classification under Domain Shift
    • [cs.LG]Efficient Rollout Strategies for Bayesian Optimization
    • [cs.LG]Federated Learning for Resource-Constrained IoT Devices: Panoramas and State-of-the-art
    • [cs.LG]General Framework for Binary Classification on Top Samples
    • [cs.LG]Gödel’s Sentence Is An Adversarial Example But Unsolvable
    • [cs.LG]HarDNN: Feature Map Vulnerability Evaluation in CNNs
    • [cs.LG]Human Apprenticeship Learning via Kernel-based Inverse Reinforcement Learning
    • [cs.LG]I Am Going MAD: Maximum Discrepancy Competition for Comparing Classifiers Adaptively
    • [cs.LG]Interpolating Between Gradient Descent and Exponentiated Gradient Using Reparameterized Gradient Descent
    • [cs.LG]Learning the mapping $\mathbf{x}\mapsto \sum_{i=1}^d x_i^2$: the cost of finding the needle in a haystack
    • [cs.LG]Model-Based Reinforcement Learning for Physical Systems Without Velocity and Acceleration Measurements
    • [cs.LG]Modeling Continuous Stochastic Processes with Dynamic Normalizing Flows
    • [cs.LG]Neural Networks are Convex Regularizers: Exact Polynomial-time Convex Optimization Formulations for Two-Layer Networks
    • [cs.LG]Novel Change of Measure Inequalities and PAC-Bayesian Bounds
    • [cs.LG]Off-Policy Deep Reinforcement Learning with Analogous Disentangled Exploration
    • [cs.LG]On Feature Normalization and Data Augmentation
    • [cs.LG]On Reinforcement Learning for Turn-based Zero-sum Markov Games
    • [cs.LG]Optimal Gradient Quantization Condition for Communication-Efficient Distributed Training
    • [cs.LG]Practical and Bilateral Privacy-preserving Federated Learning
    • [cs.LG]Precise Tradeoffs in Adversarial Training for Linear Regression
    • [cs.LG]Progressive Learning and Disentanglement of Hierarchical Representations
    • [cs.LG]Provable Representation Learning for Imitation Learning via Bi-level Optimization
    • [cs.LG]Relevant-features based Auxiliary Cells for Energy Efficient Detection of Natural Errors
    • [cs.LG]Rewriting History with Inverse RL: Hindsight Inference for Policy Improvement
    • [cs.LG]Robust Estimation, Prediction and Control with Linear Dynamics and Generic Costs
    • [cs.LG]Scheduled Restart Momentum for Accelerated Stochastic Gradient Descent
    • [cs.LG]Searching for Winograd-aware Quantized Networks
    • [cs.LG]Sequence-to-Sequence Imputation of Missing Sensor Data
    • [cs.LG]Stochastic-Sign SGD for Federated Learning with Theoretical Guarantees
    • [cs.LG]Subspace Fitting Meets Regression: The Effects of Supervision and Orthonormality Constraints on Double Descent of Generalization Errors
    • [cs.LG]Teaching the Old Dog New Tricks: Supervised Learning with Constraints
    • [cs.LG]The Curious Case of Adversarially Robust Models: More Data Can Help, Double Descend, or Hurt Generalization
    • [cs.LG]Three Approaches for Personalization with Applications to Federated Learning
    • [cs.LG]Towards an Efficient and General Framework of Robust Training for Graph Neural Networks
    • [cs.LG]Training Binary Neural Networks using the Bayesian Learning Rule
    • [cs.LG]Understanding and Mitigating the Tradeoff Between Robustness and Accuracy
    • [cs.LG]Variational Hyper RNN for Sequence Modeling
    • [cs.LG]Variational Wasserstein Barycenters for Geometric Clustering
    • [cs.LO]Facets of the PIE Environment for Proving, Interpolating and Eliminating on the Basis of First-Order Logic
    • [cs.MA]Scalable Multi-Agent Inverse Reinforcement Learning via Actor-Attention-Critic
    • [cs.MM]A Comparative Evaluation of Temporal Pooling Methods for Blind Video Quality Assessment
    • [cs.MM]Model Watermarking for Image Processing Networks
    • [cs.NE]An Assignment Problem Formulation for Dominance Move Indicator
    • [cs.NE]Backpropamine: training self-modifying neural networks with differentiable neuromodulated plasticity
    • [cs.NE]Multi-objective beetle antennae search algorithm
    • [cs.NE]Non-Volatile Memory Array Based Quantization- and Noise-Resilient LSTM Neural Networks
    • [cs.NE]Separating the Effects of Batch Normalization on CNN Training Speed and Stability Using Classical Adaptive Filter Theory
    • [cs.NI]Network-Density-Controlled Decentralized Parallel Stochastic Gradient Descent in Wireless Systems
    • [cs.NI]Personalized Federated Learning for Intelligent IoT Applications: A Cloud-Edge based Framework
    • [cs.PF]Learning Queuing Networks by Recurrent Neural Networks
    • [cs.PL]MLIR: A Compiler Infrastructure for the End of Moore’s Law
    • [cs.RO]Alternating Minimization Based Trajectory Generation for Quadrotor Aggressive Flight
    • [cs.RO]Denoising IMU Gyroscopes with Deep Learning for Open-Loop Attitude Estimation
    • [cs.RO]Estimating Human Teleoperator Posture Using Only a Haptic-Input Device
    • [cs.RO]Feasible Computationally Efficient Path Planning for UAV Collision Avoidance
    • [cs.RO]Human Perception-Optimized Planning for Comfortable VR-Based Telepresence
    • [cs.RO]Learning Machines from Simulation to Real World
    • [cs.RO]Least Squares Optimization: from Theory to Practice
    • [cs.RO]Non-Gaussian Chance-Constrained Trajectory Planning for Autonomous Vehicles in the Presence of Uncertain Agents
    • [cs.RO]Optimisation of Body-ground Contact for Augmenting Whole-Body Loco-manipulation of Quadruped Robots
    • [cs.RO]Safe Optimal Control under Parametric Uncertainties
    • [cs.SI]Automating Discovery of Dominance in Synchronous Computer-Mediated Communication
    • [cs.SI]MIDMod-OSN: A Microscopic-level Information Diffusion Model for Online Social Networks
    • [cs.SI]Migration Networks: Applications of Network Analysis to Large-Scale Human Mobility
    • [econ.TH]A Practical Approach to Social Learning
    • [eess.IV]Co-VeGAN: Complex-Valued Generative Adversarial Network for Compressive Sensing MR Image Reconstruction
    • [eess.IV]Deep learning predicts total knee replacement from magnetic resonance images
    • [eess.IV]Fully-automated Body Composition Analysis in Routine CT Imaging Using 3D Semantic Segmentation Convolutional Neural Networks
    • [eess.IV]Recalibrating 3D ConvNets with Project & Excite
    • [eess.IV]Technical report: Kidney tumor segmentation using a 2D U-Net followed by a statistical post-processing filter
    • [eess.IV]Variational Inference and Bayesian CNNs for Uncertainty Estimation in Multi-Factorial Bone Age Prediction
    • [eess.SP]An Adaptive QRS Detection Algorithm for Ultra-Long-Term ECG Recordings
    • [eess.SP]Design Optimisation of Power-Efficient Submarine Line through Machine Learning
    • [eess.SP]Gesture recognition with 60GHz 802.11 waveforms
    • [eess.SP]Robust Wireless Fingerprinting: Generalizing Across Space and Time
    • [eess.SP]Wireless 2.0: Towards an Intelligent Radio Environment Empowered by Reconfigurable Meta-Surfaces and Artificial Intelligence
    • [math.AC]Second generalized Hamming weight of Projective Toric Code over Hypersimplices
    • [math.CO]New bounds for perfect $k$-hashing
    • [math.DS]Sparsity-promoting algorithms for the discovery of informative Koopman invariant subspaces
    • [math.GR]Commutator subgroups of Sylow 2-subgroups of alternating group and Miller-Moreno groups as bases of new Key Exchange Protocol
    • [math.OC]Biased Stochastic Gradient Descent for Conditional Stochastic Optimization
    • [math.OC]Can speed up the convergence rate of stochastic gradient methods to $\mathcal{O}(1/k^2)$ by a gradient averaging strategy?
    • [math.OC]On the regularity and conditioning of low rank semidefinite programs
    • [math.OC]Statistically Preconditioned Accelerated Gradient Method for Distributed Optimization
    • [math.OC]Stochastic Polyak Step-size for SGD: An Adaptive Learning Rate for Fast Convergence
    • [math.ST]Asymptotic Analysis of Sampling Estimators for Randomized Numerical Linear Algebra Algorithms
    • [math.ST]Structural adaptation in the density model
    • [physics.flu-dyn]Physics-informed deep learning for incompressible laminar flows
    • [physics.med-ph]Multifold Acceleration of Diffusion MRI via Slice-Interleaved Diffusion Encoding (SIDE)
    • [physics.soc-ph]How many infections of COVID-19 there will be in the “Diamond Princess”-Predicted by a virus transmission model based on the simulation of crowd flow
    • [q-bio.NC]Stochastic encoding of graphs in deep learning allows for complex analysis of gender classification in resting-state and task functional brain networks from the UK Biobank
    • [q-bio.QM]Uncovering ecological state dynamics with hidden Markov models
    • [q-fin.PM]G-Learner and GIRL: Goal Based Wealth Management with Reinforcement Learning
    • [quant-ph]Nonbinary Error-Detecting Hybrid Codes
    • [quant-ph]Planning for Compilation of a Quantum Algorithm for Graph Coloring
    • [stat.AP]Continuous-time multi-state capture-recapture models
    • [stat.AP]Gaussian Process Regression for Probabilistic Short-term Solar Output Forecast
    • [stat.AP]Multi Linear Regression applied to Communications systems Analysis
    • [stat.AP]Statistical inference for Axiom A attractors
    • [stat.ME]Bayesian Multi-scale Modeling of Factor Matrix without using Partition Tree
    • [stat.ME]Bayesian analysis of count-valued, binary-valued, and continuous-valued responses using unknown transformations
    • [stat.ME]Causal bounds for outcome-dependent sampling in observational studies
    • [stat.ME]Demystify Lindley’s Paradox by Interpreting P-value as Posterior Probability
    • [stat.ME]MissDeepCausal: Causal Inference from Incomplete Data Using Deep Latent Variable Models
    • [stat.ME]Model-assisted estimation through random forests in finite population sampling
    • [stat.ME]Multivariate time-series modeling with generative neural networks
    • [stat.ME]Probabilistic elicitation of expert knowledge through assessment of computer simulations
    • [stat.ME]The DURATIONS randomised trial design: estimation targets, analysis methods and operating characteristics
    • [stat.ME]Uncertainty estimation in equality-constrained MAP and maximum likelihood estimation with applications to system identification and state estimation
    • [stat.ML]A General Method for Robust Learning from Batches
    • [stat.ML]Causal Inference With Selectively-Deconfounded Data
    • [stat.ML]Gaussian Hierarchical Latent Dirichlet Allocation: Bringing Polysemy Back
    • [stat.ML]Missing Data Imputation for Classification Problems
    • [stat.ML]Neuron Shapley: Discovering the Responsible Neurons
    • [stat.ML]Statistical Adaptive Stochastic Gradient Methods

    ·····································

    • [cs.AI]A Double Q-Learning Approach for Navigation of Aerial Vehicles with Connectivity Constraint
    Behzad Khamidehi, Elvino S. Sousa
    http://arxiv.org/abs/2002.10563v1

    • [cs.AI]AMP Chain Graphs: Minimal Separators and Structure Learning Algorithms
    Mohammad Ali Javidian, Marco Valtorta, Pooyan Jamshidi
    http://arxiv.org/abs/2002.10870v1

    • [cs.AI]Counterfactual fairness: removing direct effects through regularization
    Pietro G. Di Stefano, James M. Hickey, Vlasios Vasileiou
    http://arxiv.org/abs/2002.10774v1

    • [cs.AI]Declarative Memory-based Structure for the Representation of Text Data
    Sumant Pushp, Pragya Kashmira, Shyamanta M Hazarika
    http://arxiv.org/abs/2002.10665v1

    • [cs.AI]Efficient exploration of zero-sum stochastic games
    Carlos Martin, Tuomas Sandholm
    http://arxiv.org/abs/2002.10524v1

    • [cs.AI]FairRec: Two-Sided Fairness for Personalized Recommendations in Two-Sided Platforms
    Gourab K. Patro, Arpita Biswas, Niloy Ganguly, Krishna P. Gummadi, Abhijnan Chakraborty
    http://arxiv.org/abs/2002.10764v1

    • [cs.AI]Forming Diverse Teams from Sequentially Arriving People
    Faez Ahmed, John Dickerson, Mark Fuge
    http://arxiv.org/abs/2002.10697v1

    • [cs.AI]Injecting Domain Knowledge in Neural Networks: a Controlled Experiment on a Constrained Problem
    Mattia Silvestri, Michele Lombardi, Michela Milano
    http://arxiv.org/abs/2002.10742v1

    • [cs.AI]Problems with Shapley-value-based explanations as feature importance measures
    I. Elizabeth Kumar, Suresh Venkatasubramanian, Carlos Scheidegger, Sorelle Friedler
    http://arxiv.org/abs/2002.11097v1

    • [cs.AI]Turning 30: New Ideas in Inductive Logic Programming
    Andrew Cropper, Sebastijan Dumančić, Stephen H. Muggleton
    http://arxiv.org/abs/2002.11002v1

    • [cs.CL]A more abstractive summarization model
    Satyaki Chakraborty, Xinya Li, Sayak Chakraborty
    http://arxiv.org/abs/2002.10959v1

    • [cs.CL]BERT Can See Out of the Box: On the Cross-modal Transferability of Text Representations
    Thomas Scialom, Patrick Bordes, Paul-Alexis Dray, Jacopo Staiano, Patrick Gallinari
    http://arxiv.org/abs/2002.10832v1

    • [cs.CL]Detecting Asks in SE attacks: Impact of Linguistic and Structural Knowledge
    Bonnie J. Dorr, Archna Bhatia, Adam Dalton, Brodie Mather, Bryanna Hebenstreit, Sashank Santhanam, Zhuo Cheng, Samira Shaikh, Alan Zemel, Tomek Strzalkowski
    http://arxiv.org/abs/2002.10931v1

    • [cs.CL]Differentiable Reasoning over a Virtual Knowledge Base
    Bhuwan Dhingra, Manzil Zaheer, Vidhisha Balachandran, Graham Neubig, Ruslan Salakhutdinov, William W. Cohen
    http://arxiv.org/abs/2002.10640v1

    • [cs.CL]End-to-end Emotion-Cause Pair Extraction via Learning to Link
    Haolin Song, Chen Zhang, Qiuchi Li, Dawei Song
    http://arxiv.org/abs/2002.10710v1

    • [cs.CL]Event Detection with Relation-Aware Graph Convolutional Neural Networks
    Shiyao Cui, Bowen Yu, Tingwen Liu, Zhenyu Zhang, Xuebin Wang, Jinqiao Shi
    http://arxiv.org/abs/2002.10757v1

    • [cs.CL]Exploring BERT Parameter Efficiency on the Stanford Question Answering Dataset v2.0
    Eric Hulburd
    http://arxiv.org/abs/2002.10670v1

    • [cs.CL]KEML: A Knowledge-Enriched Meta-Learning Framework for Lexical Relation Classification
    Chengyu Wang, Minghui Qiu, Jun Huang, Xiaofeng He
    http://arxiv.org/abs/2002.10903v1

    • [cs.CL]Label-guided Learning for Text Classification
    Xien Liu, Song Wang, Xiao Zhang, Xinxin You, Ji Wu, Dejing Dou
    http://arxiv.org/abs/2002.10772v1

    • [cs.CL]Language-Independent Tokenisation Rivals Language-Specific Tokenisation for Word Similarity Prediction
    Danushka Bollegala, Ryuichi Kiryo, Kosuke Tsujino, Haruki Yukawa
    http://arxiv.org/abs/2002.11004v1

    • [cs.CL]MiniLM: Deep Self-Attention Distillation for Task-Agnostic Compression of Pre-Trained Transformers
    Wenhui Wang, Furu Wei, Li Dong, Hangbo Bao, Nan Yang, Ming Zhou
    http://arxiv.org/abs/2002.10957v1

    • [cs.CL]MuST-Cinema: a Speech-to-Subtitles corpus
    Alina Karakanta, Matteo Negri, Marco Turchi
    http://arxiv.org/abs/2002.10829v1

    • [cs.CL]Multimodal Transformer with Pointer Network for the DSTC8 AVSD Challenge
    Hung Le, Nancy F. Chen
    http://arxiv.org/abs/2002.10695v1

    • [cs.CL]Parsing Early Modern English for Linguistic Search
    Seth Kulick, Neville Ryant
    http://arxiv.org/abs/2002.10546v1

    • [cs.CL]Semantic Relatedness for Keyword Disambiguation: Exploiting Different Embeddings
    María G. Buey, Carlos Bobed, Jorge Gracia, Eduardo Mena
    http://arxiv.org/abs/2002.11023v1

    • [cs.CL]Small-Footprint Open-Vocabulary Keyword Spotting with Quantized LSTM Networks
    Théodore Bluche, Maël Primet, Thibault Gisselbrecht
    http://arxiv.org/abs/2002.10851v1

    • [cs.CV]3D-MiniNet: Learning a 2D Representation from Point Clouds for Fast and Efficient 3D LIDAR Semantic Segmentation
    Iñigo Alonso, Luis Riazuelo, Luis Montesano, Ana C. Murillo
    http://arxiv.org/abs/2002.10893v1

    • [cs.CV]A Deep Learning Framework for Simulation and Defect Prediction Applied in Microelectronics
    Nikolaos Dimitriou, Lampros Leontaris, Thanasis Vafeiadis, Dimosthenis Ioannidis, Tracy Wotherspoon, Gregory Tinker, Dimitrios Tzovaras
    http://arxiv.org/abs/2002.10986v1

    • [cs.CV]ABCNet: Real-time Scene Text Spotting with Adaptive Bezier-Curve Network
    Yuliang Liu, Hao Chen, Chunhua Shen, Tong He, Lianwen Jin, Liangwei Wang
    http://arxiv.org/abs/2002.10200v2

    • [cs.CV]Anatomy-aware 3D Human Pose Estimation in Videos
    Tianlang Chen, Chen Fang, Xiaohui Shen, Yiheng Zhu, Zhili Chen, Jiebo Luo
    http://arxiv.org/abs/2002.10322v2

    • [cs.CV]Circle Loss: A Unified Perspective of Pair Similarity Optimization
    Yifan Sun, Changmao Cheng, Yuhan Zhang, Chi Zhang, Liang Zheng, Zhongdao Wang, Yichen Wei
    http://arxiv.org/abs/2002.10857v1

    • [cs.CV]Copy and Paste GAN: Face Hallucination from Shaded Thumbnails
    Yang Zhang, Ivor Tsang, Yawei Luo, Changhui Hu, Xiaobo Lu, Xin Yu
    http://arxiv.org/abs/2002.10650v1

    • [cs.CV]Cross-layer Feature Pyramid Network for Salient Object Detection
    Zun Li, Congyan Lang, Junhao Liew, Qibin Hou, Yidong Li, Jiashi Feng
    http://arxiv.org/abs/2002.10864v1

    • [cs.CV]DDet: Dual-path Dynamic Enhancement Network for Real-World Image Super-Resolution
    Yukai Shi, Haoyu Zhong, Zhijing Yang, Xiaojun Yang, Liang Lin
    http://arxiv.org/abs/2002.11079v1

    • [cs.CV]Deep Representation Learning on Long-tailed Data: A Learnable Embedding Augmentation Perspective
    Jialun Liu, Yifan Sun, Chuchu Han, Zhaopeng Dou, Wenhui Li
    http://arxiv.org/abs/2002.10826v1

    • [cs.CV]Evaluating Registration Without Ground Truth
    Carole J. Twining, Vladimir S. Petrović, Timothy F. Cootes, Roy S. Schestowitz, William R. Crum, Christopher J. Taylor
    http://arxiv.org/abs/2002.10534v1

    • [cs.CV]Exploring Learning Dynamics of DNNs via Layerwise Conditioning Analysis
    Lei Huang, Jie Qin, Li Liu, Fan Zhu, Ling Shao
    http://arxiv.org/abs/2002.10801v1

    • [cs.CV]FPConv: Learning Local Flattening for Point Convolution
    Yiqun Lin, Zizheng Yan, Haibin Huang, Dong Du, Ligang Liu, Shuguang Cui, Xiaoguang Han
    http://arxiv.org/abs/2002.10701v1

    • [cs.CV]Fast Loop Closure Detection via Binary Content
    Han Wang, Juncheng Li, Maopeng Ran, Lihua Xie
    http://arxiv.org/abs/2002.10622v1

    • [cs.CV]Fault Diagnosis in Microelectronics Attachment via Deep Learning Analysis of 3D Laser Scans
    Nikolaos Dimitriou, Lampros Leontaris, Thanasis Vafeiadis, Dimosthenis Ioannidis, Tracy Wotherspoon, Gregory Tinker, Dimitrios Tzovaras
    http://arxiv.org/abs/2002.10974v1

    • [cs.CV]Freeze Discriminator: A Simple Baseline for Fine-tuning GANs
    Sangwoo Mo, Minsu Cho, Jinwoo Shin
    http://arxiv.org/abs/2002.10964v1

    • [cs.CV]Globally Optimal Contrast Maximisation for Event-based Motion Estimation
    Daqi Liu, Álvaro Parra, Tat-Jun Chin
    http://arxiv.org/abs/2002.10686v1

    • [cs.CV]Ground Texture Based Localization Using Compact Binary Descriptors
    Jan Fabian Schmid, Stephan F. Simon, Rudolf Mester
    http://arxiv.org/abs/2002.11061v1

    • [cs.CV]Hierarchical Conditional Relation Networks for Video Question Answering
    Thao Minh Le, Vuong Le, Svetha Venkatesh, Truyen Tran
    http://arxiv.org/abs/2002.10698v1

    • [cs.CV]MPM: Joint Representation of Motion and Position Map for Cell Tracking
    Junya Hayashida, Kazuya Nishimura, Ryoma Bise
    http://arxiv.org/abs/2002.10749v1

    • [cs.CV]MagnifierNet: Towards Semantic Regularization and Fusion for Person Re-identification
    Yushi Lan, Yuan Liu, Maoqing Tian, Xinchi Zhou, Xuesen Zhang, Shuai Yi, Hongsheng Zhou
    http://arxiv.org/abs/2002.10979v1

    • [cs.CV]On Pruning Adversarially Robust Neural Networks
    Vikash Sehwag, Shiqi Wang, Prateek Mittal, Suman Jana
    http://arxiv.org/abs/2002.10509v1

    • [cs.CV]Optimal least-squares solution to the hand-eye calibration problem
    Amit Dekel, Linus Härenstam-Nielsen, Sergio Caccamo
    http://arxiv.org/abs/2002.10838v1

    • [cs.CV]PointAugment: an Auto-Augmentation Framework for Point Cloud Classification
    Ruihui Li, Xianzhi Li, Pheng-Ann Heng, Chi-Wing Fu
    http://arxiv.org/abs/2002.10876v1

    • [cs.CV]Random Bundle: Brain Metastases Segmentation Ensembling through Annotation Randomization
    Darvin Yi, Endre Gøvik, Michael Iv, Elizabeth Tong, Greg Zaharchuk, Daniel Rubin
    http://arxiv.org/abs/2002.09809v1

    • [cs.CV]Real-time Fusion Network for RGB-D Semantic Segmentation Incorporating Unexpected Obstacle Detection for Road-driving Images
    Lei Sun, Kailun Yang, Xinxin Hu, Weijian Hu, Kaiwei Wang
    http://arxiv.org/abs/2002.10570v1

    • [cs.CV]Revisiting Saliency Metrics: Farthest-Neighbor Area Under Curve
    Sen Jia, Neil D. B. Bruce
    http://arxiv.org/abs/2002.10540v1

    • [cs.CV]ScopeFlow: Dynamic Scene Scoping for Optical Flow
    Aviram Bar-Haim, Lior Wolf
    http://arxiv.org/abs/2002.10770v1

    • [cs.CV]See, Attend and Brake: An Attention-based Saliency Map Prediction Model for End-to-End Driving
    Ekrem Aksoy, Ahmet Yazıcı, Mahmut Kasap
    http://arxiv.org/abs/2002.11020v1

    • [cs.CV]Toward fast and accurate human pose estimation via soft-gated skip connections
    Adrian Bulat, Jean Kossaifi, Georgios Tzimiropoulos, Maja Pantic
    http://arxiv.org/abs/2002.11098v1

    • [cs.CV]Towards Better Surgical Instrument Segmentation in Endoscopic Vision: Multi-Angle Feature Aggregation and Contour Supervision
    Fangbo Qin, Shan Lin, Yangming Li, Randall A. Bly, Kris S. Moe, Blake Hannaford
    http://arxiv.org/abs/2002.10675v1

    • [cs.CV]Towards Learning a Generic Agent for Vision-and-Language Navigation via Pre-training
    Weituo Hao, Chunyuan Li, Xiujun Li, Lawrence Carin, Jianfeng Gao
    http://arxiv.org/abs/2002.10638v1

    • [cs.CV]Triple Wins: Boosting Accuracy, Robustness and Efficiency Together by Enabling Input-Adaptive Inference
    Ting-Kuei Hu, Tianlong Chen, Haotao Wang, Zhangyang Wang
    http://arxiv.org/abs/2002.10025v2

    • [cs.CV]Triplet Online Instance Matching Loss for Person Re-identification
    Ye Li, Guangqiang Yin, Chunhui Liu, Xiaoyu Yang, Zhiguo Wang
    http://arxiv.org/abs/2002.10560v1

    • [cs.CV]When Relation Networks meet GANs: Relation GANs with Triplet Loss
    Runmin Wu, Kunyao Zhang, Lijun Wang, Yue Wang, Huchuan Lu, Yizhou Yu
    http://arxiv.org/abs/2002.10174v2

    • [cs.DB]BAD to the Bone: Big Active Data at its Core
    Steven Jacobs, Xikui Wang, Michael J. Carey, Vassilis J. Tsotras, Md Yusuf Sarwar Uddin
    http://arxiv.org/abs/2002.09755v1

    • [cs.DC]A$^3$: Accelerating Attention Mechanisms in Neural Networks with Approximation
    Tae Jun Ham, Sung Jun Jung, Seonghak Kim, Young H. Oh, Yeonhong Park, Yoonho Song, Jung-Hun Park, Sanghee Lee, Kyoung Park, Jae W. Lee, Deog-Kyoon Jeong
    http://arxiv.org/abs/2002.10941v1

    • [cs.DC]Analysis of Amnesiac Flooding
    Volker Turau
    http://arxiv.org/abs/2002.10752v1

    • [cs.DC]Combining Learning and Optimization for Transprecision Computing
    Andrea Borghesi, Giuseppe Tagliavini, Michele Lombardi, Luca Benini, Michela Milano
    http://arxiv.org/abs/2002.10890v1

    • [cs.DC]Distributed Edge Coloring in Time Quasi-Polylogarithmic in Delta
    Alkida Balliu, Fabian Kuhn, Dennis Olivetti
    http://arxiv.org/abs/2002.10780v1

    • [cs.DC]Distributed Training of Deep Neural Network Acoustic Models for Automatic Speech Recognition
    Xiaodong Cui, Wei Zhang, Ulrich Finkler, George Saon, Michael Picheny, David Kung
    http://arxiv.org/abs/2002.10502v1

    • [cs.DC]Graph Computing based Distributed State Estimation with PMUs
    Yi Lu, Chen Yuan, Xiang Zhang, Hua Huang, Guangyi Liu, Renchang Dai, Zhiwei Wang
    http://arxiv.org/abs/2002.09477v1

    • [cs.DC]Near Optimal Task Graph Scheduling with Priced Timed Automata and Priced Timed Markov Decision Processes
    Anne Ejsing, Martin Jensen, Marco Muñiz, Jacob Nørhave, Lars Rechter
    http://arxiv.org/abs/2002.10783v1

    • [cs.DC]Quantized Push-sum for Gossip and Decentralized Optimization over Directed Graphs
    Hossein Taheri, Aryan Mokhtari, Hamed Hassani, Ramtin Pedarsani
    http://arxiv.org/abs/2002.09964v2

    • [cs.DS]Efficient and Simple Algorithms for Fault Tolerant Spanners
    Michael Dinitz, Caleb Robelle
    http://arxiv.org/abs/2002.10889v1

    • [cs.GR]Image Stylization: From Predefined to Personalized
    Ignacio Garcia-Dorado, Pascal Getreuer, Bartlomiej Wronski, Peyman Milanfar
    http://arxiv.org/abs/2002.10945v1

    • [cs.GR]PolyGen: An Autoregressive Generative Model of 3D Meshes
    Charlie Nash, Yaroslav Ganin, S. M. Ali Eslami, Peter W. Battaglia
    http://arxiv.org/abs/2002.10880v1

    • [cs.GT]Inducing Equilibria in Networked Public Goods Games through Network Structure Modification
    David Kempe, Sixie Yu, Yevgeniy Vorobeychik
    http://arxiv.org/abs/2002.10627v1

    • [cs.HC]ORCSolver: An Efficient Solver for Adaptive GUI Layout with OR-Constraints
    Yue Jiang, Wolfgang Stuerzlinger, Matthias Zwicker, Christof Lutteroth
    http://arxiv.org/abs/2002.09925v1

    • [cs.IR]Abstractive Snippet Generation
    Wei-Fan Chen, Shahbaz Syed, Benno Stein, Matthias Hagen, Martin Potthast
    http://arxiv.org/abs/2002.10782v1

    • [cs.IR]Data Augmentation for Personal Knowledge Graph Population
    Lingraj S Vannur, Lokesh Nagalapatti, Balaji Ganesan, Hima Patel
    http://arxiv.org/abs/2002.10943v1

    • [cs.IR]Leveraging Code Generation to Improve Code Retrieval and Summarization via Dual Learning
    Wei Ye, Rui Xie, Jinglei Zhang, Tianxiang Hu, Xiaoyin Wang, Shikun Zhang
    http://arxiv.org/abs/2002.10198v2

    • [cs.IT]Amplitude and Phase Estimation for Absolute Calibration of Massive MIMO Front-Ends
    Guoda Tian, Harsh Tataria, Fredrik Tufvesson
    http://arxiv.org/abs/2002.10817v1

    • [cs.IT]Deep Reinforcement Learning for Intelligent Reflecting Surfaces: Towards Standalone Operation
    Abdelrahman Taha, Yu Zhang, Faris B. Mismar, Ahmed Alkhateeb
    http://arxiv.org/abs/2002.11101v1

    • [cs.IT]Enhancing Physical Layer Security of Random Caching in Large-Scale Multi-Antenna Heterogeneous Wireless Networks
    Wanli Wen, Chenxi Liu, Yaru Fu, Tony Q. S. Quek, Fu-Chun Zheng, Shi Jin
    http://arxiv.org/abs/2002.10656v1

    • [cs.IT]Learning Beam Codebooks with Neural Networks: Towards Environment-Aware mmWave MIMO
    Yu Zhang, Muhammad Alrabeiah, Ahmed Alkhateeb
    http://arxiv.org/abs/2002.10663v1

    • [cs.IT]LoRa beyond ALOHA: An Investigation of Alternative Random Access Protocols
    Luca Beltramelli, Aamir Mahmood, Patrik Österberg, Mikael Gidlund
    http://arxiv.org/abs/2002.10732v1

    • [cs.IT]Millimeter Wave Communications with an Intelligent Reflector: Performance Optimization and Distributional Reinforcement Learning
    Qianqian Zhang, Walid Saad, Mehdi Bennis
    http://arxiv.org/abs/2002.10572v1

    • [cs.IT]Study of Coarse Quantization-Aware Block Diagonalization Algorithms for MIMO Systems with Low Resolution
    S. B. Pinto, R. C. de Lamare
    http://arxiv.org/abs/2002.10916v1

    • [cs.LG](De)Randomized Smoothing for Certifiable Defense against Patch Attacks
    Alexander Levine, Soheil Feizi
    http://arxiv.org/abs/2002.10733v1

    • [cs.LG]A Theory of Usable Information Under Computational Constraints
    Yilun Xu, Shengjia Zhao, Jiaming Song, Russell Stewart, Stefano Ermon
    http://arxiv.org/abs/2002.10689v1

    • [cs.LG]Adaptive Distributed Stochastic Gradient Descent for Minimizing Delay in the Presence of Stragglers
    Serge Kas Hanna, Rawad Bitar, Parimal Parag, Venkat Dasari, Salim El Rouayheb
    http://arxiv.org/abs/2002.11005v1

    • [cs.LG]Batch norm with entropic regularization turns deterministic autoencoders into generative models
    Amur Ghose, Abdullah Rashwan, Pascal Poupart
    http://arxiv.org/abs/2002.10631v1

    • [cs.LG]Breaking Batch Normalization for better explainability of Deep Neural Networks through Layer-wise Relevance Propagation
    Mathilde Guillemot, Catherine Heusele, Rodolphe Korichi, Sylvianne Schnebert, Liming Chen
    http://arxiv.org/abs/2002.11018v1

    • [cs.LG]Coherent Gradients: An Approach to Understanding Generalization in Gradient Descent-based Optimization
    Satrajit Chatterjee
    http://arxiv.org/abs/2002.10657v1

    • [cs.LG]Diversity-Based Generalization for Neural Unsupervised Text Classification under Domain Shift
    Jitin Krishnan, Hemant Purohit, Huzefa Rangwala
    http://arxiv.org/abs/2002.10937v1

    • [cs.LG]Efficient Rollout Strategies for Bayesian Optimization
    Eric Hans Lee, David Eriksson, Bolong Cheng, Michael McCourt, David Bindel
    http://arxiv.org/abs/2002.10539v1

    • [cs.LG]Federated Learning for Resource-Constrained IoT Devices: Panoramas and State-of-the-art
    Ahmed Imteaj, Urmish Thakker, Shiqiang Wang, Jian Li, M. Hadi Amini
    http://arxiv.org/abs/2002.10610v1

    • [cs.LG]General Framework for Binary Classification on Top Samples
    Lukáš Adam, Václav Mácha, Václav Šmídl, Tomáš Pevný
    http://arxiv.org/abs/2002.10923v1

    • [cs.LG]Gödel’s Sentence Is An Adversarial Example But Unsolvable
    Xiaodong Qi, Lansheng Han
    http://arxiv.org/abs/2002.10703v1

    • [cs.LG]HarDNN: Feature Map Vulnerability Evaluation in CNNs
    Abdulrahman Mahmoud, Siva Kumar Sastry Hari, Christopher W. Fletcher, Sarita V. Adve, Charbel Sakr, Naresh Shanbhag, Pavlo Molchanov, Michael B. Sullivan, Timothy Tsai, Stephen W. Keckler
    http://arxiv.org/abs/2002.09786v2

    • [cs.LG]Human Apprenticeship Learning via Kernel-based Inverse Reinforcement Learning
    Mark A. Rucker, Layne T. Watson, Laura E. Barnes, Matthew S. Gerber
    http://arxiv.org/abs/2002.10904v1

    • [cs.LG]I Am Going MAD: Maximum Discrepancy Competition for Comparing Classifiers Adaptively
    Haotao Wang, Tianlong Chen, Zhangyang Wang, Kede Ma
    http://arxiv.org/abs/2002.10648v1

    • [cs.LG]Interpolating Between Gradient Descent and Exponentiated Gradient Using Reparameterized Gradient Descent
    Ehsan Amid, Manfred K. Warmuth
    http://arxiv.org/abs/2002.10487v1

    • [cs.LG]Learning the mapping $\mathbf{x}\mapsto \sum_{i=1}^d x_i^2$: the cost of finding the needle in a haystack
    Jiefu Zhang, Leonardo Zepeda-Núñez, Yuan Yao, Lin Lin
    http://arxiv.org/abs/2002.10561v1

    • [cs.LG]Model-Based Reinforcement Learning for Physical Systems Without Velocity and Acceleration Measurements
    Alberto Dalla Libera, Diego Romeres, Devesh K. Jha, Bill Yerazunis, Daniel Nikovski
    http://arxiv.org/abs/2002.10621v1

    • [cs.LG]Modeling Continuous Stochastic Processes with Dynamic Normalizing Flows
    Ruizhi Deng, Bo Chang, Marcus A. Brubaker, Greg Mori, Andreas Lehrmann
    http://arxiv.org/abs/2002.10516v1

    • [cs.LG]Neural Networks are Convex Regularizers: Exact Polynomial-time Convex Optimization Formulations for Two-Layer Networks
    Mert Pilanci, Tolga Ergen
    http://arxiv.org/abs/2002.10553v1

    • [cs.LG]Novel Change of Measure Inequalities and PAC-Bayesian Bounds
    Yuki Ohnishi, Jean Honorio
    http://arxiv.org/abs/2002.10678v1

    • [cs.LG]Off-Policy Deep Reinforcement Learning with Analogous Disentangled Exploration
    Anji Liu, Yitao Liang, Guy Van den Broeck
    http://arxiv.org/abs/2002.10738v1

    • [cs.LG]On Feature Normalization and Data Augmentation
    Boyi Li, Felix Wu, Ser-Nam Lim, Serge Belongie, Kilian Q. Weinberger
    http://arxiv.org/abs/2002.11102v1

    • [cs.LG]On Reinforcement Learning for Turn-based Zero-sum Markov Games
    Devavrat Shah, Varun Somani, Qiaomin Xie, Zhi Xu
    http://arxiv.org/abs/2002.10620v1

    • [cs.LG]Optimal Gradient Quantization Condition for Communication-Efficient Distributed Training
    An Xu, Zhouyuan Huo, Heng Huang
    http://arxiv.org/abs/2002.11082v1

    • [cs.LG]Practical and Bilateral Privacy-preserving Federated Learning
    Yan Feng, Xue Yang, Weijun Fang, Shu-Tao Xia, Xiaohu Tang
    http://arxiv.org/abs/2002.09843v2

    • [cs.LG]Precise Tradeoffs in Adversarial Training for Linear Regression
    Adel Javanmard, Mahdi Soltanolkotabi, Hamed Hassani
    http://arxiv.org/abs/2002.10477v1

    • [cs.LG]Progressive Learning and Disentanglement of Hierarchical Representations
    Zhiyuan Li, Jaideep Vitthal Murkute, Prashnna Kumar Gyawali, Linwei Wang
    http://arxiv.org/abs/2002.10549v1

    • [cs.LG]Provable Representation Learning for Imitation Learning via Bi-level Optimization
    Sanjeev Arora, Simon S. Du, Sham Kakade, Yuping Luo, Nikunj Saunshi
    http://arxiv.org/abs/2002.10544v1

    • [cs.LG]Relevant-features based Auxiliary Cells for Energy Efficient Detection of Natural Errors
    Sai Aparna Aketi, Priyadarshini Panda, Kaushik Roy
    http://arxiv.org/abs/2002.11052v1

    • [cs.LG]Rewriting History with Inverse RL: Hindsight Inference for Policy Improvement
    Benjamin Eysenbach, Xinyang Geng, Sergey Levine, Ruslan Salakhutdinov
    http://arxiv.org/abs/2002.11089v1

    • [cs.LG]Robust Estimation, Prediction and Control with Linear Dynamics and Generic Costs
    Edouard Leurent, Denis Efimov, Odalric-Ambrym Maillard
    http://arxiv.org/abs/2002.10816v1

    • [cs.LG]Scheduled Restart Momentum for Accelerated Stochastic Gradient Descent
    Bao Wang, Tan M. Nguyen, Andrea L. Bertozzi, Richard G. Baraniuk, Stanley J. Osher
    http://arxiv.org/abs/2002.10583v1

    • [cs.LG]Searching for Winograd-aware Quantized Networks
    Javier Fernandez-Marques, Paul N. Whatmough, Andrew Mundy, Matthew Mattina
    http://arxiv.org/abs/2002.10711v1

    • [cs.LG]Sequence-to-Sequence Imputation of Missing Sensor Data
    Joel Janek Dabrowski, Ashfaqur Rahman
    http://arxiv.org/abs/2002.10767v1

    • [cs.LG]Stochastic-Sign SGD for Federated Learning with Theoretical Guarantees
    Richeng Jin, Yufan Huang, Xiaofan He, Huaiyu Dai, Tianfu Wu
    http://arxiv.org/abs/2002.10940v1

    • [cs.LG]Subspace Fitting Meets Regression: The Effects of Supervision and Orthonormality Constraints on Double Descent of Generalization Errors
    Yehuda Dar, Paul Mayer, Lorenzo Luzi, Richard G. Baraniuk
    http://arxiv.org/abs/2002.10614v1

    • [cs.LG]Teaching the Old Dog New Tricks: Supervised Learning with Constraints
    Fabrizio Detassis, Michele Lombardi, Michela Milano
    http://arxiv.org/abs/2002.10766v1

    • [cs.LG]The Curious Case of Adversarially Robust Models: More Data Can Help, Double Descend, or Hurt Generalization
    Yifei Min, Lin Chen, Amin Karbasi
    http://arxiv.org/abs/2002.11080v1

    • [cs.LG]Three Approaches for Personalization with Applications to Federated Learning
    Yishay Mansour, Mehryar Mohri, Jae Ro, Ananda Theertha Suresh
    http://arxiv.org/abs/2002.10619v1

    • [cs.LG]Towards an Efficient and General Framework of Robust Training for Graph Neural Networks
    Kaidi Xu, Sijia Liu, Pin-Yu Chen, Mengshu Sun, Caiwen Ding, Bhavya Kailkhura, Xue Lin
    http://arxiv.org/abs/2002.10947v1

    • [cs.LG]Training Binary Neural Networks using the Bayesian Learning Rule
    Xiangming Meng, Roman Bachmann, Mohammad Emtiyaz Khan
    http://arxiv.org/abs/2002.10778v1

    • [cs.LG]Understanding and Mitigating the Tradeoff Between Robustness and Accuracy
    Aditi Raghunathan, Sang Michael Xie, Fanny Yang, John Duchi, Percy Liang
    http://arxiv.org/abs/2002.10716v1

    • [cs.LG]Variational Hyper RNN for Sequence Modeling
    Ruizhi Deng, Yanshuai Cao, Bo Chang, Leonid Sigal, Greg Mori, Marcus A. Brubaker
    http://arxiv.org/abs/2002.10501v1

    • [cs.LG]Variational Wasserstein Barycenters for Geometric Clustering
    Liang Mi, Tianshu Yu, Jose Bento, Wen Zhang, Baoxin Li, Yalin Wang
    http://arxiv.org/abs/2002.10543v1

    • [cs.LO]Facets of the PIE Environment for Proving, Interpolating and Eliminating on the Basis of First-Order Logic
    Christoph Wernhard
    http://arxiv.org/abs/2002.10892v1

    • [cs.MA]Scalable Multi-Agent Inverse Reinforcement Learning via Actor-Attention-Critic
    Wonseok Jeon, Paul Barde, Derek Nowrouzezahrai, Joelle Pineau
    http://arxiv.org/abs/2002.10525v1

    • [cs.MM]A Comparative Evaluation of Temporal Pooling Methods for Blind Video Quality Assessment
    Zhengzhong Tu, Chia-Ju Chen, Li-Heng Chen, Neil Birkbeck, Balu Adsumilli, Alan C. Bovik
    http://arxiv.org/abs/2002.10651v1

    • [cs.MM]Model Watermarking for Image Processing Networks
    Jie Zhang, Dongdong Chen, Jing Liao, Han Fang, Weiming Zhang, Wenbo Zhou, Hao Cui, Nenghai Yu
    http://arxiv.org/abs/2002.11088v1

    • [cs.NE]An Assignment Problem Formulation for Dominance Move Indicator
    Claudio Lucio do Val Lopes, Flávio Vinícius Cruzeiro Martins, Elizabeth F. Wanner
    http://arxiv.org/abs/2002.10842v1

    • [cs.NE]Backpropamine: training self-modifying neural networks with differentiable neuromodulated plasticity
    Thomas Miconi, Aditya Rawal, Jeff Clune, Kenneth O. Stanley
    http://arxiv.org/abs/2002.10585v1

    • [cs.NE]Multi-objective beetle antennae search algorithm
    Junfei Zhang, Yimiao Huang, Guowei Ma, Brett Nener
    http://arxiv.org/abs/2002.10090v1

    • [cs.NE]Non-Volatile Memory Array Based Quantization- and Noise-Resilient LSTM Neural Networks
    Wen Ma, Pi-Feng Chiu, Won Ho Choi, Minghai Qin, Daniel Bedau, Martin Lueker-Boden
    http://arxiv.org/abs/2002.10636v1

    • [cs.NE]Separating the Effects of Batch Normalization on CNN Training Speed and Stability Using Classical Adaptive Filter Theory
    Elaina Chai, Mert Pilanci, Boris Murmann
    http://arxiv.org/abs/2002.10674v1

    • [cs.NI]Network-Density-Controlled Decentralized Parallel Stochastic Gradient Descent in Wireless Systems
    Koya Sato, Yasuyuki Satoh, Daisuke Sugimura
    http://arxiv.org/abs/2002.10758v1

    • [cs.NI]Personalized Federated Learning for Intelligent IoT Applications: A Cloud-Edge based Framework
    Qiong Wu, Kaiwen He, Xu Chen
    http://arxiv.org/abs/2002.10671v1

    • [cs.PF]Learning Queuing Networks by Recurrent Neural Networks
    Giulio Garbi, Emilio Incerto, Mirco Tribastone
    http://arxiv.org/abs/2002.10788v1

    • [cs.PL]MLIR: A Compiler Infrastructure for the End of Moore’s Law
    Chris Lattner, Jacques Pienaar, Mehdi Amini, Uday Bondhugula, River Riddle, Albert Cohen, Tatiana Shpeisman, Andy Davis, Nicolas Vasilache, Oleksandr Zinenko
    http://arxiv.org/abs/2002.11054v1

    • [cs.RO]Alternating Minimization Based Trajectory Generation for Quadrotor Aggressive Flight
    Zhepei Wang, Xin Zhou, Chao Xu, Jian Chu, Fei Gao
    http://arxiv.org/abs/2002.10629v1

    • [cs.RO]Denoising IMU Gyroscopes with Deep Learning for Open-Loop Attitude Estimation
    Martin Brossard, Silvere Bonnabel, Axel Barrau
    http://arxiv.org/abs/2002.10718v1

    • [cs.RO]Estimating Human Teleoperator Posture Using Only a Haptic-Input Device
    Amir Yazdani, Roya Sabbagh Novin, Andrew Merryweather, Tucker Hermans
    http://arxiv.org/abs/2002.10586v1

    • [cs.RO]Feasible Computationally Efficient Path Planning for UAV Collision Avoidance
    Han Wang, Muqing Cao, Hao Jiang, Lihua Xie
    http://arxiv.org/abs/2002.10623v1

    • [cs.RO]Human Perception-Optimized Planning for Comfortable VR-Based Telepresence
    Israel Becerra, Markku Suomalainen, Eliezer Lozano, Katherine J. Mimnaugh, Rafael Murrieta-Cid, Steven M. LaValle
    http://arxiv.org/abs/2002.10696v1

    • [cs.RO]Learning Machines from Simulation to Real World
    Tomer Iwan, Oktay Kavi, Erkin Yildirim
    http://arxiv.org/abs/2002.10853v1

    • [cs.RO]Least Squares Optimization: from Theory to Practice
    Giorgio Grisetti, Tiziano Guadagnino, Irvin Aloise, Mirco Colosi, Bartolomeo Della Corte, Dominik Schlegel
    http://arxiv.org/abs/2002.11051v1

    • [cs.RO]Non-Gaussian Chance-Constrained Trajectory Planning for Autonomous Vehicles in the Presence of Uncertain Agents
    Allen Wang, Ashkan Jasour, Brian Williams
    http://arxiv.org/abs/2002.10999v1

    • [cs.RO]Optimisation of Body-ground Contact for Augmenting Whole-Body Loco-manipulation of Quadruped Robots
    Wouter Wolfslag, Christopher McGreavy, Guiyang Xin, Carlo Tiseo, Sethu Vijayakumar, Zhibin Li
    http://arxiv.org/abs/2002.10552v1

    • [cs.RO]Safe Optimal Control under Parametric Uncertainties
    Hemanth Sarabu, Venkata Ramana Makkapati, Vinodhini Comandur, Panagiotis Tsiotras, Seth Hutchinson
    http://arxiv.org/abs/2002.11043v1

    • [cs.SI]Automating Discovery of Dominance in Synchronous Computer-Mediated Communication
    Jim Samuel, Richard Holowczak, Raquel Benbunan-Fich, Ilan Levine
    http://arxiv.org/abs/2002.10582v1

    • [cs.SI]MIDMod-OSN: A Microscopic-level Information Diffusion Model for Online Social Networks
    Abiola Osho, colin Goodman, george Amariucai
    http://arxiv.org/abs/2002.10522v1

    • [cs.SI]Migration Networks: Applications of Network Analysis to Large-Scale Human Mobility
    Valentin Danchev, Mason A. Porter
    http://arxiv.org/abs/2002.10992v1

    • [econ.TH]A Practical Approach to Social Learning
    Amir Ban, Moran Koren
    http://arxiv.org/abs/2002.11017v1

    • [eess.IV]Co-VeGAN: Complex-Valued Generative Adversarial Network for Compressive Sensing MR Image Reconstruction
    Bhavya Vasudeva, Puneesh Deora, Saumik Bhattacharya, Pyari Mohan Pradhan
    http://arxiv.org/abs/2002.10523v1

    • [eess.IV]Deep learning predicts total knee replacement from magnetic resonance images
    Aniket A. Tolpadi, Jinhee J. Lee, Valentina Pedoia, Sharmila Majumdar
    http://arxiv.org/abs/2002.10591v1

    • [eess.IV]Fully-automated Body Composition Analysis in Routine CT Imaging Using 3D Semantic Segmentation Convolutional Neural Networks
    Sven Koitka, Lennard Kroll, Eugen Malamutmann, Arzu Oezcelik, Felix Nensa
    http://arxiv.org/abs/2002.10776v1

    • [eess.IV]Recalibrating 3D ConvNets with Project & Excite
    Anne-Marie Rickmann, Abhijit Guha Roy, Ignacio Sarasua, Christian Wachinger
    http://arxiv.org/abs/2002.10994v1

    • [eess.IV]Technical report: Kidney tumor segmentation using a 2D U-Net followed by a statistical post-processing filter
    Iwan Paolucci
    http://arxiv.org/abs/2002.10727v1

    • [eess.IV]Variational Inference and Bayesian CNNs for Uncertainty Estimation in Multi-Factorial Bone Age Prediction
    Stefan Eggenreich, Christian Payer, Martin Urschler, Darko Štern
    http://arxiv.org/abs/2002.10819v1

    • [eess.SP]An Adaptive QRS Detection Algorithm for Ultra-Long-Term ECG Recordings
    John Malik, Elsayed Z Soliman, Hau-Tieng Wu
    http://arxiv.org/abs/2002.10633v1

    • [eess.SP]Design Optimisation of Power-Efficient Submarine Line through Machine Learning
    Maria Ionescu, Amirhossein Ghazisaeidi, Jérémie Renaudier, Pascal Pecci, Olivier Courtois
    http://arxiv.org/abs/2002.11037v1

    • [eess.SP]Gesture recognition with 60GHz 802.11 waveforms
    Eran Hof, Amichai Sanderovich, Evyatar Hemo
    http://arxiv.org/abs/2002.10836v1

    • [eess.SP]Robust Wireless Fingerprinting: Generalizing Across Space and Time
    Metehan Cekic, Soorya Gopalakrishnan, Upamanyu Madhow
    http://arxiv.org/abs/2002.10791v1

    • [eess.SP]Wireless 2.0: Towards an Intelligent Radio Environment Empowered by Reconfigurable Meta-Surfaces and Artificial Intelligence
    Haris Gacanin, Marco Di Renzo
    http://arxiv.org/abs/2002.11040v1

    • [math.AC]Second generalized Hamming weight of Projective Toric Code over Hypersimplices
    Nupur Patanker, Sanjay Kumar Singh
    http://arxiv.org/abs/2002.10920v1

    • [math.CO]New bounds for perfect $k$-hashing
    Simone Costa, Marco Dalai
    http://arxiv.org/abs/2002.11025v1

    • [math.DS]Sparsity-promoting algorithms for the discovery of informative Koopman invariant subspaces
    Shaowu Pan, Nicholas Arnold-Medabalimi, Karthik Duraisamy
    http://arxiv.org/abs/2002.10637v1

    • [math.GR]Commutator subgroups of Sylow 2-subgroups of alternating group and Miller-Moreno groups as bases of new Key Exchange Protocol
    Ruslan V. Skuratovskii, Aled Williams
    http://arxiv.org/abs/2002.10528v1

    • [math.OC]Biased Stochastic Gradient Descent for Conditional Stochastic Optimization
    Yifan Hu, Siqi Zhang, Xin Chen, Niao He
    http://arxiv.org/abs/2002.10790v1

    • [math.OC]Can speed up the convergence rate of stochastic gradient methods to $\mathcal{O}(1/k^2)$ by a gradient averaging strategy?
    Xin Xu, Xiaopeng Luo
    http://arxiv.org/abs/2002.10769v1

    • [math.OC]On the regularity and conditioning of low rank semidefinite programs
    Lijun Ding, Madeleine Udell
    http://arxiv.org/abs/2002.10673v1

    • [math.OC]Statistically Preconditioned Accelerated Gradient Method for Distributed Optimization
    Hadrien Hendrikx, Lin Xiao, Sebastien Bubeck, Francis Bach, Laurent Massoulie
    http://arxiv.org/abs/2002.10726v1

    • [math.OC]Stochastic Polyak Step-size for SGD: An Adaptive Learning Rate for Fast Convergence
    Nicolas Loizou, Sharan Vaswani, Issam Laradji, Simon Lacoste-Julien
    http://arxiv.org/abs/2002.10542v1

    • [math.ST]Asymptotic Analysis of Sampling Estimators for Randomized Numerical Linear Algebra Algorithms
    Ping Ma, Xinlian Zhang, Xin Xing, Jingyi Ma, Michael W. Mahoney
    http://arxiv.org/abs/2002.10526v1

    • [math.ST]Structural adaptation in the density model
    Lepski O. V., Rebelles G
    http://arxiv.org/abs/2002.10850v1

    • [physics.flu-dyn]Physics-informed deep learning for incompressible laminar flows
    Chengping Rao, Hao Sun, Yang Liu
    http://arxiv.org/abs/2002.10558v1

    • [physics.med-ph]Multifold Acceleration of Diffusion MRI via Slice-Interleaved Diffusion Encoding (SIDE)
    Yoonmi Hong, Wei-Tang Chang, Geng Chen, Ye Wu, Weili Lin, Dinggang Shen, Pew-Thian Yap
    http://arxiv.org/abs/2002.10908v1

    • [physics.soc-ph]How many infections of COVID-19 there will be in the “Diamond Princess”-Predicted by a virus transmission model based on the simulation of crowd flow
    Zhiming Fang, Zhongyi Huang, Xiaolian Li, Jun Zhang, Wei Lv, Lei Zhuang, Xingpeng Xu, Nan Huang
    http://arxiv.org/abs/2002.10616v1

    • [q-bio.NC]Stochastic encoding of graphs in deep learning allows for complex analysis of gender classification in resting-state and task functional brain networks from the UK Biobank
    Matthew Leming, John Suckling
    http://arxiv.org/abs/2002.10936v1

    • [q-bio.QM]Uncovering ecological state dynamics with hidden Markov models
    Brett T. McClintock, Roland Langrock, Olivier Gimenez, Emmanuelle Cam, David L. Borchers, Richard Glennie, Toby A. Patterson
    http://arxiv.org/abs/2002.10497v1

    • [q-fin.PM]G-Learner and GIRL: Goal Based Wealth Management with Reinforcement Learning
    Matthew Dixon, Igor Halperin
    http://arxiv.org/abs/2002.10990v1

    • [quant-ph]Nonbinary Error-Detecting Hybrid Codes
    Andrew Nemec, Andreas Klappenecker
    http://arxiv.org/abs/2002.11075v1

    • [quant-ph]Planning for Compilation of a Quantum Algorithm for Graph Coloring
    Minh Do, Zhihui Wang, Bryan O’Gorman, Davide Venturelli, Eleanor Rieffel, Jeremy Frank
    http://arxiv.org/abs/2002.10917v1

    • [stat.AP]Continuous-time multi-state capture-recapture models
    Sina Mews, Roland Langrock, Ruth King, Nicola Quick
    http://arxiv.org/abs/2002.10997v1

    • [stat.AP]Gaussian Process Regression for Probabilistic Short-term Solar Output Forecast
    Fatemeh Najibi, Dimitra Apostolopoulou, Eduardo Alonso
    http://arxiv.org/abs/2002.10878v1

    • [stat.AP]Multi Linear Regression applied to Communications systems Analysis
    Federico Rodas Bajaña, Luis Hernan Montoya Lara, Manolo Paredes, Elena Gimenez de Ory
    http://arxiv.org/abs/2002.10573v1

    • [stat.AP]Statistical inference for Axiom A attractors
    Michael LuValle
    http://arxiv.org/abs/2002.10545v1

    • [stat.ME]Bayesian Multi-scale Modeling of Factor Matrix without using Partition Tree
    Maoran Xu, Leo L. Duan
    http://arxiv.org/abs/2002.09606v2

    • [stat.ME]Bayesian analysis of count-valued, binary-valued, and continuous-valued responses using unknown transformations
    Jonathan R. Bradley
    http://arxiv.org/abs/org/abs/2002.09983v1

    • [stat.ME]Causal bounds for outcome-dependent sampling in observational studies
    Erin E. Gabriel, Michael C. Sachs, Arvid Sjölander
    http://arxiv.org/abs/2002.10519v1

    • [stat.ME]Demystify Lindley’s Paradox by Interpreting P-value as Posterior Probability
    Guosheng Yin, Haolun Shi
    http://arxiv.org/abs/2002.10883v1

    • [stat.ME]MissDeepCausal: Causal Inference from Incomplete Data Using Deep Latent Variable Models
    Imke Mayer, Julie Josse, Félix Raimundo, Jean-Philippe Vert
    http://arxiv.org/abs/2002.10837v1

    • [stat.ME]Model-assisted estimation through random forests in finite population sampling
    Mehdi Dagdoug, Camelia Goga, David Haziza
    http://arxiv.org/abs/2002.09736v2

    • [stat.ME]Multivariate time-series modeling with generative neural networks
    Marius Hofert, Avinash Prasad, Mu Zhu
    http://arxiv.org/abs/2002.10645v1

    • [stat.ME]Probabilistic elicitation of expert knowledge through assessment of computer simulations
    Owen Thomas, Henri Pesonen, Jukka Corander
    http://arxiv.org/abs/2002.10902v1

    • [stat.ME]The DURATIONS randomised trial design: estimation targets, analysis methods and operating characteristics
    Matteo Quartagno, James R. Carpenter, A. Sarah Walker, Michelle Clements, Mahesh K. B. Parmar
    http://arxiv.org/abs/2002.10962v1

    • [stat.ME]Uncertainty estimation in equality-constrained MAP and maximum likelihood estimation with applications to system identification and state estimation
    Dimas Abreu Archanjo Dutra
    http://arxiv.org/abs/2002.10975v1

    • [stat.ML]A General Method for Robust Learning from Batches
    Ayush Jain, Alon Orlitsky
    http://arxiv.org/abs/2002.11099v1

    • [stat.ML]Causal Inference With Selectively-Deconfounded Data
    Kyra Gan, Andrew A. Li, Zachary C. Lipton, Sridhar Tayur
    http://arxiv.org/abs/2002.11096v1

    • [stat.ML]Gaussian Hierarchical Latent Dirichlet Allocation: Bringing Polysemy Back
    Takahiro Yoshida, Ryohei Hisano, Takaaki Ohnishi
    http://arxiv.org/abs/2002.10855v1

    • [stat.ML]Missing Data Imputation for Classification Problems
    Arkopal Choudhury, Michael R. Kosorok
    http://arxiv.org/abs/2002.10709v1

    • [stat.ML]Neuron Shapley: Discovering the Responsible Neurons
    Amirata Ghorbani, James Zou
    http://arxiv.org/abs/2002.09815v2

    • [stat.ML]Statistical Adaptive Stochastic Gradient Methods
    Pengchuan Zhang, Hunter Lang, Qiang Liu, Lin Xiao
    http://arxiv.org/abs/2002.10597v1