Aananthakrishnan, Sriram · more Sriram Aananthakrishnan (Intel) | Prune the Unnecessary: Parallel Pull-Push Louvain Algorithms with Automatic Edge Pruning · view |
Abad, Pablo · more Pablo Abad (University of Cantabria) | SPECcast: A Methodology for Fast Performance Evaluation with SPEC CPU 2017 Multiprogrammed Workloads · view |
Abdelrahman, Tarek · more Tarek Abdelrahman (University of Toronto) | Balancing Graph Processing Workloads Using Work Stealing on Heterogeneous CPU-FPGA Systems · view |
Agostini, Matthew · more Matthew Agostini (University of Toronto) | Balancing Graph Processing Workloads Using Work Stealing on Heterogeneous CPU-FPGA Systems · view |
Akella, Venkatesh · more Venkatesh Akella (University of California, Davis) | HCAPP: Scalable Power Control for Heterogeneous 2.5D Integrated Systems · view |
Akyildiz, Taha Atahan · more Taha Atahan Akyildiz (Sabancı University) | GOSH: Embedding Big Graphs on Small Hardware · view |
Alabsi Aljundi, Amro · more Amro Alabsi Aljundi (Sabancı University) | GOSH: Embedding Big Graphs on Small Hardware · view |
Alibhai, Shakeel · more Shakeel Alibhai (Temple University) | A Rack-aware Pipeline Repair Scheme for Erasure-coded Distributed Storage Systems · view |
Alkabani, Yousra · more Yousra Alkabani (Halmstad University) | DNNARA: A Deep Neural Network Accelerator using Residue Arithmetic and Integrated Photonics · view |
Amarasinghe, Saman · more Saman Amarasinghe (MIT) Saman P. Amarasinghe is a Professor in the Department of Electrical Engineering and Computer Science at Massachusetts Institute of Technology and a member of its Computer Science and Artificial Intelligence Laboratory (CSAIL) where he leads the Commit compiler group. Under Saman's guidance, the Commit group developed the StreamIt, StreamJIT, PetaBricks, Halide, Simit, MILK, Cimple, TACO, GraphIt, Tiramisu, BioStream and Seq programming languages and compilers, DynamoRIO dynamic instrumentation system, Superword level parallelism for SIMD vectorization, Program Shepherding to protect programs against external attacks, the OpenTuner extendable autotuner, and the Kendo deterministic execution system. He was the co-leader of the Raw architecture project. Saman was the founder of Determina Corporation, and a co-founder of Lanka Internet Services Ltd., and Venti Technologies Corporation. Saman received his BS in Electrical Engineering and Computer Science from Cornell University in 1988, and his MSEE and Ph.D. from Stanford University in 1990 and 1997, respectively. He is an ACM Fellow. | How to Make Sparse Fast · view |
Amini Salehi, Mohsen · more Mohsen Amini Salehi (University of Louisiana at Lafayette) | The Art of CPU-Pinning: Evaluating and Improving the Performance of Virtualization and Containerization Platforms · view |
Angerd, Alexandra · more Alexandra Angerd (Chalmers University of Technology) | A GPU Register File using Static Data Compression · view |
Bacik, Josef · more Josef Bacik (Facebook) | The Art of CPU-Pinning: Evaluating and Improving the Performance of Virtualization and Containerization Platforms · view |
Bao, Wei · more Wei Bao (The University of Sydney) | Federated Learning with Proximal Stochastic Variance Reduced Gradient Algorithms · view |
Barker, Kevin · more Kevin Barker (Pacific Northwest National Laboratory) | Detecting Anomalous Computation with RNNs on GPU-Accelerated HPC Machines · view |
Bensaou, Brahim · more Brahim Bensaou (Hong Kong University of Science and Technology) | Reducing Latency in Multi-Tenant Data Centers via Cautious Congestion Watch · view |
Cai, Shangming · more Shangming Cai (Tsinghua University) | CARD: A Congestion-Aware Request Dispatching Scheme for Replicated Metadata Server Cluster · view |
Cai, Wentong · more Wentong Cai (School of Computer Science and Engineering, Nanyang Technological University) | Rendering Server Allocation for MMORPG Players in Cloud Gaming · view |
Cai, Xiaoqing · more Xiaoqing Cai (Shanghai Jiao Tong University) | OVERSEE: Outsourcing Verification to Enable Resource Sharing in Edge Environment · view |
Cai, Zhiping · more Zhiping Cai (National University of Defense Technology) | FEEL: A Federated Edge Learning System for Efficient and Privacy-Preserving Mobile Healthcare · view |
Cao, Qiang · more Qiang Cao (Wuhan National Laboratory for Optoelectronics, Huazhong University of Science and Technology) | GraBi: Communication-Efficient and Workload-Balanced Partitioning for Bipartite Graphs · view SeRW: Adaptively Separating Read and Write upon SSDs of Hybrid Storage Server in Clouds · view |
Castro, Fernando · more Fernando Castro (Complutense University of Madrid) | Enabling performance portability of data-parallel OpenMP applications on asymmetric multicore processors · view |
Chai, Qifei · more Qifei Chai (Tianjin University) | Balancing Fairness and Efficiency for Cache Sharing in Semi-external Memory System · view |
Chau, Sid Chi-Kin · more Sid Chi-Kin Chau (The Australian National University) | Reliability Augmentation of Requests with Service Function Chain Requirements in Mobile Edge-Cloud Networks · view |
Chen, Huaming · more Huaming Chen (Tencent) | Saec: Similarity-Aware Embedding Compression in Recommendation Systems · view |
Chen, Li · more Li Chen (University of Louisiana at Lafayette) | E-LAS: Design and Analysis of Completion-Time Agnostic Scheduling for Distributed Deep Learning Cluster · view FEEL: A Federated Edge Learning System for Efficient and Privacy-Preserving Mobile Healthcare · view |
Chen, Mengqiang · more Mengqiang Chen (Sun Yat-sen University) | Dual-Way Gradient Sparsification for Asynchronous Distributed Deep Learning · view |
Chen, Quan · more Quan Chen (Shanghai Jiao Tong University) | URSA: Precise Capacity Planning and Fair Scheduling based on Low-level Statistics for Public Clouds · view OVERSEE: Outsourcing Verification to Enable Resource Sharing in Edge Environment · view |
Chen, Wuhui · more Wuhui Chen (Sun Yat-sen University, zhangjt26@mail2.sysu.edu.cn) | SkyChain: A Deep Reinforcement Learning-Empowered Dynamic Blockchain Sharding System · view |
Chen, Yang · more Yang Chen (Temple University) | Optimizing Flow Bandwidth Consumption with Traffic-diminishing Middlebox Placement · view |
Chen, Yu · more Yu Chen (Huazhong University of Science and Technology) | Mass: Workload-Aware Storage Policy for OpenStack Swift · view |
Chen, Zheng · more Zheng Chen (Renmin University of China) | ParSecureML: An Efficient Parallel Secure Machine Learning Framework on GPUs · view |
Cheng, Yuchen · more Yuchen Cheng (Shanghai Jiao Tong University) | OPS: Optimized Shuffle Management System for Apache Spark · view |
Chuah, Mooi Choo · more Mooi Choo Chuah (Lehigh University) | Impact of Memory DoS Attacks on Cloud Applications and Real-Time Detection Schemes · view |
Chung, Jae-Won · more Jae-Won Chung (Seoul National University) | ShadowTutor: Distributed Partial Distillation for Mobile Video DNN Inference · view |
Curtis-Maury, Matthew · more Matthew Curtis-Maury (NetApp, Inc) | Scalable Coordination of Hierarchical Parallelism · view |
Deng, Fan · more Fan Deng (Wuhan National Laboratory for Optoelectronics, Huazhong University of Science and Technology) | SeRW: Adaptively Separating Read and Write upon SSDs of Hybrid Storage Server in Clouds · view |
Deng, Jing · more Jing Deng (University of North Carolina) | Cooperative Game for Multiple Chargers with Dynamic Network Topology · view |
Denninnart, Chavit · more Chavit Denninnart (University of Louisiana at Lafayette) | The Art of CPU-Pinning: Evaluating and Improving the Performance of Virtualization and Containerization Platforms · view |
Devadas, Vinay · more Vinay Devadas (NetApp, Inc) | Scalable Coordination of Hierarchical Parallelism · view |
Dinh, Canh T. · more Canh T. Dinh (The University of Sydney) | Federated Learning with Proximal Stochastic Variance Reduced Gradient Algorithms · view |
Dong, Yuanyuan · more Yuanyuan Dong (Alibaba Inc.) | SeRW: Adaptively Separating Read and Write upon SSDs of Hybrid Storage Server in Clouds · view |
Du, Xiaoyong · more Xiaoyong Du (Renmin University of China) | ParSecureML: An Efficient Parallel Secure Machine Learning Framework on GPUs · view CapelliniSpTRSV: A Thread-Level Synchronization-Free Sparse Triangular Solve on GPUs · view |
Du, Yishu · more Yishu Du (Tongji University Shanghai, ENS Lyon) | Robustness of the Young/Daly formula for stochastic iterative applications · view |
Duan, Kaiyue · more Kaiyue Duan (College of Computer Science, Nankai University) | Improving Load Balance via Resource Exchange in Large-Scale Search Engines · view |
Duan, Xiaohui · more Xiaohui Duan (Shandong University) | SWMapper: Scalable Read Mapper on SunWay TaihuLight · view |
El-Ghazawi, Tarek · more Tarek El-Ghazawi (The George Washington University) | DNNARA: A Deep Neural Network Accelerator using Residue Arithmetic and Integrated Photonics · view |
Ellingwood, Nathan · more Nathan Ellingwood (Sandia National Labs) | Performance Portable Supernode-based Sparse Triangular Solver for Manycore Architectures · view |
Fan, Pingzhi · more Pingzhi Fan (Southwest Jiaotong University) | Selective Coflow Completion for Time-sensitive Distributed Applications with Poco · view |
Farrens, Matthew · more Matthew Farrens (University of California, Davis) | HCAPP: Scalable Power Control for Heterogeneous 2.5D Integrated Systems · view |
Feng, Dan · more Dan Feng (Huazhong University of Science and Technology) | Mass: Workload-Aware Storage Policy for OpenStack Swift · view CCHL: Compression-Consolidation Hardware Logging for Efficient Failure-Atomic Persistent Memory Updates · view |
Fröning, Holger · more Holger Fröning (Heidelberg University) | On Network Locality in MPI-Based HPC Applications · view |
Gansterer, Wilfried N. · more Wilfried N. Gansterer (University of Vienna) | Algorithm-Based Checkpoint-Recovery for the Conjugate Gradient Method · view |
Gao, Hongyun · more Hongyun Gao (Tianjin Key Laboratory of Advanced Networking (TANKLab), College of Intelligence and Computing (CIC), Tianjin University) | XShot: Light-weight Link Failure Localization using Crossed Probing Cycles in SDN · view |
Gao, Yiqin · more Yiqin Gao (ENS Lyon) | Energy-aware strategies for reliability-oriented real-time task allocation on heterogeneous platforms · view |
Gavrilovska, Ada · more Ada Gavrilovska (Georgia Institute of Technology) | Generating Robust Parallel Programs via Model Driven Prediction of Compiler Optimizations for Non-determinism · view |
Ge, Rong · more Rong Ge (Clemson University) | Detecting Anomalous Computation with RNNs on GPU-Accelerated HPC Machines · view |
Ghatrehsamani, Davood · more Davood Ghatrehsamani (University of Louisiana at Lafayette) | The Art of CPU-Pinning: Evaluating and Improving the Performance of Virtualization and Containerization Platforms · view |
Gómez Flores, Wilfrido · more Wilfrido Gómez Flores (Centro de Investigación y de Estudios Avanzados, Tamaulipas) | Towards Parallelization of a Texture Description Algorithm for Breast Lesion Classification using OpenMP and CUDA · view |
Gong, Ruihao · more Ruihao Gong (Beihang University, SenseTime Research) | Extremely Low-bit Convolution Optimization for Quantized Neural Network on Modern Computer Architectures · view |
Gong, Xiaoli · more Xiaoli Gong (Nankai University) | DQEMU: A Scalable Emulator with Retargetable DBT on Distributed Platforms · view |
Gong, Yifan · more yifan gong (Tusimple) | EPMA: Efficient Partial Message Access in IoT Era · view Memory-Centric Communication Mechanism for Real-time Autonomous Navigation Applications · view |
Gregorio, Jose Angel · more Jose Angel Gregorio (University of Cantabria) | SPECcast: A Methodology for Fast Performance Evaluation with SPEC CPU 2017 Multiprogrammed Workloads · view |
Guo, Minyi · more Minyi Guo (Shanghai Jiao Tong University) | URSA: Precise Capacity Planning and Fair Scheduling based on Low-level Statistics for Public Clouds · view OVERSEE: Outsourcing Verification to Enable Resource Sharing in Edge Environment · view |
Guo, Song · more Song Guo (The Hong Kong Polytechnic University) | SkyChain: A Deep Reinforcement Learning-Empowered Dynamic Blockchain Sharding System · view |
Guo, Yeting · more Yeting Guo (National University of Defense Technology) | FEEL: A Federated Edge Learning System for Efficient and Privacy-Preserving Mobile Healthcare · view |
H. Tran, Nguyen · more Nguyen H. Tran (The University of Sydney) | Federated Learning with Proximal Stochastic Variance Reduced Gradient Algorithms · view |
Han, Li · more Li Han (East China Normal University, ENS Lyon) | Energy-aware strategies for reliability-oriented real-time task allocation on heterogeneous platforms · view |
Han, Qingchang · more Qingchang Han (Beihang University, SenseTime Research) | Extremely Low-bit Convolution Optimization for Quantized Neural Network on Modern Computer Architectures · view |
He, Bingsheng · more Bingsheng He (National University of Singapore) | CapelliniSpTRSV: A Thread-Level Synchronization-Free Sparse Triangular Solve on GPUs · view |
He, Bo · more Bo He (Beijing University of Posts and Telecommunications) | DeepHop on Edge: Hop-by-hop Routing by Distributed Learning with Semantic Attention · view |
He, Ligang · more Ligang He (University of Warwick) | Developing a Loss Prediction-based Asynchronous Stochastic Gradient Descent Algorithm for Distributed Training of Deep Neural Networks · view |
He, Tian · more Tian He (University of Minnesota) | AMRT: Anti-ECN Marking to Improve Utilization of Receiver-driven Transmission in Data Center · view |
He, Xubin · more Xubin He (Temple University) | A Rack-aware Pipeline Repair Scheme for Erasure-coded Distributed Storage Systems · view |
Hedayati, Mohammad · more Mohammad Hedayati (University of Rochester) | Safe, Fast Sharing of memcached as a Protected Library · view |
Helm, Christian · more Christian Helm (The University of Tokyo) | Automatic Identification and Precise Attribution of DRAM Bandwidth Contention · view |
Herrero, Jose Angel · more Jose Angel Herrero (University of Cantabria) | SPECcast: A Methodology for Fast Performance Evaluation with SPEC CPU 2017 Multiprogrammed Workloads · view |
Hinkle, Jacob · more Jacob Hinkle (ORNL) | Toward Large-Scale Image Segmentation on Summit · view |
Hong, Zicong · more Zicong Hong (Sun Yat-sen University) | SkyChain: A Deep Reinforcement Learning-Empowered Dynamic Blockchain Sharding System · view |
Hovland, Paul · more Paul Hovland (Argonne National Laboratory) | Vector Forward Mode Automatic Differentiation on SIMD/SIMT architectures · view |
Hu, Jinbin · more Jinbin Hu (Central South University) | AMRT: Anti-ECN Marking to Improve Utilization of Receiver-driven Transmission in Data Center · view |
Hu, Peng · more Peng Hu (SenseTime Research, Beihang University) | Extremely Low-bit Convolution Optimization for Quantized Neural Network on Modern Computer Architectures · view |
Hu, Yongmin · more Yongmin Hu (Beihang University) | Extremely Low-bit Convolution Optimization for Quantized Neural Network on Modern Computer Architectures · view |
Hu, Zhenbo · more Zhenbo Hu (Harbin Institute of Technology, Shenzhen) | Delta-DNN: Efficiently Compressing Deep Neural Networks via Exploiting Floats Similarity · view |
Hua, Yu · more Yu Hua (Huazhong University of Science & Technology) | An Efficient Wear-level Architecture using Self-adaptive Wear Leveling · view |
Huang, Fangting · more Fangting Huang (Huazhong University of Science & Technology) | An Efficient Wear-level Architecture using Self-adaptive Wear Leveling · view |
Huang, Jianming · more Jianming Huang (Huazhong University of Science & Technology) | An Efficient Wear-level Architecture using Self-adaptive Wear Leveling · view |
Huang, Jiawei · more Jiawei Huang (Central South University) | AMRT: Anti-ECN Marking to Improve Utilization of Receiver-driven Transmission in Data Center · view |
Hueckelheim, Jan · more Jan Hueckelheim (Argonne National Laboratory) | Vector Forward Mode Automatic Differentiation on SIMD/SIMT architectures · view |
Inaba, Yoko · more Yoko Inaba (NTT DATA Corporation) | Adaptive Bulk Search: Solving Quadratic Unconstrained Binary Optimization Problems on Multiple GPUs · view |
Ito, Yasuaki · more Yasuaki Ito (Hiroshima University) | Adaptive Bulk Search: Solving Quadratic Unconstrained Binary Optimization Problems on Multiple GPUs · view Huffman Coding with Gap Arrays for GPU Acceleration · view |
Jaya, Iryanto · more Iryanto Jaya (School of Computer Science and Engineering, Nanyang Technological University) | Rendering Server Allocation for MMORPG Players in Cloud Gaming · view |
Ji, Bo · more Bo Ji (Temple University) | Optimizing Flow Bandwidth Consumption with Traffic-diminishing Middlebox Placement · view |
Jia, Xiaohua · more Xiaohua Jia (City University of Hong Kong) | Reliability Augmentation of Requests with Service Function Chain Requirements in Mobile Edge-Cloud Networks · view |
Jiang, Hong · more Hong Jiang (University of Texas at Arlington, Department of Computer Science and Engineering) | GraBi: Communication-Efficient and Workload-Balanced Partitioning for Bipartite Graphs · view |
Jiang, Wanchun · more Wanchun Jiang (School of Computer Science and Engineering, Central South University) | PS : Periodic Strategy for the 40-100Gbps Energy Efficient Ethernet · view Polo: Receiver-Driven Congestion Control for Low Latency over Commodity Network Fabric · view |
Jiang, Zhang · more Zhang Jiang (Nankai University) | DQEMU: A Scalable Emulator with Retargetable DBT on Distributed Platforms · view |
Jiang, Ziyue · more Ziyue Jiang (TuSimple) | EPMA: Efficient Partial Message Access in IoT Era · view |
Jin, Jiangming · more jiangming jin (Tusimple) | EPMA: Efficient Partial Message Access in IoT Era · view Memory-Centric Communication Mechanism for Real-time Autonomous Navigation Applications · view |
Jin, Sian · more Sian Jin (University of Alabama) | Delta-DNN: Efficiently Compressing Deep Neural Networks via Exploiting Floats Similarity · view |
Kasagi, Akihiko · more Akihiko Kasagi (Fujitsu Laboratories Ltd.) | Huffman Coding with Gap Arrays for GPU Acceleration · view |
Katsuki, Ryota · more Ryota Katsuki (NTT DATA Corporation) | Adaptive Bulk Search: Solving Quadratic Unconstrained Binary Optimization Problems on Multiple GPUs · view |
Kaya, Kamer · more Kamer Kaya (Sabancı University) | GOSH: Embedding Big Graphs on Small Hardware · view |
Kim, Jae-Yun · more Jae-Yun Kim (Seoul National University) | ShadowTutor: Distributed Partial Distillation for Mobile Video DNN Inference · view |
Kirmani, Shad · more Shad Kirmani (EBay Inc.) | Fast Spectral Graph Layout on Multicore Platforms · view |
Kjellqvist, Chris · more Chris Kjellqvist (Duke University) | Safe, Fast Sharing of memcached as a Protected Library · view |
Kratochvíl, Miroslav · more Miroslav Kratochvíl (Charles University) | Detailed Analysis and Optimization of CUDA K-means Algorithm · view |
Kruliš, Martin · more Martin Kruliš (Charles University) | Detailed Analysis and Optimization of CUDA K-means Algorithm · view |
Leng, Jingwen · more Jingwen Leng (Shanghai Jiao Tong University) | URSA: Precise Capacity Planning and Fair Scheduling based on Low-level Statistics for Public Clouds · view OVERSEE: Outsourcing Verification to Enable Resource Sharing in Edge Environment · view |
Levonyak, Markus · more Markus Levonyak (University of Vienna) | Algorithm-Based Checkpoint-Recovery for the Conjugate Gradient Method · view |
Li, Ang · more Ang Li (Pacific Northwest National Laboratory) | Detecting Anomalous Computation with RNNs on GPU-Accelerated HPC Machines · view |
Li, Chao · more Chao Li (Shanghai Jiao Tong University) | OVERSEE: Outsourcing Verification to Enable Resource Sharing in Edge Environment · view |
Li, Junyu · more Junyu Li (University of Warwick) | Developing a Loss Prediction-based Asynchronous Stochastic Gradient Descent Algorithm for Distributed Training of Deep Neural Networks · view |
Li, Keqiu · more Keqiu Li (Tianjin Key Laboratory of Advanced Networking (TANKLab), College of Intelligence and Computing (CIC), Tianjin University) | XShot: Light-weight Link Failure Localization using Crossed Probing Cycles in SDN · view |
Li, Xin · more Xin Li (Shandong University) | SWMapper: Scalable Read Mapper on SunWay TaihuLight · view |
Li, Xinyuan · more Xinyuan Li (Computer Network Information Center, Chinese Academy of Science; University of Chinese Academy of Science) | Large-scale Simulations of Peridynamics on Sunway Taihulight Supercomputer · view |
Li, Yusen · more Yusen Li (School of Computer Science, Nankai University) | Improving Load Balance via Resource Exchange in Large-Scale Search Engines · view Balancing Fairness and Efficiency for Cache Sharing in Semi-external Memory System · view Rendering Server Allocation for MMORPG Players in Cloud Gaming · view |
Li, Zhaoyi · more Zhaoyi Li (Central South University) | AMRT: Anti-ECN Marking to Improve Utilization of Receiver-driven Transmission in Data Center · view |
Li, Zhuozhao · more Zhuozhao Li (University of Chicago) | Impact of Memory DoS Attacks on Cloud Applications and Real-Time Detection Schemes · view |
Li, Zongpeng · more Zongpeng Li (Huawei, Wuhan University) | An Online Learning-Based Task Offloading Framework for 5G Small Cell Networks · view |
Liang, Weifa · more Weifa Liang (The Australian National University) | Reliability Augmentation of Requests with Service Function Chain Requirements in Mobile Edge-Cloud Networks · view |
Liao, Jianxin · more Jianxin Liao (Beijing University of Posts and Telecommunications) | DeepHop on Edge: Hop-by-hop Routing by Distributed Learning with Semantic Attention · view |
Liao, kaiqin · more kaiqin Liao (School of Computer Science and Engineering, Central South University) | PS : Periodic Strategy for the 40-100Gbps Energy Efficient Ethernet · view |
Lim, Seung-Hwan · more Seung-Hwan Lim (ORNL) | Toward Large-Scale Image Segmentation on Summit · view |
Lin, Chi · more Chi Lin (Dalian University of Technology) | Cooperative Game for Multiple Chargers with Dynamic Network Topology · view |
Lin, Jieyu · more Jieyu Lin (University of Toronto) | Adaptive Distributed Convolutional Neural Network Inference at the Network Edge with ADCNN · view |
Liu, Bing · more Bing Liu (SenseTime Research) | Extremely Low-bit Convolution Optimization for Quantized Neural Network on Modern Computer Architectures · view |
Liu, Chang · more Chang Liu (Shanghai Jiao Tong University) | OVERSEE: Outsourcing Verification to Enable Resource Sharing in Edge Environment · view |
Liu, Cong · more Cong Liu (China Mobile Research Institute) | DeepHop on Edge: Hop-by-hop Routing by Distributed Learning with Semantic Attention · view |
Liu, Fang · more Fang Liu (Sun Yat-Sen University) | FEEL: A Federated Edge Learning System for Efficient and Privacy-Preserving Mobile Healthcare · view |
Liu, Jing · more Jing Liu (East China Normal University) | Energy-aware strategies for reliability-oriented real-time task allocation on heterogeneous platforms · view |
Liu, Jingning · more Jingning Liu (Huazhong University of Science and Technology) | CCHL: Compression-Consolidation Hardware Logging for Efficient Failure-Atomic Persistent Memory Updates · view |
Liu, Qi · more Qi Liu (University of Virginia) | A Reinforcement Learning Based System for Minimizing Cloud Storage Service Cost · view |
Liu, Shuyang · more Shuyang Liu (Wuhan National Laboratory for Optoelectronics, Huazhong University of Science and Technology) | SeRW: Adaptively Separating Read and Write upon SSDs of Hybrid Storage Server in Clouds · view |
Liu, Tong · more Tong Liu (Temple University) | A Rack-aware Pipeline Repair Scheme for Erasure-coded Distributed Storage Systems · view |
Liu, Wei · more wei liu (Tusimple) | EPMA: Efficient Partial Message Access in IoT Era · view Memory-Centric Communication Mechanism for Real-time Autonomous Navigation Applications · view |
Liu, Weifeng · more Weifeng Liu (China University of Petroleum, Beijing) | CapelliniSpTRSV: A Thread-Level Synchronization-Free Sparse Triangular Solve on GPUs · view Efficient Block Algorithms for Parallel Sparse Triangular Solve · view |
Liu, Weiguo · more Weiguo Liu (Shandong University) | SWMapper: Scalable Read Mapper on SunWay TaihuLight · view |
Liu, Xiaoguang · more Xiaoguang Liu (College of Computer Science, Nankai University) | Improving Load Balance via Resource Exchange in Large-Scale Search Engines · view |
Liu, Ximing · more Ximing Liu (Nankai University) | DQEMU: A Scalable Emulator with Retargetable DBT on Distributed Platforms · view |
Liu, Yang · more Yang Liu (Harbin Institute of Technology, Shenzhen) | Delta-DNN: Efficiently Compressing Deep Neural Networks via Exploiting Floats Similarity · view |
Liu, Yanqiang · more Yanqiang Liu (Shanghai Jiao Tong University) | OPS: Optimized Shuffle Management System for Apache Spark · view |
Liu, Zixia · more Zixia Liu (Department of Computer Science, University of Central Florida) | Deep Reinforcement Learning based Elasticity-compatible Heterogeneous Resource Management for Time-critical Computing · view |
Llort, German · more German Llort (Barcelona Supercomputing Center) | Experiences on the characterization of parallel applications in embedded systems with Extrae/Paraver · view |
Lowe-Power, Jason · more Jason Lowe-Power (University of California, Davis) | HCAPP: Scalable Power Control for Heterogeneous 2.5D Integrated Systems · view |
Lu, Youyou · more Youyou Lu (Tsinghua University) | DIESEL: A Dataset-Based Distributed Storage and Caching System for Large-Scale Deep Learning Training · view |
Lu, Zhengyang · more Zhengyang Lu (China University of Petroleum, Beijing) | Efficient Block Algorithms for Parallel Sparse Triangular Solve · view |
Luan, Zhongzhi · more Zhongzhi Luan (Beihang University) | Extremely Low-bit Convolution Optimization for Quantized Neural Network on Modern Computer Architectures · view |
Lui, John C.S. · more John C.S. Lui (Chinese University of Hong Kong) | An Online Learning-Based Task Offloading Framework for 5G Small Cell Networks · view |
Lunga, Dalton · more Dalton Lunga (ORNL) | Toward Large-Scale Image Segmentation on Summit · view |
Luo, Qiong · more Qiong Luo (Hong Kong University of Science and Technology) | DIESEL: A Dataset-Based Distributed Storage and Caching System for Large-Scale Deep Learning Training · view |
Luo, Shouxi · more Shouxi Luo (Southwest Jiaotong University) | Selective Coflow Completion for Time-sensitive Distributed Applications with Poco · view |
M. Abdelmoniem, Ahmed · more Ahmed M. Abdelmoniem (Hong Kong University of Science and Technology; Assiut University, Egypt) | Reducing Latency in Multi-Tenant Data Centers via Cautious Congestion Watch · view |
Ma, Tao · more Tao Ma (Alibaba Cloud) | URSA: Precise Capacity Planning and Fair Scheduling based on Low-level Statistics for Public Clouds · view |
Ma, Yu · more Yu Ma (The Australian National University) | Reliability Augmentation of Requests with Service Function Chain Requirements in Mobile Edge-Cloud Networks · view |
Madduri, Kamesh · more Kamesh Madduri (Pennsylvania State University) | Fast Spectral Graph Layout on Multicore Platforms · view |
Mao, Rui · more Rui Mao (Shenzhen University) | Developing a Loss Prediction-based Asynchronous Stochastic Gradient Descent Algorithm for Distributed Training of Deep Neural Networks · view |
Marbach, Trent · more Trent Marbach (College of Computer Science, Nankai University) | Improving Load Balance via Resource Exchange in Large-Scale Search Engines · view |
Marchal, Loris · more Loris Marchal (ENS Lyon) | Robustness of the Young/Daly formula for stochastic iterative applications · view |
McCamant, Stephen · more Stephen McCamant (University of Minnesota) | First Time Miss : Low Overhead Mitigation For Shared Memory Cache Side Channels · view |
Meneses Viveros, Amilcar · more Amilcar Meneses Viveros (Centro de Investigación y de Estudios Avanzados, Zacatenco) | Towards Parallelization of a Texture Description Algorithm for Breast Lesion Classification using OpenMP and CUDA · view |
Meng, Xiangxu · more Xiangxu Meng (Shandong University) | SWMapper: Scalable Read Mapper on SunWay TaihuLight · view |
Mercadal, Estanislao · more Estanislao Mercadal (Barcelona Supercomputing Center) | Experiences on the characterization of parallel applications in embedded systems with Extrae/Paraver · view |
Mishra, Ashirbad · more Ashirbad Mishra (Pennsylvania State University) | Fast Spectral Graph Layout on Multicore Platforms · view |
Moon, Soo-Mook · more Soo-Mook Moon (Seoul National University) | ShadowTutor: Distributed Partial Distillation for Mobile Video DNN Inference · view |
Munera, Adrian · more Adrian Munera (Barcelona Supercomputing Center) | Experiences on the characterization of parallel applications in embedded systems with Extrae/Paraver · view |
Mururu, Girish · more Girish Mururu (Georgia Institute of Technology) | Generating Robust Parallel Programs via Model Driven Prediction of Compiler Optimizations for Non-determinism · view |
Nakano, Koji · more Koji Nakano (Hiroshima University) | Adaptive Bulk Search: Solving Quadratic Unconstrained Binary Optimization Problems on Multiple GPUs · view Huffman Coding with Gap Arrays for GPU Acceleration · view |
Narayanan, Sri Hari Krishna · more Sri Hari Krishna Narayanan (Argonne National Laboratory) | Vector Forward Mode Automatic Differentiation on SIMD/SIMT architectures · view |
Nasre, Rupesh · more Rupesh Nasre (Indian Institute of Technology Madras) | Graffix: Efficient Graph Processing with a Tinge of GPU-Specific Approximations · view |
Nguyen, Tuan Dung · more Tuan Dung Nguyen (The University of Melbourne) | Federated Learning with Proximal Stochastic Variance Reduced Gradient Algorithms · view |
Nie, Lihai · more Lihai Nie (Tianjin Key Laboratory of Advanced Networking (TANKLab), College of Intelligence and Computing (CIC), Tianjin University) | XShot: Light-weight Link Failure Localization using Crossed Probing Cycles in SDN · view |
Nitta, Christopher · more Christopher Nitta (University of California, Davis) | HCAPP: Scalable Power Control for Heterogeneous 2.5D Integrated Systems · view |
Niu, Yuyao · more Yuyao Niu (China University of Petroleum, Beijing) | Efficient Block Algorithms for Parallel Sparse Triangular Solve · view |
O'Brien, Francis · more Francis O'Brien (University of Toronto) | Balancing Graph Processing Workloads Using Work Stealing on Heterogeneous CPU-FPGA Systems · view |
Pachajoa, Carlos · more Carlos Pachajoa (University of Vienna) | Algorithm-Based Checkpoint-Recovery for the Conjugate Gradient Method · view |
Pacher, Christina · more Christina Pacher (University of Vienna) | Algorithm-Based Checkpoint-Recovery for the Conjugate Gradient Method · view |
Pal, Lisa · more Lisa Pal (Centro de Investigación y de Estudios Avanzados, Zacatenco) | Towards Parallelization of a Texture Description Algorithm for Breast Lesion Classification using OpenMP and CUDA · view |
Pallez, Guillaume · more Guillaume Pallez (INRIA Bordeaux) | Robustness of the Young/Daly formula for stochastic iterative applications · view |
Pande, Santosh · more Santosh Pande (Georgia Institute of Technology) | Generating Robust Parallel Programs via Model Driven Prediction of Compiler Optimizations for Non-determinism · view |
Peng, Jiaxin · more Jiaxin Peng (The George Washington University) | DNNARA: A Deep Neural Network Accelerator using Residue Arithmetic and Integrated Photonics · view |
Petrini, Fabrizio · more Fabrizio Petrini (Intel) | Prune the Unnecessary: Parallel Pull-Push Louvain Algorithms with Automatic Edge Pruning · view |
Prajapati, Nirmal · more Nirmal Prajapati (LANL, Colorado State University) | Revisiting Sparse Dynamic Programming for the 0/1 Knapsack Problem · view |
Prieto, Pablo · more Pablo Prieto (University of Cantabria) | SPECcast: A Methodology for Fast Performance Evaluation with SPEC CPU 2017 Multiprogrammed Workloads · view |
Prieto-Matias, Manuel · more Manuel Prieto-Matias (Complutense University of Madrid) | Enabling performance portability of data-parallel OpenMP applications on asymmetric multicore processors · view |
Puente, Valentin · more Valentin Puente (University of Cantabria) | SPECcast: A Methodology for Fast Performance Evaluation with SPEC CPU 2017 Multiprogrammed Workloads · view |
Qi, Qi · more Qi Qi (Beijing University of Posts and Telecommunications) | DeepHop on Edge: Hop-by-hop Routing by Distributed Learning with Semantic Attention · view |
Qi, Zhengwei · more Zhengwei Qi (Shanghai Jiao Tong University) | OPS: Optimized Shuffle Management System for Apache Spark · view |
Qian, Depei · more Depei Qian (Beihang University) | Extremely Low-bit Convolution Optimization for Quantized Neural Network on Modern Computer Architectures · view |
Qiu, Xiaoyu · more Xiaoyu Qiu (Sun Yat-sen University) | SkyChain: A Deep Reinforcement Learning-Empowered Dynamic Blockchain Sharding System · view |
Quan, Gang · more Gang Quan (Electrical and Computer Engineering Department, Florida International University) | Deep Reinforcement Learning based Elasticity-compatible Heterogeneous Resource Management for Time-critical Computing · view |
Quiñones, Eduardo · more Eduardo Quiñones (Barcelona Supercomputing Center) | Experiences on the characterization of parallel applications in embedded systems with Extrae/Paraver · view |
Rajamanickam, Sivasankaran · more Sivasankaran Rajamanickam (Sandia National Labs) | Performance Portable Supernode-based Sparse Triangular Solver for Manycore Architectures · view |
Rajopadhye, Sanjay · more Sanjay Rajopadhye (Colorado State University) | Revisiting Sparse Dynamic Programming for the 0/1 Knapsack Problem · view |
Ramkrishnan, Kartik · more Kartik Ramkrishnan (University of Minnesota) | First Time Miss : Low Overhead Mitigation For Shared Memory Cache Side Channels · view |
Ravichandran, Kaushik · more Kaushik Ravichandran (Georgia Institute of Technology, Facebook) | Generating Robust Parallel Programs via Model Driven Prediction of Compiler Optimizations for Non-determinism · view |
Ren, Rui · more Rui Ren (Shanghai Jiao Tong University) | OPS: Optimized Shuffle Management System for Apache Spark · view |
Ren, Shenyuan · more Shenyuan Ren (University of Oxford) | Developing a Loss Prediction-based Asynchronous Stochastic Gradient Descent Algorithm for Distributed Training of Deep Neural Networks · view |
Robert, Yves · more Yves Robert (ENS Lyon, University of Tennessee) | Energy-aware strategies for reliability-oriented real-time task allocation on heterogeneous platforms · view Robustness of the Young/Daly formula for stochastic iterative applications · view |
Rodriguez, Matthew A. · more Matthew A. Rodriguez (Lehigh University) | Optimizing Linearizable Bulk Operations on Data Structures · view |
Royuela, Sara · more Sara Royuela (Barcelona Supercomputing Center) | Experiences on the characterization of parallel applications in embedded systems with Extrae/Paraver · view |
Ruan, Chang · more Chang Ruan (Central South University) | Polo: Receiver-Driven Congestion Control for Low Latency over Commodity Network Fabric · view |
Saez, Juan Carlos · more Juan Carlos Saez (Complutense University of Madrid) | Enabling performance portability of data-parallel OpenMP applications on asymmetric multicore processors · view |
Schanen, Michel · more Michel Schanen (Argonne National Laboratory) | Vector Forward Mode Automatic Differentiation on SIMD/SIMT architectures · view |
Schmidt, Bertil · more Bertil Schmidt (Johannes Gutenberg University) | SWMapper: Scalable Read Mapper on SunWay TaihuLight · view |
Schulte, Michael · more Michael Schulte (AMD) Michael Schulte is a Senior Fellow with AMD Research, where he leads research, advanced development, and technology transfer activities in high-performance computing, machine learning, heterogeneous systems, and power-efficient processors. He is currently the Chief Engineer for AMD’s Extreme-scale Computing Technologies, which will be deployed in the Frontier and El Capitan supercomputers. Michael was previously a Principal Investigator on AMD’s PathForward, FastForward-2 Node Architecture, and FastForward Extreme-scale Computing projects. Prior to joining AMD, he was a tenured faculty member at the University of Wisconsin-Madison and Lehigh University.
Michael holds Ph.D. and M.S. degrees in Electrical Engineering from the University of Texas at Austin, and a B.S. degree in Electrical Engineering with a second major in Computer Science from the University of Wisconsin-Madison. Michael is an IEEE Fellow for contributions to compute architectures. He is the recipient of an NSF CAREER Award, the Alfred Noble Robinson Award, the AMD Next 5% Award, the AMD Way Award, and the AMD Datacenter and Embedded Solutions Group Award of Excellence. Throughout his career, Michael has had the honor of working with incredibly kind and talented people. | Challenges and Opportunities for Extreme-Scale Computing · view |
Scott, Michael L. · more Michael L. Scott (University of Rochester) | Safe, Fast Sharing of memcached as a Protected Library · view |
Seal, Sudip · more Sudip Seal (Oak Ridge National Laboratory) | Toward Large-Scale Image Segmentation on Summit · view |
Sen, Tanmoy · more Tanmoy Sen (University of Virginia) | Impact of Memory DoS Attacks on Cloud Applications and Real-Time Detection Schemes · view |
Shao, Airan · more Airan Shao (Tsinghua University) | An Adaptive Erasure-Coded Storage Scheme with an Efficient Code-Switching Algorithm · view |
Shen, Haiying · more Haiying Shen (University of Virginia) | A Reinforcement Learning Based System for Minimizing Cloud Storage Service Cost · view Impact of Memory DoS Attacks on Cloud Applications and Real-Time Detection Schemes · view |
Sheng, Feng · more Feng Sheng (Huazhong University of Science and Technology, Wuhan National Laboratory for Optoelectronics) | GraBi: Communication-Efficient and Workload-Balanced Partitioning for Bipartite Graphs · view |
Shi, Jiuchen · more Jiuchen Shi (Shanghai Jiao Tong University) | OVERSEE: Outsourcing Verification to Enable Resource Sharing in Edge Environment · view |
Shi, Yang · more Yang Shi (National University of Defense Technology) | Towards High-Efficiency Data Centers via Job-Aware Network Scheduling · view |
Sifat, Tarequl Islam · more Tarequl Islam Sifat (Corespeq Inc, Colorado State University) | Revisiting Sparse Dynamic Programming for the 0/1 Knapsack Problem · view |
Singh, Somesh · more Somesh Singh (Indian Institute of Technology Madras) | Graffix: Efficient Graph Processing with a Tinge of GPU-Specific Approximations · view |
Sintorn, Erik · more Erik Sintorn (Chalmers University of Technology) | A GPU Register File using Static Data Compression · view |
Song, Zhuo · more Zhuo Song (Alibaba Cloud) | URSA: Precise Capacity Planning and Fair Scheduling based on Low-level Statistics for Public Clouds · view |
Sorger, Volker · more Volker Sorger (The George Washington University) | DNNARA: A Deep Neural Network Accelerator using Residue Arithmetic and Integrated Photonics · view |
Spear, Michael F. · more Michael F. Spear (Lehigh University) | Optimizing Linearizable Bulk Operations on Data Structures · view |
Stasiak, Andrzej · more Andrzej Stasiak (Intel) | Prune the Unnecessary: Parallel Pull-Push Louvain Algorithms with Automatic Edge Pruning · view |
Stenström, Per · more Per Stenström (Chalmers University of Technology) | A GPU Register File using Static Data Compression · view |
Straube, Kramer · more Kramer Straube (University of California, Davis) | HCAPP: Scalable Power Control for Heterogeneous 2.5D Integrated Systems · view |
Su, Jiya · more Jiya Su (Renmin University of China) | CapelliniSpTRSV: A Thread-Level Synchronization-Free Sparse Triangular Solve on GPUs · view |
Sultana, Abeda · more Abeda Sultana (University of Louisiana at Lafayette) | E-LAS: Design and Analysis of Completion-Time Agnostic Scheduling for Distributed Deep Learning Cluster · view |
Sun, Chao · more Chao Sun (Tianjin University) | Balancing Fairness and Efficiency for Cache Sharing in Semi-external Memory System · view |
Sun, Haifeng · more Haifeng Sun (Beijing University of Posts and Telecommunications) | DeepHop on Edge: Hop-by-hop Routing by Distributed Learning with Semantic Attention · view |
Sun, Shuai · more Shuai Sun (The George Washington University) | DNNARA: A Deep Neural Network Accelerator using Residue Arithmetic and Integrated Photonics · view |
Sun, Yu · more Yu Sun (Dalian University of Technology) | Cooperative Game for Multiple Chargers with Dynamic Network Topology · view |
Susanto, Hengky · more Hengky Susanto (Hong Kong University of Science and Technology) | Reducing Latency in Multi-Tenant Data Centers via Cautious Congestion Watch · view |
Tabaru, Tsuguchika · more Tsuguchika Tabaru (Fujitsu Laboratories Ltd.) | Huffman Coding with Gap Arrays for GPU Acceleration · view |
Takafuji, Daisuke · more Daisuke Takafuji (Hiroshima University) | Huffman Coding with Gap Arrays for GPU Acceleration · view |
Tang, Shanjiang · more Shanjiang Tang (Tianjin University) | Balancing Fairness and Efficiency for Cache Sharing in Semi-external Memory System · view |
Tao, Dingwen · more Dingwen Tao (The University of Alabama, Tuscaloosa, AL, USA) | Delta-DNN: Efficiently Compressing Deep Neural Networks via Exploiting Floats Similarity · view |
Tatekawa, Masaru · more Masaru Tatekawa (NTT DATA Corporation) | Adaptive Bulk Search: Solving Quadratic Unconstrained Binary Optimization Problems on Multiple GPUs · view |
Taura, Kenjiro · more Kenjiro Taura (The University of Tokyo) | Automatic Identification and Precise Attribution of DRAM Bandwidth Contention · view |
Tian, Zhao · more Zhao Tian (Tianjin Key Laboratory of Advanced Networking (TANKLab), College of Intelligence and Computing (CIC), Tianjin University) | XShot: Light-weight Link Failure Localization using Crossed Probing Cycles in SDN · view |
Tithi, Jesmin Jahan · more Jesmin Jahan Tithi (Intel) | Prune the Unnecessary: Parallel Pull-Push Louvain Algorithms with Automatic Edge Pruning · view |
Tong, Wei · more Wei Tong (Huazhong University of Science and Technology) | Mass: Workload-Aware Storage Policy for OpenStack Swift · view CCHL: Compression-Consolidation Hardware Logging for Efficient Failure-Atomic Persistent Memory Updates · view |
Tsaris, Aristeidis · more Aristeidis Tsaris (ORNL) | Toward Large-Scale Image Segmentation on Summit · view |
Vivien, Frédéric · more Frédéric Vivien (ENS Lyon) | Energy-aware strategies for reliability-oriented real-time task allocation on heterogeneous platforms · view |
Wang, Chengning · more Chengning Wang (Huazhong University of Science and Technology) | CCHL: Compression-Consolidation Hardware Logging for Efficient Failure-Atomic Persistent Memory Updates · view |
Wang, Dali · more Dali Wang (ORNL) | Toward Large-Scale Image Segmentation on Summit · view |
Wang, Dongsheng · more Dongsheng Wang (Tsinghua University, Peng Cheng Laboratory) | CARD: A Congestion-Aware Request Dispatching Scheme for Replicated Metadata Server Cluster · view An Adaptive Erasure-Coded Storage Scheme with an Efficient Code-Switching Algorithm · view |
Wang, Gang · more Gang Wang (College of Computer Science, Nankai University) | Improving Load Balance via Resource Exchange in Large-Scale Search Engines · view |
Wang, Haixia · more Haixia Wang (Tsinghua University) | CARD: A Congestion-Aware Request Dispatching Scheme for Replicated Metadata Server Cluster · view An Adaptive Erasure-Coded Storage Scheme with an Efficient Code-Switching Algorithm · view |
Wang, Haoyu · more Haoyu Wang (University of Virginia) | A Reinforcement Learning Based System for Minimizing Cloud Storage Service Cost · view |
Wang, Huanbin · more Huanbin Wang (Tianjin Key Laboratory of Advanced Networking (TANKLab), College of Intelligence and Computing (CIC), Tianjin University) | XShot: Light-weight Link Failure Localization using Crossed Probing Cycles in SDN · view |
Wang, Jian · more Jian Wang (Tencent) | Saec: Similarity-Aware Embedding Compression in Recommendation Systems · view |
wang, jianxin · more Jianxin Wang (Central South University) | PS : Periodic Strategy for the 40-100Gbps Energy Efficient Ethernet · view Polo: Receiver-Driven Congestion Control for Low Latency over Commodity Network Fabric · view AMRT: Anti-ECN Marking to Improve Utilization of Receiver-driven Transmission in Data Center · view |
Wang, Jingyu · more Jingyu Wang (Beijing University of Posts and Telecommunications) | DeepHop on Edge: Hop-by-hop Routing by Distributed Learning with Semantic Attention · view |
Wang, Lei · more Lei Wang (University of North Carolina at Greensboro) | Cooperative Game for Multiple Chargers with Dynamic Network Topology · view |
Wang, Lipeng · more Lipeng Wang (Hong Kong University of Science and Technology) | DIESEL: A Dataset-Based Distributed Storage and Caching System for Large-Scale Deep Learning Training · view |
Wang, Liqiang · more Liqiang Wang (Department of Computer Science, University of Central Florida) | Deep Reinforcement Learning based Elasticity-compatible Heterogeneous Resource Management for Time-critical Computing · view |
Wang, Rui · more Rui Wang (Beihang University) | Extremely Low-bit Convolution Optimization for Quantized Neural Network on Modern Computer Architectures · view |
Wang, Rujia · more Rujia Wang (Illinois Institute of Technology) | CapelliniSpTRSV: A Thread-Level Synchronization-Free Sparse Triangular Solve on GPUs · view |
Wang, Shucheng · more Shucheng Wang (Wuhan National Laboratory for Optoelectronics, Huazhong University of Science and Technology) | SeRW: Adaptively Separating Read and Write upon SSDs of Hybrid Storage Server in Clouds · view |
Wang, Wenwen · more Wenwen Wang (University of Georgia) | DQEMU: A Scalable Emulator with Retargetable DBT on Distributed Platforms · view |
Wang, Yanfei · more Yanfei Wang (SenseTime Research) | Extremely Low-bit Convolution Optimization for Quantized Neural Network on Modern Computer Architectures · view |
Wang, Yi · more Yi Wang (Peng Cheng Laboratory) | Jeor: Accelerate Linear Algebra Operation in SSDs · view |
Wang, Zhanye · more Zhanye Wang (Tsinghua University) | CARD: A Congestion-Aware Request Dispatching Scheme for Replicated Metadata Server Cluster · view |
Wang, Zike · more Zike Wang (Huazhong University of Science and Technology) | Mass: Workload-Aware Storage Policy for OpenStack Swift · view |
Wang, Zizhong · more Zizhong Wang (Tsinghua University) | An Adaptive Erasure-Coded Storage Scheme with an Efficient Code-Switching Algorithm · view |
Wartel, Franck · more Franck Wartel (Airbus Defence and Space) | Experiences on the characterization of parallel applications in embedded systems with Extrae/Paraver · view |
Wei, Xueliang · more Xueliang Wei (Huazhong University of Science and Technology) | CCHL: Compression-Consolidation Hardware Logging for Efficient Failure-Atomic Persistent Memory Updates · view |
Wen, Mei · more Mei Wen (National University of Defense Technology) | Towards High-Efficiency Data Centers via Job-Aware Network Scheduling · view |
Williams-Young, David B. · more David B. Williams-Young (Lawrence Berkeley National Laboratory) | Parallel Shift-Invert Spectrum Slicing on Distributed Architectures with GPU Accelerators · view |
Wu, Chunghsuan · more Chunghsuan Wu (Shanghai Jiao Tong University) | OPS: Optimized Shuffle Management System for Apache Spark · view |
Wu, Guowei · more Guowei Wu (Dalian University of Technology) | Cooperative Game for Multiple Chargers with Dynamic Network Topology · view |
Wu, Hao · more hao wu (Tusimple) | EPMA: Efficient Partial Message Access in IoT Era · view Memory-Centric Communication Mechanism for Real-time Autonomous Navigation Applications · view |
Wu, Jie · more Jie Wu (Temple University) | Optimizing Flow Bandwidth Consumption with Traffic-diminishing Middlebox Placement · view |
Wu, Ruofan · more Ruofan Wu (Renmin University of China) | CapelliniSpTRSV: A Thread-Level Synchronization-Free Sparse Triangular Solve on GPUs · view |
Wu, Weigang · more Weigang Wu (Sun Yat-sen University) | Dual-Way Gradient Sparsification for Asynchronous Distributed Deep Learning · view |
Wu, Xiaorui · more Xiaorui Wu (City University of Hong Kong) | Saec: Similarity-Aware Embedding Compression in Recommendation Systems · view Jeor: Accelerate Linear Algebra Operation in SSDs · view |
Xia, Wen · more Wen Xia (Harbin Institute of Technology, Shenzhen) | Delta-DNN: Efficiently Compressing Deep Neural Networks via Exploiting Floats Similarity · view |
Xiao, Danyang · more Danyang Xiao (Sun Yat-sen University) | Dual-Way Gradient Sparsification for Asynchronous Distributed Deep Learning · view |
Xiao, Nong · more Nong Xiao (National University of Defense Technology) | FEEL: A Federated Edge Learning System for Efficient and Privacy-Preserving Mobile Healthcare · view |
Xing, Huanlai · more Huanlai Xing (Southwest Jiaotong University) | Selective Coflow Completion for Time-sensitive Distributed Applications with Poco · view |
Xu, Fei · more Fei Xu (East China Normal University) | E-LAS: Design and Analysis of Completion-Time Agnostic Scheduling for Distributed Deep Learning Cluster · view |
Xu, Hong · more Hong Xu (City University of Hong Kong) | Saec: Similarity-Aware Embedding Compression in Recommendation Systems · view Jeor: Accelerate Linear Algebra Operation in SSDs · view OPS: Optimized Shuffle Management System for Apache Spark · view |
Xu, Jie · more Jie Xu (George Mason University) | A Reinforcement Learning Based System for Minimizing Cloud Storage Service Cost · view |
Xu, Kai · more Kai Xu (Shandong University) | SWMapper: Scalable Read Mapper on SunWay TaihuLight · view |
Xu, Wenzheng · more Wenzheng Xu (Sichuan University) | Reliability Augmentation of Requests with Service Function Chain Requirements in Mobile Edge-Cloud Networks · view |
Y. Zomaya, Albert · more Albert Y. Zomaya (The University of Sydney) | Federated Learning with Proximal Stochastic Variance Reduced Gradient Algorithms · view |
Yamamoto, Naoya · more Naoya Yamamoto (Hiroshima University) | Huffman Coding with Gap Arrays for GPU Acceleration · view |
Yamazaki, Ichitaro · more Ichitaro Yamazaki (Sandia National Labs) | Performance Portable Supernode-based Sparse Triangular Solver for Manycore Architectures · view |
Yan, Shengen · more Shengen Yan (SenseTime Research) | DIESEL: A Dataset-Based Distributed Storage and Caching System for Large-Scale Deep Learning Training · view |
yan, yulong · more yulong yan (School of Computer Science and Engineering, Central South University) | PS : Periodic Strategy for the 40-100Gbps Energy Efficient Ethernet · view |
Yan, Zijie · more Zijie Yan (Sun Yat-sen University) | Dual-Way Gradient Sparsification for Asynchronous Distributed Deep Learning · view |
Yang, Baichen · more Baichen Yang (Hong Kong University of Science and Technology) | DIESEL: A Dataset-Based Distributed Storage and Caching System for Large-Scale Deep Learning Training · view |
Yang, Bin · more Bin Yang (Intel Corporation) | OPS: Optimized Shuffle Management System for Apache Spark · view |
Yang, Chao · more Chao Yang (Lawrence Berkeley National Laboratory) | Parallel Shift-Invert Spectrum Slicing on Distributed Architectures with GPU Accelerators · view |
Yang, Hailong · more Hailong Yang (Beihang University) | Extremely Low-bit Convolution Optimization for Quantized Neural Network on Modern Computer Architectures · view |
Yang, Puyuan · more Puyuan Yang (Alibaba Inc.) | SeRW: Adaptively Separating Read and Write upon SSDs of Hybrid Storage Server in Clouds · view |
Yang, Yong · more Yong Yang (Alibaba Cloud) | URSA: Precise Capacity Planning and Fair Scheduling based on Low-level Statistics for Public Clouds · view |
Yang, Ziwei · more Ziwei Yang (Dalian University of Technology) | Cooperative Game for Multiple Chargers with Dynamic Network Topology · view |
Yao, Jie · more Jie Yao (Wuhan National Laboratory for Optoelectronics, Huazhong University of Science and Technology) | SeRW: Adaptively Separating Read and Write upon SSDs of Hybrid Storage Server in Clouds · view |
Yasudo, Ryota · more Ryota Yasudo (Hiroshima University) | Adaptive Bulk Search: Solving Quadratic Unconstrained Binary Optimization Problems on Multiple GPUs · view |
Yazane, Takashi · more Takashi Yazane (NTT DATA Corporation) | Adaptive Bulk Search: Solving Quadratic Unconstrained Binary Optimization Problems on Multiple GPUs · view |
Ye, Huang · more Huang Ye (Computer Network Information Center, Chinese Academy of Science) | Large-scale Simulations of Peridynamics on Sunway Taihulight Supercomputer · view |
Ye, Liuqing · more Liuqing Ye (Huazhong University of Science and Technology) | CCHL: Compression-Consolidation Hardware Logging for Efficient Failure-Atomic Persistent Memory Updates · view |
Ye, Songgao · more Songgao Ye (SenseTime Research) | DIESEL: A Dataset-Based Distributed Storage and Caching System for Large-Scale Deep Learning Training · view |
Yelick, Kathy · more Kathy Yelick (University of California, Berkeley; Lawrence Berkeley National Laboratory) Katherine (Kathy) Yelick is the Robert S. Pepper Distinguished Professor of Electrical Engineering and Computer Sciences and the Associate Dean for Research in the Division of Computing, Data Science and Society at UC Berkeley. She is also the Senior Advisor on Computing at Lawrence Berkeley National Laboratory. Her research is in high performance computing, programming languages, compilers, parallel algorithms, and automatic performance tuning. She currently leads the ExaBiome project on scalable tools for analyzing microbial data and co-leads the Berkeley Benchmarking and Optimization (Bebop) group. She was the Associate Laboratory Director for Computing Sciences at LBNL from 2010 through 2019 and prior to the led the National Energy Research Scientific Computing Center (NERSC). Yelick is a member of National Academy of Engineering and the American Academy of Arts and Sciences, and she is a Fellow of both the Association for Computing Machinery and the American Association for the Advancement of Science. | Genomic Analysis and Learning at Scale: Mapping Irregular Computations to Advanced Architectures · view |
Yew, Pen · more Pen-Chung Yew (University of Minnesota at Twin Cities) | First Time Miss : Low Overhead Mitigation For Shared Memory Cache Side Channels · view DQEMU: A Scalable Emulator with Retargetable DBT on Distributed Platforms · view |
Yu, Ce · more Ce Yu (Tianjin University) | Balancing Fairness and Efficiency for Cache Sharing in Semi-external Memory System · view |
Yu, Fengwei · more Fengwei Yu (SenseTime Research) | Extremely Low-bit Convolution Optimization for Quantized Neural Network on Modern Computer Architectures · view |
Yu, Hongfang · more Hongfang Yu (University of Electronic Science and Technology of China) | Selective Coflow Completion for Time-sensitive Distributed Applications with Poco · view |
Yuan, Rui · more Rui Yuan (Shanghai Jiao Tong University) | OVERSEE: Outsourcing Verification to Enable Resource Sharing in Edge Environment · view |
Yuan, Xu · more Xu Yuan (University of Louisiana at Lafayette) | E-LAS: Design and Analysis of Completion-Time Agnostic Scheduling for Distributed Deep Learning Cluster · view |
Zahn, Felix · more Felix Zahn (Heidelberg University) | On Network Locality in MPI-Based HPC Applications · view |
Zhai, Antonia · more Antonia Zhai (University of Minnesota) | First Time Miss : Low Overhead Mitigation For Shared Memory Cache Side Channels · view |
Zhai, Jidong · more jidong zhai (Tsinghua University, BNRist) | ParSecureML: An Efficient Parallel Secure Machine Learning Framework on GPUs · view Memory-Centric Communication Mechanism for Real-time Autonomous Navigation Applications · view |
Zhan, Yufeng · more Yufeng Zhan (The Hong Kong Polytechnic University) | SkyChain: A Deep Reinforcement Learning-Empowered Dynamic Blockchain Sharding System · view |
Zhang, Chenyang · more Chenyang Zhang (Renmin University of China) | ParSecureML: An Efficient Parallel Secure Machine Learning Framework on GPUs · view |
Zhang, Chunyuan · more Chunyuan Zhang (National University of Defense Technology) | Towards High-Efficiency Data Centers via Job-Aware Network Scheduling · view |
Zhang, Feng · more Feng Zhang (Renmin University of China) | ParSecureML: An Efficient Parallel Secure Machine Learning Framework on GPUs · view CapelliniSpTRSV: A Thread-Level Synchronization-Free Sparse Triangular Solve on GPUs · view |
Zhang, Hequan · more Hequan Zhang (SenseTime Research) | DIESEL: A Dataset-Based Distributed Storage and Caching System for Large-Scale Deep Learning Training · view |
Zhang, Honglin · more Honglin Zhang (Tencent) | Saec: Similarity-Aware Embedding Compression in Recommendation Systems · view |
Zhang, Jian · more Jian Zhang (Computer Network Information Center, Chinese Academy of Science) | Large-scale Simulations of Peridynamics on Sunway Taihulight Supercomputer · view |
Zhang, Jianting · more Jianting Zhang (Sun Yat-sen University) | SkyChain: A Deep Reinforcement Learning-Empowered Dynamic Blockchain Sharding System · view |
Zhang, Qi · more Qi Zhang (Microsoft) | Adaptive Distributed Convolutional Neural Network Inference at the Network Edge with ADCNN · view |
Zhang, Sai Qian · more Sai Qian Zhang (Harvard University) | Adaptive Distributed Convolutional Neural Network Inference at the Network Edge with ADCNN · view |
Zhang, Tao · more Tao Zhang (Changsha University) | Polo: Receiver-Driven Congestion Control for Low Latency over Commodity Network Fabric · view |
Zhang, Wei · more Wei Zhang (Shanghai Jiao Tong University) | URSA: Precise Capacity Planning and Fair Scheduling based on Low-level Statistics for Public Clouds · view |
Zhang, Weizhe · more Weizhe Zhang (Harbin Institute of Technology, Shenzhen) | Delta-DNN: Efficiently Compressing Deep Neural Networks via Exploiting Floats Similarity · view |
Zhang, Xueying · more Xueying Zhang (Wuhan University) | An Online Learning-Based Task Offloading Framework for 5G Small Cell Networks · view |
Zhang, Zheng · more Zheng Zhang (Harbin Institute of Technology, Shenzhen) | Delta-DNN: Efficiently Compressing Deep Neural Networks via Exploiting Floats Similarity · view |
Zhao, Laiping · more Laiping Zhao (Tianjin Key Laboratory of Advanced Networking (TANKLab), College of Intelligence and Computing (CIC), Tianjin University) | XShot: Light-weight Link Failure Localization using Crossed Probing Cycles in SDN · view |
Zhao, Ziyi · more Ziyi Zhao (Nankai University) | DQEMU: A Scalable Emulator with Retargetable DBT on Distributed Platforms · view |
Zheng, Kevin · more Kevin Zheng (University of Virginia) | A Reinforcement Learning Based System for Minimizing Cloud Storage Service Cost · view |
Zheng, Ningxin · more Ningxin Zheng (Shanghai Jiao Tong University) | URSA: Precise Capacity Planning and Fair Scheduling based on Low-level Statistics for Public Clouds · view |
Zheng, Wenli · more Wenli Zheng (Shanghai Jiao Tong University) | OVERSEE: Outsourcing Verification to Enable Resource Sharing in Edge Environment · view |
Zhou, Amelie Chi · more Amelie Chi Zhou (Shenzhen University) | ParSecureML: An Efficient Parallel Secure Machine Learning Framework on GPUs · view |
Zhou, Bing B. · more Bing B. Zhou (The University of Sydney) | Federated Learning with Proximal Stochastic Variance Reduced Gradient Algorithms · view |
Zhou, Jieying · more Jieying Zhou (Sun Yat-sen University) | Dual-Way Gradient Sparsification for Asynchronous Distributed Deep Learning · view |
Zhou, Ruiting · more Ruiting Zhou (Wuhan University, Chinese University of Hong Kong) | An Online Learning-Based Task Offloading Framework for 5G Small Cell Networks · view |
Zhou, Wen · more Wen Zhou (Huazhong University of Science & Technology) | An Efficient Wear-level Architecture using Self-adaptive Wear Leveling · view |
Zhou, Zhi · more Zhi Zhou (Sun Yat-sen University) | An Online Learning-Based Task Offloading Framework for 5G Small Cell Networks · view |
Zhuang, Zirui · more Zirui Zhuang (Beijing University of Posts and Telecommunications) | DeepHop on Edge: Hop-by-hop Routing by Distributed Learning with Semantic Attention · view |
Zou, Pengfei · more Pengfei Zou (Clemson University) | Detecting Anomalous Computation with RNNs on GPU-Accelerated HPC Machines · view |
Zou, Xiangyu · more Xiangyu Zou (Harbin Institute of Technology, Shenzhen) | Delta-DNN: Efficiently Compressing Deep Neural Networks via Exploiting Floats Similarity · view |
Zuo, Pengfei · more Pengfei Zuo (Huazhong University of Science & Technology) | An Efficient Wear-level Architecture using Self-adaptive Wear Leveling · view |