John Owens / Electrical and Computer Engineering / UC Davis

John Owens's calculated h-index is 59. This page was automatically generated on 2024-03-06.

3367	Owens:2007:ASO	A Survey of General-Purpose Computation on Graphics Hardware
2850	Owens:2008:GC	GPU Computing
1328	Rixner:2000:MAS	Memory Access Scheduling
1138	Harris:2007:PPS	Parallel Prefix Sum (Scan) with CUDA
907	Liu:2020:EOD	Energy-based Out-of-distribution Detection
843	Sengupta:2007:SPF	Scan Primitives for GPU Computing
646	Owens:2007:RCF	Research Challenges for On-Chip Interconnection Networks
628	Wang:2016:GAH	Gunrock: A High-Performance Graph Processing Library on the GPU
504	Khailany:2001:IMP	Imagine: Media Processing with Streams
446	Kapasi:2003:PSP	Programmable Stream Processors
412	Rixner:2000:ROF	Register Organization for Media Processing
393	Zhang:2011:AQP	A Quantitative Performance Analysis Model for GPU Architectures
368	Kapasi:2002:TIS	The Imagine Stream Processor
364	Rixner:1998:ABA	A Bandwidth-Efficient Architecture for Media Processing
337	Zhang:2010:FTS	Fast Tridiagonal Solvers on the GPU
293	Stuart:2011:MMO	Multi-GPU MapReduce on GPU Clusters
273	Kepner:2016:MFO	Mathematical Foundations of the GraphBLAS
265	Gupta:2012:ASO	A Study of Persistent Threads Style GPU Programming for GPGPU Workloads
264	Alcantara:2009:RPH	Real-Time Parallel Hashing on the GPU
251	Davidson:2014:WPG	Work-Efficient Parallel GPU Methods for Single-Source Shortest Paths
232	Lefohn:2006:GGE	Glift: Generic, Efficient, Random-Access GPU Data Structures
176	Owens:2005:SAA	Streaming Architectures and Technology Trends
171	Tzeng:2010:TMF	Task Management for Irregular-Parallel Workloads on the GPU
167	Muyan-Ozcelik:2008:FDR	Fast Deformable Registration on the GPU: A CUDA Implementation of Demons
155	Silberstein:2008:ECO	Efficient Computation of Sum-products on GPUs Through Software-Managed Cache
153	Park:2006:DSI	Discrete Sibson Interpolation
151	Wang:2017:GGG	Gunrock: GPU Graph Analytics
150	Samant:2008:HPC	High performance computing for deformable image registration: Towards a new paradigm in adaptive radiotherapy
143	Kapasi:2000:ECO	Efficient Conditional Operations for Data-parallel Architectures
138	Kass:2006:IDO	Interactive Depth of Field Using Simulated Diffusion on a GPU
135	Patel:2012:PLD	Parallel Lossless Data Compression on the GPU
135	Ebeida:2011:EMP	Efficient Maximal Poisson-Disk Sampling
126	Owens:2002:MPA	Media Processing Applications on the Imagine Stream Processor
123	Davidson:2011:AAM	An Auto-tuned Method for Solving Large Tridiagonal Systems on the GPU
122	Ebeida:2012:ASA	A Simple Algorithm for Maximal Poisson-Disk Sampling in High Dimensions
121	Sengupta:2006:AWS	A Work-Efficient Step-Efficient Prefix Sum Algorithm
115	Yang:2018:DPF	Design Principles for Sparse Matrix Multiplication on the GPU
113	Stuart:2009:MPO	Message Passing on Data-Parallel Architectures
100	Owens:2000:PRO	Polygon Rendering on a Stream Architecture
97	Phillips:2009:RAP	Rapid Aerodynamic Performance Prediction on a Cluster of Graphics Processing Units
95	Lefohn:2007:RSM	Resolution-Matched Shadow Maps
90	Davidson:2012:EPM	Efficient Parallel Merge Sort for Fixed and Variable Length Keys
85	Stuart:2010:MVR	Multi-GPU Volume Rendering using MapReduce
84	Pan:2017:MGA	Multi-GPU Graph Analytics
81	Alcantara:2011:BAE	Building an Efficient Hash Table on the GPU
78	Khailany:2003:ETV	Exploring the VLSI Scalability of Stream Processors
76	Yang:2022:GAH	GraphBLAST: A High-Performance Linear Algebra-based Graph Framework on the GPU
76	Patney:2008:RRA	Real-Time Reyes-Style Adaptive Surface Subdivision
75	Kapasi:2001:SS	Stream Scheduling
74	Stuart:2011:ESP	Efficient Synchronization Primitives for GPUs
73	Ashkiani:2018:ADH	A Dynamic Hash Table for the GPU
72	Budge:2009:ODM	Out-of-core Data Management for Path Tracing on Hybrid Resources
69	Wang:2016:ACS	A Comparative Study on Exact Triangle Counting Algorithms on the GPU
69	Mattson:2000:CS	Communication Scheduling
66	Davidson:2011:RPF	Register Packing for Cyclic Reduction: A Case Study
65	Davidson:2012:TTF	Toward Techniques for Auto-tuning GPU Algorithms
64	Patney:2009:PVT	Parallel View-Dependent Tessellation of Catmull-Clark Subdivision Surfaces
63	Jenkins:2011:LLF	Lessons Learned from Exploring the Backtracking Paradigm on the GPU
61	Owens:2002:CGO	Computer Graphics on a Stream Architecture

58	Moerschell:2008:DTM	Distributed Texture Memory in a Multi-GPU Environment
57	Lefohn:2005:IEP	Implementing Efficient Parallel Data Structures on GPUs
57	Szumel:2005:TAM	Towards a Mobile Agent Framework for Sensor Networks
50	Awad:2019:EAH	Engineering a High-Performance GPU B-Tree
45	Yang:2015:FSM	Fast Sparse Matrix and Sparse Vector Multiplication Algorithm on the GPU
45	Tzeng:2012:AGT	A GPU Task-Parallel Model with Dependency Resolution
45	Ebeida:2011:EAG	Efficient and Good Delaunay Meshes From Random Points
44	Abdelkader:2020:VVM	VoroCrust: Voronoi Meshing Without Clipping
41	Yang:2018:IPE	Implementing Push-Pull Efficiently in GraphBLAS
40	Riffel:2004:MFM	Mio: Fast Multipass Partitioning via Priority-Based Instruction Scheduling
40	Owens:2002:CRA	Comparing Reyes and OpenGL on a Stream Architecture
39	Ebeida:2011:ICR	Isotropic conforming refinement of quadrilateral and hexahedral meshes using two-refinement templates
38	Wu:2015:PCO	Performance Characterization of High-Level Programming Models for GPU Graph Analytics
36	Lefohn:2005:DAS	Dynamic Adaptive Shadow Maps on Graphics Hardware
31	Stone:2011:GPA	GPGPU parallel algorithms for structured-grid CFD codes
31	Stuart:2011:EMT	Extending MPI to Accelerators
30	Ashkiani:2018:GLA	GPU LSM: A Dynamic Dictionary Data Structure for the GPU
30	Ashkiani:2016:GM	GPU Multisplit
30	Glavtchev:2011:FSL	Feature-Based Speed Limit Sign Detection Using a Graphics Processing Unit
30	Owens:2005:AOG	Assessment of Graphic Processing Units (GPUs) for Department of Defense (DoD) Digital Signal Processing (DSP) Applications
29	Lin:2019:BDL	Benchmarking Deep Learning Frameworks and Investigating FPGA Deployment for Traffic Sign Classification and Detection
28	Stuart:2010:GC	GPU-to-CPU Callbacks
28	Zhang:2011:AHM	A Hybrid Method for Solving Tridiagonal Systems on the GPU
27	Gosink:2009:DPB	Data Parallel Bin-Based Indexing for Answering Queries on Multi-Core Architectures
27	Tzeng:2012:FCH	Finding Convex Hulls Using Quickhull on the GPU
26	Geil:2018:QFA	Quotient Filters: Approximate Membership Queries on the GPU
25	Kniss:2005:OTO	Octree Textures on Graphics Hardware
24	Awad:2020:DGO	Dynamic Graphs on the GPU
24	Osama:2019:GCO	Graph Coloring on the GPU
24	Patney:2015:PAF	Piko: A Framework for Authoring Programmable Graphics Pipelines
24	Patney:2010:FCA	Fragment-Parallel Composite and Filter
23	Gupta:2009:TOF	Three-Layer Optimizations for Fast GMM Computations on GPU-like Parallel Processors
23	Park:2005:AFF	A Framework for Real-Time Volume Visualization of Streaming Scattered Data
22	Wang:2015:FSA	Fast Parallel Suffix Array on the GPU
22	Phillips:2010:UTS	Unsteady Turbulent Simulations on a Cluster of Graphics Processors
21	Gosink:2008:BIA	Bin-Hash Indexing: A Parallel Method For Fast Query Processing
20	Ebeida:2014:KDS	$k$-d Darts: Sampling by $k$-Dimensional Flat Searches
20	Tzeng:2012:HPD	High-Quality Parallel Depth-of-Field Using Line Samples
20	Muyan-Ozcelik:2010:ATA	A Template-Based Approach for Real-Time Speed-Limit-Sign Recognition on an Embedded System using GPU Computing
20	Ma:2007:UVR	Ultra-Scale Visualization: Research and Education
19	Wang:2016:FPS	Fast Parallel Skew and Prefix-Doubling Suffix Array Construction on the GPU
19	Serebrin:2002:ASP	A Stream Processor Development Platform
18	Zhang:2011:APE	A Parallel Error Diffusion Implementation on a GPU
17	Wang:2019:ADI	Accelerating DNN Inference with GraphBLAS and the GPU
17	Gupta:2011:CAM	Compute \& Memory Optimizations for High-Quality Speech Recognition on Low-End GPU Processors
17	Szumel:2006:TVP	The Virtual Pheromone Communication Primitive
16	Muyan-Ozcelik:2011:RSR	Real-Time Speed-Limit-Sign Recognition on an Embedded System Using a GPU
16	Khailany:2000:ISA	Imagine: Signal and Image Processing Using Streams
15	Abdelkader:2018:SCF	Sampling Conditions for Conforming Voronoi Meshing by the VoroCrust Algorithm
15	Abdelkader:2017:ACR	A Constrained Resampling Strategy for Mesh Improvement
13	Pan:2018:SBS	Scalable Breadth-First Search on a GPU Cluster
13	Ashkiani:2017:GMA	GPU Multisplit: an extended study of a parallel algorithm
13	Geil:2014:WGC	WTF, GPU! Computing Twitter's Who-To-Follow on the GPU
13	Wang:2020:FGS	Fast Gunrock Subgraph Matching (GSM) on GPUs
12	Ebeida:2013:SD	Sifted Disks
11	Muyan-Ozcelik:2016:MRE	Multitasking Real-time Embedded GPU Computing Tasks
11	Zhang:2012:PDE	Plane-dependent Error Diffusion on a GPU
10	Osama:2022:EOP	Essentials of Parallel Graph Analytics
10	Wang:2019:FBT	Fast BFS-Based Triangle Counting on GPUs
10	Ebeida:2016:DDT	Disk Density Tuning of a Maximal Random Packing
9	Mahmoud:2021:RAG	RXMesh: A GPU Mesh Data Structure
9	Yih:2018:FVG	FPGA versus GPU for Speed-Limit-Sign Recognition
9	Ashkiani:2016:PAT	Parallel Approaches to the String Matching Problem on the GPU
8	Seitz:2019:SMF	Staged Metaprogramming for Shader System Development
8	Liu:2018:OLA	Object Localization and Motion Transfer learning with Capsules
7	Lin:2022:BAP	Building a Performance Model for Deep Learning Recommendation Model Training on GPUs
7	Owens:2007:TMS	Towards Multi-GPU Support for Visualization
6	Osama:2023:SWP	Stream-K: Work-centric Parallel Decomposition for Dense Matrix-Matrix Multiplication on the GPU
6	Owens:2004:GTF	GPUs tapped for general computing
5	Odemuyiwa:2023:ASD	Accelerating Sparse Data Orchestration via Dynamic Reflexive Tiling
5	Seitz:2022:SUS	Supporting Unified Shader Specialization by Co-opting C++ Features
5	Lin:2018:BDL	Benchmarking Deep Learning Frameworks with FPGA-suitable Models on a Traffic Sign Dataset
5	Weber:2015:PRA	Parallel Reyes-style Adaptive Subdivision with Bounded Memory Usage
4	Awad:2023:AAI	Analyzing and Implementing GPU Hash Tables
4	Chen:2022:AAT	Atos: A Task-Parallel GPU Scheduler for Graph Analytics
4	Ebeida:2014:EIH	Exercises in High-Dimensional Sampling: Maximal Poisson-disk Sampling and $k$-d Darts
4	Phillips:2011:AO2	Acceleration of 2-D Compressible Flow Solvers with Graphics Processing Unit Clusters
4	Szumel:2003:OTF	On the Feasibility of the UC Davis Metanet
3	Brock:2019:RVR	RDMA vs.\ RPC for Implementing Distributed Data Structures
3	Abdelkader:2018:VIT	VoroCrust Illustrated: Theory and Challenges (Multimedia Exposition)
3	Mak:2014:GAE	GPU-Accelerated and Efficient Multi-View Triangulation for Scene Reconstruction
2	Osama:2023:APM	A Programming Model for GPU Load Balancing
2	Chen:2022:SIP	Scalable Irregular Parallelism with GPUs: Getting CPUs Out of the Way
2	Owens:2018:TPG	Technical Perspective: Graphs, Betweenness Centrality, and the GPU
2	Muyan-Ozcelik:2017:MFM	Methods for Multitasking among Real-time Embedded Compute Tasks Running on the GPU
2	Wang:2017:MAL	Mini-Gunrock: A Lightweight Graph Analytics Framework on the GPU
2	Gegan:2016:RGT	Real-Time GPU-based Timing Channel Detection using Entropy
2	Seitz:2013:AGI	A GPU Implementation for Two-Dimensional Shallow Water Modeling
2	Owens:2004:OTS	On The Scalability of Sensor Network Routing and Compression Algorithms
1	Awad:2022:AGM	A GPU Multiversion B-Tree
1	Kemal:2016:MSA	Multidisciplinary simulation acceleration using multiple shared memory graphical processing units
1	Silberstein:2011:ASC	Applying Software-Managed Caching and CPU/GPU Task Scheduling for Accelerating Dynamic Workloads
1	Owens:2006:TIA	The Installation and Use of OpenType Fonts in \LaTeX
1	Awad:2021:BGH	Better GPU Hash Tables
1	Liu:2019:UOS	Unsupervised Object Segmentation with Explicit Localization Module

Navigate