Home X Github About

Papers

Papers Index

Deduplicated index of papers referenced across research docs. Sorted alphabetically by first author within each section.


Database Internals & Query Processing

PaperVenueCited In
Abadi, Madden, Hachem, "The Design and Implementation of Modern Column-Oriented Database Systems"Foundations and Trends in Databases, 2013Arrow Format, Arrow PG Integration
Abadi, Madden, Hachem, "Column-Stores vs. Row-Stores: How Different Are They Really?"SIGMOD 2008ClickHouse Internals
Athanassoulis, Kester et al., "Designing Access Methods: The RUM Conjecture"EDBT 2016WAL & Torn Pages
Begoli, Camacho-Rodríguez, Hyde, Mior, Lemire, "Apache Calcite: A Foundational Framework for Optimized Query Processing Over Heterogeneous Data Sources"SIGMOD 2018Calcite Internals
Boncz, Zukowski, Nes, "MonetDB/X100: Hyper-Pipelining Query Execution"CIDR 2005Arrow PG Integration, DuckDB Internals, Velox Internals
Berenson et al., "A Critique of ANSI SQL Isolation Levels"SIGMOD 1995MongoDB & WiredTiger
Bottcher, Leis, Neumann, Kemper, "Scalable Garbage Collection for In-Memory MVCC Systems"VLDB 2019HyPer/Umbra/CedarDB
Budiu et al., "DBSP" (Database Stream Processor framework)VLDB 2023Database Systems
Cahill et al., Serializable Snapshot Isolation2008Database Systems
Diaconu et al., "Hekaton: SQL Server's Memory-Optimized OLTP Engine"SIGMOD 2013WAL & Torn Pages
Freitag, Kemper, Neumann, "Memory-Optimized Multi-Version Concurrency Control for Disk-Based Database Systems"VLDB 2022HyPer/Umbra/CedarDB
Goldstein, Larson, "Optimizing Queries Using Materialized Views: A Practical, Scalable Solution"SIGMOD 2001Calcite Internals
Graefe, "The Cascades Framework for Query Optimization"IEEE Data Eng. Bull., 1995Calcite Internals, CockroachDB Optimizer Rules
Graefe, McKenna, "The Volcano Optimizer Generator: Extensibility and Efficient Search"ICDE 1993Calcite Internals
Graefe, "Modern B-Tree Techniques"Foundations and Trends in Databases, 2011MongoDB & WiredTiger, Data Structures
Graefe, "Query Evaluation Techniques for Large Databases"ACM Computing Surveys, 1993Join Algorithms
Harinarayan, Rajaraman, Ullman, "Implementing Data Cubes Efficiently"SIGMOD 1996Calcite Internals
Gray, Putzolu, "The Five-Minute Rule for Trading Memory for Disc Accesses"SIGMOD 1987WAL & Torn Pages
Gray, Reuter, "Transaction Processing: Concepts and Techniques"Morgan Kaufmann, 1993WAL & Torn Pages
Hellerstein, Stonebraker, Hamilton, "Architecture of a Database System"Foundations and Trends in Databases, 2007WAL & Torn Pages
Kersten, Leis, Neumann, "Tidy Tuples and Flying Start: Fast Compilation and Fast Execution of Relational Queries in Umbra"VLDB Journal 2021Database Systems, HyPer/Umbra/CedarDB
Kersten, Leis, Kemper, Neumann, Pavlo, Boncz, "Everything You Always Wanted to Know About Compiled and Vectorized Queries But Were Afraid to Ask"VLDB 2018Arrow PG Integration, HyPer/Umbra/CedarDB, Join Algorithms, ISA Critical Instructions, Velox Internals
Leis, Kemper, Neumann, "The Adaptive Radix Tree: ARTful Indexing for Main-Memory Databases"ICDE 2013Database Systems, HyPer/Umbra/CedarDB, Data Structures
Leis, Gubichev, Mirber, Olteanu, Kemper, Neumann, "How Good Are Query Optimizers, Really?"VLDB 2015Join Algorithms, Database Statistics, DuckDB Internals
Mohan, Haderle, Lindsay, Pirahesh, Schwarz, "ARIES: A Transaction Recovery Method Supporting Fine-Granularity Locking"ACM TODS, 1992WAL & Torn Pages
Muhlbauer, Rodiger, Kemper, Neumann, "Fast Serializable Multi-Version Concurrency Control for Main-Memory Database Systems"SIGMOD 2015HyPer/Umbra/CedarDB, DuckDB Internals
Neumann, "Efficiently Compiling Efficient Query Plans for Modern Hardware"VLDB 2011Arrow Format, Database Systems, HyPer/Umbra/CedarDB, ClickHouse Internals
Neumann, Freitag, "Umbra: A Disk-Based System with In-Memory Performance"CIDR 2020Database Systems, HyPer/Umbra/CedarDB, DuckDB Internals
Raasveldt, Muhleisen, "DuckDB: an Embeddable Analytical Database"SIGMOD 2019 (demo)DuckDB Internals
Raasveldt, Muhleisen, "Don't Hold My Data Hostage – A Case for Client Protocol Redesign"VLDB 2017DuckDB Internals
Pelkonen, Franklin, Cavallaro, Huang, Meza, Teller, Veeraraghavan, "Gorilla: A Fast, Scalable, In-Memory Time Series Database"VLDB 2015ClickHouse Internals
Selinger et al., "Access Path Selection in a Relational Database Management System"SIGMOD 1979Join Algorithms
Stonebraker et al., "C-Store: A Column-oriented DBMS"VLDB 2005ClickHouse Internals
Stonebraker, "The Design of POSTGRES Storage System"1987WAL Incremental Conversion
Zukowski, Heman, Nes, Boncz, "Super-Scalar RAM-CPU Cache Compression"ICDE 2006ClickHouse Internals

Join Algorithms

PaperVenueCited In
Albutiu, Kemper, Neumann, "Massively Parallel Sort-Merge Joins in Main Memory Multi-Core Database Systems"VLDB 2012Join Algorithms
Atserias, Grohe, Marx, "Size Bounds and Query Plans for Relational Joins" (AGM bound)FOCS 2008 / SICOMP 2013Join Algorithms
Avnur, Hellerstein, "Eddies: Continuously Adaptive Query Processing"SIGMOD 2000Join Algorithms
Balkesen, Alonso, Teubner, Ozsu, "Multi-Core, Main-Memory Joins: Sort vs. Hash Revisited"VLDB 2013Join Algorithms
Bandle, Giceva, Neumann, "To Partition, or Not to Partition, That Is the Join Question"SIGMOD 2021Join Algorithms
Birler, Kemper, Neumann, "Robust Join Processing with Diamond Hardened Joins"VLDB 2024Join Algorithms
Birler, Schmidt, Fent, Neumann, "Simple, Efficient, and Robust Hash Tables for Join Processing"DaMoN 2024 (Best Paper)Join Algorithms
Blanas, Li, Hellerstein, Patel, "Design and Evaluation of Main Memory Hash Join Algorithms for Multi-Core CPUs"SIGMOD 2011Join Algorithms
Blanas, Patel, Ercegovac, Rao, Shekita, Tian, "A Comparison of Join Algorithms for Log Processing in MapReduce"SIGMOD 2010Join Algorithms
Boncz et al., "MonetDB/X100: Hyper-Pipelining Query Execution"CIDR 2005Arrow Format, Arrow PG Integration, DuckDB Internals
DeWitt, Gerber, "Multiprocessor Hash-Based Join Algorithms" (Hybrid hash join)VLDB 1985Join Algorithms
DeWitt, Katz, Olken, Shapiro, Stonebraker, Wood, "Implementation Techniques for Main Memory Database Systems"SIGMOD 1984Join Algorithms
Gross, ten Wolde, Boncz, "Adaptive Factorization Using Linear-Chained Hash Tables"CIDR 2025Join Algorithms
Khayyat et al., "Lightning Fast and Space Efficient Inequality Joins" (IEJoin)VLDB 2015 (pub. 2016)Join Algorithms
Kim, Sedlar, Chhugani et al., "Sort vs. Hash Revisited: Fast Join Implementation on Modern Multi-Core CPUs"VLDB 2009Join Algorithms
Kitsuregawa, Tanaka, Moto-oka, "Application of Hash to Database Machine and Its Architecture" (Grace hash join)New Generation Computing, 1983Join Algorithms
Kuiper, Gross, Boncz, Muhleisen, "Saving Private Hash Join"VLDB 2025Join Algorithms
Leis, Boncz, Kemper, Neumann, "Morsel-Driven Parallelism: A NUMA-Aware Query Evaluation Framework"SIGMOD 2014Database Systems, HyPer/Umbra/CedarDB, Join Algorithms, DuckDB Internals, Velox Internals
Pedreira, Erling, Basmanova, Wilfong, Sakka, Pai, He, Chattopadhyay, "Velox: Meta's Unified Execution Engine"PVLDB 2022Velox Internals
Manegold, Boncz, Kersten, "Optimizing Main-Memory Join on Modern Hardware"TKDE 2002Join Algorithms
Moerkotte, Neumann, "Analysis of Two Existing and One New Dynamic Programming Algorithm" (DPccp)VLDB 2006Join Algorithms
Moerkotte, Neumann, "Dynamic Programming Strikes Back" (DPhyp)SIGMOD 2008Join Algorithms, DuckDB Internals
Ngo, Porat, Re, Rudra, "Worst-Case Optimal Join Algorithms"PODS 2012Join Algorithms
Ngo, Re, Rudra, "Skew Strikes Back: New Developments in the Theory of Join Algorithms"SIGMOD Record 2013Join Algorithms
Okcan, Riedewald, "Processing Theta-Joins using MapReduce"SIGMOD 2011Join Algorithms
Polychroniou, Raghavan, Ross, "Rethinking SIMD Vectorization for In-Memory Databases"SIGMOD 2015Join Algorithms, ISA Critical Instructions, Data Structures
Qiao, Huang, Zhang, "Data Chunk Compaction in Vectorized Execution"SIGMOD 2025Join Algorithms
Raman, Hellerstein, "State Modules (SteMs) for Adaptive Query Processing"VLDB 2003Join Algorithms
Schuh, Chen, Dittrich, "An Experimental Comparison of Thirteen Relational Equi-Joins in Main Memory"SIGMOD 2016Join Algorithms
Shapiro, "Join Processing in Database Systems with Large Main Memories"ACM Computing Surveys, 1986Join Algorithms
Shatdal, Kant, Naughton, "Cache Conscious Algorithms for Relational Query Processing"VLDB 1994Join Algorithms
Urhan, Franklin, "XJoin: A Reactively-Scheduled Pipelined Join Operator"IEEE Data Engineering Bulletin, 2000Join Algorithms
Veldhuizen, "Leapfrog Triejoin: A Simple, Worst-Case Optimal Join Algorithm"ICDT 2014Join Algorithms

Buffer Management & Storage Engines

PaperVenueCited In
Chang et al., "Resource-Adaptive Query Execution with Paged Memory Management" (LIPAH)CIDR 2025Buffer Management
Johnson et al., "Shore-MT: A Scalable Storage Manager for the Multicore Era"EDBT 2009Buffer Management
Leis, Haubenschild, Neumann, "LeanStore: In-Memory Data Management Beyond Main Memory"ICDE 2018Buffer Management, HyPer/Umbra/CedarDB
Leis et al., "Optimistic Lock Coupling"IEEE Data Eng. Bull. 2019Buffer Management
Leis et al., "Virtual-Memory Assisted Buffer Management" (vmcache)SIGMOD 2023Buffer Management
Vohringer, Leis, "Write-Aware Timestamp Tracking" (WATT eviction)VLDB 2023Buffer Management
Zinsmeister, Nguyen, Leis, Neumann, "Predictive Translation: High-Performance Buffer Management Without the Trade-Offs"SIGMOD 2026Buffer Management
Boncz, Neumann, Leis, "FSST: Fast Random Access String Compression"VLDB 2020DuckDB Internals
Liakos, Papakonstantinopoulou, Kotidis, "Chimp: Efficient Lossless Floating Point Compression for Time Series Databases"VLDB 2022DuckDB Internals
Afroozeh, Kuffo, Boncz, "ALP: Adaptive Lossless floating-Point Compression"SIGMOD 2024DuckDB Internals

LSM Trees & Write-Optimized Storage

PaperVenueCited In
Balmau, Didona et al., "SILK: Preventing Latency Spikes in Log-Structured Merge Key-Value Stores"USENIX ATC 2019WAL & Torn Pages
Bjorling et al., "ZNS: Avoiding the Block Interface Tax for Flash-based SSDs"USENIX ATC 2021LSM Trees
Chang et al., "Bigtable: A Distributed Storage System for Structured Data"OSDI 2006LSM Trees
Dai et al., "Bourbon: Learned Index Structures in Storage Engines"SOSP 2020LSM Trees
Dayan, Idreos, "Dostoevsky: Better Space-Time Trade-Offs for LSM-Tree Based Key-Value Stores"SIGMOD 2018LSM Trees
Dayan et al., "Monkey: Optimal Navigable Key-Value Store"SIGMOD 2017LSM Trees
Dillinger, Walzer, "Ribbon Filter: Practically Smaller Than Bloom and Xor"2021LSM Trees, Data Structures
Dong et al., "RocksDB: Evolution of Development Priorities in a Key-Value Store"TODS 2021LSM Trees
Kaiyrakhmet et al., "SLM-DB: Single-Level Key-Value Store with Persistent Memory"FAST 2019LSM Trees
Lu, Pillai et al., "WiscKey: Separating Keys from Values in SSD-Conscious Storage"FAST 2016WAL & Torn Pages, LSM Trees
Luo, Carey, "LSM-based Storage Techniques: A Survey"VLDB Journal 2020LSM Trees, Data Structures
O'Neil, Cheng, Gawlick, O'Neil, "The Log-Structured Merge-Tree (LSM-Tree)"Acta Informatica, 1996WAL & Torn Pages, LSM Trees, MongoDB & WiredTiger, ClickHouse Internals
Yao et al., "MatrixKV: Reducing Write Stalls and Write Amplification in LSM-tree Based KV Stores"USENIX ATC 2020LSM Trees
Zhong et al., "REMIX: Efficient Range Query for LSM-trees"FAST 2021LSM Trees

Disaggregated Storage & Cloud Databases

PaperVenueCited In
Antonopoulos et al., "Socrates: The New SQL Server in the Cloud"SIGMOD 2019Disaggregated Storage
Armbrust et al., "Delta Lake: High-Performance ACID Table Storage over Cloud Object Stores"VLDB 2020Disaggregated Storage
Bacon et al., "Spanner: Becoming a SQL System"SIGMOD 2017Disaggregated Storage, Distributed Consensus
Cao et al., "PolarFS: An Ultra-low Latency and Failure Resilient Distributed File System"VLDB 2018LSM Trees, Disaggregated Storage
Shvachko, Kuang, Radia, Chansler, "The Hadoop Distributed File System"MSST 2010ClickHouse Internals
Corbett et al., "Spanner: Google's Globally-Distributed Database"OSDI 2012WAL & Torn Pages, Disaggregated Storage, Distributed Consensus
Dageville et al., "The Snowflake Elastic Data Warehouse"SIGMOD 2016Disaggregated Storage
Depoutovitch et al., "Taurus Database: How to Be Fast, Available, and Frugal in the Cloud"SIGMOD 2020Disaggregated Storage
Dong, Zhang et al., "Cloud-Native Databases: A Survey"IEEE TKDE 2024Disaggregated Storage
GaussDB team, "GaussDB: A Cloud-Native Multi-Primary Database with Compute-Memory-Storage Disaggregation"VLDB 2024Disaggregated Storage
Ghemawat, Gobioff, Leung, "The Google File System"SOSP 2003Disaggregated Storage
Pang, Wang, "Understanding the Performance Implications of the Design Principles in Storage-Disaggregated Databases" (OpenAurora)SIGMOD 2024Disaggregated Storage
PolarDB-MP team, "PolarDB-MP: A Multi-Primary Cloud-Native Database via Disaggregated Shared Memory"SIGMOD Companion 2024Disaggregated Storage
PolarDB team, "From Scale-Up to Scale-Out: PolarDB's Journey to Achieving 2 Billion tpmC"VLDB 2025Disaggregated Storage
Verbitski et al., "Amazon Aurora: Design Considerations for High Throughput Cloud-Native Relational Databases"SIGMOD 2017WAL & Torn Pages, Disaggregated Storage
Verbitski et al., "Amazon Aurora: On Avoiding Distributed Consensus for I/Os, Commits, and Membership Changes"SIGMOD 2018Disaggregated Storage
Vuppalapati et al., "Building An Elastic Query Engine on Disaggregated Storage"NSDI 2020Disaggregated Storage
Wang, Zhang, "Disaggregated Database Systems" (Tutorial)SIGMOD 2023Disaggregated Storage
Wang et al., "Cache Coherence Over Disaggregated Memory" (SELCC)VLDB 2025Disaggregated Storage
Weisgut et al., "CXL Memory Performance for In-Memory Data Processing"VLDB 2025Disaggregated Storage
Yu et al., "Disaggregation: A New Architecture for Cloud Databases"VLDB 2025Disaggregated Storage

Distributed Consensus & Replication

PaperVenueCited In
Abadi, "Consistency Tradeoffs in Modern Distributed Database System Design"Computer (IEEE), 2012Distributed Consensus
Ailijiang, Charapko, Demirbas, Mitra, "WPaxos: Wide Area Network Flexible Consensus"IEEE TPDS 2020Distributed Consensus
Baudet et al., "State Machine Replication in the Libra Blockchain"The Libra Association, 2019Distributed Consensus
Buchman, Kwon, Milosevic, "The Latest Gossip on BFT Consensus"arXiv 2018Distributed Consensus
Castro, Liskov, "Practical Byzantine Fault Tolerance"OSDI 1999Distributed Consensus
Castro, Liskov, "Practical Byzantine Fault Tolerance and Proactive Recovery"TOCS 2002Distributed Consensus
Chandra, Griesemer, Redstone, "Paxos Made Live: An Engineering Perspective"PODC 2007Distributed Consensus
Chandra, Toueg, "Unreliable Failure Detectors for Reliable Distributed Systems"JACM 1996Distributed Consensus
Danezis, Kokoris-Kogias, Sonnino, Spiegelman, "Narwhal and Tusk: A DAG-based Mempool and Efficient BFT Consensus"EuroSys 2022Distributed Consensus
Dwork, Lynch, Stockmeyer, "Consensus in the Presence of Partial Synchrony"JACM 1988Distributed Consensus
Fischer, Lynch, Paterson, "Impossibility of Distributed Consensus with One Faulty Process" (FLP)JACM 1985Distributed Consensus, Deterministic Simulation Testing
Gafni, Lamport, "Disk Paxos"Distributed Computing, 2003Distributed Consensus
Gilbert, Lynch, "Brewer's Conjecture and the Feasibility of Consistent, Available, Partition-Tolerant Web Services"SIGACT News 2002Distributed Consensus
Howard, Malkhi, Spiegelman, "Flexible Paxos: Quorum Intersection Revisited"OPODIS 2016Distributed Consensus
Howard, Mortier, "Paxos vs Raft: Have We Reached Consensus on Distributed Consensus?"HotCloud 2020Distributed Consensus
Huang et al., "TiDB: A Raft-based HTAP Database"VLDB 2020Disaggregated Storage, Distributed Consensus
Hunt, Konar, Junqueira, Reed, "ZooKeeper: Wait-free Coordination for Internet-scale Systems"USENIX ATC 2010Distributed Consensus
Junqueira, Reed, Serafini, "Zab: High-performance Broadcast for Primary-backup Systems"DSN 2011Distributed Consensus
Kreps, Narkhede, Rao, "Kafka: A Distributed Messaging System for Log Processing"NetDB Workshop, 2011Distributed Consensus, Kafka Internals
Lamport, "Fast Paxos"Distributed Computing, 2006Distributed Consensus
Lamport, "Generalized Consensus and Paxos"MSR Technical Report, 2005Distributed Consensus
Lamport, "Paxos Made Simple"SIGACT News, 2001Distributed Consensus
Lamport, "The Part-Time Parliament"TOCS 1998Distributed Consensus
Lamport, "Time, Clocks, and the Ordering of Events in a Distributed System"CACM 1978Deterministic Simulation Testing
Lamport, Massa, "Cheap Paxos"DSN 2004Distributed Consensus
Liskov, Cowling, "Viewstamped Replication Revisited"MIT-CSAIL-TR-2012-021, 2012Distributed Consensus
Mao, Junqueira, Marzullo, "Mencius: Building Efficient Replicated State Machines for WANs"OSDI 2008Distributed Consensus
Moraru, Andersen, Kaminsky, "There Is More Consensus in Egalitarian Parliaments" (EPaxos)SOSP 2013Distributed Consensus
Oki, Liskov, "Viewstamped Replication: A New Primary Copy Method"PODC 1988Distributed Consensus
Ongaro, Ousterhout, "In Search of an Understandable Consensus Algorithm" (Raft)USENIX ATC 2014Distributed Consensus
Santos, Schiper, "Optimizing Paxos with Batching and Pipelining"Theoretical Computer Science, 2012Distributed Consensus
Shapiro, Preguica, Baquero, Zawirski, "Conflict-free Replicated Data Types"SSS 2011Distributed Consensus, Data Structures
Taft et al., "CockroachDB: The Resilient Geo-Distributed SQL Database"SIGMOD 2020Distributed Consensus, CockroachDB Optimizer Rules
Van Renesse, Altinbuken, "Paxos Made Moderately Complex"ACM Computing Surveys, 2015Distributed Consensus
Yin, Malkhi, Reiter, Gueta, Abraham, "HotStuff: BFT Consensus with Linearity and Responsiveness"PODC 2019Distributed Consensus
Zhou et al., "FoundationDB: A Distributed Unbundled Transactional Key Value Store"SIGMOD 2021Disaggregated Storage, Distributed Consensus, Deterministic Simulation Testing

Deterministic Simulation Testing

PaperVenueCited In
Alpern, Schneider, "Defining Liveness"Information Processing Letters, 1985Distributed Consensus, Deterministic Simulation Testing
Chen, Groce, Zhang, Wong, Fern, Eide, Regehr, "Taming Compiler Fuzzers" (Swarm testing)PLDI 2013Deterministic Simulation Testing
Newcombe, Rath, Zhang, Metz, Kelley, "How Amazon Web Services Uses Formal Methods"CACM 2015Deterministic Simulation Testing

Storage Reliability & Crash Consistency

PaperVenueCited In
Alagappan et al., "Protocol-Aware Recovery for Consensus-Based Storage"FAST 2018 (Best Paper)WAL & Torn Pages
He, Kannan et al., "The Unwritten Contract of Solid State Drives"EuroSys 2017WAL & Torn Pages
Michael, "Hazard Pointers: Safe Memory Reclamation for Lock-Free Objects"IEEE TPDS 2004MongoDB & WiredTiger, Data Structures
Pillai et al., "All File Systems Are Not Created Equal: On the Complexity of Crafting Crash-Consistent Applications"OSDI 2014WAL & Torn Pages
Rebello et al., "Can Applications Recover from fsync Failures?"USENIX ATC 2020WAL & Torn Pages
Schroeder, Merchant et al., "Flash Reliability in Production: The Expected and the Unexpected"FAST 2016WAL & Torn Pages

File Systems

PaperVenueCited In
Boyd-Wickizer et al., "An Analysis of Linux Scalability to Many Cores"OSDI 2010Filesystem Design
Ganger, Patt, "Soft Updates: A Solution to the Metadata Update Problem in File Systems"1994Filesystem Design
Harter et al., "All File Systems Are Not Created Equal"2011Filesystem Design
Kadekodi et al., "SplitFS: Reducing Software Overhead in File Systems for Persistent Memory"SOSP 2019Filesystem Design
Lee et al., "F2FS: A New File System for Flash Storage"2015Filesystem Design
McKusick et al., "A Fast File System for UNIX"1984Filesystem Design
McVoy, Kleiman, "Extent-like Performance from a UNIX File System"1991Filesystem Design
Prabhakaran et al., "Analysis and Evolution of Journaling File Systems"2005Filesystem Design
Rosenblum, Ousterhout, "The Design and Implementation of a Log-Structured File System"1992Filesystem Design
Xu, Swanson, "NOVA: A Log-structured File System for Hybrid Volatile/Non-volatile Main Memories"2016Filesystem Design

Virtualization & Device Passthrough

PaperVenueCited In
Amit et al., "vIOMMU: Efficient IOMMU Emulation"USENIX ATC 2011VFIO Internals
Ben-Yehuda et al., "The Turtles Project: Design and Implementation of Nested Virtualization"OSDI 2010VFIO Internals
Bugnion, Nieh, Tsafrir, "Hardware and Software Support for Virtualization"Morgan & Claypool, 2017ISA Critical Instructions
Dall, Nieh, "KVM/ARM: The Design and Implementation of the Linux ARM Hypervisor"ASPLOS 2014ISA Critical Instructions
Markuze et al., "True IOMMU Protection from DMA Attacks: When Copy is Faster than Zero Copy"ASPLOS 2016VFIO Internals
Neugebauer et al., "Understanding PCIe Performance for End Host Networking"SIGCOMM 2018VFIO Internals
Tian et al., "A Full GPU Virtualization Solution with Mediated Pass-Through" (GVT-g)USENIX ATC 2014VFIO Internals

GPU, TPU & Accelerator Architecture

PaperVenueCited In
Abts et al., "Think Fast: A Tensor Streaming Processor for Accelerating Deep Learning Workloads" (Groq TSP)ISCA 2020GPU/TPU Accelerator Design
Ajayi et al., "OpenROAD: Toward a Self-Driving, Open-Source Digital Layout Implementation Tool Chain"GOMAC 2019GPU/TPU Accelerator Design
Chen et al., "Eyeriss: An Energy-Efficient Reconfigurable Accelerator for Deep CNNs"JSSC 2017GPU/TPU Accelerator Design
Chen et al., "Eyeriss v2: A Flexible Accelerator for Emerging Deep Neural Networks"JETCAS 2019GPU/TPU Accelerator Design
Dao et al., "FlashAttention: Fast and Memory-Efficient Exact Attention"NeurIPS 2022GPU/TPU Accelerator Design
Dao, "FlashAttention-2: Faster Attention with Better Parallelism and Work Partitioning"ICLR 2024GPU/TPU Accelerator Design
Darvish Rouhani et al., "Microscaling Data Formats for Deep Learning" (OCP MX spec)arXiv 2023GPU/TPU Accelerator Design
Genc et al., "Gemmini: Enabling Systematic Deep-Learning Architecture Evaluation via Full-Stack Integration"DAC 2021GPU/TPU Accelerator Design
Gustafson, Yonemoto, "Beating Floating Point at its Own Game: Posit Arithmetic"Supercomputing Frontiers, 2017GPU/TPU Accelerator Design
Jia et al., "Dissecting the NVIDIA Volta GPU Architecture via Microbenchmarking"arXiv 2018GPU/TPU Accelerator Design
Jia et al., "Dissecting the NVidia Turing T4 GPU via Microbenchmarking"arXiv 2019GPU/TPU Accelerator Design
Jouppi et al., "In-Datacenter Performance Analysis of a Tensor Processing Unit" (TPU v1)ISCA 2017GPU/TPU Accelerator Design
Jouppi et al., "A Domain-Specific Supercomputer for Training Deep Neural Networks" (TPU v2/v3)Comm. ACM 2020GPU/TPU Accelerator Design
Jouppi et al., "TPU v4: An Optically Reconfigurable Supercomputer for Machine Learning"ISCA 2023GPU/TPU Accelerator Design
Kung, "Why Systolic Architectures?"IEEE Computer, 1982GPU/TPU Accelerator Design
Kwon et al., "Efficient Memory Management for Large Language Model Serving with PagedAttention" (vLLM)SOSP 2023GPU/TPU Accelerator Design
Kwon et al., "MAERI: Enabling Flexible Dataflow Mapping over DNN Accelerators"ASPLOS 2018GPU/TPU Accelerator Design
Leviathan et al., "Fast Inference from Transformers via Speculative Decoding"ICML 2023GPU/TPU Accelerator Design
Lindholm et al., "NVIDIA Tesla: A Unified Graphics and Computing Architecture"IEEE Micro, 2008GPU/TPU Accelerator Design
Micikevicius et al., "Mixed Precision Training"ICLR 2018GPU/TPU Accelerator Design
Naffziger et al., "AMD Chiplet Architecture for High-Performance Server and Desktop Products"ISSCC 2020GPU/TPU Accelerator Design
Norrie et al., "The Design Process for Google's Training Chips: TPUv2 and TPUv3"IEEE Micro, 2021GPU/TPU Accelerator Design
Noune et al., "8-bit Numerical Formats for Deep Neural Networks"NeurIPS 2022GPU/TPU Accelerator Design
Parashar et al., "Timeloop: A Systematic Approach to DNN Accelerator Evaluation"ISPASS 2019GPU/TPU Accelerator Design
Pope et al., "Efficiently Scaling Transformer Inference"MLSys 2023GPU/TPU Accelerator Design
Rogers et al., "Cache-Conscious Wavefront Scheduling" (CCWS)MICRO 2012GPU/TPU Accelerator Design
Shazeer, "Fast Transformer Decoding: One Write-Head is All You Need" (Multi-Query Attention)arXiv 2019GPU/TPU Accelerator Design
Sze et al., "Efficient Processing of Deep Neural Networks: A Tutorial and Survey"Proceedings of the IEEE, 2017GPU/TPU Accelerator Design

ISA, Memory Models & Microarchitecture

PaperVenueCited In
Alglave et al., "Frightening Small Children and Disconcerting Grown-ups: Concurrency in the Linux Kernel"ASPLOS 2018ISA Critical Instructions
Herlihy, Shavit, "The Art of Multiprocessor Programming"2008 (revised 2020)ISA Critical Instructions
Kocher et al., "Spectre Attacks: Exploiting Speculative Execution"S&P 2019ISA Critical Instructions
Lahav et al., "Repairing Sequential Consistency in C/C++11"PLDI 2017ISA Critical Instructions
Lipp et al., "Meltdown: Reading Kernel Memory from User Space"USENIX Security 2018ISA Critical Instructions
Luo, Wen, "GCoM: A Detailed GPU Core Model for Accurate Analytical Modeling of Modern GPUs"ISPASS 2019GPU/TPU Accelerator Design
Maranget, Sarkar, Sewell, "A Tutorial Introduction to the ARM and POWER Relaxed Memory Models"2012ISA Critical Instructions
Mellor-Crummey, Scott, "Algorithms for Scalable Synchronization on Shared-Memory Multiprocessors" (MCS lock)TOCS 1991ISA Critical Instructions
Michael, Scott, "Simple, Fast, and Practical Non-Blocking and Blocking Concurrent Queue Algorithms"PODC 1996ISA Critical Instructions
Narasiman et al., "Improving GPU Performance via Large Warps and Two-Level Warp Scheduling"MICRO 2011GPU/TPU Accelerator Design
Sewell et al., "x86-TSO: A Rigorous and Usable Programmer's Model for x86 Multiprocessors"CACM 2010ISA Critical Instructions
Stephens et al., "The Scalable Vector Extension for AArch64"IEEE Micro, 2017ISA Critical Instructions

Data Structures & Algorithms

PaperVenueCited In
Almeida et al., "Delta State Replicated Data Types"J. Parallel & Distributed Computing, 2018Data Structures
Bender, Hu, "Packed Memory Arrays"FOCS 2007Data Structures
Cormode, Muthukrishnan, "An Improved Data Stream Summary: The Count-Min Sketch"2005Data Structures
Ding et al., "ALEX: An Updatable Adaptive Learned Index"SIGMOD 2020Data Structures
Esposito et al., "RecSplit: Minimal Perfect Hashing via Recursive Splitting"ALENEX 2020Data Structures
Fan et al., "Cuckoo Filter: Practically Better Than Bloom"CoNEXT 2014Data Structures
Ferragina, Manzini, "Opportunistic Data Structures with Applications" (FM-Index)FOCS 2000Data Structures
Ferragina, Vinciguerra, "PGM-Index: A Fully-Dynamic Compressed Learned Index"VLDB 2020Data Structures
Hendler et al., "Flat Combining and the Synchronization-Parallelism Tradeoff"SPAA 2010Data Structures
Karnin, Lang, Liberty, "Optimal Streaming Quantile Sketches" (KLL)VLDB 2016Data Structures
Kraska et al., "The Case for Learned Index Structures"SIGMOD 2018LSM Trees, Data Structures
Kulkarni et al., "Logical Physical Clocks and Consistent Snapshots" (HLC)OPODIS 2014Data Structures
Lemire et al., "Roaring Bitmaps: Implementation of an Optimized Software Library"Software: Practice & Experience, 2018Data Structures
Levandoski et al., "The Bw-Tree: A B-tree for New Hardware Platforms"ICDE 2013Data Structures
Malkov, Yashunin, "Efficient and Robust Approximate Nearest Neighbor Search Using HNSW Graphs"TPAMI 2020Data Structures
Mao et al., "Cache Craftiness for Fast Multicore Key-Value Storage" (Masstree)EuroSys 2012Data Structures
Masson et al., "DDSketch: A Fast and Fully-Mergeable Quantile Sketch"VLDB 2019Data Structures
Megiddo, Modha, "ARC: A Self-Tuning, Low Overhead Replacement Cache"FAST 2003Data Structures
Prokopec et al., "Concurrent Tries with Efficient Non-Blocking Snapshots" (CTrie)PPoPP 2012Data Structures
Sleator, Tarjan, "Self-Adjusting Binary Search Trees" (Splay)JACM 1985Data Structures
Stoica et al., "Chord: A Scalable Peer-to-peer Lookup Protocol"IEEE/ACM TON 2003Data Structures
Subramanya et al., "DiskANN: Fast Accurate Billion-point Nearest Neighbor Search on a Single Node"NeurIPS 2019Data Structures
Wongkham et al., "Are Updatable Learned Indexes Ready?"VLDB 2022Data Structures

Superscalar & CPU Microarchitecture

PaperVenueCited In
AMD, "Software Optimization Guide for AMD EPYC 9004 Series Processors (Zen 4)"AMD Publication 57647, 2024Superscalar OoO CPU
Bera, Subramanian, Singh, Mutlu, "Pythia: A Customizable Hardware Prefetching Framework Using Online Reinforcement Learning"MICRO 2021Superscalar OoO CPU
Binkert et al., "The gem5 Simulator"ACM SIGARCH Computer Architecture News 39(2), 2011Superscalar OoO CPU
Celio, Patterson, Asanovic, "The Berkeley Out-of-Order Machine (BOOM): An Industry-Competitive, Synthesizable, Parameterized RISC-V Processor"UCB/EECS-2017-2, 2017Superscalar OoO CPU
Chrysos, Emer, "Memory Dependence Prediction Using Store Sets"ISCA 1998Superscalar OoO CPU
Esmaeilzadeh, Blem, St.Amant, Sankaralingam, Burger, "Dark Silicon and the End of Multicore Scaling"ISCA 2011Superscalar OoO CPU
Ferdman et al., "Clearing the Clouds: A Study of Emerging Scale-Out Workloads on Modern Hardware"ASPLOS 2012Superscalar OoO CPU
Fog, A., "Microarchitecture of Intel, AMD and VIA CPUs"agner.org/optimize (continuously updated)Superscalar OoO CPU
Gabbay, Mendelson, "Speculative Execution Based on Value Prediction"Technion Tech Report CS0974, 1997Superscalar OoO CPU
Gochman et al., "The Intel Pentium M Processor: Microarchitecture and Performance"Intel Technology Journal 7(2), 2003Superscalar OoO CPU
Intel, "Intel Core Ultra (Series 2) — Microarchitecture Overview (Lion Cove)"Intel Architecture Disclosure, 2024Superscalar OoO CPU
Kessler, "The Alpha 21264 Microprocessor"IEEE Micro 19(2), 1999Superscalar OoO CPU
Kim, Jeong, Chang, Suh, "Revisiting Virtual Memory Translation for Hardware Prefetchers"ISCA 2019Superscalar OoO CPU
Lam, Wilson, "Limits of Control Flow on Parallelism"ISCA 1992Superscalar OoO CPU
Nesbit, Smith, "Data Cache Prefetching Using a Global History Buffer"HPCA 2004Superscalar OoO CPU
Seznec, "A New Case for the TAGE Branch Predictor" (TAGE-SC-L)MICRO 2011Superscalar OoO CPU
Seznec, "TAGE-SC-L Branch Predictors Again" (ITTAGE)5th JILP Workshop on Computer Architecture Competitions (JWAC-5), 2014Superscalar OoO CPU
Seznec, Michaud, "A Case for (Partially) Tagged GEometric History Length Branch Prediction"JILP 8, 2006Superscalar OoO CPU
Smith, Pleszkun, "Implementing Precise Interrupts in Pipelined Processors"IEEE Transactions on Computers 37(5), 1988Superscalar OoO CPU
Sohi, Vajapeyam, "Tradeoffs in Instruction Format Design for Horizontal Architectures"ASPLOS 1987Superscalar OoO CPU
Srinath, Mutlu, Kim, Patt, "Feedback Directed Prefetching: Improving the Performance and Bandwidth-Efficiency of Hardware Prefetchers"HPCA 2007Superscalar OoO CPU
Tendler, Dodson, Fields, Le, Sinharoy, "POWER4 System Microarchitecture"IBM Journal of Research and Development 46(1), 2002Superscalar OoO CPU
Tomasulo, "An Efficient Algorithm for Exploiting Multiple Arithmetic Units"IBM Journal of Research and Development 11(1), 1967Superscalar OoO CPU
Xu et al., "Towards Developing High Performance RISC-V Processors Using Agile Methodology" (XiangShan)MICRO 2022Superscalar OoO CPU
Yeh, Patt, "Alternative Implementations of Two-Level Adaptive Branch Prediction"ISCA 1992Superscalar OoO CPU

Text Search & Information Retrieval

PaperVenueCited In
Broder, Carmel, Herscovici, Soffer, Zien, "Efficient Query Evaluation Using a Two-Level Retrieval Process" (WAND)CIKM 2003Text & Vector Search
Burkhard, Keller, "Some Approaches to Best-Match File Searching" (BK-trees)CACM 1973Text & Vector Search
Chambi, Lemire, Kaser, Godin, "Better Bitmap Performance with Roaring Bitmaps"Software: Practice & Experience 2016Text & Vector Search
Ding, Suel, "Faster Top-k Document Retrieval Using Block-Max Indexes" (BMW)SIGIR 2011Text & Vector Search
Ferragina, Manzini, "Opportunistic Data Structures with Applications" (FM-index)FOCS 2000Text & Vector Search
Lemire, Boytsov, "Decoding Billions of Integers per Second Through Vectorization" (SIMD-BP128)Software: Practice & Experience 2015Text & Vector Search
Ottaviano, Venturini, "Partitioned Elias-Fano Indexes"SIGIR 2014Text & Vector Search
Robertson, Walker, Jones, Hancock-Beaulieu, Gatford, "Okapi at TREC-3" (BM25)TREC 1994Text & Vector Search
Robertson, Zaragoza, "The Probabilistic Relevance Framework: BM25 and Beyond"FnTIR 2009Text & Vector Search
Schulz, Mihov, "Fast String Correction with Levenshtein-Automata"IJDAR 2002Text & Vector Search
Turtle, Flood, "Query Evaluation: Strategies and Optimizations" (MaxScore)IPM 1995Text & Vector Search
PaperVenueCited In
Douze, Guzhva, Deng, Johnson et al., "The FAISS Library"arXiv 2024Text & Vector Search
Fu, Cai, Du, "Fast Approximate Nearest Neighbor Search With the Navigating Spreading-out Graph" (NSG)VLDB 2019Text & Vector Search
Gao, Long, "RaBitQ: Quantizing High-Dimensional Vectors with a Theoretical Error Bound for ANN Search"SIGMOD 2024Text & Vector Search
Guo, Sun, Lindgren, Geng, Simcha, Chern, Kumar, "Accelerating Large-Scale Inference with Anisotropic Vector Quantization" (ScaNN)ICML 2020Text & Vector Search
Indyk, Motwani, "Approximate Nearest Neighbors: Towards Removing the Curse of Dimensionality"STOC 1998Text & Vector Search
Jégou, Douze, Schmid, "Product Quantization for Nearest Neighbor Search"IEEE TPAMI 2011Text & Vector Search
Johnson, Douze, Jégou, "Billion-scale Similarity Search with GPUs" (FAISS)arXiv 2017Text & Vector Search
Malkov, Yashunin, "Efficient and Robust Approximate Nearest Neighbor Search Using Hierarchical Navigable Small World Graphs" (HNSW)IEEE TPAMI 2018Text & Vector Search
Ootomo, Ozaki, Itasaka, Yokota, Tanaka, "CAGRA: Highly Parallel Graph Construction and ANN Search for GPUs"arXiv 2308.15136Text & Vector Search
Patel, Kraft, Guestrin, Zaharia, "ACORN: Performant and Predicate-Agnostic Search Over Vector Embeddings and Structured Data"SIGMOD 2024Text & Vector Search
Singh, Jaiswal et al., "FreshDiskANN: A Fast and Accurate Graph-Based ANN Index for Streaming Similarity Search"arXiv 2021Text & Vector Search
Subramanya, Devvrit, Kakde, Krishnaswamy, Shrivastava, "DiskANN: Fast Accurate Billion-point Nearest Neighbor Search on a Single Node"NeurIPS 2019Text & Vector Search

Neural & Learned Retrieval

PaperVenueCited In
Cormack, Clarke, Buettcher, "Reciprocal Rank Fusion Outperforms Condorcet and Individual Rank Learning Methods" (RRF)SIGIR 2009Text & Vector Search
Formal, Piwowarski, Clinchant, "SPLADE: Sparse Lexical and Expansion Model for First Stage Retrieval"SIGIR 2021Text & Vector Search
Karpukhin, Oğuz, Min, Lewis et al., "Dense Passage Retrieval for Open-Domain Question Answering" (DPR)EMNLP 2020Text & Vector Search
Khattab, Zaharia, "ColBERT: Efficient and Effective Passage Search via Contextualized Late Interaction over BERT"SIGIR 2020Text & Vector Search
Kusupati, Bhatt, Rege et al., "Matryoshka Representation Learning" (MRL)NeurIPS 2022Text & Vector Search
Muennighoff, Tazi, Magne, Reimers, "MTEB: Massive Text Embedding Benchmark"EACL 2023Text & Vector Search
Radford, Kim, Hallacy et al., "Learning Transferable Visual Models From Natural Language Supervision" (CLIP)ICML 2021Text & Vector Search
Santhanam, Khattab, Saad-Falcon, Potts, Zaharia, "ColBERTv2: Effective and Efficient Retrieval via Lightweight Late Interaction"NAACL 2022Text & Vector Search
Santhanam, Khattab, Potts, Zaharia, "PLAID: An Efficient Engine for Late Interaction Retrieval"CIKM 2022Text & Vector Search
Thakur, Reimers, Rücklé, Srivastava, Gurevych, "BEIR: A Heterogeneous Benchmark for Zero-shot Evaluation of IR Models"NeurIPS 2021Text & Vector Search
Xiong, Xiong, Li et al., "Approximate Nearest Neighbor Negative Contrastive Estimation for Dense Text Retrieval" (ANCE)ICLR 2021Text & Vector Search

Statistics, Cardinality Estimation & Query Optimization

PaperVenueCited In
Alon, Matias, Szegedy, "The Space Complexity of Approximating the Frequency Moments"J. Comput. Syst. Sci. 1999Database Statistics
Bruno, Chaudhuri, Gravano, "STHoles: A Multidimensional Workload-Aware Histogram"SIGMOD 2001Database Statistics
Cormode, Muthukrishnan, "An Improved Data Stream Summary: The Count-Min Sketch and its Applications"J. Algorithms 2005Database Statistics, Data Structures
Dunning, Ertl, "Computing Extremely Accurate Quantiles Using t-Digests"arXiv 2019Database Statistics, Data Structures
Estan, Naughton, "End-Biased Samples for Join Cardinality Estimation"SIGMOD 2006Database Statistics
Flajolet, Martin, "Probabilistic Counting Algorithms for Data Base Applications"J. Comput. Syst. Sci. 1985Database Statistics
Flajolet, Fusy, Gandouet, Meunier, "HyperLogLog: the Analysis of a Near-Optimal Cardinality Estimation Algorithm"DMTCS 2007Database Statistics, Data Structures
Haas, Naughton, Seshadri, Stokes, "Sampling-Based Estimation of the Number of Distinct Values of an Attribute"VLDB 1995Database Statistics
Han, Wu, Wu, Zhu, Pfadler, Qin, Li, Pfadler, "Cardinality Estimation in DBMS: A Comprehensive Benchmark Evaluation"VLDB 2021Database Statistics
Heule, Nunkesser, Hall, "HyperLogLog in Practice: Algorithmic Engineering of a State of the Art Cardinality Estimation Algorithm"EDBT 2013Database Statistics
Hilprecht, Schmidt, Kulessa, Molina, Kersting, Binnig, "DeepDB: Learn from Data, not from Queries!"VLDB 2020Database Statistics
Ioannidis, Kang, "Randomized Algorithms for Optimizing Large Join Queries"SIGMOD 1990Database Statistics
Ioannidis, Kang, "Optimal Histograms for Limiting Worst-Case Error Propagation in the Size of Join Results"TODS 1993Database Statistics
Karnin, Lang, Liberty, "Optimal Quantile Approximation in Streams"FOCS 2016Database Statistics, Data Structures
Kipf, Kipf, Radke, Leis, Boncz, Kemper, "Learned Cardinalities: Estimating Correlated Joins with Deep Learning"CIDR 2019Database Statistics
Li, Quoc, Ho, Gubichev, Kemper, Abo Khamis, Olteanu, Schleich, "Wander Join: Online Aggregation via Random Walks"SIGMOD 2016Database Statistics
Marcus, Negi, Mao, Zhang, Alizadeh, Kraska, Papaemmanouil, Tatbul, "Bao: Making Learned Query Optimization Practical"SIGMOD 2021Database Statistics
Masson, Rim, Lee, "DDSketch: A Fast and Fully-Mergeable Quantile Sketch with Relative-Error Guarantees"VLDB 2019Database Statistics, Data Structures
Metwally, Agrawal, El Abbadi, "Efficient Computation of Frequent and Top-k Elements in Data Streams"ICDT 2005Database Statistics
Misra, Gries, "Finding Repeated Elements"Science of Computer Programming 1982Database Statistics
Moerkotte, Steinbrunn, Moerkotte, "Analysis of Two Existing and One New Dynamic Programming Algorithm for the Generation of Optimal Bushy Join Trees without Cross Products" (Q-error)VLDB 2006Database Statistics, Join Algorithms
Poosala, Ioannidis, Haas, Shekita, "Improved Histograms for Selectivity Estimation of Range Predicates"SIGMOD 1996Database Statistics
Vitter, "Random Sampling with a Reservoir"ACM Trans. Math. Software 1985Database Statistics
Vitter, Wang, Iyer, "Data Cube Approximation and Histograms via Wavelets"VLDB 1998Database Statistics
Yang, Liang, Kamsetty, Wu, Guestrin, Stoica, Krishnan, Abbeel, "NeuroCard: One Cardinality Estimator for All Tables"NeurIPS 2020Database Statistics

Datacenter Networking & RDMA

PaperVenueCited In
Addanki, Apostolaki, Ghobadi, Schmid, Vanbever, "PowerTCP: Pushing the Performance Limits of Datacenter Networks"NSDI 2022Interconnects
Alizadeh, Greenberg, Maltz, Padhye, Patel, Prabhakar, Sengupta, Sridharan, "Data Center TCP (DCTCP)"SIGCOMM 2010Interconnects
Bansal, Khan, Goyal et al., "Meta's RoCE Networks: Building, Operating, and Lessons Learned"SIGCOMM 2023Interconnects
Dragojević, Narayanan, Hodson, Castro, "FaRM: Fast Remote Memory"NSDI 2014Interconnects
Gibson, Hartl, Wlodarczyk, Vahdat, Mogul, Goldberg, Sjödin, Sosa, Yang, Singh, "Aquila: A Unified, Low-Latency Fabric for Datacenter Networks"NSDI 2022Interconnects
Guo, Wu, Deng, Liu, Haridas, Liu, Xu, Yu, Xiang, Wang, Yu, Zhang, Zhang, Padhye, Lipshteyn, "RDMA over Commodity Ethernet at Scale"SIGCOMM 2016Interconnects
Kumar, Dukkipati, Jouppi, Lam, Madhavan, Mittal, Mittal, Wassel, Wetherall, Wu, Yang, Zats, "Swift: Delay is Simple and Effective for Congestion Control in the Datacenter"SIGCOMM 2020Interconnects
Li, Miao, Liu, Zhou, Sridharan, Kumar, Bao, Zhou, Yang, Tewari, "HPCC: High Precision Congestion Control"SIGCOMM 2019Interconnects
Mellette, McGuinness, Roy, Forencich, Papen, Snoeren, Porter, "RotorNet: A Scalable, Low-Complexity, Optical Datacenter Network"SIGCOMM 2017Interconnects
Mittal, Lam, Dukkipati, Blem, Wassel, Ghobadi, Vahdat, Wang, Wetherall, Zats, "TIMELY: RTT-based Congestion Control for the Datacenter"SIGCOMM 2015Interconnects
Mittal, Shpiner, Panda, Zahavi, Krishnamurthy, Ratnasamy, Shenker, "Revisiting Network Support for RDMA" (IRN)SIGCOMM 2018Interconnects
Olteanu, Agache, Voinescu, Raiciu, "An Edge-Queued Datagram Service for All Datacenter Traffic" (EQDS)NSDI 2022Interconnects
Poutievski, Mashayekhi, Ong, Singhvi, Tariq, Tariq, Vahdat, Wanderer, "Jupiter Evolving: Transforming Google's Datacenter Network via Optical Circuit Switches and Software-Defined Networking"SIGCOMM 2022Interconnects
Singh, Ong, Agarwal, Anderson, Armistead, Bannon, Boving, Desai, Felderman, Germano, Kanagala, Provost, Simmons, Tanda, Wanderer, Hölzle, Stuart, Vahdat, "Jupiter Rising: A Decade of Clos Topologies and Centralized Control in Google's Datacenter Network"SIGCOMM 2015Interconnects
Stephens, Akella, Swift, "Loom: Flexible and Efficient NIC Packet Scheduling" (Annulus follow-on)SIGCOMM 2019Interconnects
Zhu, Eran, Firestone, Guo, Lipshteyn, Liron, Padhye, Raindel, Yahia, Zhang, "Congestion Control for Large-Scale RDMA Deployments" (DCQCN)SIGCOMM 2015Interconnects

HPC Fabrics & Optical Networks

PaperVenueCited In
Ajima, Inoue, Hiramoto, Takagi, Shimizu, "The Tofu Interconnect D" (Fugaku)HotI 2018Interconnects
Alverson, Roweth, Kaplan, "The Gemini System Interconnect" (Cray XE/XK)HotI 2010Interconnects
Ballani, Costa, Behrendt, Cletheroe, Haller, Jozwik, Karinou, Lange, Shi, Thomsen, Williams, "Sirius: A Flat Datacenter Network with Nanosecond Optical Switching"SIGCOMM 2020Interconnects
De Sensi, Di Girolamo, McMahon, Roweth, Hoefler, "An In-Depth Analysis of the Slingshot Interconnect"SC 2020Interconnects
Faanes, Bataineh, Roweth, Court, Froese, Alverson, Johnson, Kopnick, Higgins, Reinhard, "Cray Cascade: A Scalable HPC System Based on a Dragonfly Network" (Aries)SC 2012Interconnects
Khani, Ghobadi, Alizadeh, Zhu, Glick, Bergman, Vahdat, Klenk, Ebrahimi, "SiP-ML: High-Bandwidth Optical Network Interconnects for Machine Learning Training"SIGCOMM 2021Interconnects
Liu, Theogarajan, Pinheiro, Vahdat, "Apollo: A Sequencing-Based Approach to Reconfigurable Optical Networks"SIGCOMM 2021Interconnects
Mellette, Das, Guo, McGuinness, Snoeren, Porter, Papen, "Expanding Across Time to Deliver Bandwidth Efficiency and Low Latency" (Opera)NSDI 2020Interconnects
Shaw, Adams, Azaria, Bank, Batson, Bell, Bergdorf et al., "Anton 2: Raising the Bar for Performance and Programmability in a Special-Purpose Molecular Dynamics Supercomputer"SC 2014Interconnects

Standards & Specifications

SpecYearCited In
ARM, "AMBA AXI and ACE Protocol Specification" Issue G2021Interconnects
ARM, "AMBA CHI Architecture Specification" Issue F2023Interconnects
CXL Consortium, "Compute Express Link 3.2 Specification"Dec 2024Interconnects
IEEE 802.1Qbb, "Priority-based Flow Control"2011Interconnects
IEEE 802.3df-2024, "200/400/800 Gb/s Ethernet"2024Interconnects
IEEE 802.3dj (draft), "1.6 Tb/s Ethernet"2026 (project)Interconnects
InfiniBand Trade Association, "InfiniBand Architecture Specification 1.7 (Vol 1)"2023Interconnects
NVMe Express, "NVMe over Fabrics Specification 1.1a"2023Interconnects
PCI-SIG, "PCI Express Base Specification Revision 7.0"2025Interconnects, PCIe Internals
UCIe Consortium, "Universal Chiplet Interconnect Express 2.1 Specification"Aug 2025Interconnects
Ultra Ethernet Consortium, "Ultra Ethernet Specification 1.0"Jun 2025Interconnects

Cache Eviction, Admission & Prefetching

PaperVenueCited In
Belady, "A Study of Replacement Algorithms for a Virtual-Storage Computer"IBM Systems Journal, 1966Cache Algorithms
Mattson, Gecsei, Slutz, Traiger, "Evaluation Techniques for Storage Hierarchies"IBM Systems Journal, 1970Cache Algorithms
Denning, "The Working Set Model for Program Behavior"CACM, 1968Cache Algorithms
O'Neil, O'Neil, Weikum, "The LRU-K Page Replacement Algorithm for Database Disk Buffering"SIGMOD 1993Cache Algorithms
Johnson, Shasha, "2Q: A Low Overhead High Performance Buffer Management Replacement Algorithm"VLDB 1994Cache Algorithms
Jiang, Zhang, "LIRS: An Efficient Low Inter-reference Recency Set Replacement Policy to Improve Buffer Cache Performance"SIGMETRICS 2002Cache Algorithms
Megiddo, Modha, "ARC: A Self-Tuning, Low Overhead Replacement Cache"FAST 2003Cache Algorithms
Bansal, Modha, "CAR: Clock with Adaptive Replacement"FAST 2004Cache Algorithms
Jiang, Chen, Zhang, "CLOCK-Pro: An Effective Improvement of the CLOCK Replacement"USENIX ATC 2005Cache Algorithms
Park, Kang, Park, "CFLRU: A Replacement Algorithm for Flash Memory"EMSOFT 2006Cache Algorithms
Gill, Bathen, "AMP: Adaptive Multi-stream Prefetching in a Shared Cache"FAST 2007Cache Algorithms
Zhou, Philbin, Li, "The Multi-Queue Replacement Algorithm for Second Level Buffer Caches"USENIX ATC 2001Cache Algorithms
Cherkasova, "Improving WWW Proxies Performance with Greedy-Dual-Size-Frequency Caching Policy"HPL Technical Report, 1998Cache Algorithms
Wires, Ingram, Drudi, Harvey, Warfield, "Characterizing Storage Workloads with Counter Stacks"OSDI 2014Cache Algorithms
Waldspurger, Park, Garthwaite, Ahmad, "Efficient MRC Construction with SHARDS"FAST 2015Cache Algorithms
Blankstein, Shah, Wenisch, "Hyperbolic Caching: Flexible Caching for Web Applications"USENIX ATC 2017Cache Algorithms
Berger, Sitaraman, Harchol-Balter, "AdaptSize: Orchestrating the Hot Object Memory Cache in a CDN"NSDI 2017Cache Algorithms
Yang, Karimi, Sæmundsson, Wildani, Vigfusson, "Mithril: Mining Sporadic Associations for Cache Prefetching"ACM SoCC 2017Cache Algorithms
Einziger, Friedman, Manes, "TinyLFU: A Highly Efficient Cache Admission Policy"USENIX SYSTOR 2017Cache Algorithms
Einziger, Friedman, Manes, "Adaptive Software Cache Management" (W-TinyLFU / Caffeine)IPDPS 2018Cache Algorithms
Beckmann, Sanfilippo, "LHD: Improving Cache Hit Rate by Maximizing Hit Density"NSDI 2018Cache Algorithms
Vietri, Rodriguez, Bux, Singla, Smirni, "Driving Cache Replacement with ML-based LeCaR"USENIX HotStorage 2018Cache Algorithms
Song, Yang, Rashmi, "Learning Relaxed Belady for Content Distribution Network Caching" (LRB)NSDI 2020Cache Algorithms
Yang, Yue, Rashmi, "A Large-scale Analysis of Hundreds of In-memory Key-value Cache Clusters at Twitter"OSDI 2020 / ACM TOS 2021Cache Algorithms
Shi, Akram, Pickett, Lustig, "Voyager: Combining Local and Global Features for Practical Learned Memory Prefetching"ASPLOS 2021Cache Algorithms
Bera, Kanellopoulos, Nori, Shahroodi, Subramoney, Mutlu, "Pythia: A Customizable Hardware Prefetching Framework Using Online Reinforcement Learning"MICRO 2021Cache Algorithms, Superscalar OoO CPU
Rodriguez, Sundaram, "Learning Cache Replacement with CACHEUS"FAST 2021Cache Algorithms
Berg, Berger, McAllister, Grosof, et al., "The CacheLib Caching Engine: Design and Experiences at Scale"OSDI 2020Cache Algorithms
Yang, Zhang, Yue, Rashmi, "GL-Cache: Group-level Learning for Efficient and High-Performance Caching"FAST 2023Cache Algorithms
Yang, Qiu, Zhang, Yue, Rashmi, "FIFO Queues Are All You Need for Cache Eviction" (S3-FIFO)SOSP 2023Cache Algorithms
Kwon, Li, Zhuang, Sheng, et al., "Efficient Memory Management for Large Language Model Serving with PagedAttention"SOSP 2023Cache Algorithms
Zhang, Sheng, Hou, et al., "H2O: Heavy-Hitter Oracle for Efficient Generative Inference of Large Language Models"NeurIPS 2023Cache Algorithms
Vöhringer, Leis, "WATT: Write-Aware Timestamp Tracking for Efficient Buffer Management"VLDB 2023Cache Algorithms, Buffer Management
Zhang, Yang, Yue, Rashmi, "SIEVE is Simpler than LRU: an Efficient Turn-Key Eviction Algorithm for Web Caches"NSDI 2024Cache Algorithms
Xiao, Tian, Chen, Han, "Efficient Streaming Language Models with Attention Sinks" (StreamingLLM)ICLR 2024Cache Algorithms

Textbooks & Reference Works

PaperVenueCited In
Dally, Towles, "Principles and Practices of Interconnection Networks"Textbook (Morgan Kaufmann 2003)GPU/TPU Accelerator Design, Interconnects
Duato, Yalamanchili, Ni, "Interconnection Networks: An Engineering Approach"Textbook (Morgan Kaufmann 2003)Interconnects
Gregg, "BPF Performance Tools"Addison-Wesley, 2019Linux Expert Syscalls
Gregg, "Systems Performance" (2nd ed.)Addison-Wesley, 2020Linux Expert Syscalls
Hennessy, Patterson, "Computer Architecture: A Quantitative Approach" (6th ed.)TextbookGPU/TPU Accelerator Design, ISA Critical Instructions
Kerrisk, "The Linux Programming Interface"No Starch Press, 2010Linux Expert Syscalls
Love, "Linux Kernel Development" (3rd ed.)Addison-Wesley, 2010Linux Expert Syscalls
Navarro, "Succinct Data Structures"Cambridge University Press, 2016Data Structures
Weste, Harris, "CMOS VLSI Design: A Circuits and Systems Perspective"TextbookGPU/TPU Accelerator Design

Last updated: 2026-05-30