Papers
- Published in 2014
- Published in 2013
- Published in 2012
- Published in 2011
- Published in 2010
- Published in 2009
- Published in 2008
- Published in 2007
- Published in 2006
- Published in 2005
- Published in 2004
- Published in 2003
- Published in 2002
- Published in 2001
- Published in 2000
- Published in 1999
- Published in 1998
- Published in 1997
2014
- 2014 - 3D Object Manipulation in a Single Photograph using Stock 3D Models
- 2014 - A Partitioning Framework for Aggressive Data Skipping
- 2014 - A Self-Configurable Geo-Replicated Cloud Storage System
- 2014 - Coordination Avoidance in Database Systems
- 2014 - DeepFace: Closing the Gap to Human-Level Performance in Face Verification
- 2014 - Execution Primitives for Scalable Joins and Aggregations in Map Reduce
- 2014 - f4: Facebookâs Warm BLOB Storage System
- 2014 - Fastpass: A Centralized "Zero-Queue" Datacenter Network
- 2014 - First-person Hyper-lapse Videos
- 2014 - Guess Who Rated This Movie: Identifying Users Through Subspace Clustering
- 2014 - In Search of an Understandable Consensus Algorithm
- 2014 - Log-structured Memory for DRAM-based Storage
- 2014 - Logical Physical Clocks and Consistent Snapshots in Globally Distributed Databases
- 2014 - MapGraph: A High Level API for Fast Development of High Performance Graph Analytics on GPUs
- 2014 - Mesa: Geo-Replicated, Near Real-Time, Scalable Data Warehousing
- 2014 - Orca A Modular Query Optimizer Architecture for Big Data
- 2014 - Pigeon: A Spatial MapReduce Language
- 2014 - Scalable Object Detection using Deep Neural Networks
- 2014 - Sequence to Sequence Learning with Neural Networks
- 2014 - Show and Tell: A Neural Image Caption Generator
2013
- 2013 - A Demonstration of SpatailHadoop: An Efficient MapReduce Framework for Spatial Data
- 2013 - CG_Hadoop: Computational Geometry in MapReduce
- 2013 - Consistency-Based Service Level Agreements for Cloud Storage
- 2013 - Dimension Independent Matrix Square using MapReduce
- 2013 - Druid A Real-time Analytical Data Store
- 2013 - Event labeling combining ensemble detectors and background knowledge
- 2013 - Everything You Always Wanted to Know About Synchronization but Were Afraid to Ask
- 2013 - F1: A Distributed SQL Database That Scales
- 2013 - GraphX: A Resilient Distributed Graph System on Spark
- 2013 - HyperLogLog in Practice: Algorithmic Engineering of a State of The Art Cardinality 2013 Estimation Algorithm
- 2013 - MillWheel: Fault-Tolerant Stream Processing at Internet Scale
- 2013 - MLbase: A Distributed Machine-learning System
- 2013 - Naiad: A Timely Dataflow System
- 2013 - Online, Asynchronous Schema Change in F1
- 2013 - Presto: Distributed Machine Learning and Graph Processing with Sparse Matrices
- 2013 - Recursive Deep Models for Semantic Compositionality Over a Sentiment Treebank
- 2013 - Rich feature hierarchies for accurate object detection and semantic segmentation
- 2013 - Scalable Progressive Analytics on Big Data in the Cloud
- 2013 - Scaling Memcache at Facebook
- 2013 - Scuba: Diving into Data at Facebook
- 2013 - Shark: SQL and Rich Analytics at Scale
- 2013 - Some Improvements on Deep Convolutional Neural Network Based Image Classification
- 2013 - TAO: Facebookâs Distributed Data Store for the Social Graph
- 2013 - Toward Common Patterns for Distributed, Concurrent, Fault-Tolerant Code
- 2013 - Unicorn: A System for Searching the Social Graph
- 2013 - Warp: Lightweight Multi-Key Transactions for Key-Value Stores
2012
- 2012 - A Few Useful Things to Know about Machine Learning
- 2012 - A Sublinear Time Algorithm for PageRank Computations
- 2012 - Avatara: OLAP for Web-scale Analytics Products
- 2012 - Blink and It's Done. Interactive Queries on Very Large Data
- 2012 - BlinkDB: Queries with Bounded Errors and Bounded Response Times on Very Large Data
- 2012 - Dimension Independent Similarity Computation
- 2012 - Earlybird: Real-Time Search at Twitter
- 2012 - Fast and Interactive Analytics over Hadoop Data with Spark
- 2012 - HyperDex: A Distributed, Searchable Key-Value Store
- 2012 - ImageNet Classification with Deep Convolutional Neural Networks
- 2012 - Large:Scale Machine Learning at Twitter
- 2012 - Multi-Scale Matrix Sampling and Sublinear-Time PageRank Computation
- 2012 - Paxos Made Parallel
- 2012 - Paxos Replicated State Machines as the Basis of a High-Performance Data Store
- 2012 - Processing a Trillion Cells per Mouse Click
- 2012 - Shark: Fast Data Analysis Using Coarse-grained Distributed Memory
- 2012 - Spanner: Google's Globally-Distributed Database
- 2012 - The Unified Logging Infrastructure for Data Analytics at Twitter
- 2012 - The Vertica Analytic Database- C-Store 7 Years Later
2011
- 2011 - CrowdDB: Answering Queries with Crowdsourcing
- 2011 - CrowdDB: Query Processing with the VLDB Crowd
- 2011 - Fast Crash Recovery in RAMCloud
- 2011 - Hogwild!: A Lock-Free Approach to Parallelizing Stochastic Gradient Descent
- 2011 - It's Time for Low Latency
- 2011 - Matching Unstructured Product Offers to Structured Product Specifications
- 2011 - Megastore: Providing Scalable, Highly Available Storage for Interactive Services
- 2011 - Resilient Distributed Datasets- A Fault-Tolerant Abstraction for In-Memory Cluster Computing
- 2011 - Scarlett: Coping with Skewed Content Popularity in MapReduce Clusters
2010
- 2010 - Dapper, a Large-Scale Distributed Systems Tracing Infrastructure
- 2010 - Distributed Optimization and Statistical Learning via the Alternating Direction Method of Multipliers
- 2010 - Dremel: Interactive Analysis of Web-Scale Datasets
- 2010 - Finding a needle in Haystack- Facebook's photo storage
- 2010 - FlumeJava: Easy, Eff¥cient Data-Parallel Pipelines
- 2010 - Large:scale Incremental Processing Using Distributed Transactions and Notifications
- 2010 - Mesos: A Platform for Fine-Grained Resource Sharing in the Data Center
- 2010 - Pregel: A System for Large-Scale Graph Processing
- 2010 - S4: Distributed Stream Computing Platform
- 2010 - Spark: Cluster Computing with Working Sets
- 2010 - The Learning Behind Gmail Priority Inbox
- 2010 - ZooKeeper: Wait-free coordination for Internet-scale systems
2009
- 2009 - Cassandra - A Decentralized Structured Storage System
- 2009 - HadoopDB: An Architectural Hybrid of MapReduce and DBMS Technologies for Analytical Workloads
- 2009 - Vertical Paxos and Primary-Backup Replication
2008
- 2008 - Chukwa: A large-scale monitoring system
- 2008 - Column:Stores vs. Row-Stores- How Different Are They Really?
- 2008 - PNUTS: Yahoo!Õs Hosted Data Serving Platform
- 2008 - Top 10 algorithms in data mining
2007
- 2007 - Dryad: Distributed Data-Parallel Programs from Sequential Building Blocks
- 2007 - Dynamo: Amazon's Highly Available Key-value Store
- 2007 - Labeled Faces in the Wild: A Database for Studying Face Recognition in Unconstrained Environments
- 2007 - Life beyond Distributed Transactions: an ApostateÕs Opinion
- 2007 - Paxos Made Live - An Engineering Perspective
2006
- 2006 - Bigtable: A Distributed Storage System for Structured Data
- 2006 - Ceph: A Scalable, High-Performance Distributed File System
- 2006 - Map-Reduce for Machine Learning on Multicore
- 2006 - The Chubby lock service for loosely-coupled distributed systems
2005
- 2005 - Fast Paxos
2004
2003
- 2003 - The Google File System
2002
- 2002 - Brewer's Conjecture and the Feasibility of Consistent, Available, Partition-Tolerant Web Services
2001
- 2001 - Chord: A Scalable Peer-to-peer Lookup Service for Internet Applications
- 2001 - Paxos Made Simple
- 2001 - Random Forrest
1999
- 1999 - Pasting Small Votes for Classification in Large Databases and On-Line
- 1999 - The PageRank Citation Ranking: Bringing Order to the Web