Massive Datasets and Data Streams

RadixZip: Linear Time Compression of Token Streams

**VLDB 2007**(33rd International Conference on Very Large Data Bases), p 1162-1172, Sep 2007

Detecting Near-Duplicates for Web Crawling

**WWW 2007**(16th International World Wide Web Conference), p 141-149, May 2007

Approximate Counts and Quantiles over Sliding Windows

**PODS 2004**(23rd ACM Symposium on Principles of Database Systems), p 286-296, June 2004

Query Processing, Resource Management and Approximation in a Data Stream Management System

**CIDR 2003**(1st Biennial Conference On Innovative Data Systems Research), p 245-254, Jan 2003

Random Sampling Techniques for Space Efficient Online Computation of Order Statistics of Large Datasets

**SIGMOD 1999**(1999 ACM SIGMOD), p 251-62, June 1999

Approximate Medians and other Quantiles in One Pass and with Limited Memory

**SIGMOD 1998**(1998 ACM SIGMOD), p 426-35, June 1998

Peer to Peer Systems -- Distributed Hash Tables

Brief Announcement: Papillon: Greedy Routing in Rings

**DISC 2005**(19th International Symposium on Distributed Computing), p 514-515, September 2005

Decentralized Algorithms using Both Local and Random Probes for P2P Load Balancing

**SPAA 2005**(17th ACM Symposium on Parallelism in Algorithms and Architectures), p 135-144, July 2005

Balanced Binary Trees for ID Management and Load Balance in Distributed Hash Tables

**PODC 2004**(23rd ACM Symposium on Principles of Distributed Computing), p 197-205, July 2004

Know thy Neighbor's Neighbor: the Power of Lookahead in Randomized P2P Networks

**STOC 2004**(36th ACM Symposium on Theory of Computing), p 54-63, June 2004

Optimal Routing in Chord

**SODA 2004**(15th Annual ACM-SIAM Symposium on Discrete Algorithms), p 169-178, January 2004

Routing Networks for Distributed Hash Tables

**PODC 2003**(22nd ACM Symposium on Principles of Distributed Computing), p 133-142, June 2003

Symphony: Distributed Hashing in a Small World

**USITS 2003**(4th USENIX Symposium on Internet Technologies and Systems), p 127-140, March 2003

SETS: Search Enhanced by Topic Segmentation

**SIGIR 2003**(26th International ACM SIGIR 2003), p 306-313, July 2003

Miscellaneous Subjects

A Loop-free Gray Code for Minimal Signed-Binary Representations

**ESA 2005**(13th Annual European Symposium on Algorithms), p 438-447, Oct 2005

Structural Symmetry and Model Checking

**CAV 1998**(10th International Conference on Computer-Aided Verification), p p 159-171, July 1998

Object Tracking using Affine Structure for Point Correspondences

**CVPR 1997**(IEEE Conf. for Computer Vision and Pattern Recognition), p 704-709, June 1997

A New Voting Based Hardware Data Prefetch Scheme

**HiPC 1997**(Fourth International Conference on High Performance Computing), p 100-105, December 1997

A Linear Time Algorithm for the Bottleneck Biconnected Spanning Subgraph Problem

**IPL 1996**(Information Processing Letters), p 1-7, July 1996

Circuit Partitioning with Partial Order for Mixed Simulation Emulation Environment

**RSP 1995**(Sixth Intl. Conf. on Rapid System Prototyping), p 201-207, June 1995

Patents

System and Method for Searching Peer-to-Peer Computer Networks by Selecting a Computer Based on At Least a Number of Files Shared by the Computer

**US Patent #07089301**(Issued: Aug 8, 2006), p 1-14, August 2006

Single Pass Space Efficient System and Method for Generating an Approximate Quantile in a Data Set Having an Unknown Size

**US Patent #06343288**(Issued: Jan 29, 2002), p 1-20, January 2002

Single Pass Space Efficient System and Method for Generating Approximate Quantiles Satisfying an Apriori User-Defined Approximation Error

**US Patent #06108658**(Issued: Aug 22, 2000), p 1-17, August 2000

Theses

Structural Symmetries and Model Checking

**M. S. Thesis**(U C Berkeley, Tech Report UCB/ERL M97/92), p 1-76, December 1997

Object Tracking using Affine Multiple Views Geomtry

**B. Tech. Thesis**(IIT Delhi (won the Best B.Tech. Project Award)), p 1-56, May 1995