Design

Good Hash Tables & Multiple Hash Functions

By Michael Mitzenmacher, May 01, 2002

Michael's multiple hash method produces good hash tables for applications ranging from employee databases to Internet routers.

May02: Hash Function Performance

Hash Function Performance

Let x_k(t) be the fraction of the m buckets that hold at least k keys when there are tm keys in the hash table. Note that x₀(t) always equals 1. Consider now what happens when you add one more key. The "time" t increases to t+1/m. The value of x_k(t) increases by 1/m if the new key lands in a bin with exactly k-1 keys already there. Specifically, x_k(t) increases if and only if both choices have at least k-1 keys but both choices do not have at least k keys. The probability that both choices have at least j keys is (x_j(t))², so the probability that x_k(t) increases is (x_k_-1(t))²-(x_k(t))². You can express this result as in Figure 3(a): that is, the expected change in x_k when a new key arrives is just ((x_k-₁(t))²- (x_k(t))²)/m. You can rewrite the equation, as in Figure 3(b). The left side looks much like the equation for a derivative, and if m is large, a good approximation for the aforementioned equation is in Figure 3(c).

The analysis therefore yields a family of differential equations that accurately describe the behavior of the x_k(t) when the number of buckets m and the number of keys n are reasonably large, and n=mt for some constant t. By calculating the solution to the family numerically, you obtain Table 2. Using similar methods, you can develop families of differential equations that lead to Tables 3 and 4.

— M.M.

Previous 1 2 3 4 5 6 7 8 9 Next

More Insights

INFO-LINK


	To upload an avatar photo, first complete your Disqus profile. \| View the list of supported HTML tags you can use to style comments. \| Please read our commenting policy.

Design

Good Hash Tables & Multiple Hash Functions

Hash Function Performance

Related Reading

More Insights

Currently we allow the following HTML tags in comments:

Single tags

Matching tags

Design Recent Articles

Most Popular

This month's Dr. Dobb's Journal

Upcoming Events

Featured Reports

Featured Whitepapers

Most Recent Premium Content

Design

Good Hash Tables & Multiple Hash Functions

Hash Function Performance

Related Reading

News

Commentary

Slideshow

Video

Most Popular

More Insights

White Papers

Reports

Webcasts

Currently we allow the following HTML tags in comments:

Single tags

Matching tags

Design Recent Articles

Most Popular

This month's Dr. Dobb's Journal

Upcoming Events

Featured Reports

Featured Whitepapers

Most Recent Premium Content