The Hash Function

Explore the role of hash functions in managing key sizes within C++ hash tables. Understand how these functions convert large keys into valid array indices using methods like modular arithmetic, truncation, and folding to optimize data structure performance.

We'll cover the following...

Restricting the Key Size
What Hash Functions Do?

Restricting the Key Size

In the last lesson, we learned that an array could be used to implement a hash table in C++. A key is used to map a value on the array, and the efficiency of a hash table depends on how a key is computed. At first glance, you may observe that we can directly use the indices as keys because each index is unique.

The only problem is that the key would eventually exceed the size of the array, and at every insertion, the array would need to be resized. We can increase the array size by increasing their capacity exponentially, but the process still takes O(n) time because it will copy all the elements into the new array.

In order to limit the range of the keys to the boundaries of the array, we need a function that converts a large key into a smaller key. This is the job of the hash function.

What Hash Functions Do?

Have a look at the following illustration to get the analogy of a hash function.

A hash function simply takes an item’s key and returns the corresponding index in the array for that item. Depending on your program, the calculation of this index can be simple arithmetic or a very complicated encryption method. However, it is very important to choose an efficient hashing function as it directly affects the performance of the hash table mechanism.

Let’s have a look at some of the most common hash functions used in modern programming.

Arithmetic Modular

In this approach, we take the modular of the key with the array size:

index = key \text{ } MOD \text{ } tableSize

Hence, the index will always stay between 0 and tableSize - 1.

C++

int hashFold(int key, int chunkSize) {
    cout << "Key: " << key << endl;
    string strKey = std::to_string(key); // Convert integer into string for slicing
    int hashVal = 0, tempNum=0;
    string temp;
    cout << "Chunks: ";
    // increment i to chunksize everytime
    for(int i = 0; i < strKey.length(); i+=chunkSize){ 
        temp = "";
        if(i + chunkSize <= strKey.length()) //check if chunksize is less than equal to key
        {
            for(int j=i; j< i+chunkSize; j++) {
                temp += strKey[j];
                cout << strKey[j];
            }
            cout << " ";
            // converting string to integer
            stringstream conv(temp);
            conv >> tempNum;
            hashVal = hashVal + tempNum; // adding sliced number to hashVal
        }
        else{
            for(int j = i; j <= strKey.length(); j++){
                temp += strKey[j];
                cout << strKey[j];
            }
            // converting string to integer
            stringstream conv(temp);
            conv >> tempNum;
            hashVal = hashVal + tempNum;// adding sliced number to hashVal
        }
    }
    return hashVal;
}
int main() {
    int key = 456789;
    int chunkSize = 2;
    cout << endl << "Hash Key: " << hashFold(key, chunkSize) << endl;
    return 0;
}

1.Introduction to Complexity Measures

2.Introduction to Arrays

3.Introduction to Linked Lists

4.Introduction to Stack/Queues

5.Introduction to Graphs

6.Introduction to Trees

7.Trie

8.Introduction to Heap

9.Introduction to Hashing

10.Summary of Data Structures

The Hash Function

Restricting the Key Size

What Hash Functions Do?

Arithmetic Modular

Truncation

Folding