Hashtable源码探讨（基于JDK1.8）

程序员文章站 2022-05-12 11:30:19

...

Hashtable 简介

Hashtable 存储的内容是键值对(key-value)映射，其底层实现是一个Entry数组+链表。

Hashtable是线程安全的它的key、value都不可以为null。此外，Hashtable中的映射不是有序的。

Hashtable的继承结构


public class Hashtable<K,V>
    extends Dictionary<K,V>
    implements Map<K,V>, Cloneable, java.io.Serializable {

Hashtable的重要成员变量


private transient Entry<?,?>[] table; //数组+链表实现
private transient int count; //Entry的总数
private int threshold; //count>=此值时，扩容rehash
private float loadFactor; //负载因子
private transient int modCount = 0;  //用来帮助实现fail-fast机制

我们必须要知道Hashtable底层是采用数组+链表的结构来实现的。

加载因子loadFactor是Hashtable扩容前可以达到多满的一个尺度。这个参数是可以设置的，默认是0.75。加载因子过高虽然减少了空间开销，但同时也增加了查找某个条目的时间（在大多数 Hashtable 操作中，包括 get 和 put 操作，都反映了这一点）。

Hashtable的四个构造函数


public Hashtable(int initialCapacity) {  //指定初始容量来构造Hashtable
        this(initialCapacity, 0.75f);
    }


public Hashtable() {
        this(11, 0.75f);
    }



public Hashtable(Map<? extends K, ? extends V> t) {

        this(Math.max(2*t.size(), 11), 0.75f);
        putAll(t);
    }


 public Hashtable(int initialCapacity, float loadFactor) {
        if (initialCapacity < 0)
            throw new IllegalArgumentException("Illegal Capacity: "+
                                               initialCapacity);
        if (loadFactor <= 0 || Float.isNaN(loadFactor))
            throw new IllegalArgumentException("Illegal Load: "+loadFactor);

        if (initialCapacity==0)
            initialCapacity = 1;
        this.loadFactor = loadFactor;
        table = new Entry<?,?>[initialCapacity];
        threshold = (int)Math.min(initialCapacity * loadFactor, MAX_ARRAY_SIZE + 1);
    }

其实上面的前三个构造方法，最终都调用了第四个构造方法。如果在初始化Hashtable时，不指定加载因子loadFactor，那么加载因子会被设置为0.75f。

Hashtable的contains方法


public synchronized boolean contains(Object value) {
        if (value == null) { //不允许val为null
            throw new NullPointerException();
        }

        Entry<?,?> tab[] = table;
        for (int i = tab.length ; i-- > 0 ;) {
            for (Entry<?,?> e = tab[i] ; e != null ; e = e.next) {
                if (e.value.equals(value)) {
                    return true;
                }
            }
        }
        return false;
    }

这里使用的是很简单的两层for循环，外层是在table上，内层是在链表上检索，通过equals方法来比对。

Hashtable的get方法


public synchronized V get(Object key) {
        Entry<?,?> tab[] = table;
        int hash = key.hashCode(); 
        int index = (hash & 0x7FFFFFFF) % tab.length; //index表示在table中的位置，也就是桶的位置
        for (Entry<?,?> e = tab[index] ; e != null ; e = e.next) { //在链表中循环找到key对应的value
            if ((e.hash == hash) && e.key.equals(key)) {
                return (V)e.value;
            }
        }
        return null;
    }

Hashtable的put方法


public synchronized V put(K key, V value) {
        // Make sure the value is not null
        if (value == null) {
            throw new NullPointerException(); //value不可以为null
        }

        // Makes sure the key is not already in the hashtable.
        Entry<?,?> tab[] = table;
        int hash = key.hashCode(); //如果key为null，这里会有异常抛出
        int index = (hash & 0x7FFFFFFF) % tab.length;  //桶的位置
        @SuppressWarnings("unchecked")
        Entry<K,V> entry = (Entry<K,V>)tab[index];
        for(; entry != null ; entry = entry.next) {
            if ((entry.hash == hash) && entry.key.equals(key)) { //如果键值对已经存在，更新值
                V old = entry.value;
                entry.value = value;
                return old;
            }
        }

        addEntry(hash, key, value, index);  //加入新的键值对
        return null;
    }

使用put方法时，需要注意Hashtable不需要key或value为null。如果key和value的映射已经存在，该方法更新旧的值。addEntry是个很重要的方法，后面会详细讲。

Hashtable的addEntry方法


private void addEntry(int hash, K key, V value, int index) {
        modCount++; //修改次数

        Entry<?,?> tab[] = table;
        if (count >= threshold) {
            // Rehash the table if the threshold is exceeded
            rehash();  //如果实际大小count大于阀值，需要rehash，调整整个table

            tab = table;
            hash = key.hashCode();  //直接调用Object类的hashCode方法
            index = (hash & 0x7FFFFFFF) % tab.length;  //index就是桶的位置
        }

        // Creates the new entry.
        @SuppressWarnings("unchecked")
        Entry<K,V> e = (Entry<K,V>) tab[index]; //index对应的桶
        tab[index] = new Entry<>(hash, key, value, e);  //链接新的结点，新结点的next域指向旧结点，也就是新的结点是链表表头
        count++;
    }

这个方法需要注意链接新节点的时候，新的结点是链表表头。

Hashtable的rehash方法


protected void rehash() {
        int oldCapacity = table.length;
        Entry<?,?>[] oldMap = table;

        // overflow-conscious code
        int newCapacity = (oldCapacity << 1) + 1;  //在原来大小上扩大两倍+1，关于为什么是这样后面会说
        if (newCapacity - MAX_ARRAY_SIZE > 0) {
            if (oldCapacity == MAX_ARRAY_SIZE)
                // Keep running with MAX_ARRAY_SIZE buckets
                return;
            newCapacity = MAX_ARRAY_SIZE;
        }
        Entry<?,?>[] newMap = new Entry<?,?>[newCapacity];  //新的数组

        modCount++; //修改次数加1
        threshold = (int)Math.min(newCapacity * loadFactor, MAX_ARRAY_SIZE + 1);  //更新阀值
        table = newMap;
   

        for (int i = oldCapacity ; i-- > 0 ;) { 
            for (Entry<K,V> old = (Entry<K,V>)oldMap[i] ; old != null ; ) {
                Entry<K,V> e = old;
                old = old.next;

                int index = (e.hash & 0x7FFFFFFF) % newCapacity;  //新的index，结点Entry的hash没有变化
                e.next = (Entry<K,V>)newMap[index];  //同一个桶中，新的结点链接到表头
                newMap[index] = e;
            }
        }
    }

Hashtable遍历方式

测试demo


public class TestHashtable {
    private static Hashtable table = new Hashtable();

    public static void main(String[] args) {
        init();
        entrySetAccess();
        iteratorAccess();
        enumerationAccess();
    }


    public static void init() {
        long startTime;
        long endTime;
        startTime = System.currentTimeMillis();
        for(int i=0; i<100000; i++) {
            table.put(i,i);
        }
        endTime = System.currentTimeMillis();
        long interval = endTime - startTime;
        System.out.println("init time：" + interval+" ms");
    }


    public static void entrySetAccess() {
        Integer integ = null;
        Integer key = null;
        Iterator iter = table.entrySet().iterator();
        long startTime;
        long endTime;
        startTime = System.currentTimeMillis();
        while(iter.hasNext()) {
            Map.Entry entry = (Map.Entry)iter.next();
            // 获取key
            key = (Integer) entry.getKey();
            // 获取value
            integ = (Integer)entry.getValue();
        }
        endTime = System.currentTimeMillis();
        long interval = endTime - startTime;
        System.out.println("entrySetAccess time：" + interval+" ms");
    }

    public static void iteratorAccess() {
        Integer key = null;
        Integer integ = null;
        Iterator iter = table.keySet().iterator();
        long startTime;
        long endTime;
        startTime = System.currentTimeMillis();
        while (iter.hasNext()) {
            // 获取key
            key = (Integer) iter.next();
            // 根据key，获取value
            integ = (Integer)table.get(key);
        }
        endTime = System.currentTimeMillis();
        long interval = endTime - startTime;
        System.out.println("iteratorAccess time：" + interval+" ms");
    }

    public static void enumerationAccess() {
        long startTime;
        long endTime;
        startTime = System.currentTimeMillis();
        Enumeration enu = table.keys();
        while(enu.hasMoreElements()) {
            enu.nextElement();
        }
        endTime = System.currentTimeMillis();
        long interval = endTime - startTime;
        System.out.println("enumerationAccess time：" + interval+" ms");
    }


}

控制台输出：

init time：29 ms

entrySetAccess time：17 ms

iteratorAccess time：16 ms

enumerationAccess time：6 ms

可知通过Enumeration来遍历Hashtable效率更高。

Hashtable源码探讨（基于JDK1.8）

源码分析--ConcurrentHashMap与HashTable（JDK1.8）

基于JDK1.8，Java容器源码分析

Java集合类源码解析：HashMap (基于JDK1.8)

HashSet源码解析（基于JDK1.8）

ArrayList源码解析（基于JDK1.8）

LinkedList源码分析，基于JDK1.8逐行分析

Java集合框架--LinkedList源码分析(基于JDK1.8)

LinkedList源码解析——基于JDK1.8

HashMap源码分析，基于jdk1.8，附源码注释

HashMap源码分析基于JDK1.8