关于map和null的一些小故事

程序员文章站 2022-07-05 11:46:32

...

转载自 https://blog.csdn.net/u010666119/article/details/53873876

因为项目里一处ConcurrentHashMap put value是 null时报错，当时我是震惊的，和hashmap不一样吗？不一样吗？不一样吗？额还真不一样。

前几天看谷歌的Guava对HashMap#get(Object key)方法进行了一些解释，如果返回null，可分为两种情形，

1.当前key下，所对应的value = null

2.当前key不存在，返回null

这确实是令人有些疑惑，当然针对这些情形，可以使用HashMap#containsKey(Object key)进行判断。

记得之前有看过在Java中对map的实现中对于key value为null的情况有不同的实现有不同的处理，常常在一起比较的是Hashtable和HashMap

这几天翻了源码，看看内部如何进行处理，加深理解。

重点比较了put和get操作，其他操作的判断逻辑也应该相通。展示put和get操作。

1.put

1.hashtable， K，V均不能为null，代码显示的对value进行null判断，但注意下边有key.hashCode(),如果key为null，会发生什么呢。

[java] view plain copy 
public synchronized V put(K key, V value) {  
    // Make sure the value is not null  
    if (value == null) {  
        throw new NullPointerException();  
    }  
  
    // Makes sure the key is not already in the hashtable.  
    Entry<?,?> tab[] = table;  
    //key 不能为null  
    int hash = key.hashCode();  
    .....  
    }  

2.HashMap K,V可为null， null 的hash返回0，所以多次Key为null会覆盖Value, 可以有多个不同的Key的Value为null。

注意hash()方法，hash()方法对Key是否为null进行判断，在null时hashCode = 0，不为null是key#hashCode().

[java] view plain copy 
public V put(K key, V value) {  
        return putVal(hash(key), key, value, false, true);  
    }  

[java] view plain copy 
static final int hash(Object key) {  
        int h;  
        return (key == null) ? 0 : (h = key.hashCode()) ^ (h >>> 16);  
    }  

[java] view plain copy 
final V putVal(int hash, K key, V value, boolean onlyIfAbsent,  
               boolean evict) {  
    Node<K,V>[] tab; Node<K,V> p; int n, i;  
    if ((tab = table) == null || (n = tab.length) == 0)  
        n = (tab = resize()).length;  
    if ((p = tab[i = (n - 1) & hash]) == null)  
        tab[i] = newNode(hash, key, value, null);  
    else {  
        Node<K,V> e; K k;  
        if (p.hash == hash &&  
            ((k = p.key) == key || (key != null && key.equals(k))))  
            e = p;  
        else if (p instanceof TreeNode)  
            e = ((TreeNode<K,V>)p).putTreeVal(this, tab, hash, key, value);  
        else {  
            for (int binCount = 0; ; ++binCount) {  
                if ((e = p.next) == null) {  
                    p.next = newNode(hash, key, value, null);  
                    if (binCount >= TREEIFY_THRESHOLD - 1) // -1 for 1st  
                        treeifyBin(tab, hash);  
                    break;  
                }  
                if (e.hash == hash &&  
                    ((k = e.key) == key || (key != null && key.equals(k))))  
                    break;  
                p = e;  
            }  
        }  
        if (e != null) { // existing mapping for key  
            V oldValue = e.value;  
            if (!onlyIfAbsent || oldValue == null)  
                e.value = value;  
            afterNodeAccess(e);  
            return oldValue;  
        }  
    }  
    ++modCount;  
    if (++size > threshold)  
        resize();  
    afterNodeInsertion(evict);  
    return null;  
}  

3.ConcurrentHashMap, K，V均不能为null, ConcurrentHashMap中通过显示的null判断，对Key和Value均进行了验证

[java] view plain copy 
public V put(K key, V value) {  
        return putVal(key, value, false);  
    }  

[java] view plain copy 
final V putVal(K key, V value, boolean onlyIfAbsent) {  
        if (key == null || value == null) throw new     NullPointerException();  
    int hash = spread(key.hashCode());  
    //....}  

2.get

1.Hashtable, K 不可为 null

[java] view plain copy 
public synchronized V get(Object key) {  
        Entry<?,?> tab[] = table;  
        int hash = key.hashCode();  
        //....  
    }  

2.HashMap，K可以为null

[java] view plain copy 
public V get(Object key) {  
        Node<K,V> e;  
        return (e = getNode(hash(key), key)) == null ? null : e.value;  
    }  

3.ConcurrentHashMap, K不能为null

[java] view plain copy 
public V get(Object key) {  
        Node<K,V>[] tab; Node<K,V> e, p; int n, eh; K ek;  
        int h = spread(key.hashCode());  
    //......  
}  

最后，重点，为什么同样的key-value结构，hashmap就能putnull，啊？蛤？

找到了这样的解答:The main reason that nulls aren’t allowed in ConcurrentMaps (ConcurrentHashMaps, ConcurrentSkipListMaps) is that ambiguities that may be just barely tolerable in non-concurrent maps can’t be accommodated. The main one is that if map.get(key) returns null, you can’t detect whether the key explicitly maps to null vs the key isn’t mapped. In a non-concurrent map, you can check this via map.contains(key), but in a concurrent one, the map might have changed between calls.

理解：ConcurrentHashmap和Hashtable都是支持并发的，这样会有一个问题，当你通过get(k)获取对应的value时，如果获取到的是null时，你无法判断，它是put（k,v）的时候value为null，还是这个key从来没有做过映射。HashMap是非并发的，可以通过contains(key)来做这个判断。而支持并发的Map在调用m.contains（key）和m.get(key),m可能已经不同了。

个人觉得这个解答还是很有道理的，也是解决了心头的一个疑惑，大牛们在设计时确实考虑的很多，在这里分享给大家。

类似的解答还有这个：
down vote
I believe it is, at least in part, to allow you to combine containsKey and get into a single call. If the map can hold nulls, there is no way to tell if get is returning a null because there was no key for that value, or just because the value was null.

Why is that a problem? Because there is no safe way to do that yourself. Take the following code:

if (m.containsKey(k)) {
   return m.get(k);
} else {
   throw new KeyNotPresentException();
}

Since m is a concurrent map, key k may be deleted between the containsKey and get calls, causing this snippet to return a null that was never in the table, rather than the desired KeyNotPresentException.

Normally you would solve that by synchronizing, but with a concurrent map that of course won’t work. Hence the signature for get had to change, and the only way to do that in a backwards-compatible way was to prevent the user inserting null values in the first place, and continue using that as a placeholder for “key not found”.

关于map和null的一些小故事

关于Java8中map()和flatMap()的一些事

关于map和null的一些小故事

有关于mysql数据库和php的一些小疑点，请高手指教

有关于mysql数据库和php的一些小疑点，请高手指教

有关于mysql数据库和php的一些小疑点，请高手指教

关于Java8中map()和flatMap()的一些事