Java - PriorityQueue

程序员文章站 2022-07-09 12:46:16

JDK 10.0.2 前段时间在网上刷题，碰到一个求中位数的题，看到有网友使用PriorityQueue来实现，感觉其解题思想挺不错的。加上我之前也没使用过PriorityQueue，所以我也试着去读该类源码，并用同样的思想解决了那个题目。现在来对该类做个总结，需要注意，文章内容以算法和数据结构为中 ......

jdk 10.0.2

前段时间在网上刷题，碰到一个求中位数的题，看到有网友使用priorityqueue来实现，感觉其解题思想挺不错的。加上我之前也没使用过priorityqueue，所以我也试着去读该类源码，并用同样的思想解决了那个题目。现在来对该类做个总结，需要注意，文章内容以算法和数据结构为中心，不考虑其他细节内容。如果小伙伴想看那个题目，可以直接跳转到（）。

一. ：queue[]、size、comparator
二. ：heapify()、siftdowncomparable(k, e)
三. ：offer(e)、siftupusingcomparator(k, e)
四. ：indexof(o)
五. ：remove(o)、removeat(i)、removeeq(o)
六. ：peek()
七. ：poll()
八. ：clear()
九. ：iterator()、toarray()、toarray(t[] a)
十. ：数据流中的中位数

一. 数据结构

我只列出了讲解需要的重要属性，不考虑其他细节。priorityqueue（优先队列）内部是以堆来实现的。为了描述方便，接下来的内容我将用pq[ ]代替queue[ ]。

priorityqueue<e> {
    /* 平衡二叉堆 用于存储元素
     * n : 0 -> size-1
     * pq[n].left = pq[2*n+1]
     * pq[n].right = pq[2*(n+1)]
     */
    object[] queue; 
    int size; // pq中元素个数
    comparator<? super e> comparator; // 自定义比较器
}

二. 初始化（堆化）

如果使用已有集合来构造priorityqueue，就会用到heapify()来对pq[ ]进行初始化（即：二叉堆化），使其满足堆的性质。而heapify()又通过调用siftdowncomparable(k, e)来完成堆化。源码如下：

 1 @suppresswarnings("unchecked")
 2 private void heapify() {
 3     final object[] es = queue;
 4     int i = (size >>> 1) - 1;
 5     if (comparator == null)
 6         for (; i >= 0; i--)
 7             siftdowncomparable(i, (e) es[i]);
 8     else
 9         for (; i >= 0; i--)
10             siftdownusingcomparator(i, (e) es[i]);
11 }
12 
13 @suppresswarnings("unchecked")
14 private void siftdowncomparable(int k, e x) {
15     comparable<? super e> key = (comparable<? super e>)x;
16     int half = size >>> 1;        // loop while a non-leaf
17     while (k < half) {
18         int child = (k << 1) + 1; // assume left child is least
19         object c = queue[child];
20         int right = child + 1;
21         if (right < size &&
22             ((comparable<? super e>) c).compareto((e) queue[right]) > 0)
23             c = queue[child = right];
24         if (key.compareto((e) c) <= 0)
25             break;
26         queue[k] = c;
27         k = child;
28     }
29     queue[k] = key;
30 }

view code

如果有自定义比较器的话，调用：siftdownusingcomparator(k, e)，否则调用：siftdowncomparable(k, e)。这两个方法只是在比较两个元素大小时的表现形式不同，其他内容相同，所以我们只需要看其中一种情况就行。为了描述方便，下面的例子中，我使用integer作为pq[ ]存储元素类型，所以调用的是siftdowncomparable(k, e)。（size >>> 1 表示 size 无符号右移1位，等价于size / 2）

我不会去细抠源码，一行一行地为大家讲解，而是尽量使用简单的例子来展示，我觉得通过例子以及后期大家自己阅读源码，会更容易理解算法内容。

现在我们来看看，使用集合{2, 9, 8, 4, 7, 1, 3, 6, 5}来构造priorityqueue的过程。算法时间复杂度为o(n)，n = size。（时间复杂度证明：《算法导论》（第3版）第6章6.3建堆）

Java - PriorityQueue

首先，从下到上，从右到左，找到第一个父结点 i，满足规律：i = (size >>> 1) - 1，这里size = 9，i = 3；
比较pq[3, 7, 8]中的元素，将最小的元素pq[x]与堆顶元素pq[3]互换，由于pq[x] = pq[3]，所以无互换；
移动到下一个父结点 i = 2，同理，比较pq[2, 5, 6]中的元素，将最小的元素pq[5]与pq[2]互换，后面的操作同理；
需要注意，当pq[1]（9）和pq[3]（4）互换后（如图2.d），pq[3, 7, 8]违背了最小堆的性质，所以需要进一步调整（向下调整），当调整到叶结点时（i >= size/2）结束；

三. 添加元素

添加元素：add(e)，offer(e)，由于添加元素可能破坏堆的性质，所以需要调用siftup(i, e)向上调整来维护堆性质。同样，siftup(i, e)根据有无自定义比较器来决定调用siftupusingcomparator(k, e)还是siftupcomparable(k, e)。在我举的例子中，使用的是siftupcomparable(k, e)。下面是添加元素的相关源码：

 1 public boolean offer(e e) {
 2     if (e == null)
 3         throw new nullpointerexception();
 4     modcount++;
 5     int i = size;
 6     if (i >= queue.length)
 7         grow(i + 1);
 8     siftup(i, e);
 9     size = i + 1;
10     return true;
11 }
12 
13 @suppresswarnings("unchecked")
14 private void siftupcomparable(int k, e x) {
15     comparable<? super e> key = (comparable<? super e>) x;
16     while (k > 0) {
17         int parent = (k - 1) >>> 1;
18         object e = queue[parent];
19         if (key.compareto((e) e) >= 0)
20             break;
21         queue[k] = e;
22         k = parent;
23     }
24     queue[k] = key;
25 }

view code

源码中 grow(i + 1) 是当pq[ ]容量不够时的增长策略，目前可以不用考虑。现在来看往最小堆 pq = {3, 5, 6, 7, 8, 9} 中添加元素 1的过程。算法时间复杂度为o(lgn)，n = size。

Java - PriorityQueue

首先，把要添加的元素 1 放到pq[size]，然后调用siftup(k, e)来维护堆，调整结束后 size++；
向上调整(k, e)时，先找到结点pq[k]的父结点，满足规律 parent = (k - 1) >>> 1，例子中，k = 6, parent = 2；
比较pq[k]与pq[parent]，将较小者放到高处，较大者移到低处，例子中，交换pq[6]（1）与pq[2]（6）的位置；
此次交换结束后，令 k = parent，继续以同样的方法操作，直到 k <= 0 时（到达根结点）结束；

四. 索引

indexof(o)是个私有方法，但好多公开方法中都调用了它，比如：remove(o)，contains(o)等，所以在这里也简单提一下。该算法并不复杂。时间复杂度为o(n)，n = size。

1 private int indexof(object o) {
2     if (o != null) {
3         for (int i = 0; i < size; i++)
4             if (o.equals(queue[i]))
5                 return i;
6     }
7     return -1;
8 }

view code

indexof(o)中比较两个元素是否相等，使用的是equals()，而接下来要提的removeeq(o)中直接使用了 == 来判断，请读者注意区别。

五. 删除元素

remove(o)、removeeq(o)，二者只是在判断两个元素是否相等时使用的方法不同（前者使用equals()，后者使用==），其他内容相同，它们都调用了removeat(i)来执行删除操作。删除元素后很可能会破坏堆的性质，所以同样需要进行维护。删除元素的维护要比添加元素的维护稍微复杂一点，因为可能同时涉及了：向上调整siftup和向下调整siftdown。源码如下：

 1 public boolean remove(object o) {
 2     int i = indexof(o);
 3     if (i == -1)
 4         return false;
 5     else {
 6         removeat(i);
 7         return true;
 8     }
 9 }
10 
11 boolean removeeq(object o) {
12     for (int i = 0; i < size; i++) {
13         if (o == queue[i]) {
14             removeat(i);
15             return true;
16         }
17     }
18     return false;
19 }
20 
21 @suppresswarnings("unchecked")
22 e removeat(int i) {
23     // assert i >= 0 && i < size;
24     modcount++;
25     int s = --size;
26     if (s == i) // removed last element
27         queue[i] = null;
28     else {
29         e moved = (e) queue[s];
30         queue[s] = null;
31         siftdown(i, moved);
32         if (queue[i] == moved) {
33             siftup(i, moved);
34             if (queue[i] != moved)
35                 return moved;
36         }
37     }
38     return null;
39 }

view code

我们还是通过例子来学习吧，通过对 pq = {0, 1, 7, 2, 3, 8, 9, 4, 5, 6} 进行一系列删除操作，来理解算法的运作过程。算法时间复杂度o(lgn)，n = size。

Java - PriorityQueue

第1步，remove(6)，indexof(6) = 9，removeat(9)（用r(9)表示，后面同理）,由于i = 9为队列末端，删除后不会破坏堆性质，所以可以直接删除；
第2步，remove(1)，即r(1)，根据图(5.b)可以看出，算法是拿队列尾部pq[8]去替换pq[1]，替换后破坏了最小堆的性质，需要向下调整进行维护；
第3步，remove(8)，即r(5)，使用队列尾部元素pq[7]替换pq[5]，替换后破坏了最小堆的性质，需要向上调整进行维护；

六. 取堆顶

peek()可以在o(1)的时间复杂度下取到堆顶元素pq[0]，看源码一目了然：

1 @suppresswarnings("unchecked")
2 public e peek() {
3     return (size == 0) ? null : (e) queue[0];
4 }

view code

七. 删除堆顶

删除堆顶使用poll()方法，其算法思想等价于removeat(0)（时间复杂度o(lgn)），稍微有点区别的是，其只涉及到向下调整，不涉及向上调整。不清楚的朋友可以参看（），下面是源码：

 1 @suppresswarnings("unchecked")
 2 public e poll() {
 3     if (size == 0)
 4         return null;
 5     int s = --size;
 6     modcount++;
 7     e result = (e) queue[0];
 8     e x = (e) queue[s];
 9     queue[s] = null;
10     if (s != 0)
11         siftdown(0, x);
12     return result;
13 }

view code

八. 清除队列

清除队列clear()，就是依次把pq[i]置为null，然后size置0，但是pq.length没有改变。时间复杂度为o(n)，n = size。源码如下：

1 public void clear() {
2     modcount++;
3     for (int i = 0; i < size; i++)
4         queue[i] = null;
5     size = 0;
6 }

view code

九. 遍历

可以使用迭代器（iterator）来遍历pq[ ]本身，或者调用toarray()、toarray(t[] a)方法来生成一个pq[ ]的副本进行遍历。遍历本身的时间复杂度为o(n)，n = size。

使用迭代器遍历 pq = {0, 1, 7, 2, 3, 8, 9, 4, 5, 6}，方法如下：

 1 public static void traverse1(priorityqueue<integer> x) {
 2     iterator<integer> it = x.iterator();
 3     while (it.hasnext()) {
 4         system.out.print(it.next() + " ");
 5     }
 6     system.out.println();
 7 }
 8 // 或者更简单的，结合java语法糖，可以写成如下形式
 9 public static void traverse2(priorityqueue<integer> x) {
10     for (int a : x) {
11         system.out.print(a + " ");
12     }
13     system.out.println();
14 }
15 /* 输出
16 0 1 7 2 3 8 9 4 5 6 
17 */

view code

通过拷贝pq[ ]副本来遍历，方法如下：

 1 public static void traverse3(priorityqueue<integer> x) {
 2     object[] ins = x.toarray();
 3     for (object a : ins) {
 4         system.out.print((integer)a + " ");
 5     }
 6     system.out.println();
 7 }
 8 
 9 public static void traverse4(priorityqueue<integer> x) {
10     integer[] ins = new integer[100];
11     ins = x.toarray(ins);
12     for (int i = 0, len = x.size(); i < len; i++) {
13         system.out.print(ins[i] + " ");
14     }
15     system.out.println();
16 }
17 /* 输出
18 0 1 7 2 3 8 9 4 5 6 
19 */

view code

在使用toarray(t[] a)拷贝来进行遍历时，需要注意（x表示priorityqueue对象）：

如果ins[ ]的容量大于x.size()，请使用for (int i = 0; i < x.size(); i++) 来遍历，否则可能会获取到多余的数据；或者你使用for (int a : ins)来遍历时，可能导致nullpointerexception异常；
请使用 ins = x.toarray(ins) 的写法来确保正确获取到pq[ ]副本。当ins[ ]容量大于x.size()时，写为 x.toarray(ins) 能正确获取到副本，但当ins[ ]容量小于x.size()时，该写法就无法正确获取副本。因为此情况下toarray(t[] a)内部会重新生成一个大小为x.size()的integer数组进行拷贝，然后return该数组；

toarray(t[] a)源码如下：

 1 @suppresswarnings("unchecked")
 2 public <t> t[] toarray(t[] a) {
 3     final int size = this.size;
 4     if (a.length < size)
 5         // make a new array of a's runtime type, but my contents:
 6         return (t[]) arrays.copyof(queue, size, a.getclass());
 7     system.arraycopy(queue, 0, a, 0, size);
 8     if (a.length > size)
 9         a[size] = null;
10     return a;
11 }

view code

十. 小测试

下面来说说文章开头我提到的那个题目吧，如下（点击这里在线做题）（请使用priorityqueue来完成）：

/* 数据流中的中位数
题目描述
如何得到一个数据流中的中位数？如果从数据流中读出奇数个数值，那么中位数就是所有数值排序之后位于中间的数值。
如果从数据流中读出偶数个数值，那么中位数就是所有数值排序之后中间两个数的平均值。我们使用insert()方法读取数据流，
使用getmedian()方法获取当前读取数据的中位数。
*/

public class solution {
    public void insert(integer num) {}
    public double getmedian() {}
}

我写的参考代码（带解析），如下：

 1 /*
 2 关键点：
 3  大根堆maxq       小根堆minq
 4 ----------      -------------
 5           \    /
 6   <= a     a  b   >= b
 7           /    \
 8 ----------      -------------
 9  
10 每次insert(num)前要确保 ：
11     1) maxq.size == q.size // 偶数个时，二者元素个数相等
12 或  2) minq.size == maxq.size + 1 // 奇数个时把多余的1个放到小根堆minq
13 这样一来，获取中位数时：
14 奇数个：minq.top;
15 偶数个：(minq.top + maxq.top) / 2
16  
17 每次isnert(num)后，可能会打破上面的条件，出现下面的情况：
18     1) maxq.size == q.size + 1 // 打破条件(1) => 这时需要把maxq.top放到minq中
19 或  2) minq.size == maxq.size + 2 // 打破条件(2) => 这时需要把minq.top放到maxq中
20 */
21 
22 import java.util.comparator;
23 import java.util.priorityqueue;
24  
25 public class jzoffer_63_solution_02 {
26     priorityqueue<integer> minq = new priorityqueue<integer>();
27     priorityqueue<integer> maxq = new priorityqueue<integer>((o1, o2) -> o2.compareto(o1));
28 
29     public void insert(integer num) {
30         if (minq.isempty() || num >= minq.peek()) minq.offer(num);
31         else maxq.offer(num);
32         if (minq.size() == maxq.size()+2) maxq.offer(minq.poll());
33         if (maxq.size() == minq.size()+1) minq.offer(maxq.poll());
34     }
35 
36     public double getmedian() {
37         return minq.size() == maxq.size() ? (double)(minq.peek()+maxq.peek())/2.0 : (double)minq.peek();
38     }
39 }

view code

转载请说明出处，have a good time! :d

上一篇： Perl中捕获警告信息、异常信息并写入日志详解

下一篇： ASP漏洞集-虚拟web目录容易泄露ASP源代码 (MS,缺陷)

Java - PriorityQueue

Ubuntu 12.10下安装 Oracle Java分享教程

PHP、Java des加密解密实例

Java字符串面试问答

Java中Integer.parseInt和Integer.valueOf，你还傻傻分不清吗？

Java锁-Synchronized深层剖析

并发编程之Java锁

Java描述设计模式(18)：享元模式

面试开挂：近百道Java面试题整理

java打jar包的几种方式详解

Java内存模型以及happens-before规则