Kotlin Coroutines不复杂, 我来帮你理一理

程序员文章站 2022-11-07 11:28:05

Coroutines 协程最近在总结Kotlin的一些东西, 发现协程这块确实不容易说清楚. 之前的那篇就写得不好, 所以决定重写. 反复研究了官网文档和各种教程博客, 本篇内容是最基础也最主要的内容, 力求小白也能看懂并理解. Coroutines概念 Coroutines(协程), 计算机程序 ......

coroutines 协程

最近在总结kotlin的一些东西, 发现协程这块确实不容易说清楚. 之前的那篇就写得不好, 所以决定重写.
反复研究了官网文档和各种教程博客, 本篇内容是最基础也最主要的内容, 力求小白也能看懂并理解.

coroutines概念

coroutines(协程), 计算机程序组件, 通过允许任务挂起和恢复执行, 来支持非抢占式的多任务. (见wiki).

协程主要是为了异步, 非阻塞的代码. 这个概念并不是kotlin特有的, go, python等多个语言中都有支持.

kotlin coroutines

kotlin中用协程来做异步和非阻塞任务, 主要优点是代码可读性好, 不用回调函数. (用协程写的异步代码乍一看很像同步代码.)

kotlin对协程的支持是在语言级别的, 在标准库中只提供了最低程度的apis, 然后把很多功能都代理到库中.

kotlin中只加了suspend作为关键字.
async和await不是kotlin的关键字, 也不是标准库的一部分.

比起futures和promises, kotlin中suspending function的概念为异步操作提供了一种更安全和不易出错的抽象.

kotlinx.coroutines是协程的库, 为了使用它的核心功能, 项目需要增加kotlinx-coroutines-core的依赖.

coroutines basics: 协程到底是什么?

先上一段官方的demo:

import kotlinx.coroutines.globalscope
import kotlinx.coroutines.delay
import kotlinx.coroutines.launch


fun main() {
    globalscope.launch { // launch a new coroutine in background and continue
        delay(1000l) // non-blocking delay for 1 second (default time unit is ms)
        println("world!") // print after delay
    }
    println("hello,") // main thread continues while coroutine is delayed
    thread.sleep(2000l) // block main thread for 2 seconds to keep jvm alive
}

这段代码的输出:
先打印hello, 延迟1s之后, 打印world.

对这段代码的解释:

launch开始了一个计算, 这个计算是可挂起的(suspendable), 它在计算过程中, 释放了底层的线程, 当协程执行完成, 就会恢复(resume).

这种可挂起的计算就叫做一个协程(coroutine). 所以我们可以简单地说launch开始了一个新的协程.

注意, 主线程需要等待协程结束, 如果注释掉最后一行的thread.sleep(2000l), 则只打印hello, 没有world.

协程和线程的关系

coroutine(协程)可以理解为轻量级的线程. 多个协程可以并行运行, 互相等待, 互相通信. 协程和线程的最大区别就是协程非常轻量(cheap), 我们可以创建成千上万个协程而不必考虑性能.

协程是运行在线程上可以被挂起的运算. 可以被挂起, 意味着运算可以被暂停, 从线程移除, 存储在内存里. 此时, 线程就可以*做其他事情. 当计算准备好继续进行时, 它会返回线程(但不一定要是同一个线程).

默认情况下, 协程运行在一个共享的线程池里, 线程还是存在的, 只是一个线程可以运行多个协程, 所以线程没必要太多.

调试

在上面的代码中加上线程的名字:

fun main() {
    globalscope.launch {
        // launch a new coroutine in background and continue
        delay(1000l) // non-blocking delay for 1 second (default time unit is ms)
        println("world! + ${thread.currentthread().name}") // print after delay
    }
    println("hello, + ${thread.currentthread().name}") // main thread continues while coroutine is delayed
    thread.sleep(2000l) // block main thread for 2 seconds to keep jvm alive
}

可以在ide的edit configurations中设置vm options: -dkotlinx.coroutines.debug, 运行程序, 会在log中打印出代码运行的协程信息:

hello, + main
world! + defaultdispatcher-worker-1 @coroutine#1

suspend function

上面例子中的delay方法是一个suspend function.
delay()和thread.sleep()的区别是: delay()方法可以在不阻塞线程的情况下延迟协程. (it doesn't block a thread, but only suspends the coroutine itself). 而thread.sleep()则阻塞了当前线程.

所以, suspend的意思就是协程作用域被挂起了, 但是当前线程中协程作用域之外的代码不被阻塞.

如果把globalscope.launch替换为thread, delay方法下面会出现红线报错:

suspend functions are only allowed to be called from a coroutine or another suspend function

suspend方法只能在协程或者另一个suspend方法中被调用.

在协程等待的过程中, 线程会返回线程池, 当协程等待结束, 协程会在线程池中一个空闲的线程上恢复. (the thread is returned to the pool while the coroutine is waiting, and when the waiting is done, the coroutine resumes on a free thread in the pool.)

启动协程

启动一个新的协程, 常用的主要有以下几种方式:

launch
async
runblocking

它们被称为coroutine builders. 不同的库可以定义其他更多的构建方式.

runblocking: 连接blocking和non-blocking的世界

runblocking用来连接阻塞和非阻塞的世界.

runblocking可以建立一个阻塞当前线程的协程. 所以它主要被用来在main函数中或者测试中使用, 作为连接函数.

比如前面的例子可以改写成:

fun main() = runblocking<unit> {
    // start main coroutine
    globalscope.launch {
        // launch a new coroutine in background and continue
        delay(1000l)
        println("world! + ${thread.currentthread().name}")
    }
    println("hello, + ${thread.currentthread().name}") // main coroutine continues here immediately
    delay(2000l) // delaying for 2 seconds to keep jvm alive
}

最后不再使用thread.sleep(), 使用delay()就可以了.
程序输出:

hello, + main @coroutine#1
world! + defaultdispatcher-worker-1 @coroutine#2

launch: 返回job

上面的例子delay了一段时间来等待一个协程结束, 不是一个好的方法.

launch返回job, 代表一个协程, 我们可以用job的join()方法来显式地等待这个协程结束:

fun main() = runblocking {
    val job = globalscope.launch {
        // launch a new coroutine and keep a reference to its job
        delay(1000l)
        println("world! + ${thread.currentthread().name}")
    }
    println("hello, + ${thread.currentthread().name}")
    job.join() // wait until child coroutine completes
}

输出结果和上面是一样的.

job还有一个重要的用途是cancel(), 用于取消不再需要的协程任务.

async: 从协程返回值

async开启线程, 返回deferred<t>, deferred<t>是job的子类, 有一个await()函数, 可以返回协程的结果.

await()也是suspend函数, 只能在协程之内调用.

fun main() = runblocking {
    // @coroutine#1
    println(thread.currentthread().name)
    val deferred: deferred<int> = async {
        // @coroutine#2
        loaddata()
    }
    println("waiting..." + thread.currentthread().name)
    println(deferred.await()) // suspend @coroutine#1
}

suspend fun loaddata(): int {
    println("loading..." + thread.currentthread().name)
    delay(1000l) // suspend @coroutine#2
    println("loaded!" + thread.currentthread().name)
    return 42
}

运行结果:

main @coroutine#1
waiting...main @coroutine#1
loading...main @coroutine#2
loaded!main @coroutine#2
42

context, dispatcher和scope

看一下launch方法的声明:

public fun coroutinescope.launch(
    context: coroutinecontext = emptycoroutinecontext,
    start: coroutinestart = coroutinestart.default,
    block: suspend coroutinescope.() -> unit
): job {
...
}

其中有几个相关概念我们要了解一下.

协程总是在一个context下运行, 类型是接口coroutinecontext. 协程的context是一个索引集合, 其中包含各种元素, 重要元素就有job和dispatcher. job代表了这个协程, 那么dispatcher是做什么的呢?

构建协程的coroutine builder: launch, async, 都是coroutinescope类型的扩展方法. 查看coroutinescope接口, 其中含有coroutinecontext的引用. scope是什么? 有什么作用呢?

下面我们就来回答这些问题.

dispatchers和线程

context中的coroutinedispatcher可以指定协程运行在什么线程上. 可以是一个指定的线程, 线程池, 或者不限.

看一个例子:

fun main() = runblocking<unit> {
    launch {
        // context of the parent, main runblocking coroutine
        println("main runblocking      : i'm working in thread ${thread.currentthread().name}")
    }
    launch(dispatchers.unconfined) {
        // not confined -- will work with main thread
        println("unconfined            : i'm working in thread ${thread.currentthread().name}")
    }
    launch(dispatchers.default) {
        // will get dispatched to defaultdispatcher
        println("default               : i'm working in thread ${thread.currentthread().name}")
    }
    launch(newsinglethreadcontext("myownthread")) {
        // will get its own new thread
        println("newsinglethreadcontext: i'm working in thread ${thread.currentthread().name}")
    }
}

运行后打印出:

unconfined            : i'm working in thread main
default               : i'm working in thread defaultdispatcher-worker-1
newsinglethreadcontext: i'm working in thread myownthread
main runblocking      : i'm working in thread main

api提供了几种选项:

dispatchers.default代表使用jvm上的共享线程池, 其大小由cpu核数决定, 不过即便是单核也有两个线程. 通常用来做cpu密集型工作, 比如排序或复杂计算等.
dispatchers.main指定主线程, 用来做ui更新相关的事情. (需要添加依赖, 比如kotlinx-coroutines-android.) 如果我们在主线程上启动一个新的协程时, 主线程忙碌, 这个协程也会被挂起, 仅当线程有空时会被恢复执行.
dispatchers.io: 采用on-demand创建的线程池, 用于网络或者是读写文件的工作.
dispatchers.unconfined: 不指定特定线程, 这是一个特殊的dispatcher.

如果不明确指定dispatcher, 协程将会继承它被启动的那个scope的context(其中包含了dispatcher).

在实践中, 更推荐使用外部scope的dispatcher, 由调用方决定上下文. 这样也方便测试.

newsinglethreadcontext创建了一个线程来跑协程, 一个专注的线程算是一种昂贵的资源, 在实际的应用中需要被释放或者存储复用.

切换线程还可以用withcontext, 可以在指定的协程context下运行代码, 挂起直到它结束, 返回结果.
另一种方式是新启一个协程, 然后用join明确地挂起等待.

在android这种ui应用中, 比较常见的做法是, 顶部协程用coroutinedispatchers.main, 当需要在别的线程上做一些事情的时候, 再明确指定一个不同的dispatcher.

scope是什么?

当launch, async或runblocking开启新协程的时候, 它们自动创建相应的scope. 所有的这些方法都有一个带receiver的lambda参数, 默认的receiver类型是coroutinescope.

ide会提示this: coroutinescope:

launch { /* this: coroutinescope */
}

当我们在runblocking, launch, 或async的大括号里面再创建一个新的协程的时候, 自动就在这个scope里创建:

fun main() = runblocking {
    /* this: coroutinescope */
    launch { /* ... */ }
    // the same as:
    this.launch { /* ... */ }
}

因为launch是一个扩展方法, 所以上面例子中默认的receiver是this.
这个例子中launch所启动的协程被称作外部协程(runblocking启动的协程)的child. 这种"parent-child"的关系通过scope传递: child在parent的scope中启动.

协程的父子关系:

当一个协程在另一个协程的scope中被启动时, 自动继承其context, 并且新协程的job会作为父协程job的child.

所以, 关于scope目前有两个关键知识点:

我们开启一个协程的时候, 总是在一个coroutinescope里.
scope用来管理不同协程之间的父子关系和结构.

协程的父子关系有以下两个特性:

父协程被取消时, 所有的子协程都被取消.
父协程永远会等待所有的子协程结束.

值得注意的是, 也可以不启动协程就创建一个新的scope. 创建scope可以用工厂方法: mainscope()或coroutinescope().

coroutinescope()方法也可以创建scope. 当我们需要以结构化的方式在suspend函数内部启动新的协程, 我们创建的新的scope, 自动成为suspend函数被调用的外部scope的child.

上面的父子关系, 可以进一步抽象到, 没有parent协程, 由scope来管理其中所有的子协程.

scope在实际应用中解决什么问题呢? 如果我们的应用中, 有一个对象是有自己的生命周期的, 但是这个对象又不是协程, 比如android应用中的activity, 其中启动了一些协程来做异步操作, 更新数据等, 当activity被销毁的时候需要取消所有的协程, 来避免内存泄漏. 我们就可以利用coroutinescope来做这件事: 创建一个coroutinescope对象和activity的生命周期绑定, 或者让activity实现coroutinescope接口.

所以, scope的主要作用就是记录所有的协程, 并且可以取消它们.

a coroutinescope keeps track of all your coroutines, and it can cancel all of the coroutines started in it.

structured concurrency

这种利用scope将协程结构化组织起来的机制, 被称为"structured concurrency".
好处是:

scope自动负责子协程, 子协程的生命和scope绑定.
scope可以自动取消所有的子协程.
scope自动等待所有的子协程结束. 如果scope和一个parent协程绑定, 父协程会等待这个scope中所有的子协程完成.

通过这种结构化的并发模式: 我们可以在创建top级别的协程时, 指定主要的context一次, 所有嵌套的协程会自动继承这个context, 只在有需要的时候进行修改即可.

globalscope: daemon

globalscope启动的协程都是独立的, 它们的生命只受到application的限制. 即globalscope启动的协程没有parent, 和它被启动时所在的外部的scope没有关系.

launch(dispatchers.default) { ... }和globalscope.launch { ... }用的dispatcher是一样的.

globalscope启动的协程并不会保持进程活跃. 它们就像daemon threads(守护线程)一样, 如果jvm发现没有其他一般的线程, 就会关闭.

参考

第三方博客:

上一篇：微软更新Windows 10 v1903/v1903 CPU需求：十代酷睿入列

下一篇：凭借15亿下载量抖音成了全球第三流行的App