Linux学习教程之redis哨兵集群详解

程序员文章站 2022-05-21 11:38:00

前言 sentinel(哨兵)是用于监控redis集群中master状态的工具，其已经被集成在redis2.4+的版本中，下面话不多说了，来一起看看详细的介绍吧 1、s...

前言

sentinel(哨兵)是用于监控redis集群中master状态的工具，其已经被集成在redis2.4+的版本中，下面话不多说了，来一起看看详细的介绍吧

1、sentinel 哨兵

sentinel（哨兵）是redis 的高可用性解决方案：由一个或多个sentinel 实例组成的sentinel 系统可以监视任意多个主服务器，以及这些主服务器属下的所有从服务器，并在被监视的主服务器进入下线状态时，自动将下线主服务器属下的某个从服务器升级为新的主服务器。

例如：

Linux学习教程之redis哨兵集群详解

在server1 掉线后：

Linux学习教程之redis哨兵集群详解

升级server2 为新的主服务器：

　　 Linux学习教程之redis哨兵集群详解

2、redis 主从分离

在讲解sentinel 哨兵集群之前，我们先来搭建一个简单的主从分离（读写分离）。

首先，我们默认大家都已经安装了redis，然后我们将redis.conf 拷贝多份，并且创建多个目录，用于区分多个redis 服务：

　　　 Linux学习教程之redis哨兵集群详解

这里面，每个目录中都有自己的redis.conf 配置文件，接下来，我们先设置主服务器的配置文件。

一、配置master

1、修改端口

# accept connections on the specified port, default is 6379 (iana #815344).
# if port 0 is specified redis will not listen on a tcp socket.
port 6380

redis 的默认端口是6379，这里我们把主服务器的端口设置为6380

2、修改pidfile

# if a pid file is specified, redis writes it where specified at startup
# and removes it at exit.
#
# when the server runs non daemonized, no pid file is created if none is
# specified in the configuration. when the server is daemonized, the pid file
# is used even if not specified, defaulting to "/var/run/redis.pid".
#
# creating a pid file is best effort: if redis is not able to create it
# nothing bad happens, the server will start and run normally.
pidfile /var/run/redis_6380.pid

pidfile 是我们启动redis 的时候，linux 为我们分配的一个pid 进程号，如果这里不作修改，会影响后面redis服务的启动

3、启动 redis

Linux学习教程之redis哨兵集群详解

启动redis，我们可以看到，redis已经占领了6380 端口

进入客户端

redis-cli -p 6380
127.0.0.1:6380> info
...
# replication
role:master
connected_slaves:0
master_repl_offset:0
repl_backlog_active:0
repl_backlog_size:1048576
repl_backlog_first_byte_offset:0
repl_backlog_histlen:0
...

我们可以看到，redis 现在的角色是一个master 启动的服务。

二、配置slave

和上面配置 master一样，我们需要修改端口号和pid 文件，在修改完之后，我们有两种方法配置从服务

1、在配置文件中配置从服务

################################# replication #################################

# master-slave replication. use slaveof to make a redis instance a copy of
# another redis server. a few things to understand asap about redis replication.
#
# 1) redis replication is asynchronous, but you can configure a master to
# stop accepting writes if it appears to be not connected with at least
# a given number of slaves.
# 2) redis slaves are able to perform a partial resynchronization with the
# master if the replication link is lost for a relatively small amount of
# time. you may want to configure the replication backlog size (see the next
# sections of this file) with a sensible value depending on your needs.
# 3) replication is automatic and does not need user intervention. after a
# network partition slaves automatically try to reconnect to masters
# and resynchronize with them.
#
# slaveof <masterip> <masterport>
slaveof 127.0.0.1 6380

我们可以在配置文件中直接修改 slaveof 属性，我们直接配置主服务器的ip 地址，和端口号，如果这里主服务器有配置密码

可以通过配置masterauth 来设置链接密码

# if the master is password protected (using the "requirepass" configuration
# directive below) it is possible to tell the slave to authenticate before
# starting the replication synchronization process, otherwise the master will
# refuse the slave request.
#
# masterauth <master-password>

启动redis 服务：

Linux学习教程之redis哨兵集群详解

我们可以看到，现在有两个现在在运行，我们进入6381的客户端，看一下他的状态，

# replication
role:slave
master_host:127.0.0.1
master_port:6380
master_link_status:up
master_last_io_seconds_ago:1
master_sync_in_progress:0
slave_repl_offset:71
slave_priority:100
slave_read_only:1
connected_slaves:0
master_repl_offset:0
repl_backlog_active:0
repl_backlog_size:1048576
repl_backlog_first_byte_offset:0
repl_backlog_histlen:0

我们可以看到，现在的redis 是一个从服务的角色，连接着6380的服务。

2、在服务启动后设置

我们修改6382端口的服务器配置文件之后，启动服务

Linux学习教程之redis哨兵集群详解

进入客户端，查看当前服务器的状态：

# replication
role:master
connected_slaves:0
master_repl_offset:0
repl_backlog_active:0
repl_backlog_size:1048576
repl_backlog_first_byte_offset:0
repl_backlog_histlen:0

我们可以看到，当前服务器的状态时作为一个主服务的角色在运行，我们接下来修改他的状态：

127.0.0.1:6382> slaveof 127.0.0.1 6380

//修改后状态
# replication
role:slave
master_host:127.0.0.1
master_port:6380
master_link_status:up
master_last_io_seconds_ago:1
master_sync_in_progress:0
slave_repl_offset:617
slave_priority:100
slave_read_only:1
connected_slaves:0
master_repl_offset:0
repl_backlog_active:0
repl_backlog_size:1048576
repl_backlog_first_byte_offset:0
repl_backlog_histlen:0

3、总结

我们先看一下目前master 的状态：

# replication
role:master
connected_slaves:2
slave0:ip=127.0.0.1,port=6381,state=online,offset=785,lag=0
slave1:ip=127.0.0.1,port=6382,state=online,offset=785,lag=0
master_repl_offset:785
repl_backlog_active:1
repl_backlog_size:1048576
repl_backlog_first_byte_offset:2
repl_backlog_histlen:784

我们可以可以看到，两个从服务已经在连着主服务器，上面两种配置的区别在于，当salve 断线重连之后，

如果我们是修改类配置文件，重连之后会自己链接上去master，并且同步master 上面的数据，

如果我们是手动连接上去的主服务器，重连之后，从服务器会读取自己本地的 rdb 回复数据，而不会去自动链接主服务

我们如果需要设置读写分离，只需要在主服务器中设置：

# note: read only slaves are not designed to be exposed to untrusted clients
# on the internet. it's just a protection layer against misuse of the instance.
# still a read only slave exports by default all the administrative commands
# such as config, debug, and so forth. to a limited extent you can improve
# security of read only slaves using 'rename-command' to shadow all the
# administrative / dangerous commands.
slave-read-only yes

3、sentinel 哨兵

1、配置端口

在sentinel.conf 配置文件中，我们可以找到port 属性，这里是用来设置sentinel 的端口，一般情况下，至少会需要三个哨兵对redis 进行监控，我们可以通过修改端口启动多个sentinel 服务。

# note: read only slaves are not designed to be exposed to untrusted clients
# on the internet. it's just a protection layer against misuse of the instance.
# still a read only slave exports by default all the administrative commands
# such as config, debug, and so forth. to a limited extent you can improve
# security of read only slaves using 'rename-command' to shadow all the
# administrative / dangerous commands.
slave-read-only yes

2、配置主服务器的ip 和端口

我们把监听的端口修改成6380，并且加上权值为2，这里的权值，是用来计算我们需要将哪一台服务器升级升主服务器

# sentinel monitor <master-name> <ip> <redis-port> <quorum>
#
# tells sentinel to monitor this master, and to consider it in o_down
# (objectively down) state only if at least <quorum> sentinels agree.
#
# note that whatever is the odown quorum, a sentinel will require to
# be elected by the majority of the known sentinels in order to
# start a failover, so no failover can be performed in minority.
#
# slaves are auto-discovered, so you don't need to specify slaves in
# any way. sentinel itself will rewrite this configuration file adding
# the slaves using additional configuration options.
# also note that the configuration file is rewritten when a
# slave is promoted to master.
#
# note: master name should not include special characters or spaces.
# the valid charset is a-z 0-9 and the three characters ".-_".
sentinel monitor mymaster 127.0.0.1 6380 2

3、启动sentinel

/sentinel$ redis-sentinel sentinel.conf

Linux学习教程之redis哨兵集群详解

sentinel 启动之后，就会监视到现在有一个主服务器，两个从服务器

当我们把其中一个从服务器器关闭之后，我们可以看到日志：

10894:x 30 dec 16:27:03.670 # +sdown slave 127.0.0.1:6381 127.0.0.1 6381 @ mymaster 127.0.0.1 6380

日志表示，6381这个从服务器已经从主服务器中脱离了出来，我们重新把6381 接回去。

10894:x 30 dec 16:28:43.288 * +reboot slave 127.0.0.1:6381 127.0.0.1 6381 @ mymaster 127.0.0.1 6380
10894:x 30 dec 16:28:43.365 # -sdown slave 127.0.0.1:6381 127.0.0.1 6381 @ mymaster 127.0.0.1 6380

4、关闭master

我们手动关闭master 之后，sentinel 在监听master 确实是断线了之后，将会开始计算权值，然后重新分配主服务器

Linux学习教程之redis哨兵集群详解

我们可以看到，6380主服务器断了之后，sentinel 帮我们选了6382作为新的主服务器

我们进到6382的客户端，查看他的状态：

# replication
role:master
connected_slaves:1
slave0:ip=127.0.0.1,port=6381,state=online,offset=13751,lag=0
master_repl_offset:13751
repl_backlog_active:1
repl_backlog_size:1048576
repl_backlog_first_byte_offset:2
repl_backlog_histlen:13750

我们可以看到 6382，重slave 荣升为master

127.0.0.1:6382> set name jaycekon
ok

原本的没有权限写，也得到了相应的权限

5、重连master

大家可能会好奇，如果master 重连之后，会不会抢回属于他的位置，答案是否定的，就比如你被一个小弟抢了你老大的位置，他肯给回你这个位置吗。因此当master 回来之后，他也只能当个小弟　　

Linux学习教程之redis哨兵集群详解

4、sentinel 总结

一、sentinel的作用：

a、master 状态监测

b、如果master 异常，则会进行master-slave 转换，将其中一个slave作为master，将之前的master作为slave

c、master-slave切换后，master_redis.conf、slave_redis.conf和sentinel.conf的内容都会发生改变，即master_redis.conf中会多一行slaveof的配置，sentinel.conf的监控目标会随之调换

二、sentinel的工作方式:

1)：每个sentinel以每秒钟一次的频率向它所知的master，slave以及其他 sentinel 实例发送一个 ping 命令

2)：如果一个实例（instance）距离最后一次有效回复 ping 命令的时间超过 down-after-milliseconds 选项所指定的值，则这个实例会被 sentinel 标记为主观下线。

3)：如果一个master被标记为主观下线，则正在监视这个master的所有 sentinel 要以每秒一次的频率确认master的确进入了主观下线状态。

4)：当有足够数量的 sentinel（大于等于配置文件指定的值）在指定的时间范围内确认master的确进入了主观下线状态，则master会被标记为客观下线

5)：在一般情况下，每个 sentinel 会以每 10 秒一次的频率向它已知的所有master，slave发送 info 命令

6)：当master被 sentinel 标记为客观下线时，sentinel 向下线的 master 的所有 slave 发送 info 命令的频率会从 10 秒一次改为每秒一次

7)：若没有足够数量的 sentinel 同意 master 已经下线， master 的客观下线状态就会被移除。

若 master 重新向 sentinel 的 ping 命令返回有效回复， master 的主观下线状态就会被移除。

总结

以上就是这篇文章的全部内容了，希望本文的内容对大家的学习或者工作具有一定的参考学习价值，如果有疑问大家可以留言交流，谢谢大家对的支持。

上一篇： Nginx+SSL搭建 HTTPS 网站

下一篇： Linux下如何查杀stopped进程详解