欢迎您访问程序员文章站本站旨在为大家提供分享程序员计算机编程知识!
您现在的位置是: 首页  >  数据库

利用ycsb测试cassandra性能

程序员文章站 2022-04-10 16:43:19
...

java 、maven、ycsb 的安装及配置见这篇博客: http://blog.csdn.net/hs794502825/article/details/17309845 本篇博客主要介绍 cassandra 的安装,以及利用 ycsb 对 cassandra 进行基本的测试 在 http://cassandra.apache.org/download/ 上面下载了最新版本的

java 、maven、ycsb 的安装及配置见这篇博客:
http://blog.csdn.net/hs794502825/article/details/17309845

本篇博客主要介绍 cassandra 的安装,以及利用 ycsb 对 cassandra 进行基本的测试
在 http://cassandra.apache.org/download/ 上面下载了最新版本的 apache-cassandra-2.0.3-bin.tar.gz 存放在 /home/hs/program目录下
cd /home/hs/program
tar -zxvf apache-cassandra-2.0.3-bin.tar.gz
然后为 cassandra 设置环境变量
sudo gedit /etc/profile
在文件的最后加入:
#set cassandra environment
export CASSANDRA_HOME=/home/hs/program/apache-cassandra-2.0.3
export PATH=$PATH:$CASSANDRA_HOME/bin:$CASSANDRA_HOME/lib
之后,我就直接以普通用户(hs)执行 cassandra -f
显示了很多错误,大多都是与此相关:
无法生成目录:/var/lib/cassandra/......以及/var/log/cassandra/......

var目录的权限如下:
drwxr-xr-x 13 root root 4096 2013-12-14 21:50
只有所有者root对其有写的权限

cassandra 需要生成数据和日志信息的目录,默认情况下就是
/var/lib/cassandra/ 和 /var/log/cassandra/

然后我就在 hs 用户下执行如下命令:
sudo mkdir /var/lib/cassandra
sudo mkdir /var/log/cassandra

chown -R hs:hs /var/lib/cassandra
chown -R hs:hs /var/log/cassandra

如此一来,hs 就具有写 /var/lib/cassandra/ 和 /var/log/cassandra/ 的权限

在终端1中运行cassandra:
cassandra -f
如果有 Listening for thrift clients... 则说明成功启动 cassandra

在终端2中运行cassandra-cli:
cassandra-cli
显示:
Connected to: "Test Cluster" on 127.0.0.1/9160
Welcome to Cassandra CLI version 2.0.3

The CLI is deprecated and will be removed in Cassandra 3.0.  Consider migrating to cqlsh.
CQL is fully backwards compatible with Thrift data; see http://www.datastax.com/dev/blog/thrift-to-cql3

根据提示我就终止了 cassandra-cli,转而去使用 cqlsh
hs@hs-virtual-machine:~$ cqlsh
Connected to Test Cluster at localhost:9160.
[cqlsh 4.1.0 | Cassandra 2.0.3 | CQL spec 3.1.1 | Thrift protocol 19.38.0]

接下来在创建 keyspace 的时候,出现了如下错误:
cqlsh> create keyspace with strategy_class = 'SimpleStrategy' and strategy_options:replication_factor = '1';
Bad Request: line 1:75 mismatched input ':' expecting '='
上面那一行创建 keyspace 的命令我是从 cqlsh 的官网上 copy 过来的,所以我不知道怎么解决,第一次接触 cqlsh

后来还是去使用 cassandra-cli(注意所有的命令都需要以;结束)
create keyspace usertable;
use usertable;
create column family data;

在终端3中运行ycsb:
./bin/ycsb load cassandra-10 -P workloads/workloada -p hosts=localhost -p columnfamily=data > ./my-results/load-cassandra-a

得到如下错误:
Loading workload...
Starting test.
InvalidRequestException(why:unconfigured columnfamily data)
	at org.apache.cassandra.thrift.Cassandra$batch_mutate_result.read(Cassandra.java:20833)
	at org.apache.thrift.TServiceClient.receiveBase(TServiceClient.java:78)
	at org.apache.cassandra.thrift.Cassandra$Client.recv_batch_mutate(Cassandra.java:964)
	at org.apache.cassandra.thrift.Cassandra$Client.batch_mutate(Cassandra.java:950)
	at com.yahoo.ycsb.db.CassandraClient10.insert(CassandraClient10.java:477)
	at com.yahoo.ycsb.DBWrapper.insert(DBWrapper.java:148)
	at com.yahoo.ycsb.workloads.CoreWorkload.doInsert(CoreWorkload.java:461)
	at com.yahoo.ycsb.ClientThread.run(Client.java:269)

目测是 column family 的创建有问题
所以我在终端2中删除掉该 column family,然后重建
drop column family data;
create column family data with column_type = 'Standard' and comparator = 'UTF8Type';

返回终端3重新运行ycsb:
./bin/ycsb load cassandra-10 -P workloads/workloada -p hosts=localhost -p columnfamily=data > ./my-results/load-cassandra-a

得到如下结果:
YCSB Client 0.1
Command line: -db com.yahoo.ycsb.db.CassandraClient10 -P workloads/workloada -p hosts=localhost -p columnfamily=data -load
[OVERALL], RunTime(ms), 2287.0
[OVERALL], Throughput(ops/sec), 437.25404459991256
[INSERT], Operations, 1000
[INSERT], AverageLatency(us), 1670.687
[INSERT], MinLatency(us), 476
[INSERT], MaxLatency(us), 280228
[INSERT], 95thPercentileLatency(ms), 3
[INSERT], 99thPercentileLatency(ms), 12
[INSERT], Return=0, 1000
......

执行:
./bin/ycsb run cassandra-10 -P workloads/workloada -p hosts=localhost -p columnfamily=data > ./my-results/run-cassandra-a

得到如下结果:
YCSB Client 0.1
Command line: -db com.yahoo.ycsb.db.CassandraClient10 -P workloads/workloada -p hosts=localhost -p columnfamily=data -t
[OVERALL], RunTime(ms), 5574.0
[OVERALL], Throughput(ops/sec), 179.4043774668102
[UPDATE], Operations, 475
[UPDATE], AverageLatency(us), 2095.0547368421053
[UPDATE], MinLatency(us), 327
[UPDATE], MaxLatency(us), 143093
[UPDATE], 95thPercentileLatency(ms), 9
[UPDATE], 99thPercentileLatency(ms), 33
[UPDATE], Return=0, 475
......
[READ], Operations, 525
[READ], AverageLatency(us), 5054.5085714285715
[READ], MinLatency(us), 492
[READ], MaxLatency(us), 674167
[READ], 95thPercentileLatency(ms), 11
[READ], 99thPercentileLatency(ms), 85
[READ], Return=0, 525

下一阶段需要熟悉对 cassandra 的操作,以及使用 cqlsh