R语言数据框中的负索引介绍

程序员文章站 2022-06-04 07:56:18

以r语言自带的mtcars数据框为例：这是原始的mtcars数据：这里只列出了前面几行数据。然后负索引mtcars[,-2:-3],得到的结果删除了第二列和第三列数据所以r语言数据框中的负索引是指删除...

以r语言自带的mtcars数据框为例：

这是原始的mtcars数据：

R语言数据框中的负索引介绍

这里只列出了前面几行数据。

然后负索引mtcars[,-2:-3],得到的结果

R语言数据框中的负索引介绍

删除了第二列和第三列数据

所以r语言数据框中的负索引是指删除数据框中对应的列（或者行）

ps：这和python里面的规则好像不太一样，python里的负索引好像是指倒数第几列（或者第几行），这里这两个软件区别还挺大的~~写个笔记提醒一下自己~

补充：r语言中的负整数索引

看代码吧~

> x<-matrix(c(1,2,3,4,5,6,7,8,9),nrow = 3,ncol = 3,byrow = true)
> x
     [,1] [,2] [,3]
[1,]    1    2    3
[2,]    4    5    6
[3,]    7    8    9
> x[-1,]
     [,1] [,2] [,3]
[1,]    4    5    6
[2,]    7    8    9

这在r中称为负整数索引向量，这种索引向量指定被排除的元素而不是包括进来，因此x[-1,]表示取出矩阵x的除了第一行元素外的其他元素。

补充：r语言-基本语法、数据类型及索引

1. 基本语法

print() 、cat()：打印输出

#：单行注释

if(false){code block}：多行注释

2. 数据类型

class()：查看数据类型

2.1 基本数据类型

基本数据类型	示例
逻辑值(logical)	真：true、t，假：false、f
数字(numeric)	123、5
整型(integer)	2l、34l
复数(complex)	3+2i
字符(character)	'good'

2.2 向量vector

c()函数创建向量。

注意：必须保证元素类型相同，否则会默认进行类型转换。

> x <- c(1, 2)
> class(x)
[1] "numeric"
> x <- c('s')
> class(x)
[1] "character"
> x <- c(1, 2, 's')
> class(x)
[1] "character"

2.3 列表list

列表可以包含许多不同类型的元素，如向量、函数、嵌套列表。

注意：[]与[[]]的区别。[]取出来的仍是一个列表，[[]]取出来的是本身的数据类型。

> list1 <- list(c(2,3), 21, 's', sin)  # 分别包含列表、数字、字符、函数
> class(list1)
[1] "list"
> list1[1] # 取出来的仍是一个列表
[[1]]
[1] 2 3
> list1[[1]] # 取出来的是子列表中的元素
[1] 2 3
> class(list1[1])
[1] "list"
> class(list1[[1]])
[1] "numeric"
> list1[[2]]
[1] 21
> list1[2] + 2
error in list1[2] + 2 : non-numeric argument to binary operator
> list1[[2]] + 2
[1] 23
> list1[[4]]
function (x)  .primitive("sin")
> class(list1[[4]])
[1] "function"

2.4 矩阵matrix

矩阵是二维数据集，它可以使用矩阵函数的向量输入创建。

byrow参数决定元素存放的顺序。

> m <- matrix( c('a','a','b','c','b','a'), nrow = 2, ncol = 3, byrow = true)
> m
     [,1] [,2] [,3]
[1,] "a"  "a"  "b" 
[2,] "c"  "b"  "a" 
> m[,1] # 取出第一列数据
[1] "a" "c"
> m[1,] # 取出第一行数据
[1] "a" "a" "b"
> m[2,1] # 取出单个元素
[1] "c"

2.5 数组array

利用数组可以创建任意维度的数据。

> array1 <- array(c('green','yellow'), dim=c(3,3,2))
> array1
, , 1
     [,1]     [,2]     [,3]    
[1,] "green"  "yellow" "green" 
[2,] "yellow" "green"  "yellow"
[3,] "green"  "yellow" "green" 
, , 2
     [,1]     [,2]     [,3]    
[1,] "yellow" "green"  "yellow"
[2,] "green"  "yellow" "green" 
[3,] "yellow" "green"  "yellow"

2.6 因子factor

因子是使用向量创建的对象。它将向量与向量中元素的不同值一起存储为标签。标签是字符类型。它们在统计建模中非常有用。

使用factor()函数创建因子。nlevels函数给出级别计数。

> apple_colors <- c('green','green','yellow','red','red','red','green')
> factor_apple <- factor(apple_colors)
> factor_apple
[1] green  green  yellow red    red    red    green 
levels: green red yellow
> nlevels(factor_apple)
[1] 3

2.7 数据框data frame

表格数据对象。每列可以包含不同的数据类型。第一列可以是数字，而第二列可以是字符，第三列可以是逻辑的。它是等长度的向量的列表。

使用data.frame()函数创建数据框。

# 创建数据框，表格对象
> bmi <- data.frame(
     gender = c("male", "male","female"), 
     height = c(152, 171.5, 165), 
     weight = c(81,93, 78),
     age = c(42,38,26)
 )
 
> bmi
  gender height weight age
1   male  152.0     81  42
2   male  171.5     93  38
3 female  165.0     78  26
# 获取第二列
> bmi[2]
  height
1  152.0
2  171.5
3  165.0
# 获取第一行
> bmi[1,]
  gender height weight age
1   male    152     81  42
# 获取第一列数据，类型为dataframe
> bmi[1]
  gender
1   male
2   male
3 female
> class(bmi[1])
[1] "data.frame"
# 获取第一列，并将其转换为factor类型
> bmi[,1]
[1] male   male   female
levels: female male
# 获取第一个元素，转换为factor类型
> bmi[1,1]
[1] male
levels: female male
# 获取第二列，不改变数据类型
> bmi[2]
  height
1  152.0
2  171.5
3  165.0
# 获取第二列，改变数据类型
> bmi[,2]
[1] 152.0 171.5 165.0
# 根据列的名称获取factor类型数据
data_frame$col_name

以上为个人经验，希望能给大家一个参考，也希望大家多多支持。如有错误或未考虑完全的地方，望不吝赐教。

R语言数据框中的负索引介绍

以r语言自带的mtcars数据框为例：

看代码吧~

1. 基本语法

2. 数据类型

2.1 基本数据类型

2.2 向量vector

2.3 列表list

2.4 矩阵matrix

2.5 数组array

2.6 因子factor

2.7 数据框data frame

R语言中的多维数组、矩阵、列表、数据框

R语言实现将数据框中的字符类型数字转换为数值

SQLite中R树索引的存储结构介绍

利用python编写R语言案例研究方法中的定位算法（1）——原始数据处理

R语言删除/添加数据框中的某一行/列

R语言中矩阵matrix和数据框data.frame的使用详解

数据库中关于SQL语言的简单介绍

C语言和C++中的数据类型介绍

R语言列表和数据框的具体使用