【Jupyter】练习题

程序员文章站 2024-03-05 14:52:26

...

Part 1

For each of the four datasets...

Compute the mean and variance of both x and y
Compute the correlation coefficient between x and y
Compute the linear regression line: y=β0+β1x+ϵy=β0+β1x+ϵ (hint: use statsmodels and look at the Statsmodels notebook)

OUTPUT

【Jupyter】练习题

Code

print( 'The mean of x is : ', end="")
print(anascombe['x'].mean())
print( 'The mean of y is : ', end="")
print(anascombe['y'].mean())
print( 'The variance of x is : ', end="")
print(anascombe['x'].var())
print( 'The variance of x is : ', end="")
print(anascombe['y'].var())

print("The correlation coefficient between x and y: ", end="") 
print((np.corrcoef(np.array([anascombe['x'], anascombe['y']])))[0][1]) 

n = len(anascombe)  
is_train = np.random.rand(n) < 0.7  
train = anascombe[is_train].reset_index(drop=True)  
test = anascombe[~is_train].reset_index(drop=True)  
lin_model = smf.ols('y ~ x', train).fit()  
lin_model.summary()

Part 2

Using Seaborn, visualize all four datasets.

OUTPUT

Code

# your code here
m = sns.FacetGrid(anascombe, col="dataset")  
m.map(plt.scatter, "x","y")

上一篇： attr设置checked，disabled等属性失效的问题，jquery的attr和prop的区别

下一篇： java中的this关键字

【Jupyter】练习题

Part 1

Part 2

Python - 装机系列8 给Jupyter增加内核（环境）

SparkStreaming综合整体的练习题！[强烈推荐]

Java语法基础练习题1

Java SE综合练习题错误总结

C语言练习题(1)

学习：练习题整理1

python的字符串 --综合练习题

2019.12.31 Day1练习题

python------字典的综合练习题

算法练习题 4（水龙头接水问题）