dowhy学习1

程序员文章站 2024-02-11 13:32:52

...

特点：

与前人相比，DoWhy对因果推理模型的实现做出了三个关键贡献。

提出了一种将给定问题建模为因果图的原则方法，使所有假设都清晰可见。
为许多流行的因果推理方法提供统一的接口，结合了图形模型和潜在结果的两个主要框架。
如果可能的话，自动测试假设的有效性，并评估对违规的估计的稳健性能。

安装：

pycharm里一键添加

或者使用pip（参考官网）

原理与使用：

dowhy支持gml（首选）和dot两种因果图格式，加载数据后，主要可以实现以下四种操作：建模、识别、估计和反驳。

# Create a causal model from the data and given graph.
model = CausalModel(
    data=data["df"],
    treatment=data["treatment_name"],
    outcome=data["outcome_name"],
    graph=data["gml_graph"])

# Identify causal effect and return target estimands
identified_estimand = model.identify_effect()

# Estimate the target estimand using a statistical method.
estimate = model.estimate_effect(identified_estimand,
                                 method_name="backdoor.propensity_score_matching")

# Refute the obtained estimate using multiple robustness checks.
refute_results = model.refute_estimate(identified_estimand, estimate,
                                       method_name="random_common_cause")

四步因果推理

1，对因果问题建模。可以提供一个局部图，以表示有关某些变量的先验知识。DoWhy会自动将其余变量视为潜在的混杂因素。

2，确定目标估计。基于因果图，DoWhy根据图形模型找到识别所需因果效应的所有可能方法。

3，根据确定的估计值来计算因果效应。DoWhy支持基于后门准则和工具变量的方法。

4，反驳所得的估计。可以使用多种反驳方法来验证因果推断是使用DoWhy的主要好处。

一个例子

估计治疗对疾病痊愈的因果效应

1，导入数据集

import dowhy.datasets
data = dowhy.datasets.linear_dataset(beta=10,
        num_common_causes=5,
        num_instruments = 2,
        num_samples=10000,
        treatment_is_binary=True)
df = data["df"]
print(df.head())
print(data["dot_graph"])
print("\n")
print(data["gml_graph"])

2，以GML图格式输入因果图

# With graph
model=CausalModel(
        data = df,
        treatment=data["treatment_name"],
        outcome=data["outcome_name"],
        graph=data["gml_graph"]
        )

3，可以不用数据，仅通过因果图实现识别

identified_estimand = model.identify_effect()
print(identified_estimand)

dowhy学习1

4，预测

causal_estimate = model.estimate_effect(identified_estimand,
        method_name="backdoor.propensity_score_stratification")
print(causal_estimate)
print("Causal Estimate is " + str(causal_estimate.value))

dowhy学习1

特点：

安装：

原理与使用：

一个例子

jzoj 5662. 【GDOI2018Day1模拟4.17】尺树寸泓

dowhy学习1

Python3机器学习实战ID3决策树实现+matplotlib绘制树形图

【因果学习】VC RCNN（CVPR 2020）代码

机器学习之决策树（Decision Tree）模型

机器学习中的数学(3)——协方差矩阵和散布（散度）矩阵

Maven学习笔记（三）——*仓库与依赖管理篇

机器学习实战—决策树

Opencv2.4学习：：（二维）最大熵阈值分割----信息量最大化分割

学习笔记6（opencv+python阈值分割（最大熵））