贝叶斯信念网络,朴素贝叶斯与贝叶斯信念网络

贝叶斯分类是统计学分类方法。它们可以预测类成员关系的可能性,如给定样本属于一个特定类的概率。
贝叶斯定理是就是在给定的数据概率来表示未知的后验概率。比如已知某水果是红色的情况下,判断该水果有多大的概率是苹果,用数学符号表示就是clip_image002朴素贝叶斯与贝叶斯信念网络贝叶斯信念网络(后验概率),其中X表示“这个水果是红色的”,H表示“这个水果是苹果”。这个概率我们是不知道的,但是如果我们有大量的水果样本,就可以计算水果样本中的统计信息来逼近这个概率。
下面的三个概率都是可以通过样本统计简单计算得到的:“一个水果的颜色是红色”的概率P(X);“一个水果是苹果” 的概率 P(H)(先验概率);“如果一个水果是苹果那么它的颜色是红色”的概率clip_image004clip_image002朴素贝叶斯与贝叶斯信念网络贝叶斯信念网络
我们可以推理得到:
clip_image002[1]clip_image004clip_image002朴素贝叶斯与贝叶斯信念网络贝叶斯信念网络=clip_image006clip_image002[1]clip_image004clip_image002朴素贝叶斯与贝叶斯信念网络贝叶斯信念网络
=clip_image008clip_image006clip_image002[1]clip_image004clip_image002朴素贝叶斯与贝叶斯信念网络贝叶斯信念网络/clip_image010clip_image008clip_image006clip_image002[1]clip_image004clip_image002朴素贝叶斯与贝叶斯信念网络贝叶斯信念网络
=clip_image012clip_image010clip_image008clip_image006clip_image002[1]clip_image004clip_image002朴素贝叶斯与贝叶斯信念网络贝叶斯信念网络
=clip_image014clip_image012clip_image010clip_image008clip_image006clip_image002[1]clip_image004clip_image002朴素贝叶斯与贝叶斯信念网络贝叶斯信念网络
抽象得到的公式clip_image016clip_image014clip_image012clip_image010clip_image008clip_image006clip_image002[1]clip_image004clip_image002朴素贝叶斯与贝叶斯信念网络贝叶斯信念网络就是著名的贝叶斯定理。
贝叶斯分类的基本思路就是把clip_image018clip_image016clip_image014clip_image012clip_image010clip_image008clip_image006clip_image002[1]clip_image004clip_image002朴素贝叶斯与贝叶斯信念网络贝叶斯信念网络clip_image020clip_image018clip_image016clip_image014clip_image012clip_image010clip_image008clip_image006clip_image002[1]clip_image004clip_image002朴素贝叶斯与贝叶斯信念网络贝叶斯信念网络clip_image022clip_image020clip_image018clip_image016clip_image014clip_image012clip_image010clip_image008clip_image006clip_image002[1]clip_image004clip_image002朴素贝叶斯与贝叶斯信念网络贝叶斯信念网络clip_image024clip_image022clip_image020clip_image018clip_image016clip_image014clip_image012clip_image010clip_image008clip_image006clip_image002[1]clip_image004clip_image002朴素贝叶斯与贝叶斯信念网络贝叶斯信念网络……的概率都算出来,当有未知分类的样本时,就认为这个样本是后验概率最大的那个分类的。
朴素贝叶斯分类
实际情况要比上面的那个例子复杂一些,因为一个事物的属性是多维的,一个水果的属性可能就要包括:颜色、形状、重量、体积……。
实际上的分类可能是要算这样一个概率:
clip_image026clip_image024clip_image022clip_image020clip_image018clip_image016clip_image014clip_image012clip_image010clip_image008clip_image006clip_image002[1]clip_image004clip_image002朴素贝叶斯与贝叶斯信念网络贝叶斯信念网络
如果属性相互独立,那么
clip_image028clip_image026clip_image024clip_image022clip_image020clip_image018clip_image016clip_image014clip_image012clip_image010clip_image008clip_image006clip_image002[1]clip_image004clip_image002朴素贝叶斯与贝叶斯信念网络贝叶斯信念网络
朴素贝叶斯的朴素之处在于不管属性独不独立,都按独立来算,这样可以使运算大大简化。
归纳一下朴素贝叶斯的运算流程:
1. 每个数据样本用一个n维特征向量clip_image030clip_image028clip_image026clip_image024clip_image022clip_image020clip_image018clip_image016clip_image014clip_image012clip_image010clip_image008clip_image006clip_image002[1]clip_image004clip_image002朴素贝叶斯与贝叶斯信念网络贝叶斯信念网络表示,描述由属性clip_image032clip_image030clip_image028clip_image026clip_image024clip_image022clip_image020clip_image018clip_image016clip_image014clip_image012clip_image010clip_image008clip_image006clip_image002[1]clip_image004clip_image002朴素贝叶斯与贝叶斯信念网络贝叶斯信念网络对样本的n个度量。
2. 假定有m个类clip_image034clip_image032clip_image030clip_image028clip_image026clip_image024clip_image022clip_image020clip_image018clip_image016clip_image014clip_image012clip_image010clip_image008clip_image006clip_image002[1]clip_image004clip_image002朴素贝叶斯与贝叶斯信念网络贝叶斯信念网络。给定一个未知的数据样本X(即,没有类标号),分类法将预测X属于具有最高后验概率(条件X下)的类。即,朴素贝叶斯分类将未知的样本分配给类Ci ,当且仅当:
clip_image036clip_image034clip_image032clip_image030clip_image028clip_image026clip_image024clip_image022clip_image020clip_image018clip_image016clip_image014clip_image012clip_image010clip_image008clip_image006clip_image002[1]clip_image004clip_image002朴素贝叶斯与贝叶斯信念网络贝叶斯信念网络
这样,我们最大化clip_image038clip_image036clip_image034clip_image032clip_image030clip_image028clip_image026clip_image024clip_image022clip_image020clip_image018clip_image016clip_image014clip_image012clip_image010clip_image008clip_image006clip_image002[1]clip_image004clip_image002朴素贝叶斯与贝叶斯信念网络贝叶斯信念网络。其clip_image038[1]clip_image038clip_image036clip_image034clip_image032clip_image030clip_image028clip_image026clip_image024clip_image022clip_image020clip_image018clip_image016clip_image014clip_image012clip_image010clip_image008clip_image006clip_image002[1]clip_image004clip_image002朴素贝叶斯与贝叶斯信念网络贝叶斯信念网络最大的类Ci称为最大后验假定。根据贝叶斯定理:
clip_image041clip_image038[1]clip_image038clip_image036clip_image034clip_image032clip_image030clip_image028clip_image026clip_image024clip_image022clip_image020clip_image018clip_image016clip_image014clip_image012clip_image010clip_image008clip_image006clip_image002[1]clip_image004clip_image002朴素贝叶斯与贝叶斯信念网络贝叶斯信念网络
3.由于P(X) 对于所有类为常数,只需要clip_image043clip_image041clip_image038[1]clip_image038clip_image036clip_image034clip_image032clip_image030clip_image028clip_image026clip_image024clip_image022clip_image020clip_image018clip_image016clip_image014clip_image012clip_image010clip_image008clip_image006clip_image002[1]clip_image004clip_image002朴素贝叶斯与贝叶斯信念网络贝叶斯信念网络最大即可。如果类的先验概率未知,则通常假定这些类是等概率的;即,clip_image045clip_image043clip_image041clip_image038[1]clip_image038clip_image036clip_image034clip_image032clip_image030clip_image028clip_image026clip_image024clip_image022clip_image020clip_image018clip_image016clip_image014clip_image012clip_image010clip_image008clip_image006clip_image002[1]clip_image004clip_image002朴素贝叶斯与贝叶斯信念网络贝叶斯信念网络。并据此对只clip_image038[2]clip_image045clip_image043clip_image041clip_image038[1]clip_image038clip_image036clip_image034clip_image032clip_image030clip_image028clip_image026clip_image024clip_image022clip_image020clip_image018clip_image016clip_image014clip_image012clip_image010clip_image008clip_image006clip_image002[1]clip_image004clip_image002朴素贝叶斯与贝叶斯信念网络贝叶斯信念网络最大化。否则,我们最大化clip_image043[1]clip_image038[2]clip_image045clip_image043clip_image041clip_image038[1]clip_image038clip_image036clip_image034clip_image032clip_image030clip_image028clip_image026clip_image024clip_image022clip_image020clip_image018clip_image016clip_image014clip_image012clip_image010clip_image008clip_image006clip_image002[1]clip_image004clip_image002朴素贝叶斯与贝叶斯信念网络贝叶斯信念网络。注意,类的先验概率可以用clip_image049clip_image043[1]clip_image038[2]clip_image045clip_image043clip_image041clip_image038[1]clip_image038clip_image036clip_image034clip_image032clip_image030clip_image028clip_image026clip_image024clip_image022clip_image020clip_image018clip_image016clip_image014clip_image012clip_image010clip_image008clip_image006clip_image002[1]clip_image004clip_image002朴素贝叶斯与贝叶斯信念网络贝叶斯信念网络计算;其中,si是类C中的训练样本数,而s是训练样本总数。
4.给定具有许多属性的数据集,计算clip_image051clip_image049clip_image043[1]clip_image038[2]clip_image045clip_image043clip_image041clip_image038[1]clip_image038clip_image036clip_image034clip_image032clip_image030clip_image028clip_image026clip_image024clip_image022clip_image020clip_image018clip_image016clip_image014clip_image012clip_image010clip_image008clip_image006clip_image002[1]clip_image004clip_image002朴素贝叶斯与贝叶斯信念网络贝叶斯信念网络的开销可能非常大。为降低计算clip_image051[1]clip_image051clip_image049clip_image043[1]clip_image038[2]clip_image045clip_image043clip_image041clip_image038[1]clip_image038clip_image036clip_image034clip_image032clip_image030clip_image028clip_image026clip_image024clip_image022clip_image020clip_image018clip_image016clip_image014clip_image012clip_image010clip_image008clip_image006clip_image002[1]clip_image004clip_image002朴素贝叶斯与贝叶斯信念网络贝叶斯信念网络的开销,可以做类条件独立的朴素假定。给定样本的类标号,假定属性值条件地相互独立。即,在属性间,不存在依赖关系。这样,
clip_image054clip_image051[1]clip_image051clip_image049clip_image043[1]clip_image038[2]clip_image045clip_image043clip_image041clip_image038[1]clip_image038clip_image036clip_image034clip_image032clip_image030clip_image028clip_image026clip_image024clip_image022clip_image020clip_image018clip_image016clip_image014clip_image012clip_image010clip_image008clip_image006clip_image002[1]clip_image004clip_image002朴素贝叶斯与贝叶斯信念网络贝叶斯信念网络
概率clip_image056clip_image054clip_image051[1]clip_image051clip_image049clip_image043[1]clip_image038[2]clip_image045clip_image043clip_image041clip_image038[1]clip_image038clip_image036clip_image034clip_image032clip_image030clip_image028clip_image026clip_image024clip_image022clip_image020clip_image018clip_image016clip_image014clip_image012clip_image010clip_image008clip_image006clip_image002[1]clip_image004clip_image002朴素贝叶斯与贝叶斯信念网络贝叶斯信念网络,clip_image058clip_image056clip_image054clip_image051[1]clip_image051clip_image049clip_image043[1]clip_image038[2]clip_image045clip_image043clip_image041clip_image038[1]clip_image038clip_image036clip_image034clip_image032clip_image030clip_image028clip_image026clip_image024clip_image022clip_image020clip_image018clip_image016clip_image014clip_image012clip_image010clip_image008clip_image006clip_image002[1]clip_image004clip_image002朴素贝叶斯与贝叶斯信念网络贝叶斯信念网络,...,clip_image060clip_image058clip_image056clip_image054clip_image051[1]clip_image051clip_image049clip_image043[1]clip_image038[2]clip_image045clip_image043clip_image041clip_image038[1]clip_image038clip_image036clip_image034clip_image032clip_image030clip_image028clip_image026clip_image024clip_image022clip_image020clip_image018clip_image016clip_image014clip_image012clip_image010clip_image008clip_image006clip_image002[1]clip_image004clip_image002朴素贝叶斯与贝叶斯信念网络贝叶斯信念网络可以由训练样本估值,其中,
(a) 如果Ak是分类属性,则clip_image062clip_image060clip_image058clip_image056clip_image054clip_image051[1]clip_image051clip_image049clip_image043[1]clip_image038[2]clip_image045clip_image043clip_image041clip_image038[1]clip_image038clip_image036clip_image034clip_image032clip_image030clip_image028clip_image026clip_image024clip_image022clip_image020clip_image018clip_image016clip_image014clip_image012clip_image010clip_image008clip_image006clip_image002[1]clip_image004clip_image002朴素贝叶斯与贝叶斯信念网络贝叶斯信念网络;其中sik 是在属性Ak 上具有值xk 的类Ci 的训练样本数,而si 是Ci中的训练样本数。
(b) 如果是连续值属性,则通常假定该属性服从高斯分布。因而,
clip_image064clip_image062clip_image060clip_image058clip_image056clip_image054clip_image051[1]clip_image051clip_image049clip_image043[1]clip_image038[2]clip_image045clip_image043clip_image041clip_image038[1]clip_image038clip_image036clip_image034clip_image032clip_image030clip_image028clip_image026clip_image024clip_image022clip_image020clip_image018clip_image016clip_image014clip_image012clip_image010clip_image008clip_image006clip_image002[1]clip_image004clip_image002朴素贝叶斯与贝叶斯信念网络贝叶斯信念网络
其中,给定类Ci的训练样本属性Ak的值,clip_image066clip_image064clip_image062clip_image060clip_image058clip_image056clip_image054clip_image051[1]clip_image051clip_image049clip_image043[1]clip_image038[2]clip_image045clip_image043clip_image041clip_image038[1]clip_image038clip_image036clip_image034clip_image032clip_image030clip_image028clip_image026clip_image024clip_image022clip_image020clip_image018clip_image016clip_image014clip_image012clip_image010clip_image008clip_image006clip_image002[1]clip_image004clip_image002朴素贝叶斯与贝叶斯信念网络贝叶斯信念网络是属性Ak的高斯密度函数,而clip_image068clip_image066clip_image064clip_image062clip_image060clip_image058clip_image056clip_image054clip_image051[1]clip_image051clip_image049clip_image043[1]clip_image038[2]clip_image045clip_image043clip_image041clip_image038[1]clip_image038clip_image036clip_image034clip_image032clip_image030clip_image028clip_image026clip_image024clip_image022clip_image020clip_image018clip_image016clip_image014clip_image012clip_image010clip_image008clip_image006clip_image002[1]clip_image004clip_image002朴素贝叶斯与贝叶斯信念网络贝叶斯信念网络分别为平均值和标准差。
5.为对未知样本X分类,对每个类Ci,计算clip_image043[2]clip_image068clip_image066clip_image064clip_image062clip_image060clip_image058clip_image056clip_image054clip_image051[1]clip_image051clip_image049clip_image043[1]clip_image038[2]clip_image045clip_image043clip_image041clip_image038[1]clip_image038clip_image036clip_image034clip_image032clip_image030clip_image028clip_image026clip_image024clip_image022clip_image020clip_image018clip_image016clip_image014clip_image012clip_image010clip_image008clip_image006clip_image002[1]clip_image004clip_image002朴素贝叶斯与贝叶斯信念网络贝叶斯信念网络。样本X被指派到类Ci,当且仅当:
clip_image071clip_image043[2]clip_image068clip_image066clip_image064clip_image062clip_image060clip_image058clip_image056clip_image054clip_image051[1]clip_image051clip_image049clip_image043[1]clip_image038[2]clip_image045clip_image043clip_image041clip_image038[1]clip_image038clip_image036clip_image034clip_image032clip_image030clip_image028clip_image026clip_image024clip_image022clip_image020clip_image018clip_image016clip_image014clip_image012clip_image010clip_image008clip_image006clip_image002[1]clip_image004clip_image002朴素贝叶斯与贝叶斯信念网络贝叶斯信念网络
换言之,X被指派到其clip_image043[3]clip_image071clip_image043[2]clip_image068clip_image066clip_image064clip_image062clip_image060clip_image058clip_image056clip_image054clip_image051[1]clip_image051clip_image049clip_image043[1]clip_image038[2]clip_image045clip_image043clip_image041clip_image038[1]clip_image038clip_image036clip_image034clip_image032clip_image030clip_image028clip_image026clip_image024clip_image022clip_image020clip_image018clip_image016clip_image014clip_image012clip_image010clip_image008clip_image006clip_image002[1]clip_image004clip_image002朴素贝叶斯与贝叶斯信念网络贝叶斯信念网络最大的类Ci。
贝叶斯信念网络
朴素贝叶斯假定属性之间是独立的。贝叶斯信念网络说明联合概率分布,它提供一种因果关系的图形,可以在其上进行学习。
信念网络由两部分定义。第一部分是有向无环图,其每个结点代表一个随机变量,而每条弧代表一个概率依赖。如果一条弧由结点Y到Z,则Y是Z的双亲或直接前驱,而Z是Y的后继。第二部分是每个属性一个条件概率表(CPT)。
下面是一个LungCancer的CPT。
clip_image073clip_image043[3]clip_image071clip_image043[2]clip_image068clip_image066clip_image064clip_image062clip_image060clip_image058clip_image056clip_image054clip_image051[1]clip_image051clip_image049clip_image043[1]clip_image038[2]clip_image045clip_image043clip_image041clip_image038[1]clip_image038clip_image036clip_image034clip_image032clip_image030clip_image028clip_image026clip_image024clip_image022clip_image020clip_image018clip_image016clip_image014clip_image012clip_image010clip_image008clip_image006clip_image002[1]clip_image004clip_image002朴素贝叶斯与贝叶斯信念网络贝叶斯信念网络
在贝叶斯信念网络中对应于属性或变量clip_image075clip_image073clip_image043[3]clip_image071clip_image043[2]clip_image068clip_image066clip_image064clip_image062clip_image060clip_image058clip_image056clip_image054clip_image051[1]clip_image051clip_image049clip_image043[1]clip_image038[2]clip_image045clip_image043clip_image041clip_image038[1]clip_image038clip_image036clip_image034clip_image032clip_image030clip_image028clip_image026clip_image024clip_image022clip_image020clip_image018clip_image016clip_image014clip_image012clip_image010clip_image008clip_image006clip_image002[1]clip_image004clip_image002朴素贝叶斯与贝叶斯信念网络贝叶斯信念网络的任意元组clip_image077clip_image075clip_image073clip_image043[3]clip_image071clip_image043[2]clip_image068clip_image066clip_image064clip_image062clip_image060clip_image058clip_image056clip_image054clip_image051[1]clip_image051clip_image049clip_image043[1]clip_image038[2]clip_image045clip_image043clip_image041clip_image038[1]clip_image038clip_image036clip_image034clip_image032clip_image030clip_image028clip_image026clip_image024clip_image022clip_image020clip_image018clip_image016clip_image014clip_image012clip_image010clip_image008clip_image006clip_image002[1]clip_image004clip_image002朴素贝叶斯与贝叶斯信念网络贝叶斯信念网络的联合概率由下式计算:
clip_image079clip_image077clip_image075clip_image073clip_image043[3]clip_image071clip_image043[2]clip_image068clip_image066clip_image064clip_image062clip_image060clip_image058clip_image056clip_image054clip_image051[1]clip_image051clip_image049clip_image043[1]clip_image038[2]clip_image045clip_image043clip_image041clip_image038[1]clip_image038clip_image036clip_image034clip_image032clip_image030clip_image028clip_image026clip_image024clip_image022clip_image020clip_image018clip_image016clip_image014clip_image012clip_image010clip_image008clip_image006clip_image002[1]clip_image004clip_image002朴素贝叶斯与贝叶斯信念网络贝叶斯信念网络
如上图,对于FamilyHistory,Smoker,LungCancer这三个属性,用朴素贝叶斯计算,得到的联合概率是clip_image081clip_image079clip_image077clip_image075clip_image073clip_image043[3]clip_image071clip_image043[2]clip_image068clip_image066clip_image064clip_image062clip_image060clip_image058clip_image056clip_image054clip_image051[1]clip_image051clip_image049clip_image043[1]clip_image038[2]clip_image045clip_image043clip_image041clip_image038[1]clip_image038clip_image036clip_image034clip_image032clip_image030clip_image028clip_image026clip_image024clip_image022clip_image020clip_image018clip_image016clip_image014clip_image012clip_image010clip_image008clip_image006clip_image002[1]clip_image004clip_image002朴素贝叶斯与贝叶斯信念网络贝叶斯信念网络
但是如果用贝叶斯信念网络计算得到的联合概率将会是:
clip_image083clip_image081clip_image079clip_image077clip_image075clip_image073clip_image043[3]clip_image071clip_image043[2]clip_image068clip_image066clip_image064clip_image062clip_image060clip_image058clip_image056clip_image054clip_image051[1]clip_image051clip_image049clip_image043[1]clip_image038[2]clip_image045clip_image043clip_image041clip_image038[1]clip_image038clip_image036clip_image034clip_image032clip_image030clip_image028clip_image026clip_image024clip_image022clip_image020clip_image018clip_image016clip_image014clip_image012clip_image010clip_image008clip_image006clip_image002[1]clip_image004clip_image002朴素贝叶斯与贝叶斯信念网络贝叶斯信念网络,会更为准确。
贝叶斯信念网络的问题
1、如果贝叶斯信念网络的网络结构和所有数值都是给定的,那么可以直接进行计算。但是,数据是隐藏的,比如上图中的FamilyHistory/Somker到LungCancer的条件概率是未知的,只是知道存在这样的依存关系,这时就需要进行条件概率的估算。梯度训练算法和EM算法常被用于处理此问题。
2、贝叶斯网络的数据结构可能是未知的,这时就需要根据已知数据启发式学习贝叶斯网络结构。K2算法可用于解决此问题。
梯度训练算法
梯度训练是用于解决信念网络中隐藏数据问题的,就是已知上图(a),但是不知道上图(b)。
设D是d个训练样本clip_image085clip_image083clip_image081clip_image079clip_image077clip_image075clip_image073clip_image043[3]clip_image071clip_image043[2]clip_image068clip_image066clip_image064clip_image062clip_image060clip_image058clip_image056clip_image054clip_image051[1]clip_image051clip_image049clip_image043[1]clip_image038[2]clip_image045clip_image043clip_image041clip_image038[1]clip_image038clip_image036clip_image034clip_image032clip_image030clip_image028clip_image026clip_image024clip_image022clip_image020clip_image018clip_image016clip_image014clip_image012clip_image010clip_image008clip_image006clip_image002[1]clip_image004clip_image002朴素贝叶斯与贝叶斯信念网络贝叶斯信念网络的集合,clip_image087clip_image085clip_image083clip_image081clip_image079clip_image077clip_image075clip_image073clip_image043[3]clip_image071clip_image043[2]clip_image068clip_image066clip_image064clip_image062clip_image060clip_image058clip_image056clip_image054clip_image051[1]clip_image051clip_image049clip_image043[1]clip_image038[2]clip_image045clip_image043clip_image041clip_image038[1]clip_image038clip_image036clip_image034clip_image032clip_image030clip_image028clip_image026clip_image024clip_image022clip_image020clip_image018clip_image016clip_image014clip_image012clip_image010clip_image008clip_image006clip_image002[1]clip_image004clip_image002朴素贝叶斯与贝叶斯信念网络贝叶斯信念网络是具有双亲clip_image089clip_image087clip_image085clip_image083clip_image081clip_image079clip_image077clip_image075clip_image073clip_image043[3]clip_image071clip_image043[2]clip_image068clip_image066clip_image064clip_image062clip_image060clip_image058clip_image056clip_image054clip_image051[1]clip_image051clip_image049clip_image043[1]clip_image038[2]clip_image045clip_image043clip_image041clip_image038[1]clip_image038clip_image036clip_image034clip_image032clip_image030clip_image028clip_image026clip_image024clip_image022clip_image020clip_image018clip_image016clip_image014clip_image012clip_image010clip_image008clip_image006clip_image002[1]clip_image004clip_image002朴素贝叶斯与贝叶斯信念网络贝叶斯信念网络= clip_image091clip_image089clip_image087clip_image085clip_image083clip_image081clip_image079clip_image077clip_image075clip_image073clip_image043[3]clip_image071clip_image043[2]clip_image068clip_image066clip_image064clip_image062clip_image060clip_image058clip_image056clip_image054clip_image051[1]clip_image051clip_image049clip_image043[1]clip_image038[2]clip_image045clip_image043clip_image041clip_image038[1]clip_image038clip_image036clip_image034clip_image032clip_image030clip_image028clip_image026clip_image024clip_image022clip_image020clip_image018clip_image016clip_image014clip_image012clip_image010clip_image008clip_image006clip_image002[1]clip_image004clip_image002朴素贝叶斯与贝叶斯信念网络贝叶斯信念网络的变量clip_image093clip_image091clip_image089clip_image087clip_image085clip_image083clip_image081clip_image079clip_image077clip_image075clip_image073clip_image043[3]clip_image071clip_image043[2]clip_image068clip_image066clip_image064clip_image062clip_image060clip_image058clip_image056clip_image054clip_image051[1]clip_image051clip_image049clip_image043[1]clip_image038[2]clip_image045clip_image043clip_image041clip_image038[1]clip_image038clip_image036clip_image034clip_image032clip_image030clip_image028clip_image026clip_image024clip_image022clip_image020clip_image018clip_image016clip_image014clip_image012clip_image010clip_image008clip_image006clip_image002[1]clip_image004clip_image002朴素贝叶斯与贝叶斯信念网络贝叶斯信念网络 = clip_image095clip_image093clip_image091clip_image089clip_image087clip_image085clip_image083clip_image081clip_image079clip_image077clip_image075clip_image073clip_image043[3]clip_image071clip_image043[2]clip_image068clip_image066clip_image064clip_image062clip_image060clip_image058clip_image056clip_image054clip_image051[1]clip_image051clip_image049clip_image043[1]clip_image038[2]clip_image045clip_image043clip_image041clip_image038[1]clip_image038clip_image036clip_image034clip_image032clip_image030clip_image028clip_image026clip_image024clip_image022clip_image020clip_image018clip_image016clip_image014clip_image012clip_image010clip_image008clip_image006clip_image002[1]clip_image004clip_image002朴素贝叶斯与贝叶斯信念网络贝叶斯信念网络的CPT项。例如,如果clip_image087[1]clip_image095clip_image093clip_image091clip_image089clip_image087clip_image085clip_image083clip_image081clip_image079clip_image077clip_image075clip_image073clip_image043[3]clip_image071clip_image043[2]clip_image068clip_image066clip_image064clip_image062clip_image060clip_image058clip_image056clip_image054clip_image051[1]clip_image051clip_image049clip_image043[1]clip_image038[2]clip_image045clip_image043clip_image041clip_image038[1]clip_image038clip_image036clip_image034clip_image032clip_image030clip_image028clip_image026clip_image024clip_image022clip_image020clip_image018clip_image016clip_image014clip_image012clip_image010clip_image008clip_image006clip_image002[1]clip_image004clip_image002朴素贝叶斯与贝叶斯信念网络贝叶斯信念网络是上图(b)左上角的CPT项,则clip_image093[1]clip_image087[1]clip_image095clip_image093clip_image091clip_image089clip_image087clip_image085clip_image083clip_image081clip_image079clip_image077clip_image075clip_image073clip_image043[3]clip_image071clip_image043[2]clip_image068clip_image066clip_image064clip_image062clip_image060clip_image058clip_image056clip_image054clip_image051[1]clip_image051clip_image049clip_image043[1]clip_image038[2]clip_image045clip_image043clip_image041clip_image038[1]clip_image038clip_image036clip_image034clip_image032clip_image030clip_image028clip_image026clip_image024clip_image022clip_image020clip_image018clip_image016clip_image014clip_image012clip_image010clip_image008clip_image006clip_image002[1]clip_image004clip_image002朴素贝叶斯与贝叶斯信念网络贝叶斯信念网络是LungCancer;clip_image095[1]clip_image093[1]clip_image087[1]clip_image095clip_image093clip_image091clip_image089clip_image087clip_image085clip_image083clip_image081clip_image079clip_image077clip_image075clip_image073clip_image043[3]clip_image071clip_image043[2]clip_image068clip_image066clip_image064clip_image062clip_image060clip_image058clip_image056clip_image054clip_image051[1]clip_image051clip_image049clip_image043[1]clip_image038[2]clip_image045clip_image043clip_image041clip_image038[1]clip_image038clip_image036clip_image034clip_image032clip_image030clip_image028clip_image026clip_image024clip_image022clip_image020clip_image018clip_image016clip_image014clip_image012clip_image010clip_image008clip_image006clip_image002[1]clip_image004clip_image002朴素贝叶斯与贝叶斯信念网络贝叶斯信念网络是其值“yes”; clip_image089[1]clip_image095[1]clip_image093[1]clip_image087[1]clip_image095clip_image093clip_image091clip_image089clip_image087clip_image085clip_image083clip_image081clip_image079clip_image077clip_image075clip_image073clip_image043[3]clip_image071clip_image043[2]clip_image068clip_image066clip_image064clip_image062clip_image060clip_image058clip_image056clip_image054clip_image051[1]clip_image051clip_image049clip_image043[1]clip_image038[2]clip_image045clip_image043clip_image041clip_image038[1]clip_image038clip_image036clip_image034clip_image032clip_image030clip_image028clip_image026clip_image024clip_image022clip_image020clip_image018clip_image016clip_image014clip_image012clip_image010clip_image008clip_image006clip_image002[1]clip_image004clip_image002朴素贝叶斯与贝叶斯信念网络贝叶斯信念网络列出clip_image093[2]clip_image089[1]clip_image095[1]clip_image093[1]clip_image087[1]clip_image095clip_image093clip_image091clip_image089clip_image087clip_image085clip_image083clip_image081clip_image079clip_image077clip_image075clip_image073clip_image043[3]clip_image071clip_image043[2]clip_image068clip_image066clip_image064clip_image062clip_image060clip_image058clip_image056clip_image054clip_image051[1]clip_image051clip_image049clip_image043[1]clip_image038[2]clip_image045clip_image043clip_image041clip_image038[1]clip_image038clip_image036clip_image034clip_image032clip_image030clip_image028clip_image026clip_image024clip_image022clip_image020clip_image018clip_image016clip_image014clip_image012clip_image010clip_image008clip_image006clip_image002[1]clip_image004clip_image002朴素贝叶斯与贝叶斯信念网络贝叶斯信念网络的双亲结点{FamilyHistory, Smoker};而clip_image091[1]clip_image093[2]clip_image089[1]clip_image095[1]clip_image093[1]clip_image087[1]clip_image095clip_image093clip_image091clip_image089clip_image087clip_image085clip_image083clip_image081clip_image079clip_image077clip_image075clip_image073clip_image043[3]clip_image071clip_image043[2]clip_image068clip_image066clip_image064clip_image062clip_image060clip_image058clip_image056clip_image054clip_image051[1]clip_image051clip_image049clip_image043[1]clip_image038[2]clip_image045clip_image043clip_image041clip_image038[1]clip_image038clip_image036clip_image034clip_image032clip_image030clip_image028clip_image026clip_image024clip_image022clip_image020clip_image018clip_image016clip_image014clip_image012clip_image010clip_image008clip_image006clip_image002[1]clip_image004clip_image002朴素贝叶斯与贝叶斯信念网络贝叶斯信念网络列出双亲结点的值{“yes”, “yes”}。clip_image087[2]clip_image091[1]clip_image093[2]clip_image089[1]clip_image095[1]clip_image093[1]clip_image087[1]clip_image095clip_image093clip_image091clip_image089clip_image087clip_image085clip_image083clip_image081clip_image079clip_image077clip_image075clip_image073clip_image043[3]clip_image071clip_image043[2]clip_image068clip_image066clip_image064clip_image062clip_image060clip_image058clip_image056clip_image054clip_image051[1]clip_image051clip_image049clip_image043[1]clip_image038[2]clip_image045clip_image043clip_image041clip_image038[1]clip_image038clip_image036clip_image034clip_image032clip_image030clip_image028clip_image026clip_image024clip_image022clip_image020clip_image018clip_image016clip_image014clip_image012clip_image010clip_image008clip_image006clip_image002[1]clip_image004clip_image002朴素贝叶斯与贝叶斯信念网络贝叶斯信念网络可以看作权,类似于神经网络中隐藏单元的权。权的集合记作clip_image097clip_image087[2]clip_image091[1]clip_image093[2]clip_image089[1]clip_image095[1]clip_image093[1]clip_image087[1]clip_image095clip_image093clip_image091clip_image089clip_image087clip_image085clip_image083clip_image081clip_image079clip_image077clip_image075clip_image073clip_image043[3]clip_image071clip_image043[2]clip_image068clip_image066clip_image064clip_image062clip_image060clip_image058clip_image056clip_image054clip_image051[1]clip_image051clip_image049clip_image043[1]clip_image038[2]clip_image045clip_image043clip_image041clip_image038[1]clip_image038clip_image036clip_image034clip_image032clip_image030clip_image028clip_image026clip_image024clip_image022clip_image020clip_image018clip_image016clip_image014clip_image012clip_image010clip_image008clip_image006clip_image002[1]clip_image004clip_image002朴素贝叶斯与贝叶斯信念网络贝叶斯信念网络
梯度训练算法就是求出最为满足clip_image085[1]clip_image097clip_image087[2]clip_image091[1]clip_image093[2]clip_image089[1]clip_image095[1]clip_image093[1]clip_image087[1]clip_image095clip_image093clip_image091clip_image089clip_image087clip_image085clip_image083clip_image081clip_image079clip_image077clip_image075clip_image073clip_image043[3]clip_image071clip_image043[2]clip_image068clip_image066clip_image064clip_image062clip_image060clip_image058clip_image056clip_image054clip_image051[1]clip_image051clip_image049clip_image043[1]clip_image038[2]clip_image045clip_image043clip_image041clip_image038[1]clip_image038clip_image036clip_image034clip_image032clip_image030clip_image028clip_image026clip_image024clip_image022clip_image020clip_image018clip_image016clip_image014clip_image012clip_image010clip_image008clip_image006clip_image002[1]clip_image004clip_image002朴素贝叶斯与贝叶斯信念网络贝叶斯信念网络训练版本集的权的集合clip_image097[1]clip_image085[1]clip_image097clip_image087[2]clip_image091[1]clip_image093[2]clip_image089[1]clip_image095[1]clip_image093[1]clip_image087[1]clip_image095clip_image093clip_image091clip_image089clip_image087clip_image085clip_image083clip_image081clip_image079clip_image077clip_image075clip_image073clip_image043[3]clip_image071clip_image043[2]clip_image068clip_image066clip_image064clip_image062clip_image060clip_image058clip_image056clip_image054clip_image051[1]clip_image051clip_image049clip_image043[1]clip_image038[2]clip_image045clip_image043clip_image041clip_image038[1]clip_image038clip_image036clip_image034clip_image032clip_image030clip_image028clip_image026clip_image024clip_image022clip_image020clip_image018clip_image016clip_image014clip_image012clip_image010clip_image008clip_image006clip_image002[1]clip_image004clip_image002朴素贝叶斯与贝叶斯信念网络贝叶斯信念网络,用数学公式表示就是clip_image100clip_image097[1]clip_image085[1]clip_image097clip_image087[2]clip_image091[1]clip_image093[2]clip_image089[1]clip_image095[1]clip_image093[1]clip_image087[1]clip_image095clip_image093clip_image091clip_image089clip_image087clip_image085clip_image083clip_image081clip_image079clip_image077clip_image075clip_image073clip_image043[3]clip_image071clip_image043[2]clip_image068clip_image066clip_image064clip_image062clip_image060clip_image058clip_image056clip_image054clip_image051[1]clip_image051clip_image049clip_image043[1]clip_image038[2]clip_image045clip_image043clip_image041clip_image038[1]clip_image038clip_image036clip_image034clip_image032clip_image030clip_image028clip_image026clip_image024clip_image022clip_image020clip_image018clip_image016clip_image014clip_image012clip_image010clip_image008clip_image006clip_image002[1]clip_image004clip_image002朴素贝叶斯与贝叶斯信念网络贝叶斯信念网络最大(clip_image102clip_image100clip_image097[1]clip_image085[1]clip_image097clip_image087[2]clip_image091[1]clip_image093[2]clip_image089[1]clip_image095[1]clip_image093[1]clip_image087[1]clip_image095clip_image093clip_image091clip_image089clip_image087clip_image085clip_image083clip_image081clip_image079clip_image077clip_image075clip_image073clip_image043[3]clip_image071clip_image043[2]clip_image068clip_image066clip_image064clip_image062clip_image060clip_image058clip_image056clip_image054clip_image051[1]clip_image051clip_image049clip_image043[1]clip_image038[2]clip_image045clip_image043clip_image041clip_image038[1]clip_image038clip_image036clip_image034clip_image032clip_image030clip_image028clip_image026clip_image024clip_image022clip_image020clip_image018clip_image016clip_image014clip_image012clip_image010clip_image008clip_image006clip_image002[1]clip_image004clip_image002朴素贝叶斯与贝叶斯信念网络贝叶斯信念网络就表示clip_image104clip_image102clip_image100clip_image097[1]clip_image085[1]clip_image097clip_image087[2]clip_image091[1]clip_image093[2]clip_image089[1]clip_image095[1]clip_image093[1]clip_image087[1]clip_image095clip_image093clip_image091clip_image089clip_image087clip_image085clip_image083clip_image081clip_image079clip_image077clip_image075clip_image073clip_image043[3]clip_image071clip_image043[2]clip_image068clip_image066clip_image064clip_image062clip_image060clip_image058clip_image056clip_image054clip_image051[1]clip_image051clip_image049clip_image043[1]clip_image038[2]clip_image045clip_image043clip_image041clip_image038[1]clip_image038clip_image036clip_image034clip_image032clip_image030clip_image028clip_image026clip_image024clip_image022clip_image020clip_image018clip_image016clip_image014clip_image012clip_image010clip_image008clip_image006clip_image002[1]clip_image004clip_image002朴素贝叶斯与贝叶斯信念网络贝叶斯信念网络)。
具体的算法:
1、就是对每一个clip_image087[3]clip_image104clip_image102clip_image100clip_image097[1]clip_image085[1]clip_image097clip_image087[2]clip_image091[1]clip_image093[2]clip_image089[1]clip_image095[1]clip_image093[1]clip_image087[1]clip_image095clip_image093clip_image091clip_image089clip_image087clip_image085clip_image083clip_image081clip_image079clip_image077clip_image075clip_image073clip_image043[3]clip_image071clip_image043[2]clip_image068clip_image066clip_image064clip_image062clip_image060clip_image058clip_image056clip_image054clip_image051[1]clip_image051clip_image049clip_image043[1]clip_image038[2]clip_image045clip_image043clip_image041clip_image038[1]clip_image038clip_image036clip_image034clip_image032clip_image030clip_image028clip_image026clip_image024clip_image022clip_image020clip_image018clip_image016clip_image014clip_image012clip_image010clip_image008clip_image006clip_image002[1]clip_image004clip_image002朴素贝叶斯与贝叶斯信念网络贝叶斯信念网络取偏导数,clip_image107clip_image087[3]clip_image104clip_image102clip_image100clip_image097[1]clip_image085[1]clip_image097clip_image087[2]clip_image091[1]clip_image093[2]clip_image089[1]clip_image095[1]clip_image093[1]clip_image087[1]clip_image095clip_image093clip_image091clip_image089clip_image087clip_image085clip_image083clip_image081clip_image079clip_image077clip_image075clip_image073clip_image043[3]clip_image071clip_image043[2]clip_image068clip_image066clip_image064clip_image062clip_image060clip_image058clip_image056clip_image054clip_image051[1]clip_image051clip_image049clip_image043[1]clip_image038[2]clip_image045clip_image043clip_image041clip_image038[1]clip_image038clip_image036clip_image034clip_image032clip_image030clip_image028clip_image026clip_image024clip_image022clip_image020clip_image018clip_image016clip_image014clip_image012clip_image010clip_image008clip_image006clip_image002[1]clip_image004clip_image002朴素贝叶斯与贝叶斯信念网络贝叶斯信念网络
2、更新权值:沿梯度方向前进一小步。clip_image109clip_image107clip_image087[3]clip_image104clip_image102clip_image100clip_image097[1]clip_image085[1]clip_image097clip_image087[2]clip_image091[1]clip_image093[2]clip_image089[1]clip_image095[1]clip_image093[1]clip_image087[1]clip_image095clip_image093clip_image091clip_image089clip_image087clip_image085clip_image083clip_image081clip_image079clip_image077clip_image075clip_image073clip_image043[3]clip_image071clip_image043[2]clip_image068clip_image066clip_image064clip_image062clip_image060clip_image058clip_image056clip_image054clip_image051[1]clip_image051clip_image049clip_image043[1]clip_image038[2]clip_image045clip_image043clip_image041clip_image038[1]clip_image038clip_image036clip_image034clip_image032clip_image030clip_image028clip_image026clip_image024clip_image022clip_image020clip_image018clip_image016clip_image014clip_image012clip_image010clip_image008clip_image006clip_image002[1]clip_image004clip_image002朴素贝叶斯与贝叶斯信念网络贝叶斯信念网络,其中l是学习率,是一个小常数。
3、由于权值clip_image087[4]clip_image109clip_image107clip_image087[3]clip_image104clip_image102clip_image100clip_image097[1]clip_image085[1]clip_image097clip_image087[2]clip_image091[1]clip_image093[2]clip_image089[1]clip_image095[1]clip_image093[1]clip_image087[1]clip_image095clip_image093clip_image091clip_image089clip_image087clip_image085clip_image083clip_image081clip_image079clip_image077clip_image075clip_image073clip_image043[3]clip_image071clip_image043[2]clip_image068clip_image066clip_image064clip_image062clip_image060clip_image058clip_image056clip_image054clip_image051[1]clip_image051clip_image049clip_image043[1]clip_image038[2]clip_image045clip_image043clip_image041clip_image038[1]clip_image038clip_image036clip_image034clip_image032clip_image030clip_image028clip_image026clip_image024clip_image022clip_image020clip_image018clip_image016clip_image014clip_image012clip_image010clip_image008clip_image006clip_image002[1]clip_image004clip_image002朴素贝叶斯与贝叶斯信念网络贝叶斯信念网络是概率值,它们必须在0.0和1.0之间,并且对于所有的i,k,clip_image112clip_image087[4]clip_image109clip_image107clip_image087[3]clip_image104clip_image102clip_image100clip_image097[1]clip_image085[1]clip_image097clip_image087[2]clip_image091[1]clip_image093[2]clip_image089[1]clip_image095[1]clip_image093[1]clip_image087[1]clip_image095clip_image093clip_image091clip_image089clip_image087clip_image085clip_image083clip_image081clip_image079clip_image077clip_image075clip_image073clip_image043[3]clip_image071clip_image043[2]clip_image068clip_image066clip_image064clip_image062clip_image060clip_image058clip_image056clip_image054clip_image051[1]clip_image051clip_image049clip_image043[1]clip_image038[2]clip_image045clip_image043clip_image041clip_image038[1]clip_image038clip_image036clip_image034clip_image032clip_image030clip_image028clip_image026clip_image024clip_image022clip_image020clip_image018clip_image016clip_image014clip_image012clip_image010clip_image008clip_image006clip_image002[1]clip_image004clip_image002朴素贝叶斯与贝叶斯信念网络贝叶斯信念网络必须等于1。在权值被式更新后,可以对它们重新规格化来保证这一条件。clip_image114clip_image112clip_image087[4]clip_image109clip_image107clip_image087[3]clip_image104clip_image102clip_image100clip_image097[1]clip_image085[1]clip_image097clip_image087[2]clip_image091[1]clip_image093[2]clip_image089[1]clip_image095[1]clip_image093[1]clip_image087[1]clip_image095clip_image093clip_image091clip_image089clip_image087clip_image085clip_image083clip_image081clip_image079clip_image077clip_image075clip_image073clip_image043[3]clip_image071clip_image043[2]clip_image068clip_image066clip_image064clip_image062clip_image060clip_image058clip_image056clip_image054clip_image051[1]clip_image051clip_image049clip_image043[1]clip_image038[2]clip_image045clip_image043clip_image041clip_image038[1]clip_image038clip_image036clip_image034clip_image032clip_image030clip_image028clip_image026clip_image024clip_image022clip_image020clip_image018clip_image016clip_image014clip_image012clip_image010clip_image008clip_image006clip_image002[1]clip_image004clip_image002朴素贝叶斯与贝叶斯信念网络贝叶斯信念网络
关于clip_image087[5]clip_image114clip_image112clip_image087[4]clip_image109clip_image107clip_image087[3]clip_image104clip_image102clip_image100clip_image097[1]clip_image085[1]clip_image097clip_image087[2]clip_image091[1]clip_image093[2]clip_image089[1]clip_image095[1]clip_image093[1]clip_image087[1]clip_image095clip_image093clip_image091clip_image089clip_image087clip_image085clip_image083clip_image081clip_image079clip_image077clip_image075clip_image073clip_image043[3]clip_image071clip_image043[2]clip_image068clip_image066clip_image064clip_image062clip_image060clip_image058clip_image056clip_image054clip_image051[1]clip_image051clip_image049clip_image043[1]clip_image038[2]clip_image045clip_image043clip_image041clip_image038[1]clip_image038clip_image036clip_image034clip_image032clip_image030clip_image028clip_image026clip_image024clip_image022clip_image020clip_image018clip_image016clip_image014clip_image012clip_image010clip_image008clip_image006clip_image002[1]clip_image004clip_image002朴素贝叶斯与贝叶斯信念网络贝叶斯信念网络的偏导数推导如下:
clip_image116clip_image087[5]clip_image114clip_image112clip_image087[4]clip_image109clip_image107clip_image087[3]clip_image104clip_image102clip_image100clip_image097[1]clip_image085[1]clip_image097clip_image087[2]clip_image091[1]clip_image093[2]clip_image089[1]clip_image095[1]clip_image093[1]clip_image087[1]clip_image095clip_image093clip_image091clip_image089clip_image087clip_image085clip_image083clip_image081clip_image079clip_image077clip_image075clip_image073clip_image043[3]clip_image071clip_image043[2]clip_image068clip_image066clip_image064clip_image062clip_image060clip_image058clip_image056clip_image054clip_image051[1]clip_image051clip_image049clip_image043[1]clip_image038[2]clip_image045clip_image043clip_image041clip_image038[1]clip_image038clip_image036clip_image034clip_image032clip_image030clip_image028clip_image026clip_image024clip_image022clip_image020clip_image018clip_image016clip_image014clip_image012clip_image010clip_image008clip_image006clip_image002[1]clip_image004clip_image002朴素贝叶斯与贝叶斯信念网络贝叶斯信念网络
Tags:  朴素贝叶斯分类 朴素贝叶斯算法 朴素贝叶斯分类器 朴素贝叶斯 贝叶斯信念网络

延伸阅读

最新评论

发表评论