libsvm
Support vector Machines,SVM,支持向量机
各种SVM
C" role="presentation" style="position: relative;">C-Support Vector Classication
训练向量 — xi∈Rn,i=1,…,l" role="presentation" style="position: relative;">xi∈Rn,i=1,…,l
两个类class
指标向量 — y∈Rl" role="presentation" style="position: relative;">y∈Rl,yi∈{1,−1}" role="presentation" style="position: relative;">yi∈{1,−1}
C" role="presentation" style="position: relative;">C-SVC解决如下原始优化问题:
data:image/s3,"s3://crabby-images/b94da/b94da8a119a82b72c381fcda1e23b73acd8b5414" alt="这里写图片描述"
ϕ(xi)" role="presentation" style="position: relative;">ϕ(xi)将xi" role="presentation" style="position: relative;">xi映射到更高维空间,C>0" role="presentation" style="position: relative;">C>0为正则化参数。
由于向量参数w" role="presentation" style="position: relative;">w的可能的高维度,通常我们解决如下对偶问题
data:image/s3,"s3://crabby-images/3115b/3115bc4084a5072cc07d73b56d43c396812d7021" alt="这里写图片描述"
e=[1,…,1]T" role="presentation" style="position: relative;">e=[1,…,1]T为全为1的向量
Q" role="presentation" style="position: relative;">Q — 一个l×l" role="presentation" style="position: relative;">l×l的半正定矩阵positive semidefinite matrix
Qij≡yiyjK(xi,xj)" role="presentation" style="position: relative;">Qij≡yiyjK(xi,xj)
K(xi,xj)≡ϕ(xi)Tϕ(xj)" role="presentation" style="position: relative;">K(xi,xj)≡ϕ(xi)Tϕ(xj) — 核函数
问题(2)解决后,使用 primal-dual relationship 原始-对偶关系,最优的w" role="presentation" style="position: relative;">w满足:
data:image/s3,"s3://crabby-images/18c57/18c57d190b1047d8b369e26295828cf09b5356a0" alt="这里写图片描述"
决策函数为
data:image/s3,"s3://crabby-images/13b71/13b7103a1370ecaec896912003930bef97513584" alt="这里写图片描述"
为进行预测,存储如下参数:
yiαi,∀i" role="presentation" style="position: relative;">yiαi,∀i
b" role="presentation" style="position: relative;">b
标签名称
其他参数 如 — 核参数
ν" role="presentation" style="position: relative;">ν-Support Vector Classication
引入了新的参数 — ν∈(0,1]" role="presentation" style="position: relative;">ν∈(0,1]
data:image/s3,"s3://crabby-images/fb3c6/fb3c654d22c17344f56acea031b81558efc122ac" alt="这里写图片描述"
对偶问题为
data:image/s3,"s3://crabby-images/026e2/026e298ee7e872fa0b478be8ef6008cec725226e" alt="这里写图片描述"
当且仅当
data:image/s3,"s3://crabby-images/52754/527546991b36f513b7e5a85a2696884ca23db614" alt="这里写图片描述"
问题才有意义
决策函数为
data:image/s3,"s3://crabby-images/7520d/7520d23731bb3af706baeb83f3cc3bf3b7380a67" alt="这里写图片描述"
可用 eTα=ν" role="presentation" style="position: relative;">eTα=ν 替代 eTα≥ν" role="presentation" style="position: relative;">eTα≥ν
LIBSVM解决一个缩放版的问题,这是因为αi≤1/l" role="presentation" style="position: relative;">αi≤1/l可能过小。
data:image/s3,"s3://crabby-images/4ef8e/4ef8ed104832eacb2b7d7e75028bf4026a161e86" alt="这里写图片描述"
若α" role="presentation" style="position: relative;">α对于对偶问题(5)是最优的
ρ" role="presentation" style="position: relative;">ρ对于原始问题(4)是最优的
则,α/ρ" role="presentation" style="position: relative;">α/ρ是带有C=1/(ρl)" role="presentation" style="position: relative;">C=1/(ρl)的C" role="presentation" style="position: relative;">C-SVM的一个最优解,因此,在LIBSVM模型中的输出为(α/ρ,b/ρ)" role="presentation" style="position: relative;">(α/ρ,b/ρ)。
Distribution Estimation (One-class SVM)
单类别SVM
无类别信息
data:image/s3,"s3://crabby-images/4d76e/4d76e88fcfad32fe57b68d1ed6ee0aa9bc9f9bbc" alt="这里写图片描述"
对偶问题为
data:image/s3,"s3://crabby-images/ef1cb/ef1cb39499b50002574fa78a6592503b82423f87" alt="这里写图片描述"
决策函数为
data:image/s3,"s3://crabby-images/439cd/439cdaaabb1687043ee298f79227f3d59e0ac2c1" alt="这里写图片描述"
缩放版
data:image/s3,"s3://crabby-images/bfd54/bfd5450a4b07111d2b02741a7c6b147fe7e8b1e4" alt="这里写图片描述"
ϵ" role="presentation" style="position: relative;">ϵ-Support Vector regression (ϵ" role="presentation" style="position: relative;">ϵ-SVR)
训练点集 — {(x1,z1),…,(xl,zl)}" role="presentation" style="position: relative;">{(x1,z1),…,(xl,zl)}
xi∈Rn" role="presentation" style="position: relative;">xi∈Rn — 特征向量
zi∈R1" role="presentation" style="position: relative;">zi∈R1 — 目标输出
给定参数 — C>0" role="presentation" style="position: relative;">C>0 及 ϵ>0" role="presentation" style="position: relative;">ϵ>0,支持向量回归的标准形式为:
data:image/s3,"s3://crabby-images/a35b0/a35b0bd93d0c958783535ffff2feaa89bd60aa33" alt="这里写图片描述"
对偶问题为
data:image/s3,"s3://crabby-images/1acdf/1acdf3f369901db6efd24ed1e5f464ec3aeacf3b" alt="这里写图片描述"
在解决问题(9)后,估计函数为
data:image/s3,"s3://crabby-images/a5c12/a5c12dbe15d951504686928b7706fb705b4f413d" alt="这里写图片描述"
输出为 — α∗−α" role="presentation" style="position: relative;">α∗−α
ν" role="presentation" style="position: relative;">ν-Support Vector Regression (ν" role="presentation" style="position: relative;">ν-SVR)
data:image/s3,"s3://crabby-images/147dc/147dc557d8a171da8062a5fccf5ea6d8dc2b4b7b" alt="这里写图片描述"
对偶问题为
data:image/s3,"s3://crabby-images/673ec/673eca5b8df82b3cf4d52ee32f1bf977f7e911f4" alt="这里写图片描述"
估计函数为
data:image/s3,"s3://crabby-images/4c6c9/4c6c98bee2c629308b2e5a39ea0ffbc198ccb10e" alt="这里写图片描述"
eT(α+α∗)≤Cν" role="presentation" style="position: relative;">eT(α+α∗)≤Cν可替换为等式
C¯=C/l" role="presentation" style="position: relative;">C¯=C/l
data:image/s3,"s3://crabby-images/c37e2/c37e21264814a5a2bd3f3b7cf3dab19cb749fa63" alt="这里写图片描述"
如下二者有相同解
1. ϵ" role="presentation" style="position: relative;">ϵ-SVR — 参数(C¯,ϵ)" role="presentation" style="position: relative;">(C¯,ϵ)
2. ν" role="presentation" style="position: relative;">ν-SVR — 参数(lC¯,ν)" role="presentation" style="position: relative;">(lC¯,ν)
性能度量
分类
data:image/s3,"s3://crabby-images/b4d6c/b4d6c1c9a6117583189ef80e07080eb6629cc651" alt="这里写图片描述"
回归
data:image/s3,"s3://crabby-images/7f1cb/7f1cbd68c36d19fb73a1dec48bba09c613427171" alt="这里写图片描述"
整体组织
data:image/s3,"s3://crabby-images/6e228/6e2288f48b1776c2a03c30c434ab8b0494c4faa1" alt="这里写图片描述"
相关阅读