目标是生成服从代码块中的p(x)密度函数分布的样本,用kq(x)密度函数覆盖住p(x),使得p(x)<=kq(x)。具体原理参考随机采样方法整理与讲解(MCMC、Gibbs Sampling等) 不好的地方是很难找到合适的q分布,k的确定也麻烦。
def p(x): return (0.3 * np.exp(-(x - 0.3) ** 2) + 0.7 * np.exp(-(x - 2.) ** 2 / 0.3)) / 1.2113 def q(x, loc=0, scale=1): return 1 / (np.sqrt(2 * np.pi) * scale) * np.exp(-0.5 * (x - loc) ** 2 / scale ** 2) def reject_sampling(pdf, k, loc, scale, num=1000): normal_sampling = box_muller(num, loc=loc, scale=scale) origin_pdf = pdf(normal_sampling) cover_pdf = q(normal_sampling, loc, scale) * k u = random_uniform(num) * cover_pdf res = normal_sampling[origin_pdf > u] return res res = reject_sampling(p, 10, 1.4, 1.2, int(1e+7)) plot_hist(res)MCMC采样是从马氏链的平稳分布原理出发,构造转移概率矩阵使得马氏链的平稳分布为恰好为你想要生成的抽样分布 p ( x ) p(x) p(x)。Metropolis算法对MCMC改进的地方在于对接受率 α ( x t , y ) \alpha(x_t,y) α(xt,y)做了改进,具体可参考LDA数学八卦。 代码中取 q ( x ∣ x t ) = q ( x ) q(x|x_t)=q(x) q(x∣xt)=q(x), q ( x ) q(x) q(x)为均匀分布,因此 α = m i n { p ( y ) p ( x t ) , 1 } \alpha=min\{\frac {p(y)}{p(x_t)},1\} α=min{p(xt)p(y),1}。另外一个值得注意的地方是 q ( x ) q(x) q(x)的选取要根据你想生成的分布 p ( x ) p(x) p(x)来改变, q ( x ) q(x) q(x)至少要覆盖住 p ( x ) p(x) p(x)那些概率密度比较高的点。
def p(x): return 1 / (2 * np.pi) * np.exp(-x[0] ** 2 - x[1] ** 2) def uniform_gen(): # 计算定义域一个随机值 return np.random.random() * 4 - 2 def metropolis(x0): x = (uniform_gen(), uniform_gen()) u = np.random.uniform() alpha = min(1, p(x) / p(x0)) if u < alpha: return x else: return x0 def gen_px(num): x = (uniform_gen(), uniform_gen()) res = [x] for i in range(num): x = metropolis(x) res.append(x) return np.array(res) def plt_scatter(x, path=None): x = np.array(x) plt.scatter(x[:, 0], x[:, 1]) if path: plt.savefig(path) plt.show() res = gen_px(5000) plt_scatter(res, 'MCMC.jpg')即使对MCMC抽样的接受率 α \alpha α进行改进,高维情况下Metropolis算法的效率还是有点低。所以闲着无聊的数学家们就在考虑能不能找到一个概率转移矩阵使得 α = 1 \alpha=1 α=1,然后就发现在单个维度进行概率转移的话就能满足这个条件。原理参考LDA数学八卦,代码参考采样方法(二)MCMC相关算法介绍及代码实现
def one_dim_transit(x, dim): candidates = [] probs = [] for i in range(10): tmp = x[:] tmp[dim] = uniform_gen() candidates.append(tmp[dim]) probs.append(p(tmp)) norm_probs = np.array(probs) / sum(probs) u = np.random.uniform() sum_probs = 0 for i in range(10): sum_probs += norm_probs[i] if sum_probs >= u: return candidates[i] def gibbs(x): dim = len(x) new = x[:] for i in range(dim): new[i] = one_dim_transit(new, i) return new def gibbs_sampling(num=1000): x = [uniform_gen(), uniform_gen()] res = [x] for i in range(num): res.append(gibbs(x)) return res res = gibbs_sampling(2000) plt_scatter(res, path='gibbs.jpg')参考资料 LDA数学八卦 随机采样方法整理与讲解(MCMC、Gibbs Sampling等) 采样方法(二)MCMC相关算法介绍及代码实现