10月 | 2013 | Pacocat's Life

任意の分布に関して乱数を作りたかったのでメモ。
分布の逆関数を容易に求められる場合は、逆関数法を使ってしまうのが簡単なのだけど、

どんな（一価の）関数にも適用出来る
アルゴリズムが作りやすい

といった理由でここでは棄却法を使います。

以下のページを参考にしています。
数値計算法 2011/6/29 (pdf)
15 乱数とモンテカルロ法 (pdf)

von Neumannの棄却法

考え方はとてもシンプルで、

分布関数を含む範囲内で一様乱数のペア ( $x_{i}, y_{i}$ )を生成

具体的には分布関数の範囲を $[x_{min}, x_{max}], [0, y_{max}]$ 、
確率変数 $x_{rand}, y_{rand}$ を $0 \leq x_{rand}, y_{rand} \leq 1$ に従う一様分布と仮定した時に、

$x_{i} = x_{min} + (x_{max} - x_{min})x_{rand}\\ y_{i} = y_{max} y_{rand}\\$

を作る。

生成した乱数ペア( $x_{i}, y_{i}$ ) が分布関数内に入っていれば、 $x_{i}$ を分布に従う乱数として採用（下図でいうと赤点が採用、×が不採用）

アルゴリズム的にはこんな感じ？（コード中にあるテストファイルはこちらから→”random_dist.txt“）

結果のヒストグラム：

手計算でチェックしたかったので過程をメモ。

以下の二つの関数を畳み込み積分した場合、

$f(x) = \begin{cases} \lambda \exp(-\lambda x) & (x\ge 0) \\ 0 & (x<0) \end{cases}$
$\displaystyle g(x) = \frac{1}{\sqrt{2\pi}\sigma}\exp\left\{-\frac{(x-\mu)^2}{2\sigma^2}\right\}$

Exponentially modified Gaussian distribution
にあるようにex-Gaussian distributionと呼ばれる分布が出来る。
$\displaystyle f(x)\ast g(x) = \frac{\lambda}{2}e^{\lambda \mu + \frac{1}{2}\lambda^2\sigma^2}e^{-\lambda x}\mbox{erfc}\left(\frac{\mu+\lambda \sigma^2 -x}{\sqrt{2}\sigma}\right)$
ここでerfcはガウスの相補誤差関数（complementary error function）で以下のように定義されている。erfはガウスの誤差関数。
$\displaystyle \mbox{erfc}(x) = 1-\mbox{erf}(x) = \frac{2}{\sqrt{\pi}} \int ^{+\infty}_{x} e^{-t^2}dt$

Convolution of the normal and exponential probability density functions

以下計算、
$\displaystyle f(x)\ast g(x) = \int^{+\infty}_{-\infty} dx'f(x')g(x-x')\\[1.0ex] =\int^{+\infty}_{0} dx' \lambda \exp(-\lambda x') \frac{1}{\sqrt{2\pi}\sigma}\exp\left\{-\frac{(x-x'-\mu)^2}{2\sigma^2}\right\}\\[1.0ex] = \frac{\lambda}{\sqrt{2\pi}\sigma} \int^{+\infty}_{0} dx' \exp\left[-\frac{1}{2\sigma^2} \left\{x'^2+2x'(\mu-x+\lambda \sigma^2)+(x-\mu)^2 \right\} \right]\\[1.0ex] = \frac{\lambda}{\sqrt{2\pi}\sigma} \int^{+\infty}_{0} dx'\exp\left\{-\frac{1}{2\sigma^2}(x'-x+\mu+\lambda \sigma^2)^2 + \frac{1}{2}\lambda^2 \sigma^2 -\lambda x + \lambda \mu \right\}\\[1.0ex] = \frac{\lambda}{\sqrt{2\pi}\sigma} e^{\lambda \mu + \frac{1}{2}\lambda^2\sigma^2}e^{-\lambda x}\int^{+\infty}_{0} dx'\exp\left\{-\frac{1}{2\sigma^2}(x'-x+\mu+\lambda \sigma^2)^2 \right\}$

ここで、変数変換
$\displaystyle z = \frac{1}{\sqrt{2}\sigma}(x'-x+\mu+\lambda \sigma^2), \; dz' = \frac{dx'}{\sqrt{2}\sigma}$
とガウスの相補誤差関数の定義を使えば、
$\displaystyle f(x)\ast g(x) = \frac{\lambda}{\sqrt{\pi}} e^{\lambda \mu + \frac{1}{2}\lambda^2\sigma^2}e^{-\lambda x}\int^{+\infty}_{\frac{\mu+\lambda \sigma^2-x}{\sqrt{2}\sigma}} dz e^{-z^2}\\[1.0ex] = \frac{\lambda}{2}e^{\lambda \mu + \frac{1}{2}\lambda^2\sigma^2}e^{-\lambda x}\mbox{erfc}\left(\frac{\mu+\lambda \sigma^2 -x}{\sqrt{2}\sigma}\right)\\$
が得られる。

Python code

Pythonに用意されてるモジュールについてはnumpy.convolveも参照

#!/opt/local/bin/python import numpy as np import scipy.special # ex-Gaussian distribution def convolve_exp_norm(alpha, mu, sigma, x): co = alpha/2.0 * np.exp( alpha*mu+ alpha*alpha*sigma*sigma/2.0) x_erf = (mu + alpha*sigma*sigma - x)/(np.sqrt(2.0)*sigma) y = co * np.exp(-alpha*x) * (1.0 - scipy.special.erf(x_erf)) return y # input parameters x = np.arange(-5.0,5.0,0.1) alpha = 1.0 # index of the exponential function sigma = 0.5 # dispersion of the normal distribution mu = -0.5 # center of the normal distribution exponential = [] for i in range(len(x)): value = alpha * np.exp(-alpha*x[i]) if (x[i]>=0.0) else 0.0 exponential.append(value) np.array(exponential) normal = 1.0/(sqrt(2.0*np.pi)*sigma) * np.exp(-(x-mu)*(x-mu)/(2.0*sigma*sigma)) # convolve convolution = convolve_exp_norm(alpha, mu, sigma, x) # plot result plot(x,convolution,color='red',linewidth=2.0) plot(x,exponential,color='blue') plot(x,normal,color='green') legend(('Exponentially modified Gaussian','Exponential Distribution (input)','Normal Distribution (input)'),frameon=False) xlim(-2,5) text(3.0,0.7,r'$\lambda=1.5$',fontsize=20,verticalalignment='center') text(3.0,0.6,r'$\mu=-0.5$',fontsize=20,verticalalignment='center') text(3.0,0.5,r'$\sigma=0.5$',fontsize=20,verticalalignment='center')

結果はこんな感じ。

Pacocat's Life

Every day is a new day

月別アーカイブ: 2013年10月

任意の確率密度分布に従う乱数の生成（von Neumannの棄却法）

von Neumannの棄却法

正規分布（normal distribution）と指数関数（exponential distribution）の畳み込み積分

Convolution of the normal and exponential probability density functions

Python code