matplotlib.pyplot – axesとsubplotによる複数グラフの表示

2020-01-01 / tau / コメントする

概要

1つのfigureの中に複数のaxesを表示する方法には、add_subplot()メソッド、subplots()メソッドの2通りがある。

`add_subplot()`による方法

Figure.add_subplot()は既存のfigureオブジェクトにsubplotを追加してAxesオブジェクトを生成する。

add_subplot(): figureに1つのAxesオブジェクトを生成
add_subplot(pos): posは行数・列数・位置を表す3桁の整数。例えば234なら、2行3列のうち4番目の図。各数は当然10未満でなければならない。
add_subplot(nrows, ncols, index): 上記のposを分解して指定。行数・列数が多いときに使える。

import numpy as np
import matplotlib.pyplot as plt

x = np.linspace(-np.pi, np.pi, num=100, endpoint=True)
y1 = np.sin(x*2)
y2 = np.cos(x*2)

fig1 = plt.figure()
ax1 = fig1.add_subplot()

ax1.plot(y1)

fig2 = plt.figure()
fig2.subplots_adjust(hspace = 0.4)
ax2 = fig2.add_subplot(211)
ax3 = fig2.add_subplot(2, 1, 2)

ax2.set_title("sin 2x curve")
ax2.grid(True)
ax2.plot(x, y1)

ax3.set_title("cos 2x curve")
ax3.grid(True)
ax3.plot(x, y2)

plt.show()

import numpy as np

import matplotlib.pyplot as plt

x = np.linspace(-np.pi, np.pi, num=100, endpoint=True)

y1 = np.sin(x*2)

y2 = np.cos(x*2)

fig1 = plt.figure()

ax1 = fig1.add_subplot()

ax1.plot(y1)

fig2 = plt.figure()

fig2.subplots_adjust(hspace = 0.4)

ax2 = fig2.add_subplot(211)

ax3 = fig2.add_subplot(2, 1, 2)

ax2.set_title("sin 2x curve")

ax2.grid(True)

ax2.plot(x, y1)

ax3.set_title("cos 2x curve")

ax3.grid(True)

ax3.plot(x, y2)

plt.show()

`subplots()`による方法

Figure.subplots()は行数と列数を指定して各位置のaxesを配列として一度に生成する。

subplots(nrows=1, ncols=1, figsize=(6.4, 4.8)): nrows、ncolsはsubplotの行数・列数。

戻り値はaxesの配列だが、行数・列数によって配列の次元が違ってくるので注意。

行数・列数とも1の場合（あるいはnrows、ncolsを省略した場合）	Axesオブジェクトが1つ生成される。
行数・列数のいずれかが1の場合	1次元のAxesオブジェクトの配列が生成される。
行数・列数とも1より大きい場合	nrows×ncolsのサイズで2次元のAxesオブジェクトの配列が生成される。

import numpy as np
import matplotlib.pyplot as plt

x = np.linspace(-np.pi, np.pi, num=100, endpoint=True)

fig1, ax1 = plt.subplots()

ax1 = ax1.plot(np.sin(x))

fig2, ax2 = plt.subplots(3, 1)

for n in range(3):
    ax2[n].plot(np.sin(x * (n + 1)))

fig3, ax3 = plt.subplots(2, 2, figsize=(6.4, 4.8))

ax3[0, 0].plot(np.sin(x))
ax3[0, 1].plot(np.cos(x))
ax3[1, 0].plot(np.sin(x * 2))
ax3[1, 1].plot(np.cos(x * 2))

plt.show()

import numpy as np

import matplotlib.pyplot as plt

x = np.linspace(-np.pi, np.pi, num=100, endpoint=True)

fig1, ax1 = plt.subplots()

ax1 = ax1.plot(np.sin(x))

fig2, ax2 = plt.subplots(3, 1)

for n in range(3):

ax2[n].plot(np.sin(x * (n + 1)))

fig3, ax3 = plt.subplots(2, 2, figsize=(6.4, 4.8))

ax3[0, 0].plot(np.sin(x))

ax3[0, 1].plot(np.cos(x))

ax3[1, 0].plot(np.sin(x * 2))

ax3[1, 1].plot(np.cos(x * 2))

plt.show()

行や列のぶち抜きグラフを描きたいとき

以下のようなグラフを描きたいときの方法はこちら。

`subplot`の間隔や位置調整

figure内にsubplotで配置されたグラフの間隔やマージンを調整するにはsubplots_adjust()メソッドを用いる。

詳細な使い方はこちらを参照。

matplotlib.pyplot – axesによる表示（グラフエリアの表示要素）

2020-01-01 / tau / コメントする

概要

pyplotでグラフを描画する場合、pyplotのメソッドであるplotやxlimを使うのが簡易だが、figureオブジェクトの下にaxesオブジェクトを生成して操作する方法がある。ここではaxesオブジェクトによる方法を整理してみた。

import numpy as np
import matplotlib.pyplot as plt

x = np.linspace(-np.pi, np.pi, num=100, endpoint=True)
y = np.sin(x*2)

fig = plt.figure()
ax = fig.add_subplot(111)

ax.plot(x, y, label="sin 2x")

# タイトル設定
ax.set_title("sin 2x curve")
# x軸・y軸のラベル
ax.set_xlabel("angle")
ax.set_ylabel("sin")
# x軸・y軸の範囲
ax.set_xlim(-np.pi, np.pi)
ax.set_ylim(-1, 1)
# x軸・y軸の目盛設定
ax.set_xticks([-np.pi, -np.pi/2, 0, np.pi/2, np.pi])
ax.set_xticklabels(["-pi", "-pi/2", "0", "pi/2", "pi"])
ax.set_yticks([-1, -0.5, 0, 0.5, 1])
# 凡例
ax.legend(loc="upper right")
# 格子の表示
ax.grid(True)
# 水平線・垂直線の表示
ax.hlines([-0.5, 0.5], -np.pi, np.pi, linestyles="dotted")
ax.vlines([-np.pi/2, np.pi/2], -1, 1, linestyles="dashdot")

plt.show()

import numpy as np

import matplotlib.pyplot as plt

x = np.linspace(-np.pi, np.pi, num=100, endpoint=True)

y = np.sin(x*2)

fig = plt.figure()

ax = fig.add_subplot(111)

ax.plot(x, y, label="sin 2x")

# タイトル設定

ax.set_title("sin 2x curve")

# x軸・y軸のラベル

ax.set_xlabel("angle")

ax.set_ylabel("sin")

# x軸・y軸の範囲

ax.set_xlim(-np.pi, np.pi)

ax.set_ylim(-1, 1)

# x軸・y軸の目盛設定

ax.set_xticks([-np.pi, -np.pi/2, 0, np.pi/2, np.pi])

ax.set_xticklabels(["-pi", "-pi/2", "0", "pi/2", "pi"])

ax.set_yticks([-1, -0.5, 0, 0.5, 1])

# 凡例

ax.legend(loc="upper right")

# 格子の表示

ax.grid(True)

# 水平線・垂直線の表示

ax.hlines([-0.5, 0.5], -np.pi, np.pi, linestyles="dotted")

ax.vlines([-np.pi/2, np.pi/2], -1, 1, linestyles="dashdot")

plt.show()

各メソッドの説明

axesオブジェクトの生成

ax = fig.add_subplot(arg)

figureオブジェクトからaxesオブジェクトを生成する。subplot()の引数argの指定方法には2通りある。

なお、figureオブジェクトに属するaxesオブジェクトが1つの場合は以下のような指定もできる。

fig, ax = plt.subplots()

以後、グラフの描画やグラフエリアに対する表示オプションの設定は、得られたaxesオブジェクトのメソッドで行う。

グラフタイトル

axes.set_title(label[, loc])

文字列labelをグラフ上部に表示する。表示位置はloc="left"/"center"/"right"で指定(デフォルトは"center")

軸の設定

軸ラベル

axes.set_xlabel(label)
axes.set_ylabel(label)

文字列labelをx軸/y軸のラベルとして設定する。

set_xlabel()/set_ylabel()～軸のラベル

軸のスケール

axes.set_xscale(scale)
axes.set_yscale(scale)

軸のスケールを設定する。スケールの種類は以下の通り。

'linear'：通常の線形軸
'log'：対数軸
'symlog'：負の領域も含めた対数軸（−log (−x) for x < 0）
'logit'：？

軸の範囲

axes.set_xlim(left, right)
axes.set_ylim(bottom, top)

x軸・y軸の上限・下限を設定する。引数指定の変化などはpyplotのメソッドと同様。

軸目盛の設定

axes.set_xticks(ticks)
axes.set_yticks(ticks)

ticksのリスト等の要素で軸目盛を設定する。軸目盛のラベルを変更したい場合は、ticksと同じ要素数のlabelsで以下を実行する。

axes.set_xticklabels(labels)
axes.set_yticklabels(labels)

なお、軸のラベルを非表示にしたいときは、以下のように指定する。

axes.tick_params(labelbottom=False,
                 labelleft=False,
                 labelright=False,
                 labeltop=False)

axes.tick_params(labelbottom=False,

labelleft=False,

labelright=False,

labeltop=False)

軸目盛を非表示にしたいときは以下を指定。

axes.tick_params(bottom=False, left=False, right=False, top=False)

1	axes.tick_params(bottom=False, left=False, right=False, top=False)

軸の調整

軸の表示位置はspines()で指定する
複数グラフの外側だけに軸ラベル・目盛ラベルを表示するには、各AxesオブジェクトについてAxes.label_outer()を実行する

凡例

pyplot.legend(loc=location)

データに設定されたラベルで、locationで指定した位置に凡例を表示する。locationの指定方法はpyplot.legendと同じ。

詳細はpyplot – legendを参照。

格子

pyplot.grid(True/False)

Trueを指定すると、軸目盛に対応した格子が描かれる。

水平線・垂直線

axes.hlines([y, xmin, xmax, colors='k', linestyles='solid', label='']
axes.vlines([x, ymin, ymax, colors='k', linestyles='solid', label='']

指定した位置に水平線・垂直線を描く。引数の指定方法はpyplot.legendと同じ。

テキスト

axes.text(x, y, str, size=size, color=color)

指定した位置にstrを表示させる。

参考サイト

pyplot→pltを使う方法、axes→axを使う方法などがネット上にもそれぞれ存在していたが、以下の記事がたいへん参考になった。感謝したい。

早く知っておきたかったmatplotlibの基礎知識・・・

matplotlib.pyplot – figure～複数の図の描画

2020-01-01 / tau / コメントする

概要

pyplot.figure()は、実行のたびに新たなfigureオブジェクトを生成する。コンソールからの実行環境下では、各figureオブジェクトは別々のウィンドウとして表示され、それぞれにファイルへの保存が可能。

以下の例では、2つのfigureオブジェクトを生成し、それぞれにグラフをプロットし、それらをファイルに保存している。

figureで生成される図のサイズはfigsize=(width, height)で指定し、width, heightはインチ単位で指定する。省略した場合のデフォルトサイズは、6.4in×4.8in。

なお、figureオブジェクトを生成してグラフを描画する場合、直接figureに対してではなく、その中にaxesオブジェクトを追加して操作するのが通常。

import numpy as np
import matplotlib.pyplot as plt

x = np.linspace(-np.pi, np.pi)
y1 = np.sin(x)
y2 = np.cos(x)

fig1 = plt.figure(figsize=(8, 6))
fig1.suptitle("Figure-1")
plt.plot(x, y1)

fig2 = plt.figure()
fig2.suptitle("Figure-2")
plt.plot(x, y2)

fig1.savefig("matplotlib_pyplot_fig_1.png")
fig2.savefig("matplotlib_pyplot_fig_2.png")

plt.show()

import numpy as np

import matplotlib.pyplot as plt

x = np.linspace(-np.pi, np.pi)

y1 = np.sin(x)

y2 = np.cos(x)

fig1 = plt.figure(figsize=(8, 6))

fig1.suptitle("Figure-1")

plt.plot(x, y1)

fig2 = plt.figure()

fig2.suptitle("Figure-2")

plt.plot(x, y2)

fig1.savefig("matplotlib_pyplot_fig_1.png")

fig2.savefig("matplotlib_pyplot_fig_2.png")

plt.show()

matplotlib_pyplot_fig_1.png

matplotlib_pyplot_fig_2

matplotlib.pyplot – subplotによる複数グラフの描画

2020-01-01 / tau / コメントする

概要

pyplot.subplot()によって、一つのウィンドウに複数のグラフを描画できる。

subplot(rows, cols, positon) subplot(rcp)

rowsでグラフの行数、colsで列数を指定。positionはrows*colsの中での描画位置を1つの数値で指定し、1行1列目→1行2列目→・・・→2行1列目→2行2列目の順番に1から1つずつ増えていく。

ここで引数指定に2つの方法があって、rows, col, positionをそれぞれ1つの数値として指定する方法と、rcpの形で1つの数値として指定する方法がある。たとえば2行3列のグラフエリアの2行2列目を指定する場合は(2, 3, 5)か(235)となる。

以下の例は、2×2のグラフを描画する例。

import numpy as np
import matplotlib.pyplot as plt

x = np.linspace(-np.pi, np.pi)
y1 = np.sin(x)
y2 = np.cos(x)
y3 = np.sin(x*2)
y4 = np.cos(x*2)

plt.subplots_adjust(wspace=0.4, hspace=0.4)

plt.subplot(2, 2, 1)
plt.title("sin x")
plt.plot(x, y1, label="sin x")

plt.subplot(2, 2, 2)
plt.title("cos x")
plt.plot(x, y2)

plt.subplot(223)
plt.plot(x, y3)

plt.subplot(224)
plt.title("cos 2x")
plt.plot(x, y4)

plt.show()

import numpy as np

import matplotlib.pyplot as plt

x = np.linspace(-np.pi, np.pi)

y1 = np.sin(x)

y2 = np.cos(x)

y3 = np.sin(x*2)

y4 = np.cos(x*2)

plt.subplots_adjust(wspace=0.4, hspace=0.4)

plt.subplot(2, 2, 1)

plt.title("sin x")

plt.plot(x, y1, label="sin x")

plt.subplot(2, 2, 2)

plt.title("cos x")

plt.plot(x, y2)

plt.subplot(223)

plt.plot(x, y3)

plt.subplot(224)

plt.title("cos 2x")

plt.plot(x, y4)

plt.show()

matplotlib.pyplot – グラフエリアの表示要素

2020-01-01 / tau / コメントする

概要

pyplotで直接グラフを描画する際の基本的なオプションの例示。

import numpy as np
import matplotlib.pyplot as plt

x = np.linspace(-np.pi, np.pi)
y = np.sin(x)

plt.plot(x, y, label="sin x")

# タイトル設定
plt.title("sin x")
# x軸・y軸のラベル
plt.xlabel("x")
plt.ylabel("cos x")
# x軸・y軸の範囲
plt.xlim(-3, 3)
plt.ylim(-1, 1)
# x軸・y軸の目盛設定
plt.xticks([-3, -2, -1, 0, 1, 2, 3])
plt.yticks([-1, -0.75, -0.5, -0.25, 0, 0.25, 0.5, 0.75, 1])
# 凡例
plt.legend(loc="lower right")
# 格子の表示
plt.grid(True)
# 水平線・垂直線
plt.hlines([-0.5, 0.5], -3, 3, linestyles="dashed", colors=["red", "blue"])
plt.vlines([-np.pi/2, np.pi/2], -1, 1, linestyles=["dashdot", "dotted"])

plt.plot(x, y)

plt.show()

import numpy as np

import matplotlib.pyplot as plt

x = np.linspace(-np.pi, np.pi)

y = np.sin(x)

plt.plot(x, y, label="sin x")

# タイトル設定

plt.title("sin x")

# x軸・y軸のラベル

plt.xlabel("x")

plt.ylabel("cos x")

# x軸・y軸の範囲

plt.xlim(-3, 3)

plt.ylim(-1, 1)

# x軸・y軸の目盛設定

plt.xticks([-3, -2, -1, 0, 1, 2, 3])

plt.yticks([-1, -0.75, -0.5, -0.25, 0, 0.25, 0.5, 0.75, 1])

# 凡例

plt.legend(loc="lower right")

# 格子の表示

plt.grid(True)

# 水平線・垂直線

plt.hlines([-0.5, 0.5], -3, 3, linestyles="dashed", colors=["red", "blue"])

plt.vlines([-np.pi/2, np.pi/2], -1, 1, linestyles=["dashdot", "dotted"])

plt.plot(x, y)

plt.show()

各オプションの説明

グラフタイトル

pyplot.tytle(label[, loc])

文字列labelをグラフ上部に表示する。表示位置はloc="left"/"center"/"right"で指定(デフォルトは"center")

軸ラベル

pyplot.xlabel(label) pyplot.ylabel(label)

文字列labelをx軸/y軸のラベルとして表示する。

軸の範囲

pyplot.xlim(left, right) pyplot.ylim(bottom, top)

x軸・y軸の上限・下限を設定する。設定は2つの変数で与えるか、以下のようにタプルで与えてもよい。

pyplot.xlim((left, right))

また、上下限のいずれか1つの値を指定し、他方の値を保持したまま値の変更が可能。

pyplot.xlim(right=1)
pyplot.ylim(bottom=-1)

引数なしで実行すると、上限・下限の値がタプルで返される。

left, right = pyplot.xlim()

軸目盛の設定

pyplot.xticks(ticks[, labels]) pyplot.yticks(ticks[, labels])

リストなどで与えた値を軸目盛とする。labelsをticksと同じ要素数で与えると、その内容が軸目盛のラベルに置き換えられる。

凡例の表示

pyplot.legend(loc=location)

データに設定されたラベルで、locationで指定した位置に凡例を表示する。

locationは文字列か整数のコードで以下のように指定。

location string	location code
‘best’	0
‘upper right’	1
‘upper left’	2
‘lower left’	3
‘lower right’	4
‘right’	5
‘center left’	6
‘center right’	7
‘lower center’	8
‘upper center’	9
‘center’	10

格子の表示

pyplot.grid(True/False)

Trueを指定すると、軸目盛に対応した格子が描かれる。

水平線・垂直線の表示

pyplot.hlines([y, xmin, xmax, colors='k', linestyles='solid', label='']
pyplot.vlines([x, ymin, ymax, colors='k', linestyles='solid', label='']

たとえばhlinesの場合、水平線を引くyの値をリスト等で与え、xmin～xmaxの間に線を描く。colorsは線の色、linestylesは線のスタイルで'solid', 'dashed', 'dashdot', 'dotted'のいずれか。

colorsとlinestylesは、1つ指定した場合は全ての水平線／垂直線に適用されるほか、yの要素数にあわせて個別に指定することもできる。

colors→color、linestyles→linestyleと単数形で指定しても実行される。

numpy – arange, linspace～数列の生成

2020-01-01 / tau / コメントする

`numpy.arange()`～間隔を指定した数列の生成

引数・戻り値

numpy.arange([start, ]stop, [step, ]dtype = None)

引数	型	概要
`start`	`int/float`	数列の開始値。省略可(デフォルトは0)
`stop`	`int/float`	数列の終了値。省略不可。
`step`	`int/float`	数列の間隔。省略可(デフォルトは1)。
`dtype`	`dtype`	生成される数列のデータ型。指定しない場合は引数の型が適用される。

戻り値：開始値startからstepずつ増加し、stop未満の数列のndarray

引数の指定例

必ず以下のように指定する。numpy.arange(stop=5, step=2)のような指定はできない。

numpy.arange(stop): 0以上stop未満で増分1の数列を返す。数列の型は引数の型による。
numpy.arange(start, stop): start以上stop未満で増分1の数列を返す。数列の型は引数の型による。
numpy.arange(start, stop, step): startから始まり、stepずつ増加／減少し、stepの手前までの数列を返す。数列の型は引数の型による。

print(np.arange(10))
print(np.arange(3, 10))
print(np.arange(3, 10, 2))

# [0 1 2 3 4 5 6 7 8 9]
# [3 4 5 6 7 8 9]
# [3 5 7 9]

print(np.arange(10))

print(np.arange(3, 10))

print(np.arange(3, 10, 2))

# [0 1 2 3 4 5 6 7 8 9]

# [3 4 5 6 7 8 9]

# [3 5 7 9]

その他

降順の数列

stepを負の値にして降順の指定も可能。この場合はstart≥n>endの範囲となる。

print(np.arange(10, 3, -2))

# [10  8  6  4]

print(np.arange(10, 3, -2))

# [10 8 6 4]

実数列

stepを指定して実数の列も作れる。

print(np.arange(0.2, 0.8, 0.2))

# [0.2 0.4 0.6 0.8]

print(np.arange(0.2, 0.8, 0.2))

# [0.2 0.4 0.6 0.8]

ただしデフォルトでstart=0(0.0)、step=1(1.0)なので以下のような挙動になる。

print(np.arange(2.2))
print(np.arange(2.3, 6.4))

# [0. 1. 2.]
# 　0から始まり1.0ずつ増加させて2.2を越えない範囲
# [2.3 3.3 4.3 5.3 6.3]
# 　2.3から始まり1.0ずつ増加させて6.5を超えない範囲

print(np.arange(2.2))

print(np.arange(2.3, 6.4))

# [0. 1. 2.]

# 　0から始まり1.0ずつ増加させて2.2を越えない範囲

# [2.3 3.3 4.3 5.3 6.3]

# 　2.3から始まり1.0ずつ増加させて6.5を超えない範囲

dtypeによる型指定

dtypeで強制的に型を指定可能。

print(np.arange(5, 10, dtype=float))
print(np.arange(2.3, 6.4, dtype=int))

# [5. 6. 7. 8. 9.]
# [2 3 4 5 6]
# 　先に引数を整数化すると[2 3 4 5]となるがこうなっていない
# 　まず[2.3 3.3 4.3 5.3 6.3]の列を作り、それを整数化しているらしい

print(np.arange(5, 10, dtype=float))

print(np.arange(2.3, 6.4, dtype=int))

# [5. 6. 7. 8. 9.]

# [2 3 4 5 6]

# 　先に引数を整数化すると[2 3 4 5]となるがこうなっていない

# 　まず[2.3 3.3 4.3 5.3 6.3]の列を作り、それを整数化しているらしい

`numpy.linspace()`～個数を指定した数列の生成

引数・戻り値

numpy.linspace(start, stop, num=50, endpoint=True, retstep=False, dtype=None)

引数	型	概要
`start`	`int/float`	数列の開始値。省略不可。
`stop`	`int/float`	数列の終了値。省略不可。
`num`	`int`	数列の要素数。省略可(デフォルトは50)。
`endpoint`	`bool`	stopを要素に含むかどうかを指定。`True`で含み、`False`なら含まない。省略可(デフォルトは`True`で`stop`を含む)。
`retstep`	`bool`	戻り値として配列に加えて公差を返すかどうかを指定。Trueで配列を第1要素、公差を第2要素とするタプルを返し、Falseなら配列のみを返す。
`dtype`	`dtype`	生成される数列のデータ型。指定しない場合は`float`が適用される。

戻り値：startからstopまでをnum等分した数列のndarray

引数の指定例

numpy.linspace(start, stop): startから始まりstopで終わる50個の数列を返す。
numpy.linspace(start, stop, num): startから始まりstopで終わるnum個の数列を返す。
numpy.linspace(start, stop, num, endpoint=False): startから始まりstopの手前で終わるnum個の数列を返す。

print(np.linspace(-2.4, 2.5))
print(np.linspace(-2, 2, 11))
print(np.linspace(-2, 2, 20, endpoint=False))
print(np.linspace(-1, 1, 5, retstep=True))
a, d = np.linspace(-1, 1, 5, retstep=True)
print(a)
print(d)

# [-2.4000000e+00 -2.3000000e+00 -2.2000000e+00 -2.1000000e+00
#  -2.0000000e+00 -1.9000000e+00 -1.8000000e+00 -1.7000000e+00
# .....
#   2.0000000e+00  2.1000000e+00  2.2000000e+00  2.3000000e+00
#   2.4000000e+00  2.5000000e+00]
[-2.  -1.6 -1.2 -0.8 -0.4  0.   0.4  0.8  1.2  1.6  2. ]
[-2.  -1.6 -1.2 -0.8 -0.4  0.   0.4  0.8  1.2  1.6]
# (array([-1. , -0.5,  0. ,  0.5,  1. ]), 0.5)
# [-1.  -0.5  0.   0.5  1. ]
# 0.5

print(np.linspace(-2.4, 2.5))

print(np.linspace(-2, 2, 11))

print(np.linspace(-2, 2, 20, endpoint=False))

print(np.linspace(-1, 1, 5, retstep=True))

a, d = np.linspace(-1, 1, 5, retstep=True)

print(a)

print(d)

# [-2.4000000e+00 -2.3000000e+00 -2.2000000e+00 -2.1000000e+00

# -2.0000000e+00 -1.9000000e+00 -1.8000000e+00 -1.7000000e+00

# .....

# 2.0000000e+00 2.1000000e+00 2.2000000e+00 2.3000000e+00

# 2.4000000e+00 2.5000000e+00]

[-2. -1.6 -1.2 -0.8 -0.4 0. 0.4 0.8 1.2 1.6 2. ]

[-2. -1.6 -1.2 -0.8 -0.4 0. 0.4 0.8 1.2 1.6]

# (array([-1. , -0.5, 0. , 0.5, 1. ]), 0.5)

# [-1. -0.5 0. 0.5 1. ]

# 0.5

その他

dtypeによる型指定

dtype=intと指定して、整数列を生成できる。

print(np.linspace(-2, 2, 5, dtype=int))

# [-2 -1  0  1  2]

print(np.linspace(-2, 2, 5, dtype=int))

# [-2 -1 0 1 2]

ただし分割個数を適切に指定しないと、変な結果になる。

print(np.linspace(-2, 2, 6, dtype=int))

# [-2  0  0  2]
# 　まずfloatで以下の数列が生成される
# 　[-2.         -0.66666667  0.66666667  2.        ]
# 　その後にこれらの小数部が切り捨てられて上の結果になる

print(np.linspace(-2, 2, 6, dtype=int))

# [-2 0 0 2]

# 　まずfloatで以下の数列が生成される

# 　[-2. -0.66666667 0.66666667 2. ]

# 　その後にこれらの小数部が切り捨てられて上の結果になる

retstepを指定した公差の取り出し

retstep=Trueと指定すると、第1要素に数列のndarray、第2要素に公差を持つタプルが返される。

print(np.linspace(-1, 1, 5, retstep=True))
a, d = np.linspace(-1, 1, 5, retstep=True)
print(a)
print(d)

# (array([-1. , -0.5,  0. ,  0.5,  1. ]), 0.5)
# [-1.  -0.5  0.   0.5  1. ]
# 0.5

print(np.linspace(-1, 1, 5, retstep=True))

a, d = np.linspace(-1, 1, 5, retstep=True)

print(a)

print(d)

# (array([-1. , -0.5, 0. , 0.5, 1. ]), 0.5)

# [-1. -0.5 0. 0.5 1. ]

# 0.5

Python3 – forループの最初と最後だけ処理を変えたい

2019-12-31 / tau / コメントする

やりたいこと

たとえば多数の計算結果を表示するとき、リストなら最初だけ"["がついて、各要素の後ろに", "が付加されて、最後の要素だけそのカンマがなくて"]"が表示される。このような処理をいろいろな形で実装したいというような場合。

単純に考えると次のように格好が悪い。

lst = [10, 20, 30, 40]

print("<", end="")
for e in lst:
    print("{}, ".format(e), end="")
print(">")

# <10, 20, 30, 40, >

lst = [10, 20, 30, 40]

print("<", end="")

for e in lst:

print("{}, ".format(e), end="")

print(">")

# <10, 20, 30, 40, >

これを、<0, 1, 2, 3, 4>のように末尾だけ表示方法を変えたい、さらには、何個かごとに改行させて、2行目以降の行頭を1行目と変更したいときの処理を考える。

インデックスを利用する方法

1行で表示する場合

以下の例では、enumerateでコレクションのインデックスを取り出して、その値によって処理を分けている。行頭"<"の処理はforループの前でやってもよいが、次の複数行表示の準備としてif文で判定している。

for i, e in enumerate(lst):
    # 1文字目の場合、要素表示に先立って"<"を表示
    if i == 0:
        print("<", end="")

    # 要素を表示
    print(e, end="")

    # 最終要素の場合は">"、そうでなければ", "を表示
    if i == len(lst) - 1:
        print(">")
    else:
        print(", ", end="")

# <10, 20, 30, 40>

for i, e in enumerate(lst):

# 1文字目の場合、要素表示に先立って"<"を表示

if i == 0:

print("<", end="")

# 要素を表示

print(e, end="")

# 最終要素の場合は">"、そうでなければ", "を表示

if i == len(lst) - 1:

print(">")

else:

print(", ", end="")

# <10, 20, 30, 40>

複数行に分けて表示する場合

長いリストや多次元の配列などを複数行に分けて表示したい場合、行ごとの行末文字も変わってくる。そのような処理をforループで書いてみた。

lst = [2, 4, 6, 8, 10, 12, 14, 16]

num_in_line = 3
for i, e in enumerate(lst):
    # 1行目の場合行頭文字は"<"、そうでなければ" "で段下げ
    prefix = "<" if i // num_in_line == 0 else " "

    # 要素の後に表示する文字を、最後の文字だけ">"、その他の場合は", "とする
    suffix = ">" if i == len(lst) - 1 else ", "

    # 1文字目の場合、要素表示に先立ってprefixを表示
    if i % num_in_line == 0:
        print(prefix, end="")

    # 要素とsuffixを表示
    print("{:2d}{}".format(e, suffix), end="")

    # 行末要素か最終要素の場合は改行
    if i % num_in_line == num_in_line - 1 or i == len(lst) - 1: print()

# < 2,  4,  6, 
#   8, 10, 12, 
#  14, 16>

lst = [2, 4, 6, 8, 10, 12, 14, 16]

num_in_line = 3

for i, e in enumerate(lst):

# 1行目の場合行頭文字は"<"、そうでなければ" "で段下げ

prefix = "<" if i // num_in_line == 0 else " "

# 要素の後に表示する文字を、最後の文字だけ">"、その他の場合は", "とする

suffix = ">" if i == len(lst) - 1 else ", "

# 1文字目の場合、要素表示に先立ってprefixを表示

if i % num_in_line == 0:

print(prefix, end="")

# 要素とsuffixを表示

print("{:2d}{}".format(e, suffix), end="")

# 行末要素か最終要素の場合は改行

if i % num_in_line == num_in_line - 1 or i == len(lst) - 1: print()

# < 2, 4, 6,

# 8, 10, 12,

# 14, 16>

ジェネレーターを使う方法

ジェネレーターを使った方法。このジェネレーターは、各要素のprefix、本体要素、suffixの3つを返す。

1文字目だけ処理を変えるため、コレクションをイテレーターとし、ループに入る前に最初の要素を取り出している。next()関数でイテレーターの内容を取り出すと、その後のforループでは以降の要素が順次取り出されることを利用している。

prefixは最初だけ"<"を使い、その後は""としてprefixなしとして扱っている。

1行で表示する場合

lst = [10, 20, 30, 40]

def generator(lst):
    # 先頭に表示させる文字
    prefix = "<"

    # 与えられたコレクションをイテレーターとする
    it = iter(lst)

    # まず最初の要素を取得しておく
    next_element = next(it)

    # 最初の要素は既に取得されているので、current_elementは2つ目の要素から
    # 取得される
    for current_element in it:
        # ジェネレーターはprefix、本体要素、suffixを返す
        yield prefix, next_element, ", "
        # 以下、取得した要素をnext_elementに順次移していく
        next_element = current_element
        # 2つ目の要素以降はprefixなし
        prefix = ""
    # 最後の要素のみsuffixを">"にする
    yield prefix, next_element, ">"

for prefix, e, suffix in generator(lst):
    print("{}{}{}".format(prefix, e, suffix), end="")
print()

# <10, 20, 30, 40>

lst = [10, 20, 30, 40]

def generator(lst):

# 先頭に表示させる文字

prefix = "<"

# 与えられたコレクションをイテレーターとする

it = iter(lst)

# まず最初の要素を取得しておく

next_element = next(it)

# 最初の要素は既に取得されているので、current_elementは2つ目の要素から

# 取得される

for current_element in it:

# ジェネレーターはprefix、本体要素、suffixを返す

yield prefix, next_element, ", "

# 以下、取得した要素をnext_elementに順次移していく

next_element = current_element

# 2つ目の要素以降はprefixなし

prefix = ""

# 最後の要素のみsuffixを">"にする

yield prefix, next_element, ">"

for prefix, e, suffix in generator(lst):

print("{}{}{}".format(prefix, e, suffix), end="")

print()

# <10, 20, 30, 40>

複数行に分けて表示する場合

複数行に分ける場合、1行当たりの要素数を設定し、カウンターで要素位置を検出している。カウンターの値から先頭要素と判定した場合にprefixを" "とし、その他の場合は""としている。

lst = [2, 4, 6, 8, 10, 12, 14, 16]

def generator(lst):
    num_in_line = 3
    prefix = "<"

    it = iter(lst)
    next_element = next(it)

    # 行内の要素位置用のカウンター
    counter = 0
    for current_element in it:
        # counterは1～最大要素数でサイクリックに変化させる
        counter += 1
        # 行内の最終位置の要素なら改行、そうでなければ継続
        if counter == num_in_line:
            suffix = ",\n"
            counter = 0
        else:
            suffix = ", "

        yield prefix, next_element, suffix
        next_element = current_element
        # 2行目以降の先頭ならprefixを" "とする
        prefix = " " if counter == 0 else ""
    yield prefix, next_element, ">"

for prefix, e, suffix in generator(lst):
    print("{}{:2d}{}".format(prefix, e, suffix), end="")
print()

# < 2,  4,  6,
#   8, 10, 12,
#  14, 16>

lst = [2, 4, 6, 8, 10, 12, 14, 16]

def generator(lst):

num_in_line = 3

prefix = "<"

it = iter(lst)

next_element = next(it)

# 行内の要素位置用のカウンター

counter = 0

for current_element in it:

# counterは1～最大要素数でサイクリックに変化させる

counter += 1

# 行内の最終位置の要素なら改行、そうでなければ継続

if counter == num_in_line:

suffix = ",\n"

counter = 0

else:

suffix = ", "

yield prefix, next_element, suffix

next_element = current_element

# 2行目以降の先頭ならprefixを" "とする

prefix = " " if counter == 0 else ""

yield prefix, next_element, ">"

for prefix, e, suffix in generator(lst):

print("{}{:2d}{}".format(prefix, e, suffix), end="")

print()

# < 2, 4, 6,

# 8, 10, 12,

# 14, 16>

Python3 – yield文によるジェネレーターの実装

2019-12-30 / tau / コメントする

`return`文の確認

yield文はreturn文と同様に関数の中で使われ、戻り値を指定するが、その挙動は全く異なる。

まず準備として、通常のreturn文を持つ関数の動作を確認。呼び出されるたびに常に関数の先頭からreturn文まで実行される。

def func0():
    print("called")
    return 1

print(func0())
print(func0())

# called
# 1
# called
# 1

def func0():

print("called")

return 1

print(func0())

# called

# 1

# called

# 1

`yield`文にするとジェネレーターが生成される

このreturn文をyield文に変更してみると、関数の戻り値を返すのではなく、この関数がジェネレーターオブジェクトのコンストラクターとなっている。

def func1():
    print("called")
    yield 2

print(func1())

# <generator object func1 at 0x047BCF30>

def func1():

print("called")

yield 2

print(func1())

# <generator object func1 at 0x047BCF30>

ジェネレーターオブジェクトには、(Python3では)__next__()メソッドがあって、ジェネレーターで生成された値を順次取り出してくれる。そこで上のfunc1()でジェネレーターのインスタンスを生成し、直接値を取り出してみる。

generator = func1()
print(generator.__next__())
print(generator.__next__())

# called
# 2
# Traceback (most recent call last):
# .....
# StopIteration

generator = func1()

print(generator.__next__())

# called

# 2

# Traceback (most recent call last):

# .....

# StopIteration

このジェネレーターは値を1つしか生成しないので、2つ目を取り出そうとするとStopIteration例外を投げる。

`yield`文によるジェネレーターの挙動

以下の例は、3つの値を返すジェネレーターの例で、確認のためにyield文の前の処理をprint文で表示させるようにしている。なお、ここでは__next__()メソッドの代わりに組み込み関数next()を用いている。

def func3():
    print("He said:")
    yield "How are you?"

    print("She replied and asked:")
    yield "I'm fine. How are you?"

    print("He replied:")
    yield "I'm fine too."

generator = func3()
print(next(generator))
print(next(generator))
print(next(generator))

# He said:
# How are you?
# She replied and asked:
# I'm fine. How are you?
# He replied:
# I'm fine too.

def func3():

print("He said:")

yield "How are you?"

print("She replied and asked:")

yield "I'm fine. How are you?"

print("He replied:")

yield "I'm fine too."

generator = func3()

print(next(generator))

# He said:

# How are you?

# She replied and asked:

# I'm fine. How are you?

# He replied:

# I'm fine too.

関数で生成されるのはジェネレーターでStopIterationを投げるので、次のようにfor文で使える。

for x in func3():
    print(x)

# He said:
# How are you?
# She replied and asked:
# I'm fine. How are you?
# He replied:
# I'm fine too.

for x in func3():

print(x)

# He said:

# How are you?

# She replied and asked:

# I'm fine. How are you?

# He replied:

# I'm fine too.

`yiled`文と`return`文を混ぜた場合

関数の中でyield文を書くと、return文があってもジェネレーターが生成される。

def func4():
    yield 1
    yield 2
    yield 3
    return 4

print(func4())
for x in func4():
    print(x)

# <generator object func4 at 0x03B5CF30>
# 1
# 2
# 3

def func4():

yield 1

yield 2

yield 3

return 4

print(func4())

for x in func4():

print(x)

# <generator object func4 at 0x03B5CF30>

# 1

# 2

# 3

ただしreturn文があるとそこでStopIterationが投げられる。このとき、return文で指定した戻り値が得られるようだが、ジェネレーターとしては無視されるらしい。

def func5():
    yield 1
    yield 2
    return 3
    yield 4

g = func5()
print(next(g))
print(next(g))
print(next(g))

# 1
# 2
# Traceback (most recent call last):
# .....
# StopIteration: 3

def func5():

yield 1

yield 2

return 3

yield 4

g = func5()

print(next(g))

# 1

# 2

# Traceback (most recent call last):

# .....

# StopIteration: 3

このジェネレーターをfor文で使うと、以下のようにreturn文の手前まで実行される。

for x in func5():
    print(x)

# 1
# 2

for x in func5():

print(x)

# 1

# 2

実装の例

たとえば引数を与えて、その数以下であるフィボナッチ数列を返すジェネレーターを考える。

ef fibonacci(max):
    a = 1
    b = 1
    yield a
    yield b
    next_value = a + b
    while next_value < max:
        yield next_value
        a = b
        b = next_value
        next_value = a + b

for x in fibonacci(100):
    print(x)

# 1
# 1
# 2
# 3
# 5
# 8
# 13
# 21
# 34
# 55
# 89

ef fibonacci(max):

a = 1

b = 1

yield a

yield b

next_value = a + b

while next_value < max:

yield next_value

a = b

b = next_value

next_value = a + b

for x in fibonacci(100):

print(x)

# 1

# 2

# 3

# 5

# 8

# 13

# 21

# 34

# 55

# 89

Python3 – イテレーターとジェネレーター

2019-12-30 / tau / コメントする

イテレーター

__iter()__によるイテレーターの実装

itertools

イテレーターは再利用できない

ジェネレーター

yield文によるジェネレーターの実装

ジェネレーターの実装

k平均法

2019-11-10 / tau / コメントする

概要

k平均法(k-means clustering)はクラスタリングの手法の1つで、与えられたデータ群の特徴と初期値に基づいて、データを並列(非階層)のクラスターに分類する。

ここではk平均法の簡単な例を実装したKMeansClusteringクラスによって、その挙動を確認する。

テストケース

基本形

2つのクラスターがある程度明確なケースで試してみる。一定の円内にランダムに点を発生させ、そのグループを2つ近づけた例。

x_means[0], y_means[0] = 15, 15
x_means[1], y_means[1] = 20, 15

plot_steps = [0, 1, 2, 4, 6]

x_means[0], y_means[0] = 15, 15

x_means[1], y_means[1] = 20, 15

plot_steps = [0, 1, 2, 4, 6]

以下のように、重なった部分は仕方がないが、かなり元のグループに近い分類となっている。

convergion times = 7
[14.567003574164632, 15.215775486294216]
[25.31190419286806, 25.871321239241027]

convergion times = 7

[14.567003574164632, 15.215775486294216]

[25.31190419286806, 25.871321239241027]

初期値を変えた場合

代表点の初期値を変えて実行してみる。

x_means[0], y_means[0] = 25, 25
x_means[1], y_means[1] = 25, 30

plot_steps = [0, 1, 2, 4, 8]

x_means[0], y_means[0] = 25, 25

x_means[1], y_means[1] = 25, 30

plot_steps = [0, 1, 2, 4, 8]

上記とはかなり離れた初期値を設定しても、解は同じになる。

収束解も上記と全く同じ値になる。

convergion times = 9
[14.567003574164632, 15.215775486294216]
[25.31190419286806, 25.871321239241027]

convergion times = 9

[14.567003574164632, 15.215775486294216]

[25.31190419286806, 25.871321239241027]

クラスターが不明確な場合

先の結果だけを見ると、かなり初期値がずれてもクラス分類は安定なように見える。

そこで次に、元々の分布に明確なクラス分けが見えない場合に3つのクラスターに分ける例を考える。

初期値1

x_means[0], y_means[0] = 10, 18
x_means[1], y_means[1] = 18, 18
x_means[2], y_means[2] = 25, 18

plot_steps = [0, 1, 4, 8, 12]

x_means[0], y_means[0] = 10, 18

x_means[1], y_means[1] = 18, 18

x_means[2], y_means[2] = 25, 18

plot_steps = [0, 1, 4, 8, 12]

convergion times = 13
[11.359345006841108, 15.215511281942952]
[21.33481062455269, 13.05930376628775]
[19.777630465074534, 23.850681586725297]

convergion times = 13

[11.359345006841108, 15.215511281942952]

[21.33481062455269, 13.05930376628775]

[19.777630465074534, 23.850681586725297]

初期値2

上記に対して初期値を変更。

_means[0], y_means[0] = 18, 10
x_means[1], y_means[1] = 18, 18
x_means[2], y_means[2] = 18, 25

plot_steps = [0, 1, 4, 8, 10]

_means[0], y_means[0] = 18, 10

x_means[1], y_means[1] = 18, 18

x_means[2], y_means[2] = 18, 25

plot_steps = [0, 1, 4, 8, 10]

データは同じだが、クラスター分けは違ってきている。

convergion times = 11
[18.43160436224161, 11.596987217637075]
[12.210575780454143, 19.032933149086162]
[22.418606246186307, 23.158839350112153]

convergion times = 11

[18.43160436224161, 11.596987217637075]

[12.210575780454143, 19.032933149086162]

[22.418606246186307, 23.158839350112153]

極端な例

次に、元の分布でクラスターが見いだせないような極端な場合を考える。

初期値1

代表点の初期値は縦に並んでおり、クラスターも縦方向に分割されている。

x_means[0], y_means[0] = 15, 15
x_means[1], y_means[1] = 15, 20

plot_steps = [0, 1, 3, 5, 8]

x_means[0], y_means[0] = 15, 15

x_means[1], y_means[1] = 15, 20

plot_steps = [0, 1, 3, 5, 8]

convergion times = 9
[19.1770479839392, 13.104941433078057]
[20.361677232475646, 26.9785132838531]

convergion times = 9

[19.1770479839392, 13.104941433078057]

[20.361677232475646, 26.9785132838531]

初期値2

全く同じデータで代表点の初期値を横に並べた場合、クラスター分けは大きく異なっている。

x_means[0], y_means[0] = 15, 15
x_means[1], y_means[1] = 20, 15

x_lim = 40
y_lim = 40

plot_steps = [0, 1, 2, 4]

x_means[0], y_means[0] = 15, 15

x_means[1], y_means[1] = 20, 15

x_lim = 40

y_lim = 40

plot_steps = [0, 1, 2, 4]

convergion times = 5
[12.933983667295676, 20.22594500107436]
[25.46963894609307, 21.15223763731067]

convergion times = 5

[12.933983667295676, 20.22594500107436]

[25.46963894609307, 21.15223763731067]

3クラスター

最後に、元のデータでクラスターがかなり明確な場合を試してみる。

初期値1

_means[0], y_means[0] = 10, 15
x_means[1], y_means[1] = 15, 15
x_means[2], y_means[2] = 20, 15

plot_steps = [0, 2, 4, 6, 9]

_means[0], y_means[0] = 10, 15

x_means[1], y_means[1] = 15, 15

x_means[2], y_means[2] = 20, 15

plot_steps = [0, 2, 4, 6, 9]

初期値が隅の方から始まっていても、3つのクラスターによく分かれている。

convergion times = 10
[14.610459971900003, 15.428114490958492]
[25.252313933530775, 30.84369233062165]
[34.43506243558404, 14.084017148769334]

convergion times = 10

[14.610459971900003, 15.428114490958492]

[25.252313933530775, 30.84369233062165]

[34.43506243558404, 14.084017148769334]

初期値2

x_means[0], y_means[0] = 25, 25
x_means[1], y_means[1] = 25, 30
x_means[2], y_means[2] = 25, 35

plot_steps = [0, 1, 3, 6, 10]

x_means[0], y_means[0] = 25, 25

x_means[1], y_means[1] = 25, 30

x_means[2], y_means[2] = 25, 35

plot_steps = [0, 1, 3, 6, 10]

初期値の場所や並びがかなり異なっていても、クラスター分けは安定している。

convergion times = 11
[34.43506243558404, 14.084017148769334]
[14.665546643275666, 15.615970568228104]
[25.413807412324008, 30.963999916166244]

convergion times = 11

[34.43506243558404, 14.084017148769334]

[14.665546643275666, 15.615970568228104]

[25.413807412324008, 30.963999916166244]

まとめ

k平均法は初期値によって解が変動するとされているが、明らかにクラスターが明確な場合には解は安定している。

ただしそのようなケースは、特徴量の数が少なく分布が一目瞭然の場合に相当するので、特徴量が多く一目ではそのクラスターがわかりにくいような場合には、やはり初期値の取り方に大きく影響されるものと考えられる。

概要

add_subplot()による方法

subplots()による方法

行や列のぶち抜きグラフを描きたいとき

subplotの間隔や位置調整

概要

各メソッドの説明

axesオブジェクトの生成

グラフタイトル

軸の設定

軸ラベル

軸のスケール

軸の範囲

軸目盛の設定

軸の調整

凡例

格子

水平線・垂直線

テキスト

参考サイト

概要

概要

概要

各オプションの説明

グラフタイトル

軸ラベル

軸の範囲

軸目盛の設定

凡例の表示

格子の表示

水平線・垂直線の表示

numpy.arange()～間隔を指定した数列の生成

引数・戻り値

引数の指定例

その他

降順の数列

実数列

dtypeによる型指定

numpy.linspace()～個数を指定した数列の生成

引数・戻り値

引数の指定例

その他

dtypeによる型指定

retstepを指定した公差の取り出し

やりたいこと

インデックスを利用する方法

1行で表示する場合

複数行に分けて表示する場合

ジェネレーターを使う方法

1行で表示する場合

複数行に分けて表示する場合

return文の確認

yield文にするとジェネレーターが生成される

yield文によるジェネレーターの挙動

yiled文とreturn文を混ぜた場合

実装の例

イテレーター

ジェネレーター

概要

テストケース

基本形

初期値を変えた場合

クラスターが不明確な場合

初期値1

初期値2

極端な例

初期値1

初期値2

3クラスター

初期値1

初期値2

まとめ

`add_subplot()`による方法

`subplots()`による方法

`subplot`の間隔や位置調整

`numpy.arange()`～間隔を指定した数列の生成

`numpy.linspace()`～個数を指定した数列の生成

`return`文の確認

`yield`文にするとジェネレーターが生成される

`yield`文によるジェネレーターの挙動

`yiled`文と`return`文を混ぜた場合