Histogram¶
Used to calculate histogram counts (no plotting).
Example using DataFrame
wrapper¶
>>> df=DataFrame()
>>> df.read_tbl('data/words~ageXcondition.csv')
>>> D = df.histogram('WORDS')
>>> print(D)
Cumulative Histogram for WORDS
Bins Values
================
3.000 4.000
5.000 18.000
7.000 35.000
9.000 47.000
11.000 62.000
13.000 72.000
15.000 81.000
17.000 86.000
19.000 92.000
21.000 100.000
23.000
Example using Histogram
directly¶
>>> from pyvttbl.stats import Histogram
>>> form random import normalvariate
>>> data = [normalvariate(mu=0,sigma=1) for i in xrange(1000)]
>>> hist = Histogram()
>>> hist.run(data, bins=20)
>>> print(hist)
Histogram for
Bins Values
================
-2.562 4.000
-2.280 11.000
-1.999 25.000
-1.717 21.000
-1.435 40.000
-1.153 90.000
-0.872 93.000
-0.590 84.000
-0.308 107.000
-0.027 121.000
0.255 101.000
0.537 88.000
0.819 87.000
1.100 39.000
1.382 38.000
1.664 26.000
1.945 10.000
2.227 8.000
2.509 4.000
2.791 3.000
3.072
>>> hist.run(data, bins=20, cumulative=True)
>>> print(hist)
Cumulative Histogram for
Bins Values
=================
-2.562 4.000
-2.280 15.000
-1.999 40.000
-1.717 61.000
-1.435 101.000
-1.153 191.000
-0.872 284.000
-0.590 368.000
-0.308 475.000
-0.027 596.000
0.255 697.000
0.537 785.000
0.819 872.000
1.100 911.000
1.382 949.000
1.664 975.000
1.945 985.000
2.227 993.000
2.509 997.000
2.791 1000.000
3.072