Plot line graph from histogram data in matplotlib
Question:
I have a numpy array of ints representing time periods, which I’m currently plotting in a histogram to get a nice distribution graph, using the following code:
ax.hist(data,bins=100,range=(minimum,maximum),facecolor="r")
However I’m trying to modify this graph to represent the exact same data using a line instead of bars, so I can overlay more samples to the same plot and have them be clear (otherwise the bars overlap each other). What I’ve tried so far is to collate the data array into an array of tuples containing (time, count), and then plot it using
ax.plot(data[:,0],data[:,1],color="red",lw=2)
However that’s not giving me anything close, as I can’t accurately simulate the bins option of the histogram in my plot. Is there a better way to do this?
Answers:
Try ax.plot(zip(*data)[:][0],zip(*data)[:][1],color="red",lw=2)
You can save the output of hist
and then plot it.
import numpy as np
import pylab as p
data=np.array(np.random.rand(1000))
y,binEdges=np.histogram(data,bins=100)
bincenters = 0.5*(binEdges[1:]+binEdges[:-1])
p.plot(bincenters,y,'-')
p.show()
I am very late to the party – but maybe this will be useful to someone else. I think what you need to do is set the histtype parameter to ‘step’, i.e.
ax.hist(data,bins=100,range=(minimum,maximum),facecolor="r", histtype = 'step')
See also http://matplotlib.sourceforge.net/examples/pylab_examples/histogram_demo_extended.html
Seaborn had what I needed:
import pandas as pd
import seaborn as sb
from matplotlib import pyplot as plt
sb.distplot(data, hist=False)
plt.show()
If using seaborn library, since distplot function has been deprecated:
import seaborn
seaborn.histplot(data, element = 'poly', fill= False)
I have a numpy array of ints representing time periods, which I’m currently plotting in a histogram to get a nice distribution graph, using the following code:
ax.hist(data,bins=100,range=(minimum,maximum),facecolor="r")
However I’m trying to modify this graph to represent the exact same data using a line instead of bars, so I can overlay more samples to the same plot and have them be clear (otherwise the bars overlap each other). What I’ve tried so far is to collate the data array into an array of tuples containing (time, count), and then plot it using
ax.plot(data[:,0],data[:,1],color="red",lw=2)
However that’s not giving me anything close, as I can’t accurately simulate the bins option of the histogram in my plot. Is there a better way to do this?
Try ax.plot(zip(*data)[:][0],zip(*data)[:][1],color="red",lw=2)
You can save the output of hist
and then plot it.
import numpy as np
import pylab as p
data=np.array(np.random.rand(1000))
y,binEdges=np.histogram(data,bins=100)
bincenters = 0.5*(binEdges[1:]+binEdges[:-1])
p.plot(bincenters,y,'-')
p.show()
I am very late to the party – but maybe this will be useful to someone else. I think what you need to do is set the histtype parameter to ‘step’, i.e.
ax.hist(data,bins=100,range=(minimum,maximum),facecolor="r", histtype = 'step')
See also http://matplotlib.sourceforge.net/examples/pylab_examples/histogram_demo_extended.html
Seaborn had what I needed:
import pandas as pd
import seaborn as sb
from matplotlib import pyplot as plt
sb.distplot(data, hist=False)
plt.show()
If using seaborn library, since distplot function has been deprecated:
import seaborn
seaborn.histplot(data, element = 'poly', fill= False)