Create a subset of a DataFrame depending on column name

Question:

I have a pandas DataFrame called timedata with different column names, some of which contain the word Vibration, some eccentricity. Is is possible to create a dataframe of just the columns containing the word Vibration?

I have tried using

vib=[]
for i in timedata:
    if 'Vibration' in i:
        vib=vib.append(i)

to then create a DataFrame based on the indicies of these columns. This really does not seem like the most efficient way to do it and I’m sure there must be something simple to do with list comprehension.

EDIT

Dataframe of form:

df = DataFrame({'Ch 1:Load': randn(10), 'Ch 2:Vibration Brg 1T ': randn(10), 'Ch 3:Eccentricity Brg 1H ': randn(10), 'Ch 4:Vibration Brg 2T ': randn(10)})

Sorry I’m having a slow day! thanks for any help

Asked By: user2761786

||

Answers:

 newDf    = Df.loc[:,['Vibration']]

or

newDf    = Df.loc[:,['Vibration','eccentricity']]

to get more collumns

to search for a value in a collumn:

newDf    =  Df[Df["CollumnName"] == "vibration"]    
Answered By: Svend Feldt

Something like this to manually select all columns with the word “Vibration” in it:

df[[col for col in df.columns if "Vibration" in col]]

You can also do the same with the filter method:

df.filter(like="Vibration")

If you want to do a more flexible filter, you can use the regex option. E.g. to look if “Vibration” or “Ecc” is in the column name:

df.filter(regex='Ecc|Vibration')
Answered By: joris
Categories: questions Tags: , , ,
Answers are sorted by their score. The answer accepted by the question owner as the best is marked with
at the top-right corner.