Python: load words from file into a set

Question:

I have a simple text file with several thousands of words, each in its own line, e.g.

aardvark
hello
piper

I use the following code to load the words into a set (I need the list of words to test membership, so set is the data structure I chose):

my_set = set(open('filename.txt'))

The above code produces a set with the following entries (each word is followed by a space and new-line character:

("aardvark n", "hello n", "piper n")

What’s the simplest way to load the file into a set but get rid of the space and n?

Thanks

Asked By: Roee Adler

||

Answers:

my_set = set(map(str.strip, open('filename.txt')))
Answered By: RichieHindle

The strip() method of strings removes whitespace from both ends.

set(line.strip() for line in open('filename.txt'))
Answered By: user97370

To remove only the right hand spaces.

set(map(str.rstrip, open('filename.txt')))
Answered By: Unknown

Just load all file data and split it, it will take care of one word per line or multiple words per line separated by spaces, also it will be faster to load whole file at once unless your file is in GBs

words =  set(open('filename.txt').read().split())
Answered By: Anurag Uniyal
with open("filename.txt") as f:
    mySet = map(str.rstrip, f)

If you want to use this in Python 2.5, you need

from __future__ import with_statement
Answered By: Matt G
with open("filename.txt") as f:
    s = set([line.rstrip('n') for line in f])
Answered By: mtasic85
Categories: questions Tags: ,
Answers are sorted by their score. The answer accepted by the question owner as the best is marked with
at the top-right corner.