Dump in PyYaml as utf-8

Question:

I’m trying to load a bunch of utf-8 encoded strings and dump them again with PyYaml. This is the code for loading/dumping:

lang_yml = yaml.load(codecs.open(lang + ".yml.old", "r", "utf-8"))
test_file_path = lang + '.yml'
stream = file(test_file_path, 'w')
yaml.dump(lang_yml, stream,  default_flow_style=False, encoding=('utf-8'))

But a strings that start as “En arrière” ends up being saved as “En arrixE8re”.
What am I doing wrong?

Asked By: panmari

||

Answers:

Found the answer myself. I just had to dump it with the argument

allow_unicode=True

Source: http://dpinte.wordpress.com/2008/10/31/pyaml-dump-option/

Answered By: panmari
Categories: questions Tags: , ,
Answers are sorted by their score. The answer accepted by the question owner as the best is marked with
at the top-right corner.