How to add Headers to read output from Python CSV File?

Question:

I have a csv file, with simply looks like this:

CSV File

I have the following code, which reads the csv file, and then can then print/ access information in the CSV file.

import csv

class CsvReader:
    
    with open("Items.csv") as fp:
        reader = csv.reader(fp, delimiter=",", quotechar='"')
        next(reader, None)  # skip the headers
        data_read = [row for row in reader]

    print(data_read[0])

I get this as the output from the print

['1', '5.99$, '1', 'Blueberry Muffin']

How would I format this into a dictionary, with the headers as the keys and the information as the elements?

For example, the code would output:

{Item #: 1, Price: 5.99, Quantity: 1, Name: Blueberry Muffin}

I referenced and saw a lot of similarities in this post: How do I read and write CSV files with Python?

but couldnt find any more specifics on how to format the output specifically this way, without using something such as pandas, which I am not looking to use.

Asked By: Fear_Schwier

||

Answers:

If you want dict keys as fields i.e columns then why are you skipping them.
Here the easy solution.

import csv

class CsvReader:
    with open("Item.csv") as fp:
        reader = csv.reader(fp, delimiter=",", quotechar='"')
        fields = next(reader)
        data_read = []
        for row in reader:
            data_read.append(dict(zip(fields, row)))

    print(data_read[0])

First store the column names and map them with each row element.

Answered By: Harsh Narwariya

There’s a couple ways to do this… And I agree that using Pandas is likely overkill for reading simple files. You could argue that even using csv_reader is overkill. 🙂

Anyhow, here are 3 variations. All you need to do is capture the labels and use them as the keys in the dictionary. Realize that the methods below will get you a "list of dictionaries" (or "records" type format in pandas-speak). An alternative would be a "dictionary of dictionaries" using the item number as the first key, but in essence that is the same as a list index…so about the same. You could also probably forgo capturing the item number as that is just the index in the resultant list of dicts, but that is nuance.

You might also be interested in capturing them in a named tuple which is shown in the last variant. Very easy to work with…

# Grocery Reader

import csv
from collections import namedtuple


with open("data.csv") as fp:
    reader = csv.reader(fp, delimiter=",", quotechar='"')
    labels = next(reader, None)  # capture the headers
    result = []
    for row in reader:  # iterate the remaining rows
        pairs = zip(labels, row)
        result.append(dict(pairs))

print(result)

# the above isn't real satisfying as the numeric objects are captured as strings.
# so...
with open("data.csv") as fp:
    reader = csv.reader(fp, delimiter=",", quotechar='"')
    labels = next(reader, None)  # capture the headers
    result = []
    for row in reader:  # iterate the remaining rows
        row[0] = int(row[0])
        row[1] = float(row[1])
        row[2] = int(row[2])
        pairs = zip(labels, row)
        result.append(dict(pairs))

print(result)

with open("data.csv") as fp:
    reader = csv.reader(fp, delimiter=",", quotechar='"')
    labels = next(reader, None)  # capture the headers
    # make lowercase...just for standardization
    labels = [t.lower() for t in labels]
    Grocery = namedtuple('Grocery', labels)
    result = []
    for row in reader:  # iterate the remaining rows
        row[0] = int(row[0])
        row[1] = float(row[1])
        row[2] = int(row[2])
        grocery = Grocery(*row)
        result.append(grocery)

for grocery in result:
    # the below presumes you know the names inside the named tuple...
    print(f'a {grocery.name} costs {grocery.price}')

Yields (data.csv can be inferred)

[{'Item': '1', 'Price': '4.99', 'Qty': '2', 'Name': 'Muffin'}, {'Item': '2', 'Price': '1.25', 'Qty': '6', 'Name': 'Gum'}, {'Item': '3', 'Price': '2.50', 'Qty': '8', 'Name': 'Cookie'}]
[{'Item': 1, 'Price': 4.99, 'Qty': 2, 'Name': 'Muffin'}, {'Item': 2, 'Price': 1.25, 'Qty': 6, 'Name': 'Gum'}, {'Item': 3, 'Price': 2.5, 'Qty': 8, 'Name': 'Cookie'}]
a Muffin costs 4.99
a Gum costs 1.25
a Cookie costs 2.5
Answered By: AirSquid

Use DictReader from here csv.

cat food.csv                                                                                                                                                                                                           
Item #,Price,Quantity,Name
1, 5.99$,1,Blueberry Muffin


import csv
with open('food.csv') as csv_file:
    reader = csv.DictReader(csv_file,delimiter=",", quotechar='"')
    for row in reader:
        print(dict(row))

{'Item #': '1', 'Price': ' 5.99$', 'Quantity': '1', 'Name': 'Blueberry Muffin'}

Answered By: Adrian Klaver

Use csv.DictReader and just skip the first row:

import csv

with open("tmp/1.csv", "r") as f:
    reader = csv.DictReader(
        f, fieldnames=["Item", "Price", "Quantity", "Name"]
    )

    # skipping existing header
    it = iter(reader)
    next(it, None)

    for row in it:
        print(row)
Answered By: westandskif
Categories: questions Tags: , ,
Answers are sorted by their score. The answer accepted by the question owner as the best is marked with
at the top-right corner.