data-extraction

Why when extracting products data shows me that they are duplicate?

Why when extracting products data shows me that they are duplicate? Question: When using bs4 the products show me that they are frequent despite all attempts to know solving this problem. I have failed to inform me to solve this problem and where is it? import requests from bs4 import BeautifulSoup import csv import pandas …

Total answers: 2

Sorting Data from a JSON file by date and add up the values for each day

Sorting Data from a JSON file by date and add up the values for each day Question: I have the following problem: My JSON file that I get from my online shop is structured like this: { "orders":[ { "total_price":"100.10", "updated_at":"2022-12-29T12:10:39+01:00" }, { "total_price":"50.20", "updated_at":"2022-12-29T12:05:02+01:00" }, { "total_price":"53.20", "updated_at":"2022-12-29T12:00:39+01:00" }, { "total_price":"50.00", "updated_at":"2022-12-28T11:54:29+01:00" }, { …

Total answers: 1

How to extract numbers from a string that has no spaces into a list

How to extract numbers from a string that has no spaces into a list Question: I have an assignment for which my script should be able to receive a string for input (e.g. "c27bdj3jddj45g" ) and extract the numbers into a list (not just the digits, it should be able to detect full numbers). I …

Total answers: 4

Tensorflow Object Detection and Image Cropping for Data Extraction

Tensorflow Object Detection and Image Cropping for Data Extraction Question: Below is my code that detect the regions (tables, paragraphs) from invoice and and crop the detected region from the invoice. I am facing issues while performing data extraction on cropped images as images are very small. I am passing cropped image to tesseract for …

Total answers: 1

How to extract specific lines from a text file and then from these extracted line, extract the values between parantheses and put them in another file

How to extract specific lines from a text file and then from these extracted line, extract the values between parantheses and put them in another file Question: infile = open(‘results1’, ‘r’) lines = infile.readlines() import re for line in lines: if re.match("track: 1,", line): print(line) question solved by using python regex below Asked By: cosmicdust …

Total answers: 2

Camelot not detecting table within table

Camelot not detecting table within table Question: I have observed that camelot is not detecting nested tables in the sample document I have. In the image attached, I’m getting only one table extracted as whole. Is there anyway using which we can detect the inner tables as well? Asked By: Megha Sirisilla || Source Answers: …

Total answers: 2

Python extract multiple lat/lon from NETCDF files using xarray

Python extract multiple lat/lon from NETCDF files using xarray Question: I have a NC file (time, lat, lon) Download from here and I am trying to extracting time series of multiple stations (lat/lon points Download from here). So I tried it this way to read the coordinates and extract the nearest values from the NC …

Total answers: 1

How to extract a floating number from a string

How to extract a floating number from a string Question: I have a number of strings similar to Current Level: 13.4 db. and I would like to extract just the floating point number. I say floating and not decimal as it’s sometimes whole. Can RegEx do this or is there a better way? Asked By: …

Total answers: 7