aws-glue

Using Pandas AWS Glue Python Shell Jobs

Using Pandas AWS Glue Python Shell Jobs Question: The AWS Documentation https://docs.aws.amazon.com/glue/latest/dg/add-job-python.html mentions that The environment for running a Python shell job supports the following libraries: … pandas (required to be installed via the python setuptools configuration, setup.py) But it does not mention how to make the install. How can I use Pandas in a …

Total answers: 5

How to determine what are the columns I need since ApplyMapping is'nt case sensitive?

How to determine what are the columns I need since ApplyMapping is'nt case sensitive? Question: I’m updating a Pyspark script with a new Database model and I’ve encountered some problems calling/updating columns since PySpark apparently brings all columns in uppercase but when I use ApplyMapping it is not case sensitive BUT when I join(By left) …

Total answers: 2

aws Glue job: how to merge multiple output .csv files in s3

aws Glue job: how to merge multiple output .csv files in s3 Question: I created an aws Glue Crawler and job. The purpose is to transfer data from a postgres RDS database table to one single .csv file in S3. Everything is working, but I get a total of 19 files in S3. Every file …

Total answers: 2

AWS Glue python install – Could not find a version

AWS Glue python install – Could not find a version Question: I am trying to use the AWSGlue module in Python, but cannot install the module in the terminal. sh-4.2$ pip install awsglue Collecting awsglue Could not find a version that satisfies the requirement awsglue (from versions: ) No matching distribution found for awsglue Is …

Total answers: 1