MareArts Computer Vision Study.: paginate

Showing posts with label paginate. Show all posts

3/30/2023

The list_objects_v2 function returns up to 1000 objects by default. To read all the contents in the bucket, you can use pagination.

refer to code:

You can modify '.json' for you case.

import boto3

def get_origin_fn_list(ORIGIN_DATA_S3, ORIGIN_DATA_S3_prefix):
    s3 = boto3.client('s3')
    paginator = s3.get_paginator('list_objects_v2')
    origin_path = {}

    for response in paginator.paginate(Bucket=ORIGIN_DATA_S3, Prefix=ORIGIN_DATA_S3_prefix):
        for obj in response['Contents']:
            if obj['Key'][-4:] == '.json':
                path = obj['Key']
                uid = path.split('/')[-2]
                origin_path[uid] = path

    print(f"get kv.json list: {len(origin_path)}/{sum(1 for _ in paginator.paginate(Bucket=ORIGIN_DATA_S3, Prefix=ORIGIN_DATA_S3_prefix))}")
    return origin_path

Thank you.

🙇🏻‍♂️

www.marearts.com

10/12/2021

aws dynamodb scan all item using paginate

Refer to code, Thank you!

import boto3

ACCESS_KEY = '***'
SECRET_KEY = '***'
dynamo_client = boto3.client(
    'dynamodb',
    aws_access_key_id=ACCESS_KEY,
    aws_secret_access_key=SECRET_KEY,
    region_name = 'eu-west-1'
)


def Search_Item(TABLE_NAME):
    paginator = dynamo_client.get_paginator('scan')
    operation_parameters = {'TableName': TABLE_NAME,}
    page_iterator = paginator.paginate(**operation_parameters)    
    return page_iterator


TABLE_NAME ='****'

page_iterator = Search_Item(TABLE_NAME)
for page in page_iterator:
    for item in page['Items']:
        print(item['UID']['S'])

www.marearts.com

🙇🏻‍♂️

Pages

3/30/2023

The list_objects_v2 function returns up to 1000 objects by default. To read all the contents in the bucket, you can use pagination.

10/12/2021

aws dynamodb scan all item using paginate