add: Support for ESM v2 partial batch failure handling (Kinesis & DynamoDB) #9

lizard-boy · 2024-09-20T00:49:08Z

Motivation

This PR adds support for FunctionResponseTypes to ESM v2, allowing for partial batch failures to be reported and handled. Failed items of a batch can now be retried in accordance with a MaximumRetryAttempts policy.

Changes

Event Source Mapping: SQS

Fixed failing test at tests/aws/services/lambda_/event_source_mapping/test_lambda_integration_sqs.py::test_report_batch_item_failures_invalid_result_json_batch_fails by propagating error and payload information to the raised PartialBatchFailure.

Event Source Mapping: Kinesis & DynamoDB

ESM v2 supports setting FunctionResponseTypes to [ReportBatchItemFailures] and handling partial batch failures when batchItemFailures is correctly returned by Lambda invocation via Kinesis or DynamoDB ESM.

Testing

Partial Failure with ReportBatchItemFailures

Both tests simulates a partial batch failure, capturing failure information in an OnFailure destination config using a DLQ:

DynamoDB: Added TestDynamoDBEventSourceMapping::test_dynamodb_report_batch_item_failures
Kinesis: Added TestKinesisSource::test_kinesis_report_batch_item_failures which simulates a batch failure and captures failure information in a DLQ

Success and failure conditions

All success and failure cases outlined in the ESM docs have test coverage (where conditions are identical for all Kinesis, DynamoDB, and SQS). These are covered in the following parametrized tests:

Total Batch Successes:

test_kinesis_report_batch_item_success_scenarios
test_dynamodb_report_batch_item_success_scenarios

Total Batch Failures:

test_kinesis_report_batch_item_failure_scenarios
test_dynamodb_report_batch_item_failure_scenarios

greptile-apps

PR Summary

This PR adds support for partial batch failure handling in Event Source Mapping (ESM) v2 for Kinesis, DynamoDB, and SQS in LocalStack's Lambda service.

Implemented FunctionResponseTypes support in EsmConfigFactory for SQS, Kinesis, and DynamoDB Streams
Enhanced StreamPoller to handle PartialBatchFailureError, including retry logic for failed items
Added report_batch_item_failures flag in LambdaSender to enable partial success/failure processing
Introduced new test cases in test_lambda_integration_dynamodbstreams.py and test_lambda_integration_kinesis.py for partial batch failure scenarios
Updated utils.py with new Lambda function templates for DynamoDB and Kinesis batch item failure handling

_{16 file(s) reviewed, 9 comment(s)}
_{Edit PR Review Bot Settings}

greptile-apps · 2024-09-20T00:49:57Z

localstack-core/localstack/services/lambda_/event_source_mapping/esm_config_factory.py

-        )
+
+        # TODO What happens when FunctionResponseTypes value or target service is invalid?
+        if service in ["sqs", "kinesis", "dynamodbstreams"]:


style: Use a constant or enum for the list of services that support FunctionResponseTypes

greptile-apps · 2024-09-20T00:53:46Z

localstack-core/localstack/services/lambda_/event_source_mapping/pollers/stream_poller.py

+    def bisect_events(
+        self, sequence_number: str, events: list[dict]
+    ) -> tuple[list[dict], list[dict]]:
+        for i, event in enumerate(events):
+            if self.get_sequence_number(event) == sequence_number:
+                return events[:i], events[i:]
+
+        return events, []


style: Consider optimizing bisect_events for large event lists, potentially using binary search

greptile-apps · 2024-09-20T00:54:25Z

localstack-core/localstack/services/lambda_/event_source_mapping/senders/lambda_sender.py

@@ -19,9 +19,20 @@ class LambdaSender(Sender):
    # Flag to enable the payload dict using the "Records" key used for Lambda event source mapping
    payload_dict: bool

-    def __init__(self, target_arn, target_parameters=None, target_client=None, payload_dict=False):
+    # Flag to enable partial successes/failures when processing batched events through a Lambda event source mapping
+    report_batch_item_failures: bool


style: Consider adding type hints for boolean flags

greptile-apps · 2024-09-20T00:55:59Z

tests/aws/services/lambda_/event_source_mapping/test_lambda_integration_dynamodbstreams.py

+        expected_successes = 5
+        expected_failures = 1


style: Consider using a constant or configuration value for these numbers

greptile-apps · 2024-09-20T01:00:06Z

tests/aws/services/lambda_/event_source_mapping/test_lambda_integration_sqs.py

@@ -578,7 +563,7 @@ def test_report_batch_item_failures(
    )
    snapshot.match("first_invocation", first_invocation)

-    # check that the DQL is empty
+    # check that the DLQ is empty
    dlq_messages = aws_client.sqs.receive_message(QueueUrl=event_dlq_url)


syntax: DQL typo

greptile-apps · 2024-09-20T01:00:07Z

tests/aws/services/lambda_/event_source_mapping/test_lambda_integration_sqs.py

@@ -578,7 +563,7 @@ def test_report_batch_item_failures(
    )
    snapshot.match("first_invocation", first_invocation)

-    # check that the DQL is empty
+    # check that the DLQ is empty
    dlq_messages = aws_client.sqs.receive_message(QueueUrl=event_dlq_url)


syntax: DLQ misspelled as DQL

Suggested change

dlq_messages = aws_client.sqs.receive_message(QueueUrl=event_dlq_url)

# check that the DLQ is empty

greptile-apps · 2024-09-20T01:00:46Z

tests/aws/services/lambda_/event_source_mapping/utils.py

+        if new_image.get("should_fail", {}).get("BOOL", False):
+            batch_item_failures.append({"itemIdentifier": record["dynamodb"]["SequenceNumber"]})


style: Consider using .get('BOOL') instead of .get('BOOL', False) for consistency with the Kinesis handler

greptile-apps · 2024-09-20T01:00:47Z

tests/aws/services/lambda_/event_source_mapping/utils.py

+        if payload.get("should_fail", False):
+            batch_item_failures.append({"itemIdentifier": record["kinesis"]["sequenceNumber"]})
+
+    return {"batchItemFailures" : batch_item_failures}


syntax: Extra space before the colon in 'batchItemFailures :'

greptile-apps · 2024-09-20T01:00:48Z

tests/aws/services/lambda_/event_source_mapping/utils.py

+def create_lambda_with_response(response: str) -> str:
+    """Creates a lambda with pre-defined response"""
+    return _LAMBDA_WITH_RESPONSE.format(response=response)


style: Add type hinting for the 'response' parameter

gregfurman added 3 commits September 13, 2024 14:59

ESM-v2: Add ReportBatchItemFailure support (SQS, Kinesis, DynamoDB)

2cbf8ed

ESM-v2: Add tests for ReportBatchItemFailures cases (Kinesis, DynamoDB)

3bb56ab

ESM-v2: Add snapshots for new parity tests

3e49acd

greptile-apps bot reviewed Sep 20, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add: Support for ESM v2 partial batch failure handling (Kinesis & DynamoDB) #9

add: Support for ESM v2 partial batch failure handling (Kinesis & DynamoDB) #9

lizard-boy commented Sep 20, 2024

greptile-apps bot left a comment

greptile-apps bot Sep 20, 2024

greptile-apps bot Sep 20, 2024

greptile-apps bot Sep 20, 2024

greptile-apps bot Sep 20, 2024

greptile-apps bot Sep 20, 2024

greptile-apps bot Sep 20, 2024

greptile-apps bot Sep 20, 2024

greptile-apps bot Sep 20, 2024

greptile-apps bot Sep 20, 2024

	dlq_messages = aws_client.sqs.receive_message(QueueUrl=event_dlq_url)
	# check that the DLQ is empty

		if new_image.get("should_fail", {}).get("BOOL", False):
		batch_item_failures.append({"itemIdentifier": record["dynamodb"]["SequenceNumber"]})

add: Support for ESM v2 partial batch failure handling (Kinesis & DynamoDB) #9

Are you sure you want to change the base?

add: Support for ESM v2 partial batch failure handling (Kinesis & DynamoDB) #9

Conversation

lizard-boy commented Sep 20, 2024

Motivation

Changes

Event Source Mapping: SQS

Event Source Mapping: Kinesis & DynamoDB

Testing

Partial Failure with ReportBatchItemFailures

Success and failure conditions

Total Batch Successes:

Total Batch Failures:

greptile-apps bot left a comment

Choose a reason for hiding this comment

PR Summary

greptile-apps bot Sep 20, 2024

Choose a reason for hiding this comment

greptile-apps bot Sep 20, 2024

Choose a reason for hiding this comment

greptile-apps bot Sep 20, 2024

Choose a reason for hiding this comment

greptile-apps bot Sep 20, 2024

Choose a reason for hiding this comment

greptile-apps bot Sep 20, 2024

Choose a reason for hiding this comment

greptile-apps bot Sep 20, 2024

Choose a reason for hiding this comment

greptile-apps bot Sep 20, 2024

Choose a reason for hiding this comment

greptile-apps bot Sep 20, 2024

Choose a reason for hiding this comment

greptile-apps bot Sep 20, 2024

Choose a reason for hiding this comment