Harden frame processing on large data by uranusjr · Pull Request #69125 · apache/airflow

uranusjr · 2026-06-29T11:34:57Z

Previously, frame encoding and decoding are done against an in-memory byte array. This is simple, but may cause issues with very large amount of data, since the frame protocol allows 2^32 bytes of data per frame with the potential to clog the entire JVM.

This uses the MessagePack library's MessageBuffer helper to encode to and decode from a MessagePack message into multiple lazy buffers, converting each buffer to a byte array separately on demand to reduce peak memory usage.

I also cleaned up some abstractions since they are already pretty empty prior to this change.

Previously, frame encoding and decoding are done against an in-memory byte array. This is simple, but may cause issues with very large amount of data, since the frame protocol allows 2^32 bytes of data per frame with the potential to clog the entire JVM. This uses the MessagePack library's MessageBuffer helper to encode to and decode from a MessagePack message into multiple lazy buffers, converting each buffer to a byte array separately on demand to reduce peak memory usage. I also cleaned up some abstractions since they are already pretty empty prior to this change.

phanikumv · 2026-06-30T06:13:53Z

+      try {
+        Frame.decode(ChannelFrameInput(reader, declaredLength))
+      } catch (e: Exception) {
+        logger.error(
+          "Failed to read or decode frame",
+          mapOf("length" to declaredLength, "exception" to e),
+        )
+        shutDownRequested = true
+        return
+      }


Suggested change

try {

Frame.decode(ChannelFrameInput(reader, declaredLength))

} catch (e: Exception) {

logger.error(

"Failed to read or decode frame",

mapOf("length" to declaredLength, "exception" to e),

)

shutDownRequested = true

return

}

try {

Frame.decode(ChannelFrameInput(reader, declaredLength))

} catch (e: CancellationException) {

throw e

} catch (e: Exception) {

logger.error(

"Failed to read or decode frame",

mapOf("length" to declaredLength, "exception" to e),

)

shutDownRequested = true

return

}

(needs import kotlinx.coroutines.CancellationException)

catch (e: Exception) also catches CancellationException, so a cancellation during decode becomes a clean shutdown instead of propagating. Minor and
pre-existing in this file, but the new broad catch makes it worth rethrowing.

uranusjr marked this pull request as ready for review June 29, 2026 19:23

uranusjr requested a review from jason810496 as a code owner June 29, 2026 19:23

phanikumv reviewed Jun 30, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Harden frame processing on large data#69125

Harden frame processing on large data#69125
uranusjr wants to merge 1 commit into
apache:mainfrom
astronomer:chunk-java-comm

uranusjr commented Jun 29, 2026

Uh oh!

phanikumv Jun 30, 2026

Uh oh!

phanikumv Jun 30, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

uranusjr commented Jun 29, 2026

Uh oh!

phanikumv Jun 30, 2026

Choose a reason for hiding this comment

Uh oh!

phanikumv Jun 30, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants