Allow users to access IP/TCP/UDP headers #1031

lupino3 · 2019-10-30T16:49:04Z

This will allow users to access packet metadata without needing to parse
the "parsedPacket" string, and will also allow them to access the packet
data present after the headers for further computation, because the
header addresses are now exposed.

The IPHeader, TCPHeader, UDPHeader structs have been moved outside
PacketFragmentArgs to allow definition of properties with the same name.
They have been converted from structs to class to make them into
reference types and therefore have the homonymic properties return null
if the corresponding header was not successfully parsed by
ParsePacket().

I am proposing this change because I need to parse DNS request/response
in packets, and it is almost impossible to do it with the current class
structure without duplicating the code of FIndIpHeader(), and I think
that it's much cleaner to expose the headers themselves rather than just
exposing FindIpHeader() and having the users duplicate the parsing
logic.

lupino3 · 2019-10-30T16:53:34Z

@brianrob could you please review? This is the last bit I need for my own ETL parsing tool which interprets DNS data. Thanks! :)

brianrob · 2019-10-31T00:58:47Z

@lupino3 thanks for submitting. I will need a bit of time to look through this, but consider this an ack.

lupino3 · 2019-10-31T09:37:04Z

Sure, thanks @brianrob. Note that the diff algorithm is fooled by my addition of two extra properties. I didn't change a lot in the body of the existing ParsedPacket getter, which is now moved to ParsePacket() to make sure it's invoked for any of the properties that need parsing.

The only changes there are storing the parsed packet, IP/TCP/UDP headers in a private class attribute, which is then returned by the corresponding property.

I also added a parsed boolean to avoid running the parsing code multiple times for no reason (the result will always be the same).

Hope the change makes sense! Thanks.

brianrob · 2025-01-14T18:56:44Z

We're working to clean-up old open PRs in this repo. This PR is greater than 1 year old. If you would like to continue working on this PR, please add a comment within the next 7 days so that we can start discussion on next steps. Otherwise, we will close this PR. Please feel free to open a new PR or issue if you'd like to re-open this discussion at a later date.

lupino3 · 2025-01-15T10:12:53Z

@brianrob this is a very old PR, I haven't worked on this project for 5 years :)

That being said, I think this feature is useful and I think it should be merged.
Please let me know if there are any changes you'd need in order to get it in. Thanks!

brianrob · 2025-01-15T19:30:11Z

/azp run

azure-pipelines · 2025-01-15T19:30:20Z

Azure Pipelines successfully started running 1 pipeline(s).

brianrob · 2025-01-15T19:32:30Z

@lupino3 thanks for your response. I've triggered the CI for validation. Also, one additional question in the code review.

brianrob · 2025-01-15T19:27:55Z

src/TraceEvent/Parsers/Microsoft-Windows-NDIS-PacketCapture.cs

@@ -695,6 +635,119 @@ public override object PayloadValue(int index)

        protected override internal void SetState(object newState) { m_state = (MicrosoftWindowsNDISPacketCaptureTraceEventParserState)newState; }
        private MicrosoftWindowsNDISPacketCaptureTraceEventParserState m_state;
+
+        private bool parsed = false;


Do you expect parsed to be set to false for each new event? In most cases, you'll just have a single instance of this object and it will be attached to a raw payload and used as a parser. Thus, if you set parsed = true, it will remain true for the next event as well.

I see, I didn't take that into account at all! I see now the notice in the TraceEvent documentation. Do you think this PR looks good if I simply remove parsed, or would you recommend further changes?

I think the idea of exposing the data is reasonable. The biggest concern I have is the lifetime. I think the way to address this would be to clone the TraceEvent object and then reference it from the networking objects rather than passing a byte*. This would ensure that the lifetime of the referenced data is correct. When you call Clone on the TraceEvent object, it copies the buffer into separate memory that will live longer than the callback. Then it's safe to do the kind of stuff that you're doing here. So, I think the right path forward would be to have an API on the event itself that gets you an intermediate object that owns a clone of the TraceEvent and then each of the structured objects that you create that take a byte* can just keep a reference to the clone. Once they all go out of scope, the clone will be finalized. Alternatively, you could keep track and dispose of the clone when done.

Would copying the relevant portion of buffer into the *Header object itself make sense?

This way I don't have to manage the lifetime of TraceEvent, and the copy can be done when necessary (i.e., when the user accesses the corresponding property).

Yes, this would be fine as well.

This will allow users to access packet metadata without needing to parse the "parsedPacket" string, and will also allow them to access the packet data present after the headers for further computation, because the header addresses are now exposed. The IPHeader, TCPHeader, UDPHeader structs have been moved outside PacketFragmentArgs to allow definition of properties with the same name. They have been converted from structs to class to make them into reference types and therefore have the homonymic properties return null if the corresponding header was not successfully parsed by ParsePacket(). I am proposing this change because I need to parse DNS request/response in packets, and it is almost impossible to do it with the current class structure without duplicating the code of FIndIpHeader(), and I think that it's much cleaner to expose the headers themselves rather than just exposing FindIpHeader() and having the users duplicate the parsing logic.

Base automatically changed from master to main February 2, 2021 23:16

brianrob added the NotStale label Jan 15, 2025

brianrob reviewed Jan 15, 2025

View reviewed changes

brianrob removed the NotStale label Jan 27, 2025

lupino3 force-pushed the master branch from ba18036 to f6a83b3 Compare January 28, 2025 14:01

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allow users to access IP/TCP/UDP headers #1031

Allow users to access IP/TCP/UDP headers #1031

lupino3 commented Oct 30, 2019

lupino3 commented Oct 30, 2019

brianrob commented Oct 31, 2019

lupino3 commented Oct 31, 2019

brianrob commented Jan 14, 2025

lupino3 commented Jan 15, 2025

brianrob commented Jan 15, 2025

azure-pipelines bot commented Jan 15, 2025

brianrob commented Jan 15, 2025

brianrob Jan 15, 2025

lupino3 Jan 16, 2025

brianrob Jan 21, 2025

lupino3 Jan 28, 2025

brianrob Jan 28, 2025

Allow users to access IP/TCP/UDP headers #1031

Are you sure you want to change the base?

Allow users to access IP/TCP/UDP headers #1031

Conversation

lupino3 commented Oct 30, 2019

lupino3 commented Oct 30, 2019

brianrob commented Oct 31, 2019

lupino3 commented Oct 31, 2019

brianrob commented Jan 14, 2025

lupino3 commented Jan 15, 2025

brianrob commented Jan 15, 2025

azure-pipelines bot commented Jan 15, 2025

brianrob commented Jan 15, 2025

brianrob Jan 15, 2025

Choose a reason for hiding this comment

lupino3 Jan 16, 2025

Choose a reason for hiding this comment

brianrob Jan 21, 2025

Choose a reason for hiding this comment

lupino3 Jan 28, 2025

Choose a reason for hiding this comment

brianrob Jan 28, 2025

Choose a reason for hiding this comment