-
Notifications
You must be signed in to change notification settings - Fork 31
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
IPv6 address representation in WARC-IP-Address field #100
Comments
Your suggestion seems sensible. I've added it as a community recommendation. |
I'd guess that Browsertrix or other browser-based tools already generate IPv6 traffic -- what does it do with these addresses? Also, wget? |
wget calls The current version of browsertrix-crawler doesn't emit WARC-IP-Address. An older version I had lying around seemed to produce the canonical form. Heritrix doesn't support IPv6. jwarc currently relies on Java's default which is the expanded old "preferred" form. |
Thanks for the clarification! I can confirm:
|
This question is about IPv6 address representation in WARC captures.
x:x:x:x:x:x:x:x
) is the "preferred" one. However,::
notation, lowercase, and further detailed format specifications.I'd be in favor of the format specified in RFC5952. But the WARC standard refers to RFC4291 and does not say anything about RFCs superseded or updated by another RFC. Are there any recommendations?
The text was updated successfully, but these errors were encountered: