Fix incorrect line folding of RFC2047-encoded strings #77

dvalter · 2020-03-18T14:29:29Z

Use only whitespace characters as a separator to fold lines according to the section 2.2.3 of RFC5322. It may increase number of cases where hard limit fallback is used, but it should prevent damaging encoded subjects.

Should fix #54

codecov-io · 2020-03-18T14:30:11Z

Codecov Report

Merging #77 into master will decrease coverage by 0.12%.
The diff coverage is 100.00%.

@@            Coverage Diff             @@
##           master      #77      +/-   ##
==========================================
- Coverage   80.10%   79.97%   -0.13%     
==========================================
  Files          15       15              
  Lines         995      989       -6     
==========================================
- Hits          797      791       -6     
  Misses        122      122              
  Partials       76       76

Impacted Files	Coverage Δ
textproto/header.go	`83.69% <100.00%> (-0.35%)`	⬇️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 9c4415e...29ab0c8. Read the comment docs.

emersion · 2020-03-30T15:40:58Z

textproto/header_test.go

 	{
 		k:         "Subject",
 		v:         "=?utf-8?q?=E2=80=9CShort subject=E2=80=9D=0A?= =?utf-8?q?=0AAuthor=0A=0AOil_on...?=",
-		formatted: "Subject: =?utf-8?q?=E2=80=9CShort subject=E2=80=9D=0A?= =?utf-8?q?\r\n =0AAuthor=0A=0AOil_on...?=\r\n",
+		formatted: "Subject: =?utf-8?q?=E2=80=9CShort subject=E2=80=9D=0A?=\r\n =?utf-8?q?=0AAuthor=0A=0AOil_on...?=\r\n",


Hmm. What if a space appears in a RFC2047-encoded string and we fold it? e.g. =?utf-8?q?=E2=80=9CShort subject=E2=80=9D=0A?= → =?utf-8?q?=E2=80=9CShort\r\n subject=E2=80=9D=0A?=

Are we allowed to do this? IIRC we had the quoted-printable regex to avoid this.

It should not happen because it's clearly an ill-formed word (2 and 6.3 of RFC 2047).

When unfolded it should return back to it's original form, so it should be decoded the same way (if possible). To my knowledge Golang mime.WordDecoder and Thunderbird decoder both can handle spaces in Q-encoded utf-8 strings.

Hmm. Seems like it indeed, should've investigated a little bit more before merging #22 (which this PR effectively reverts).

Anyway, I agree with you, this PR does the right thing.

Going through RFC and possible cases I've found something possibly dangerous in \n handling.

If input contains \n without corresponding \r we will have an incorrect output (neither printable ASCII, HTAB or SPACE, not \r\n line terminator) . I'm not sure about practical possibilities of this though since any encoder should take care of this \n beforehand

Users aren't supposed to provide header keys/values with \n or \r. Maybe we should error out in this case?

(This is a separate change though - should probably be in a separate commit or PR)

Sure error is the right thing here.
I may open a new issue to discuss possible implementations, or if you have an idea right now, or if you're able to fix it straight away, you'll likely do it better since it's your code.

Yes, please open an issue! I always prefer to help contributors send patches instead of doing it myself, that's healthier for the project on the long run.

textproto/header.go

Use only whitespace characters as a separator to fold lines according to the section 2.2.3 of RFC5322. It may increase number of cases where hard limit fallback is used, but it should prevent damaging encoded subjects.

emersion

LGTM, thanks for your patience!

emersion reviewed Mar 30, 2020

View reviewed changes

emersion reviewed Apr 6, 2020

View reviewed changes

textproto/header.go Outdated Show resolved Hide resolved

emersion reviewed Apr 6, 2020

View reviewed changes

textproto/header.go Outdated Show resolved Hide resolved

Fix incorrect line folding of RFC2047-encoded strings

29ab0c8

Use only whitespace characters as a separator to fold lines according to the section 2.2.3 of RFC5322. It may increase number of cases where hard limit fallback is used, but it should prevent damaging encoded subjects.

dvalter force-pushed the fix/line-folding branch from 845e4ed to 29ab0c8 Compare April 13, 2020 21:33

emersion approved these changes Apr 15, 2020

View reviewed changes

emersion merged commit fee642d into emersion:master Apr 15, 2020

dvalter mentioned this pull request Apr 15, 2020

RFC 5322 character use limitation #80

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix incorrect line folding of RFC2047-encoded strings #77

Fix incorrect line folding of RFC2047-encoded strings #77

dvalter commented Mar 18, 2020

codecov-io commented Mar 18, 2020 •

edited

Loading

emersion Mar 30, 2020

dvalter Mar 30, 2020 •

edited

Loading

emersion Apr 6, 2020

dvalter Apr 13, 2020

emersion Apr 14, 2020

emersion Apr 14, 2020

dvalter Apr 15, 2020

emersion Apr 15, 2020

emersion left a comment

Fix incorrect line folding of RFC2047-encoded strings #77

Fix incorrect line folding of RFC2047-encoded strings #77

Conversation

dvalter commented Mar 18, 2020

codecov-io commented Mar 18, 2020 • edited Loading

Codecov Report

emersion Mar 30, 2020

Choose a reason for hiding this comment

dvalter Mar 30, 2020 • edited Loading

Choose a reason for hiding this comment

emersion Apr 6, 2020

Choose a reason for hiding this comment

dvalter Apr 13, 2020

Choose a reason for hiding this comment

emersion Apr 14, 2020

Choose a reason for hiding this comment

emersion Apr 14, 2020

Choose a reason for hiding this comment

dvalter Apr 15, 2020

Choose a reason for hiding this comment

emersion Apr 15, 2020

Choose a reason for hiding this comment

emersion left a comment

Choose a reason for hiding this comment

codecov-io commented Mar 18, 2020 •

edited

Loading

dvalter Mar 30, 2020 •

edited

Loading