Generic: use compat_urllib_parse_unquote to prevent utf8 mangling
of the entire page in python 2.
-requires- fixed compat_urllib_parse_unquote
example - the following will save with a mangled playlist title,
 instead of the kanji for 'tsunami'. This affects all utf8encoded
 urls as well
youtube-dl -f18 -o '%(playlist_title)s-%(title)s.%(ext)s' \
  61c14c1e3a/tsunami.html
			
			
This commit is contained in:
		@@ -1115,7 +1115,7 @@ class GenericIE(InfoExtractor):
 | 
				
			|||||||
        # Sometimes embedded video player is hidden behind percent encoding
 | 
					        # Sometimes embedded video player is hidden behind percent encoding
 | 
				
			||||||
        # (e.g. https://github.com/rg3/youtube-dl/issues/2448)
 | 
					        # (e.g. https://github.com/rg3/youtube-dl/issues/2448)
 | 
				
			||||||
        # Unescaping the whole page allows to handle those cases in a generic way
 | 
					        # Unescaping the whole page allows to handle those cases in a generic way
 | 
				
			||||||
        webpage = compat_urllib_parse.unquote(webpage)
 | 
					        webpage = compat_urllib_parse_unquote(webpage)
 | 
				
			||||||
 | 
					
 | 
				
			||||||
        # it's tempting to parse this further, but you would
 | 
					        # it's tempting to parse this further, but you would
 | 
				
			||||||
        # have to take into account all the variations like
 | 
					        # have to take into account all the variations like
 | 
				
			||||||
 
 | 
				
			|||||||
		Reference in New Issue
	
	Block a user