Can you clearly document what the actual heuristic is? It's not clear to me from reading the diff, and it's not the same as the notepad one. Why, for instance, are you not decoding UTF-8 text?
What happens if a UnicodeDecodeError is raised from one of these functions?
Is this partly inspired by the current thread on python-dev? mail.python. org/pipermail/ python- dev/2010- January/ 094828. html>
<http://
Can you clearly document what the actual heuristic is? It's not clear to me from reading the diff, and it's not the same as the notepad one. Why, for instance, are you not decoding UTF-8 text?
What happens if a UnicodeDecodeError is raised from one of these functions?