Bazaar

lp://staging/~gagern/bzr/str-unicode

Created by Martin von Gagern on 2008-06-10 and last modified on 2008-06-17

Python automatically converts between byte strings and unicode strings, using the default character encoding ASCII. When a string contains non-ASCII characters, this conversion will fail - a fact that will most likely only occur in special circumstances. Therefore it is desirable to handle all conversions manually, and specify the correct encodiung to use explicitely.

This branch achieves this by defining a new encoding which behaves like ASCII but writes a log of its invocations. it is set as the default encoding, the one Python uses for its internal conversions.

Get this branch:: bzr branch lp://staging/~gagern/bzr/str-unicode

Only Martin von Gagern can upload to this branch. If you are Martin von Gagern please log in for upload directions.

Branch merges

No branches dependent on this one.

Related bugs

Link a bug report

Related blueprints

Branch information

Owner:: Martin von Gagern

Project:: Bazaar

Status:: Experimental

Recent revisions

3499. By Martin von Gagern on 2008-06-17: Skip literals to keep the noise down. The approach is rather heuristic, and
might yield both false positives (lines on the stack contain the string as a
literal, but the converted object comes from a variable instead) and false
negatives (e.g. different use of escapes). Most occurrences are handled
correctly, though, which increases the speeed and usability.
3498. By Martin von Gagern on 2008-06-17: merged bzr.dev
3497. By Martin von Gagern on 2008-06-17: Use indexes
3496. By Martin von Gagern on 2008-06-16: Create text file at initialization time; handle empty log
3495. By Martin von Gagern on 2008-06-16: Pattern matching for db logs
3494. By Martin von Gagern on 2008-06-16: Warn for truncated log
3493. By Martin von Gagern on 2008-06-16: Ignore generated SqLite db
3492. By Martin von Gagern on 2008-06-16: SqLite based logger and log analyzer
3491. By Martin von Gagern on 2008-06-10: Proof of concept for a logger for automatic str/unicode conversions.
3490. By Canonical.com Patch Queue Manager <email address hidden> on 2008-06-10: (mbp) Bump version to 1.6b3

This branch contains Public information

Everyone can see this information.

Subscribers

No subscribers.