lp://staging/~gagern/bzr/str-unicode

Created by Martin von Gagern and last modified

Python automatically converts between byte strings and unicode strings, using the default character encoding ASCII. When a string contains non-ASCII characters, this conversion will fail - a fact that will most likely only occur in special circumstances. Therefore it is desirable to handle all conversions manually, and specify the correct encodiung to use explicitely.

This branch achieves this by defining a new encoding which behaves like ASCII but writes a log of its invocations. it is set as the default encoding, the one Python uses for its internal conversions.

Get this branch:
bzr branch lp://staging/~gagern/bzr/str-unicode
Only Martin von Gagern can upload to this branch. If you are Martin von Gagern please log in for upload directions.

Branch merges

Related bugs

Related blueprints

Branch information

Owner:
Martin von Gagern
Project:
Bazaar
Status:
Experimental

Recent revisions

3499. By Martin von Gagern

Skip literals to keep the noise down. The approach is rather heuristic, and
might yield both false positives (lines on the stack contain the string as a
literal, but the converted object comes from a variable instead) and false
negatives (e.g. different use of escapes). Most occurrences are handled
correctly, though, which increases the speeed and usability.

3498. By Martin von Gagern

merged bzr.dev

3497. By Martin von Gagern

Use indexes

3496. By Martin von Gagern

Create text file at initialization time; handle empty log

3495. By Martin von Gagern

Pattern matching for db logs

3494. By Martin von Gagern

Warn for truncated log

3493. By Martin von Gagern

Ignore generated SqLite db

3492. By Martin von Gagern

SqLite based logger and log analyzer

3491. By Martin von Gagern

Proof of concept for a logger for automatic str/unicode conversions.

3490. By Canonical.com Patch Queue Manager <email address hidden>

(mbp) Bump version to 1.6b3

This branch contains Public information 
Everyone can see this information.

Subscribers

No subscribers.