lp://staging/perl5

Created by Registry Administrators and last modified
Get this branch:
bzr branch lp://staging/perl5

Related bugs

Related blueprints

Branch information

Owner:
Registry Administrators
Project:
perl5
Status:
Development

Import details

Import Status: Suspended

This branch is an import of the HEAD branch of the Git repository at git://perl5.git.perl.org/perl.git.

Last successful import was .

Import started on izar and finished taking 1 hour — see the log
Import started on alnitak and finished taking 1 hour — see the log
Import started on izar and finished taking 1 hour — see the log
Import started on alnitak and finished taking 1 hour — see the log

Recent revisions

56224. By Karl Williamson <email address hidden>

mktables: Correct L<> for perluniprops; rmv trail space

56223. By Karl Williamson <email address hidden>

t/porting/regen.t: Add test for new uni_keywords.h

56222. By Karl Williamson <email address hidden>

regen/mk_invlists.pl: Fix outdated comments

56221. By Karl Williamson <email address hidden>

regen/mk_invlists.pl: use re 'qr/aa'

This makes sure that all patterns in this file are compiled under /aa.
Doing this can catch bugs. The bug the previous commit fixes would have
been caught if we did this.

56220. By Karl Williamson <email address hidden>

regen/mk_invlists.pl: Fix chicken and egg problem

The problem here is that it was using a regular expression pattern to
determine if a code point is the integer 0. When a new Unicode release
comes along and adds a new block of decimals, this routine should be run
before the interpreter is compiled for real. And the pattern won't know
about the new block, so this would fail.

Solve the problem by using only Unicode::UCD to discover this info, and
not a pattern.

56219. By Karl Williamson <email address hidden>

mktables: Add, change some comments

56218. By Karl Williamson <email address hidden>

utf8.c: Use a more generic enum instead of explicit ptr

This changes, where possible, the reference to an inversion list, from
its specific name, to using an enum value (or a #define to an enum
value) which is an offset into a list of inversion lists.

This seems slightly more robust to me, as we don't have to know the
precise name of the table, but can use an enum which may have #define's
for it to create synonyms. Some versions of Unicode may not have the
precise name, but regen/mk_invlists.pl creates synonyms where possible,
so the chances of it being undefined go down.

Currently there is an inconsistency in the tables' names. Some recent
ones all begin with 'PL_'. That was when I thought these tables were
all going to be public. But then it turned out that they could just be
defined in one file (utf8.c), so the prefix is probably unnecessary.
Older tables didn't have that, and haven't changed. I'm not sure how it
will or should turn out.

56217. By Karl Williamson <email address hidden>

utf8.c: Reorder some initialization code

This puts the code into various related groups.

56216. By Karl Williamson <email address hidden>

utf8.c: Fix \p{} to work on old Unicodes

This change to use one #define instead of a synonym causes the code to
work unchanged on any Unicode version. The synonym isn't defined in
very old Unicodes, so this wouldn't compile for them.

56215. By Karl Williamson <email address hidden>

utf8.c: qr/\p{}/ Handle Unihan numeric properties

The Unihan data base is not shipped with perl due to its size. But we
allow someone to copy its files into the unicore directory and recompile
perl in order to get access to its properties. Some of those properties
are numeric, which, like the nv property, require special handling in
utf8.c. This commit adds that handling.

Branch metadata

Branch format:
Branch format 7
Repository format:
Bazaar repository format 2a (needs bzr 1.16 or later)
This branch contains Public information 
Everyone can see this information.

Subscribers

No subscribers.