lp://staging/~loic-grobol/+junk/eslotag-scripts

Created by Loïc Grobol and last modified

Scripts for training and using a CRF-based Morpho+POS tagger and formatting corpus.

Get this branch:
bzr branch lp://staging/~loic-grobol/+junk/eslotag-scripts
Only Loïc Grobol can upload to this branch. If you are Loïc Grobol please log in for upload directions.

Related bugs

Related blueprints

Branch information

Owner:
Loïc Grobol
Status:
Experimental

Recent revisions

24. By Loïc Grobol

+ strea_lexemize + misc refactoring

23. By Loïc Grobol

+ Exception support in dfutils.columns

22. By Loïc Grobol

after-refactoring stabilized

21. By Loïc Grobol

beaking typo

20. By Loïc Grobol

(untested) code refactoring tokenizer -> delimiter

19. By Loïc Grobol

Improved doc + (untested) support for custom lexemes in dflib.tokenization.lexemize

18. By Loïc Grobol

Improved doc

17. By Loïc Grobol

stable transcriber cleaning with expansions, stable transcriber format, stable tokenization.

16. By Loïc Grobol

+ support for expanding compound expressions (aux -> à les) in transcriber.clean

15. By Loïc Grobol

Fixed tokenizers misselection in dfutils.tokenization

Branch metadata

Branch format:
Branch format 7
Repository format:
Bazaar repository format 2a (needs bzr 1.16 or later)
This branch contains Public information 
Everyone can see this information.

Subscribers