Repair benchmark and automate it further
The benchmark is now compatible with the new option, and there is an easy way to generate a large amount of sentences. The file src/middle.ml was modified in a way that is not completely satisfying, but necessary for now : the .tokens file are assumed to only have a single line return per token, and dropping this assumption is complicated.