Newer Older
POTTIER Francois's avatar
POTTIER Francois committed
1 2 3 4 5 6
%on_error_reduce declarations now have implicit priority levels, so as to
tell Menhir what to do when two such declarations are applicable.
Also, the well-formedness checks on %type and %on_error_reduce declarations
have been reinforced.

7 8 9 10 11 12 13
A small change in the generated code (both in the code and table back-ends) so
as to avoid OCaml's warning 41. The warning would arise (when compiling a
generated parser with OCaml 4.03) because Menhir's exception [Error] has the
same name as the data constructor [Error] in OCaml's pervasive library.
(Reported by Bernhard Schommer.)

14 15
Anonymous rules now work also when used inside a parameterized rule.
16 17 18
(This did not work until now.) When an anonymous rule is hoisted out
of a parameterized rule, it may itself become parameterized. Menhir
parameterizes it only over the parameters that it actually needs.

POTTIER Francois's avatar
POTTIER Francois committed
20 21 22 23
In the Coq backend, split the largest definitions into smaller
ones. This circumvenents a limitation of vm_compute on 32 bit
machines. This also enables us to perform sharing between
Jacques-Henri Jourdan's avatar
Jacques-Henri Jourdan committed
24 25
definitions, so that the generated files are much smaller.

POTTIER Francois's avatar
POTTIER Francois committed
26 27 28 29
When printing a grammar (which is done by the --only-preprocess options),
remove the leading bar '|', for compatibility with yacc and bison.

30 31 32 33 34 35
In the code back-end, generate type annotations when extracting a semantic
value out of the stack. When working with a semantic value of some function
type, OCaml would incorrectly warn that this function does not use its
argument. This warning should now be gone.

POTTIER Francois's avatar
POTTIER Francois committed
36 37 38 39
Makefile changes, so as to support ocamlbuild 4.03, which seems to have
stricter hygiene rules than previous versions.

40 41 42 43 44
Prevented an incorrect installation that would take place if USE_OCAMLFIND
was given during "make all" but not during "make install". Added a command
line directive --suggest-ocamlfind.

POTTIER Francois's avatar
POTTIER Francois committed
45 46 47 48 49
Fixed a severe bug in Menhir 20151110 which (when using the code back-end)
could cause a generated parser to crash. Thanks to ygrek for reporting the

50 51 52 53 54 55
The code produced by version XXXXXXXX of menhir --table can now be linked only
against a matching version of MenhirLib. If an incorrect version of MenhirLib
is installed, the OCaml compiler should complain that
[MenhirLib.StaticVersion.require_XXXXXXXX] is undefined.

POTTIER Francois's avatar
POTTIER Francois committed
POTTIER Francois's avatar
POTTIER Francois committed
Optimized the computation of $symbolstartpos, based on a couple of assumptions
POTTIER Francois's avatar
POTTIER Francois committed
58 59
about the lexer. (See the manual.)

POTTIER Francois's avatar
POTTIER Francois committed
60 61 62 63 64 65
Modified the treatment of %inline so that the positions that are computed are
the same, regardless of whether %inline is used. This property did not hold
until now. It now does. Of course, this means that the positions computed by
the new Menhir are not the same as those computed by older versions of Menhir.

POTTIER Francois's avatar
POTTIER Francois committed
66 67 68 69 70 71
Fixed a bug in the treatment of %inline that would lead to an incorrect
position being computed when the caller and callee had a variable by the
same name.

POTTIER Francois's avatar
POTTIER Francois committed
72 73 74
Modified Menhir so as to compute the start and end positions in the exact same
way as ocamlyacc. (There used to be a difference in the treatment of epsilon
productions.) Of course, this means that the positions computed by the new
POTTIER Francois's avatar
POTTIER Francois committed
75 76 77 78 79
Menhir are not the same as those computed by older versions of Menhir. Added
the keyword $symbolstartpos so as to simulate Parsing.symbol_start_pos()
in the ocamlyacc world. The keyword $startpos sometimes produces a position
that is too far off to the left; $symbolstartpos produces a more accurate
POTTIER Francois's avatar
POTTIER Francois committed

81 82
Incompatible change of the incremental API: instead of a unit argument, the
POTTIER Francois's avatar
POTTIER Francois committed
entry points (which are named after the start symbols) now require an initial
84 85
position, which typically should be [lexbuf.lex_curr_p].

POTTIER Francois's avatar
POTTIER Francois committed
Fix-fix-and-re-fix the Makefile in an attempt to allow installation under
POTTIER Francois's avatar
POTTIER Francois committed
opam/Windows. Thanks to Daniel Weil for patient explanations and testing.

90 91 92 93 94 95 96
MenhirLib is now installed in both binary and source forms.
"menhir --suggest-menhirLib" reports where MenhirLib is installed.
This can be used to retrieve a snapshot of MenhirLib in source form
and include it in your project (if you wish to use --table mode, yet
do not wish to have a dependency on MenhirLib).

POTTIER Francois's avatar
POTTIER Francois committed
98 99 100
Allow --list-errors to work on 32-bit machines (with low hard limits).
This should fix a problem whereby the 2015/10/23 release could not
bootstrap on a 32-bit machine.

POTTIER Francois's avatar
POTTIER Francois committed
102 103 104 105 106 107 108 109 110 111 112
New declaration "%on_error_reduce foo", where foo is a nonterminal symbol.
This modifies the automaton as follows. In every state where a production of
the form "foo -> ..." is ready to be reduced, every error action is replaced
with a reduction of this production. (If there is a conflict between several
productions that could be reduced in this manner, nothing is done.) This does
not affect the language that is accepted by the automaton, but delays the
detection of an error: more reductions take place before the error is

113 114 115 116 117
Fixed a bug whereby Menhir would warn about a useless %prec declaration,
even though it was useful. This would happen when the declaration was
duplicated (by inlining or by macro-expansion) and some but not all of
the copies were useful.

POTTIER Francois's avatar
POTTIER Francois committed
119 120
Added [has_default_reduction] to the incremental API.

POTTIER Francois's avatar
POTTIER Francois committed
Modified the meaning of --canonical to allow default reductions to take
POTTIER Francois's avatar
POTTIER Francois committed
place. This implies no loss of precision in terms of lookahead sets,
124 125 126
and should allow gaining more contextual information when a syntax
error is encountered. (It should also lead to a smaller automaton.)

POTTIER Francois's avatar
POTTIER Francois committed
127 128 129
A brand new set of tools to work on syntax errors.
New command --list-errors, which produces a list of input sentences which
POTTIER Francois's avatar
POTTIER Francois committed
are representative of all possible syntax errors. (Costly.)
POTTIER Francois's avatar
POTTIER Francois committed
New command --interpret-error, which confirms that one particular input
POTTIER Francois's avatar
POTTIER Francois committed
132 133
sentence ends in a syntax error, and prints the number of the state in
which this error occurs.
POTTIER Francois's avatar
POTTIER Francois committed
134 135 136 137 138 139 140 141
New command --compile-errors, which compiles a list of erroneous sentences
(together with error messages) to OCaml code.
New command --compare-errors, which compares two lists of erroneous sentences
to check if they cover the same error states.
New command --update-errors, which updates the auto-generated comments in
a list of erroneous sentences.
New command --echo-errors, which removes all comments and messages from
a list of erroneous sentences, and echoes just the sentences.
POTTIER Francois's avatar
POTTIER Francois committed

POTTIER Francois's avatar
POTTIER Francois committed
143 144 145 146 147
Additions to the incremental API.
A [supplier] is a function that produces tokens on demand.
[lexer_lexbuf_to_supplier] turns a lexer and a lexbuf into a supplier.
[loop] is a ready-made made main parsing loop.
POTTIER Francois's avatar
POTTIER Francois committed
148 149 150
[loop_handle] is a variant that lets the user do her own error handling.
[loop_handle_undo] is a variant that additionally allows undoing the last
  few "spurious" reductions.
[number] maps a state of the LR(1) automaton to its number.
POTTIER Francois's avatar
POTTIER Francois committed

153 154 155 156 157
Incompatible change of the incremental API: renamed the type ['a result]
to ['a checkpoint]. This is a better name anyway, and should help avoid
confusion with the type ['a result] introduced in OCaml 4.03.

158 159 160
Avoid using $(shell pwd) in Makefile, for better Windows compatibility.

161 162 163 164
Fixed a bug where inconsistent OCaml code was generated when --table
and --external-tokens were used together. (Reported by Darin Morrison.)

165 166 167 168
In --infer mode, leave the .ml file around (instead of removing it) if
ocamlc fails, so we have a chance to understand what's wrong.

POTTIER Francois's avatar
POTTIER Francois committed
169 170 171 172
Re-established some error messages concerning the mis-use of $i which
had disappeared on 2015/06/29.

173 174 175 176 177 178
Fixed the mysterious message that would appear when a nonterminal symbol
begins with an uppercase letter and --infer is turned on. Clarified the
documentation to indicate that a (non-start) nonterminal symbol can begin
with an uppercase letter, but this is not recommended.

POTTIER Francois's avatar
POTTIER Francois committed
179 180 181 182 183 184 185
New option --inspection (added last January, documented only now). This
generates an inspection API which allows inspecting the automaton's stack,
among other things. This API can in principle be used to write custom code
for error reporting, error recovery, etc. It is not yet mature and may
change in the future.

186 187 188
Added the command line options --unused-token <symbol> and --unused-tokens.

Yann Régis-Gianas's avatar
Yann Régis-Gianas committed
190 191 192 193
Changed the treatment of the positional keywords $i. They are now
rewritten into variables of the form '_i' where 'i' is an integer.
Users are advised not to use variables of this form inside semantic
Yann Régis-Gianas's avatar
Yann Régis-Gianas committed

195 196 197 198 199 200
Added support for anonymous rules. This allows writing, e.g.,
list(e = expression SEMI { e })
whereas previously one should have written
list(terminated(e, SEMI)).

201 202 203
Moved all of the demos to ocamlbuild (instead of make).

204 205 206 207
Incompatible change of the incremental API.
The incremental API now exposes shift events too.

POTTIER Francois's avatar
POTTIER Francois committed
209 210 211
Fixed a couple bugs in Makefile and src/Makefile which would cause
compilation and installation to fail with "TARGET=byte". (Reported
by Jérémie Courrèges-Anglas and Daniel Dickman.)

213 214 215 216
Incompatible change of the incremental API.
The entry point main_incremental is now named Incremental.main.

217 218 219 220 221 222 223 224
Incompatible change of the incremental API.
The API now exposes reduction events.
The type 'a result is now private.
The type env is no longer parameterized.
[handle] is renamed to [resume].
[offer] and [resume] now expect a result, not an environment.

225 226 227
Documented the Coq back-end (designed and implemented by Jacques-Henri Jourdan).

POTTIER Francois's avatar
POTTIER Francois committed
228 229 230
New incremental API (in --table mode only), inspired by Frédéric Bour.

231 232 233 234 235 236 237 238 239
Menhir now reports an error if one of the start symbols produces
either the empty language or the singleton language {epsilon}.

Although some people out there actually define a start symbol that recognizes
{epsilon} (and use it as a way of initializing or re-initializing some global
state), this is considered bad style. Furthermore, by ruling out this case, we
are able to simplify the table back-end a little bit.

POTTIER Francois's avatar
POTTIER Francois committed
240 241 242
A speed improvement in the code back-end.

243 244 245
Menhir now requires OCaml 4.02 (instead of 3.09).

246 247
Removed support for the $previouserror keyword.
Removed support for --error-recovery mode.

250 251 252 253 254
In the Coq backend, use ' instead of _ as separator in identifiers.
Also, correct a serious bug that was inadvertently introduced on
2013/03/01 (r319).

255 256 257 258
Lexer fix so as to support an open variant type [> ...] within
a %type<...> declaration.

fpottier's avatar
fpottier committed
259 260 261 262 263
Updated the Makefile so that install no longer depends on all.
Updated the demos so that the lexer does not invoke "exit 0"
when encoutering eof. (This should be more intuitive.)

264 265 266 267
Fixed a newline conversion problem that would prevent Menhir from
building on Windows when using ocaml 4.01.

fpottier's avatar
fpottier committed
268 269 270 271
Switched to ocamlbuild. Many thanks to Daniel Weil for offering
very useful guidance.

272 273 274 275 276 277 278 279 280 281 282 283 284 285 286 287 288 289 290 291 292 293 294 295 296 297 298 299 300 301 302 303 304 305 306 307 308 309 310 311 312 313 314 315 316 317 318 319 320 321 322 323 324 325 326 327 328 329 330 331 332 333 334 335 336 337 338 339 340 341 342 343 344 345 346 347 348 349 350 351 352 353 354 355 356 357 358 359 360 361 362 363 364 365 366 367 368 369 370 371 372 373 374 375 376 377 378 379 380 381 382 383 384 385 386 387 388 389 390 391 392 393 394 395 396 397 398 399 400 401 402 403 404 405 406 407 408 409 410 411 412 413 414 415 416 417 418 419 420 421 422 423 424 425 426 427 428 429 430 431 432 433 434 435 436 437 438 439 440 441 442 443 444 445 446 447 448 449 450 451 452 453 454 455 456 457 458 459 460 461 462 463 464 465
"menhir --depend" was broken since someone added new whitespace in the output
of ocamldep. Fixed.

Fixed a compilation problem that would arise when a file produced
by Menhir on a 64-bit platform was compiled by ocaml on a 32-bit

Performance improvements in the computation of various information
about the automaton (module [Invariant]). The improvements will be
noticeable only for very large automata.

The option --log-grammar 3 (and above) now causes the FOLLOW sets for
*terminal* symbols to be computed and displayed.

Added the flag --canonical, which causes Menhir to produce a canonical LR(1)
automaton in the style of Knuth. This means that no merging of states takes
place during the construction of the automaton, and that no default reductions
are allowed.

Fixed a bug whereby a %nonassoc declaration was not respected. This
declaration requests that a shift/reduce conflict be reduced in favor of
neither shifting nor reducing, that is, a syntax error must occur. However,
due to an unforeseen interaction with the "default reduction" mechanism, this
declaration was sometimes ignored and reduction would take place.

Changes in the (undocumented) Coq back-end so as to match the ESOP 2012

The Makefile now tests whether Unix or Windows is used (the test is performed
by evaluating Sys.os_type under ocaml) and changes a couple settings accordingly:
- the executable file name is either menhir or menhir.exe
- the object file suffix is either .o or .obj

Added --strict, which causes many warnings about the grammar and about the
automaton to be considered errors.

The # annotations that are inserted in the generated .ml file now retain their
full path. (That is, we no longer use [Filename.basename].) This implies that
the # annotations depend on how menhir is invoked -- e.g.  "menhir
foo/bar.mly" and "cd foo && menhir bar.mly" will produce different
results. Nevertheless, this seems reasonable and useful (e.g. in conjunction
with ocamlbuild and a hierarchy of files). Thanks to Daniel Weil.

With the -lg 1 switch, Menhir now indicates whether the grammar is SLR(1).

Removed the lock in ocamldep.wrapper. It is the responsibility of the user
to avoid interferences with other processes (or other instances of the script)
that create and/or remove files.

The (internal) computation of the automaton's invariant was broken and has
been fixed. Surprisingly, this does not seem to affect the generated code,
(which was correct,) so no observable bug is fixed. Hopefully no bug is

The grammar description files (.mly) are now read in up front and stored in
memory while they are parsed. This allows us to avoid the use of pos_in and
seek_in, which do not work correctly when CRLF conversion is being performed.

Fixed a bug in the type inference module (for parameterized non-terminals)
which would cause an infinite loop.

Fixed a bug that would cause an assertion failure in the generated parser
in some situations where the input stream was incorrect and the grammar
involved the error token. The fix might cause grammars that use the error
token to behave differently (hopefully more accurately) as of now.

Makefile changes: build and install only the bytecode version of menhirLib
when TARGET=byte is set.

Fixed ocamldep.wrapper to avoid quoting the name of the ocaml command.
This is hoped to fix a compilation problem under MinGW.

A Makefile fix to avoid a problem under Windows/Cygwin.
Renamed the ocaml-check-version script so as to avoid a warning.

Ocaml summer project: added --interpret, --table, and --suggest-*.

Fixed a problem that would cause the code inliner to abort when a semantic
value and a non-terminal symbol happened to have the same name.

Removed code sharing.

Removed an incorrect assertion that caused failures (, line 134).

Disabled code sharing by default, as it is currently broken. (See Yann's
message; assertion failure at runtime.)

Added an optimization to share code among states that have identical
outgoing transition tables.

Small Makefile change: create an executable file for check-ocaml-version in
order to work around the absence of dynamic loading on some platforms.

Made a fundamental change in the construction of the LR(1) automaton
in order to eliminate a bug that could lead to spurious conflicts --
thanks to Ketti for submitting a bug report.

Added --follow-construction to help understand the construction of the
LR(1) automaton (very verbose).

Code generation: more explicit qualifications with Pervasives so as
to avoid capture when the user redefines some of the built-in operators,
such as (+).
Added a new demo (calc-param) that shows how to use %parameter.

Makefile improvements (check for PREFIX; bootstrap in bytecode now
also available). Slight changes to OMakefile.shared.

Portability fix in Makefile and Makefile.shared (avoided "which").

Portability fix in Makefile.shared (replaced "&>" with "2>&1 >").

Made a slight restriction to Pager's criterion so as to never introduce
fake conflict tokens (see Lr0.compatible). This might help make conflict
explanations more accurate in the future.

Fixed bug that would cause positions to become invalid after %inlining.

Fixed --depend to be more lenient when analyzing ocamldep's output.
Added --raw-depend which transmits ocamldep's output unchanged (for
use in conjunction with omake).

Fixed bug that would cause --only-preprocess to print %token declarations
also for pseudo-tokens.
Fixed bug that caused some precedence declarations to be incorrectly
reported as useless.
Improved things so that useless pseudo-tokens now also cause warnings.
Fixed bug that would cause %type directives for terminal symbols to
be incorrectly accepted.
Fixed bug that would occur when a semantic action containing $i keywords
was %inlined.

Fixed problem that caused some end-of-stream conflicts not to be reported.
Fixed Pager's compatibility criterion to avoid creating end-of-stream conflicts.

Fixed problem that allowed generating incorrect but apparently well-typed
Objective Caml code when a semantic action was ill-typed and --infer was

Improved conflict reports by factoring out maximal common derivation contexts.

Fixed bug that could arise when explaining a conflict in a non-LALR(1) grammar.

Changed count of reduce/reduce conflicts to allow a comparison with ocamlyacc's diagnostics.
When refusing to resolve a conflict, report all diagnostics before dying.

Added display of FOLLOW sets when using --log-grammar 2.
Added --graph option.
Fixed behavior of --depend option.

Removed reversed lists from the standard library.