Newer Older
1 2 3
Fix the Makefile in an attempt to allow installation under opam/Cygwin.

POTTIER Francois's avatar
POTTIER Francois committed
4 5 6 7 8 9 10 11 12 13 14
New declaration "%on_error_reduce foo", where foo is a nonterminal symbol.
This modifies the automaton as follows. In every state where a production of
the form "foo -> ..." is ready to be reduced, every error action is replaced
with a reduction of this production. (If there is a conflict between several
productions that could be reduced in this manner, nothing is done.) This does
not affect the language that is accepted by the automaton, but delays the
detection of an error: more reductions take place before the error is

15 16 17 18 19
Fixed a bug whereby Menhir would warn about a useless %prec declaration,
even though it was useful. This would happen when the declaration was
duplicated (by inlining or by macro-expansion) and some but not all of
the copies were useful.

POTTIER Francois's avatar
POTTIER Francois committed
21 22
Added [has_default_reduction] to the incremental API.

POTTIER Francois's avatar
POTTIER Francois committed
24 25 26 27 28
Modified the meaning of --canonical to allow default reductions to take
place. There implies no loss of precision in terms of lookahead sets,
and should allow gaining more contextual information when a syntax
error is encountered. (It should also lead to a smaller automaton.)

POTTIER Francois's avatar
POTTIER Francois committed
29 30 31
A brand new set of tools to work on syntax errors.
New command --list-errors, which produces a list of input sentences which
POTTIER Francois's avatar
POTTIER Francois committed
are representative of all possible syntax errors. (Costly.)
POTTIER Francois's avatar
POTTIER Francois committed
New command --interpret-error, which confirms that one particular input
POTTIER Francois's avatar
POTTIER Francois committed
34 35
sentence ends in a syntax error, and prints the number of the state in
which this error occurs.
POTTIER Francois's avatar
POTTIER Francois committed
36 37 38 39 40 41 42 43
New command --compile-errors, which compiles a list of erroneous sentences
(together with error messages) to OCaml code.
New command --compare-errors, which compares two lists of erroneous sentences
to check if they cover the same error states.
New command --update-errors, which updates the auto-generated comments in
a list of erroneous sentences.
New command --echo-errors, which removes all comments and messages from
a list of erroneous sentences, and echoes just the sentences.
POTTIER Francois's avatar
POTTIER Francois committed

POTTIER Francois's avatar
POTTIER Francois committed
45 46 47 48 49 50
Additions to the incremental API.
A [supplier] is a function that produces tokens on demand.
[lexer_lexbuf_to_supplier] turns a lexer and a lexbuf into a supplier.
[loop] is a ready-made made main parsing loop.
[loop_handle] is a variant of [loop] that lets the user do her own error handling.
[number] maps a state of the LR(1) automaton to its number.
POTTIER Francois's avatar
POTTIER Francois committed

53 54 55 56 57
Incompatible change of the incremental API: renamed the type ['a result]
to ['a checkpoint]. This is a better name anyway, and should help avoid
confusion with the type ['a result] introduced in OCaml 4.03.

58 59 60
Avoid using $(shell pwd) in Makefile, for better Windows compatibility.

61 62 63 64
Fixed a bug where inconsistent OCaml code was generated when --table
and --external-tokens were used together. (Reported by Darin Morrison.)

65 66 67 68
In --infer mode, leave the .ml file around (instead of removing it) if
ocamlc fails, so we have a chance to understand what's wrong.

POTTIER Francois's avatar
POTTIER Francois committed
69 70 71 72
Re-established some error messages concerning the mis-use of $i which
had disappeared on 2015/06/29.

73 74 75 76 77 78
Fixed the mysterious message that would appear when a nonterminal symbol
begins with an uppercase letter and --infer is turned on. Clarified the
documentation to indicate that a (non-start) nonterminal symbol can begin
with an uppercase letter, but this is not recommended.

POTTIER Francois's avatar
POTTIER Francois committed
79 80 81 82 83 84 85
New option --inspection (added last January, documented only now). This
generates an inspection API which allows inspecting the automaton's stack,
among other things. This API can in principle be used to write custom code
for error reporting, error recovery, etc. It is not yet mature and may
change in the future.

86 87 88
Added the command line options --unused-token <symbol> and --unused-tokens.

Yann Régis-Gianas's avatar
Yann Régis-Gianas committed
90 91 92 93
Changed the treatment of the positional keywords $i. They are now
rewritten into variables of the form '_i' where 'i' is an integer.
Users are advised not to use variables of this form inside semantic
Yann Régis-Gianas's avatar
Yann Régis-Gianas committed

95 96 97 98 99 100
Added support for anonymous rules. This allows writing, e.g.,
list(e = expression SEMI { e })
whereas previously one should have written
list(terminated(e, SEMI)).

101 102 103
Moved all of the demos to ocamlbuild (instead of make).

104 105 106 107
Incompatible change of the incremental API.
The incremental API now exposes shift events too.

POTTIER Francois's avatar
POTTIER Francois committed
109 110 111
Fixed a couple bugs in Makefile and src/Makefile which would cause
compilation and installation to fail with "TARGET=byte". (Reported
by Jérémie Courrèges-Anglas and Daniel Dickman.)

113 114 115 116
Incompatible change of the incremental API.
The entry point main_incremental is now named Incremental.main.

117 118 119 120 121 122 123 124
Incompatible change of the incremental API.
The API now exposes reduction events.
The type 'a result is now private.
The type env is no longer parameterized.
[handle] is renamed to [resume].
[offer] and [resume] now expect a result, not an environment.

125 126 127
Documented the Coq back-end (designed and implemented by Jacques-Henri Jourdan).

POTTIER Francois's avatar
POTTIER Francois committed
128 129 130
New incremental API (in --table mode only), inspired by Frédéric Bour.

131 132 133 134 135 136 137 138 139
Menhir now reports an error if one of the start symbols produces
either the empty language or the singleton language {epsilon}.

Although some people out there actually define a start symbol that recognizes
{epsilon} (and use it as a way of initializing or re-initializing some global
state), this is considered bad style. Furthermore, by ruling out this case, we
are able to simplify the table back-end a little bit.

POTTIER Francois's avatar
POTTIER Francois committed
140 141 142
A speed improvement in the code back-end.

143 144 145
Menhir now requires OCaml 4.02 (instead of 3.09).

146 147
Removed support for the $previouserror keyword.
Removed support for --error-recovery mode.

150 151 152 153 154
In the Coq backend, use ' instead of _ as separator in identifiers.
Also, correct a serious bug that was inadvertently introduced on
2013/03/01 (r319).

155 156 157 158
Lexer fix so as to support an open variant type [> ...] within
a %type<...> declaration.

fpottier's avatar
fpottier committed
159 160 161 162 163
Updated the Makefile so that install no longer depends on all.
Updated the demos so that the lexer does not invoke "exit 0"
when encoutering eof. (This should be more intuitive.)

164 165 166 167
Fixed a newline conversion problem that would prevent Menhir from
building on Windows when using ocaml 4.01.

fpottier's avatar
fpottier committed
168 169 170 171
Switched to ocamlbuild. Many thanks to Daniel Weil for offering
very useful guidance.

172 173 174 175 176 177 178 179 180 181 182 183 184 185 186 187 188 189 190 191 192 193 194 195 196 197 198 199 200 201 202 203 204 205 206 207 208 209 210 211 212 213 214 215 216 217 218 219 220 221 222 223 224 225 226 227 228 229 230 231 232 233 234 235 236 237 238 239 240 241 242 243 244 245 246 247 248 249 250 251 252 253 254 255 256 257 258 259 260 261 262 263 264 265 266 267 268 269 270 271 272 273 274 275 276 277 278 279 280 281 282 283 284 285 286 287 288 289 290 291 292 293 294 295 296 297 298 299 300 301 302 303 304 305 306 307 308 309 310 311 312 313 314 315 316 317 318 319 320 321 322 323 324 325 326 327 328 329 330 331 332 333 334 335 336 337 338 339 340 341 342 343 344 345 346 347 348 349 350 351 352 353 354 355 356 357 358 359 360 361 362 363 364 365 366
"menhir --depend" was broken since someone added new whitespace in the output
of ocamldep. Fixed.

Fixed a compilation problem that would arise when a file produced
by Menhir on a 64-bit platform was compiled by ocaml on a 32-bit

Performance improvements in the computation of various information
about the automaton (module [Invariant]). The improvements will be
noticeable only for very large automata.

The option --log-grammar 3 (and above) now causes the FOLLOW sets for
*terminal* symbols to be computed and displayed.

Added the flag --canonical, which causes Menhir to produce a canonical LR(1)
automaton in the style of Knuth. This means that no merging of states takes
place during the construction of the automaton, and that no default reductions
are allowed.

Fixed a bug whereby a %nonassoc declaration was not respected. This
declaration requests that a shift/reduce conflict be reduced in favor of
neither shifting nor reducing, that is, a syntax error must occur. However,
due to an unforeseen interaction with the "default reduction" mechanism, this
declaration was sometimes ignored and reduction would take place.

Changes in the (undocumented) Coq back-end so as to match the ESOP 2012

The Makefile now tests whether Unix or Windows is used (the test is performed
by evaluating Sys.os_type under ocaml) and changes a couple settings accordingly:
- the executable file name is either menhir or menhir.exe
- the object file suffix is either .o or .obj

Added --strict, which causes many warnings about the grammar and about the
automaton to be considered errors.

The # annotations that are inserted in the generated .ml file now retain their
full path. (That is, we no longer use [Filename.basename].) This implies that
the # annotations depend on how menhir is invoked -- e.g.  "menhir
foo/bar.mly" and "cd foo && menhir bar.mly" will produce different
results. Nevertheless, this seems reasonable and useful (e.g. in conjunction
with ocamlbuild and a hierarchy of files). Thanks to Daniel Weil.

With the -lg 1 switch, Menhir now indicates whether the grammar is SLR(1).

Removed the lock in ocamldep.wrapper. It is the responsibility of the user
to avoid interferences with other processes (or other instances of the script)
that create and/or remove files.

The (internal) computation of the automaton's invariant was broken and has
been fixed. Surprisingly, this does not seem to affect the generated code,
(which was correct,) so no observable bug is fixed. Hopefully no bug is

The grammar description files (.mly) are now read in up front and stored in
memory while they are parsed. This allows us to avoid the use of pos_in and
seek_in, which do not work correctly when CRLF conversion is being performed.

Fixed a bug in the type inference module (for parameterized non-terminals)
which would cause an infinite loop.

Fixed a bug that would cause an assertion failure in the generated parser
in some situations where the input stream was incorrect and the grammar
involved the error token. The fix might cause grammars that use the error
token to behave differently (hopefully more accurately) as of now.

Makefile changes: build and install only the bytecode version of menhirLib
when TARGET=byte is set.

Fixed ocamldep.wrapper to avoid quoting the name of the ocaml command.
This is hoped to fix a compilation problem under MinGW.

A Makefile fix to avoid a problem under Windows/Cygwin.
Renamed the ocaml-check-version script so as to avoid a warning.

Ocaml summer project: added --interpret, --table, and --suggest-*.

Fixed a problem that would cause the code inliner to abort when a semantic
value and a non-terminal symbol happened to have the same name.

Removed code sharing.

Removed an incorrect assertion that caused failures (, line 134).

Disabled code sharing by default, as it is currently broken. (See Yann's
message; assertion failure at runtime.)

Added an optimization to share code among states that have identical
outgoing transition tables.

Small Makefile change: create an executable file for check-ocaml-version in
order to work around the absence of dynamic loading on some platforms.

Made a fundamental change in the construction of the LR(1) automaton
in order to eliminate a bug that could lead to spurious conflicts --
thanks to Ketti for submitting a bug report.

Added --follow-construction to help understand the construction of the
LR(1) automaton (very verbose).

Code generation: more explicit qualifications with Pervasives so as
to avoid capture when the user redefines some of the built-in operators,
such as (+).
Added a new demo (calc-param) that shows how to use %parameter.

Makefile improvements (check for PREFIX; bootstrap in bytecode now
also available). Slight changes to OMakefile.shared.

Portability fix in Makefile and Makefile.shared (avoided "which").

Portability fix in Makefile.shared (replaced "&>" with "2>&1 >").

Made a slight restriction to Pager's criterion so as to never introduce
fake conflict tokens (see Lr0.compatible). This might help make conflict
explanations more accurate in the future.

Fixed bug that would cause positions to become invalid after %inlining.

Fixed --depend to be more lenient when analyzing ocamldep's output.
Added --raw-depend which transmits ocamldep's output unchanged (for
use in conjunction with omake).

Fixed bug that would cause --only-preprocess to print %token declarations
also for pseudo-tokens.
Fixed bug that caused some precedence declarations to be incorrectly
reported as useless.
Improved things so that useless pseudo-tokens now also cause warnings.
Fixed bug that would cause %type directives for terminal symbols to
be incorrectly accepted.
Fixed bug that would occur when a semantic action containing $i keywords
was %inlined.

Fixed problem that caused some end-of-stream conflicts not to be reported.
Fixed Pager's compatibility criterion to avoid creating end-of-stream conflicts.

Fixed problem that allowed generating incorrect but apparently well-typed
Objective Caml code when a semantic action was ill-typed and --infer was

Improved conflict reports by factoring out maximal common derivation contexts.

Fixed bug that could arise when explaining a conflict in a non-LALR(1) grammar.

Changed count of reduce/reduce conflicts to allow a comparison with ocamlyacc's diagnostics.
When refusing to resolve a conflict, report all diagnostics before dying.

Added display of FOLLOW sets when using --log-grammar 2.
Added --graph option.
Fixed behavior of --depend option.

Removed reversed lists from the standard library.