ligo/src/passes/1-parser/cameligo/ParserMain.ml

(* Driver for the CameLIGO parser *)

module Region = Simple_utils.Region
module SSet   = Set.Make (String)

module IO =
  struct
    let options =
      let open EvalOpt in
      let block = mk_block ~opening:"(*" ~closing:"*)"
      in read ~block ~line:"//" ".mligo"
  end

module SubIO =
  struct
    type options = <
      libs    : string list;
      verbose : SSet.t;
      offsets : bool;
      block   : EvalOpt.block_comment option;
      line    : EvalOpt.line_comment option;
      ext     : string;
      mode    : [`Byte | `Point];
      cmd     : EvalOpt.command;
      mono    : bool;
      pretty  : bool
    >

    let options : options =
      object
        method libs    = IO.options#libs
        method verbose = IO.options#verbose
        method offsets = IO.options#offsets
        method block   = IO.options#block
        method line    = IO.options#line
        method ext     = IO.options#ext
        method mode    = IO.options#mode
        method cmd     = IO.options#cmd
        method mono    = IO.options#mono
        method pretty  = IO.options#pretty
      end

    let make =
      EvalOpt.make ~libs:options#libs
                   ~verbose:options#verbose
                   ~offsets:options#offsets
                   ?block:options#block
                   ?line:options#line
                   ~ext:options#ext
                   ~mode:options#mode
                   ~cmd:options#cmd
                   ~mono:options#mono
                   ~pretty:options#pretty
  end

module Parser =
  struct
    type ast  = AST.t
    type expr = AST.expr
    include Parser
  end

module ParserLog =
  struct
    type ast  = AST.t
    type expr = AST.expr
    include ParserLog
  end

module Lexer = Lexer.Make (LexToken)

module Unit =
  ParserUnit.Make (Lexer)(AST)(Parser)(ParErr)(ParserLog)(SubIO)

(* Main *)

let wrap = function
  Stdlib.Ok ast ->
    if IO.options#pretty then
      begin
        let doc = Pretty.print ast in
        let width =
          match Terminal_size.get_columns () with
            None -> 60
          | Some c -> c in
        PPrint.ToChannel.pretty 1.0 width stdout doc;
        print_newline ()
      end;
    flush_all ()
| Error msg ->
    begin
      flush_all ();
      Printf.eprintf "\027[31m%s\027[0m%!" msg.Region.value
    end

let () =
  match IO.options#input with
    None -> Unit.contract_in_stdin () |> wrap
  | Some file_path -> Unit.contract_in_file file_path |> wrap
Simple_utils is now used as a library by the local builds. Preprocessor is now a library installed by opam. Replaced ligolang@gmail.com by contact@ligolang.org in opam files. Reformatted some opam files. Removed #line directive from preprocessor. Added to the interface of ParserUnit. Script messages.sh now checks the identity of .msg and .msg.old to avoid undue warning about possibly different LR items. 2020-04-07 18:33:46 +02:00			`(* Driver for the CameLIGO parser *)`

			`module Region = Simple_utils.Region`
			`module SSet = Set.Make (String)`
Extended lib_utils/pos.ml{i}. First import of Ligodity. (No "simplify" yet.) 2019-05-12 19:31:22 +02:00
Sharing standalone lexers and parsers, and parser error API. 2020-01-04 19:49:22 +01:00			`module IO =`
			`struct`
The preprocessor library depends now on the kinds of comments instead of a closed set of languages. I also removed the offsets: I simply use the current region to determine whether the preprocessing directie starts at the beginning of a line. I also removed scanning line indicators, to make the lexer simpler. LexToken.mll: Moved the function [check_right_context] that checks stylistic constraints from Lexer.mll to LexToken.mll. While this triplicates code (as CameLIGO, PascaLIGO and ReasonLIGO share the same constraints), the benefit is that Lexer.mll becomes more generic and the signature for the TOKEN module is simpler (no more exporting predicates, except for EOF). In accordance with the change of the preprocessor, the lexers and parsers for LIGO now depend on the kind of comments, not a fixed set of syntaxes. This gives more versatility when adding a new language: only the kinds of its comments are needed, although Lexer.mll and Preproc.mll may have to be modified if they do not already know the comment delimiters, for example line comments starting with #. ************************************************************** BUG: The exceptions coming from LexToken.mll when a stylistic constraint is broken in [LexToken.check_right_context] are not caught yet. ************************************************************** Lexer.mll: I moved out as much as I could from the header into a new module LexerLib. The aim is to make it easy to reuse as much as possible of the lexer machinerie, when it cannot be used as is. 2020-04-24 21:06:18 +02:00			`let options =`
			`let open EvalOpt in`
			`let block = mk_block ~opening:"(" ~closing:")"`
			`in read ~block ~line:"//" ".mligo"`
Simple_utils is now used as a library by the local builds. Preprocessor is now a library installed by opam. Replaced ligolang@gmail.com by contact@ligolang.org in opam files. Reformatted some opam files. Removed #line directive from preprocessor. Added to the interface of ParserUnit. Script messages.sh now checks the identity of .msg and .msg.old to avoid undue warning about possibly different LR items. 2020-04-07 18:33:46 +02:00			`end`

			`module SubIO =`
			`struct`
			`type options = <`
			`libs : string list;`
			`verbose : SSet.t;`
			`offsets : bool;`
The preprocessor library depends now on the kinds of comments instead of a closed set of languages. I also removed the offsets: I simply use the current region to determine whether the preprocessing directie starts at the beginning of a line. I also removed scanning line indicators, to make the lexer simpler. LexToken.mll: Moved the function [check_right_context] that checks stylistic constraints from Lexer.mll to LexToken.mll. While this triplicates code (as CameLIGO, PascaLIGO and ReasonLIGO share the same constraints), the benefit is that Lexer.mll becomes more generic and the signature for the TOKEN module is simpler (no more exporting predicates, except for EOF). In accordance with the change of the preprocessor, the lexers and parsers for LIGO now depend on the kind of comments, not a fixed set of syntaxes. This gives more versatility when adding a new language: only the kinds of its comments are needed, although Lexer.mll and Preproc.mll may have to be modified if they do not already know the comment delimiters, for example line comments starting with #. ************************************************************** BUG: The exceptions coming from LexToken.mll when a stylistic constraint is broken in [LexToken.check_right_context] are not caught yet. ************************************************************** Lexer.mll: I moved out as much as I could from the header into a new module LexerLib. The aim is to make it easy to reuse as much as possible of the lexer machinerie, when it cannot be used as is. 2020-04-24 21:06:18 +02:00			`block : EvalOpt.block_comment option;`
			`line : EvalOpt.line_comment option;`
Simple_utils is now used as a library by the local builds. Preprocessor is now a library installed by opam. Replaced ligolang@gmail.com by contact@ligolang.org in opam files. Reformatted some opam files. Removed #line directive from preprocessor. Added to the interface of ParserUnit. Script messages.sh now checks the identity of .msg and .msg.old to avoid undue warning about possibly different LR items. 2020-04-07 18:33:46 +02:00			`ext : string;`
			mode : [`Byte \| `Point];
			`cmd : EvalOpt.command;`
* Renamed [TStringLiteral] as [TString]. * LexToken.mll for CameLIGO: Fixed printing of "Str" into "String". * Added CLI option --pretty to call the pretty-printer from ParserMain. * Use the package Terminal_size to try to determine the width of the terminal where the source is pretty-printed. 2020-05-01 20:32:48 +02:00			`mono : bool;`
			`pretty : bool`
Simple_utils is now used as a library by the local builds. Preprocessor is now a library installed by opam. Replaced ligolang@gmail.com by contact@ligolang.org in opam files. Reformatted some opam files. Removed #line directive from preprocessor. Added to the interface of ParserUnit. Script messages.sh now checks the identity of .msg and .msg.old to avoid undue warning about possibly different LR items. 2020-04-07 18:33:46 +02:00			`>`

			`let options : options =`
			`object`
			`method libs = IO.options#libs`
			`method verbose = IO.options#verbose`
			`method offsets = IO.options#offsets`
The preprocessor library depends now on the kinds of comments instead of a closed set of languages. I also removed the offsets: I simply use the current region to determine whether the preprocessing directie starts at the beginning of a line. I also removed scanning line indicators, to make the lexer simpler. LexToken.mll: Moved the function [check_right_context] that checks stylistic constraints from Lexer.mll to LexToken.mll. While this triplicates code (as CameLIGO, PascaLIGO and ReasonLIGO share the same constraints), the benefit is that Lexer.mll becomes more generic and the signature for the TOKEN module is simpler (no more exporting predicates, except for EOF). In accordance with the change of the preprocessor, the lexers and parsers for LIGO now depend on the kind of comments, not a fixed set of syntaxes. This gives more versatility when adding a new language: only the kinds of its comments are needed, although Lexer.mll and Preproc.mll may have to be modified if they do not already know the comment delimiters, for example line comments starting with #. ************************************************************** BUG: The exceptions coming from LexToken.mll when a stylistic constraint is broken in [LexToken.check_right_context] are not caught yet. ************************************************************** Lexer.mll: I moved out as much as I could from the header into a new module LexerLib. The aim is to make it easy to reuse as much as possible of the lexer machinerie, when it cannot be used as is. 2020-04-24 21:06:18 +02:00			`method block = IO.options#block`
			`method line = IO.options#line`
Simple_utils is now used as a library by the local builds. Preprocessor is now a library installed by opam. Replaced ligolang@gmail.com by contact@ligolang.org in opam files. Reformatted some opam files. Removed #line directive from preprocessor. Added to the interface of ParserUnit. Script messages.sh now checks the identity of .msg and .msg.old to avoid undue warning about possibly different LR items. 2020-04-07 18:33:46 +02:00			`method ext = IO.options#ext`
			`method mode = IO.options#mode`
			`method cmd = IO.options#cmd`
			`method mono = IO.options#mono`
* Renamed [TStringLiteral] as [TString]. * LexToken.mll for CameLIGO: Fixed printing of "Str" into "String". * Added CLI option --pretty to call the pretty-printer from ParserMain. * Use the package Terminal_size to try to determine the width of the terminal where the source is pretty-printed. 2020-05-01 20:32:48 +02:00			`method pretty = IO.options#pretty`
Simple_utils is now used as a library by the local builds. Preprocessor is now a library installed by opam. Replaced ligolang@gmail.com by contact@ligolang.org in opam files. Reformatted some opam files. Removed #line directive from preprocessor. Added to the interface of ParserUnit. Script messages.sh now checks the identity of .msg and .msg.old to avoid undue warning about possibly different LR items. 2020-04-07 18:33:46 +02:00			`end`

			`let make =`
			`EvalOpt.make ~libs:options#libs`
			`~verbose:options#verbose`
			`~offsets:options#offsets`
The preprocessor library depends now on the kinds of comments instead of a closed set of languages. I also removed the offsets: I simply use the current region to determine whether the preprocessing directie starts at the beginning of a line. I also removed scanning line indicators, to make the lexer simpler. LexToken.mll: Moved the function [check_right_context] that checks stylistic constraints from Lexer.mll to LexToken.mll. While this triplicates code (as CameLIGO, PascaLIGO and ReasonLIGO share the same constraints), the benefit is that Lexer.mll becomes more generic and the signature for the TOKEN module is simpler (no more exporting predicates, except for EOF). In accordance with the change of the preprocessor, the lexers and parsers for LIGO now depend on the kind of comments, not a fixed set of syntaxes. This gives more versatility when adding a new language: only the kinds of its comments are needed, although Lexer.mll and Preproc.mll may have to be modified if they do not already know the comment delimiters, for example line comments starting with #. ************************************************************** BUG: The exceptions coming from LexToken.mll when a stylistic constraint is broken in [LexToken.check_right_context] are not caught yet. ************************************************************** Lexer.mll: I moved out as much as I could from the header into a new module LexerLib. The aim is to make it easy to reuse as much as possible of the lexer machinerie, when it cannot be used as is. 2020-04-24 21:06:18 +02:00			`?block:options#block`
			`?line:options#line`
Simple_utils is now used as a library by the local builds. Preprocessor is now a library installed by opam. Replaced ligolang@gmail.com by contact@ligolang.org in opam files. Reformatted some opam files. Removed #line directive from preprocessor. Added to the interface of ParserUnit. Script messages.sh now checks the identity of .msg and .msg.old to avoid undue warning about possibly different LR items. 2020-04-07 18:33:46 +02:00			`~ext:options#ext`
			`~mode:options#mode`
			`~cmd:options#cmd`
			`~mono:options#mono`
* Renamed [TStringLiteral] as [TString]. * LexToken.mll for CameLIGO: Fixed printing of "Str" into "String". * Added CLI option --pretty to call the pretty-printer from ParserMain. * Use the package Terminal_size to try to determine the width of the terminal where the source is pretty-printed. 2020-05-01 20:32:48 +02:00			`~pretty:options#pretty`
Sharing standalone lexers and parsers, and parser error API. 2020-01-04 19:49:22 +01:00			`end`

Refactoring to bring local builds of the parsers closer to the global build. Added --expr to parse expressions. 2020-01-14 01:27:35 +01:00			`module Parser =`
Sharing standalone lexers and parsers, and parser error API. 2020-01-04 19:49:22 +01:00			`struct`
Refactoring to bring local builds of the parsers closer to the global build. Added --expr to parse expressions. 2020-01-14 01:27:35 +01:00			`type ast = AST.t`
Sharing standalone lexers and parsers, and parser error API. 2020-01-04 19:49:22 +01:00			`type expr = AST.expr`
			`include Parser`
			`end`

Refactoring to bring local builds of the parsers closer to the global build. Added --expr to parse expressions. 2020-01-14 01:27:35 +01:00			`module ParserLog =`
Sharing standalone lexers and parsers, and parser error API. 2020-01-04 19:49:22 +01:00			`struct`
Refactoring to bring local builds of the parsers closer to the global build. Added --expr to parse expressions. 2020-01-14 01:27:35 +01:00			`type ast = AST.t`
			`type expr = AST.expr`
Sharing standalone lexers and parsers, and parser error API. 2020-01-04 19:49:22 +01:00			`include ParserLog`
			`end`

Refactoring to bring local builds of the parsers closer to the global build. Added --expr to parse expressions. 2020-01-14 01:27:35 +01:00			`module Lexer = Lexer.Make (LexToken)`
Added support for language-specific parse errors for PascaLIGO: * Duplicate variants in the same type declaration * Duplicate parameter in the same function declaration * Shadowing of predefined value in a declaration I fixed the architecture for that support: ParserMain.ml is now where those specific errors are handled, and they are produced by the semantic actions of the parsers. 2020-01-08 16:39:52 +01:00
			`module Unit =`
Simple_utils is now used as a library by the local builds. Preprocessor is now a library installed by opam. Replaced ligolang@gmail.com by contact@ligolang.org in opam files. Reformatted some opam files. Removed #line directive from preprocessor. Added to the interface of ParserUnit. Script messages.sh now checks the identity of .msg and .msg.old to avoid undue warning about possibly different LR items. 2020-04-07 18:33:46 +02:00			`ParserUnit.Make (Lexer)(AST)(Parser)(ParErr)(ParserLog)(SubIO)`
Refactoring to bring local builds of the parsers closer to the global build. Added --expr to parse expressions. 2020-01-14 01:27:35 +01:00
			`(* Main *)`

Simple_utils is now used as a library by the local builds. Preprocessor is now a library installed by opam. Replaced ligolang@gmail.com by contact@ligolang.org in opam files. Reformatted some opam files. Removed #line directive from preprocessor. Added to the interface of ParserUnit. Script messages.sh now checks the identity of .msg and .msg.old to avoid undue warning about possibly different LR items. 2020-04-07 18:33:46 +02:00			`let wrap = function`
* Renamed [TStringLiteral] as [TString]. * LexToken.mll for CameLIGO: Fixed printing of "Str" into "String". * Added CLI option --pretty to call the pretty-printer from ParserMain. * Use the package Terminal_size to try to determine the width of the terminal where the source is pretty-printed. 2020-05-01 20:32:48 +02:00			`Stdlib.Ok ast ->`
			`if IO.options#pretty then`
			`begin`
Added more to the PascaLIGO pretty-printer. Improved the AST of PascaLIGO to better capture the struture. 2020-05-30 20:24:47 +02:00			`let doc = Pretty.print ast in`
* Renamed [TStringLiteral] as [TString]. * LexToken.mll for CameLIGO: Fixed printing of "Str" into "String". * Added CLI option --pretty to call the pretty-printer from ParserMain. * Use the package Terminal_size to try to determine the width of the terminal where the source is pretty-printed. 2020-05-01 20:32:48 +02:00			`let width =`
			`match Terminal_size.get_columns () with`
			`None -> 60`
			`\| Some c -> c in`
			`PPrint.ToChannel.pretty 1.0 width stdout doc;`
			`print_newline ()`
			`end;`
			`flush_all ()`
Simple_utils is now used as a library by the local builds. Preprocessor is now a library installed by opam. Replaced ligolang@gmail.com by contact@ligolang.org in opam files. Reformatted some opam files. Removed #line directive from preprocessor. Added to the interface of ParserUnit. Script messages.sh now checks the identity of .msg and .msg.old to avoid undue warning about possibly different LR items. 2020-04-07 18:33:46 +02:00			`\| Error msg ->`
* Renamed [TStringLiteral] as [TString]. * LexToken.mll for CameLIGO: Fixed printing of "Str" into "String". * Added CLI option --pretty to call the pretty-printer from ParserMain. * Use the package Terminal_size to try to determine the width of the terminal where the source is pretty-printed. 2020-05-01 20:32:48 +02:00			`begin`
			`flush_all ();`
			`Printf.eprintf "\027[31m%s\027[0m%!" msg.Region.value`
			`end`
Refactoring of the front-end towards integration of the local builds and the globol build, using the parser error messages, for instance. 2020-01-23 18:28:04 +01:00
Refactoring to bring local builds of the parsers closer to the global build. Added --expr to parse expressions. 2020-01-14 01:27:35 +01:00			`let () =`
Simple_utils is now used as a library by the local builds. Preprocessor is now a library installed by opam. Replaced ligolang@gmail.com by contact@ligolang.org in opam files. Reformatted some opam files. Removed #line directive from preprocessor. Added to the interface of ParserUnit. Script messages.sh now checks the identity of .msg and .msg.old to avoid undue warning about possibly different LR items. 2020-04-07 18:33:46 +02:00			`match IO.options#input with`
In EvalOpt modules, the CLI input ["-"] is becomes now [None], like the absence of an input filename. (This simplifies all the clients codes.) Fixed the dune file for the preprocessor. Fixed the build of PreprocMain.exe and PreprocMain.byte. Restricted preprocessing errors [Preproc.Newline_in_string] and [Preproc.Open_string] to the argument of the #include directive (instead of general strings: this is for the LIGO lexer to report the error). I removed the error [Preproc.Open_comment] as this is for the LIGO lexer to report. The preprocessor scanner [Preproc.lex] does not take a parameter [is_file:bool] now: the source file (if any) is determined from the lexing buffer. Accordingly, the field [is_file] of the state of the preprocessing lexer has been removed: the lexing buffer becomes now the reference for the input source (bug fix and interface improvement). Fixed the comments of the test contract pledge.religo. I removed the data constructor [Lexer.Stdin], as redundant with [Lexer.Channel]. 2020-04-09 16:18:26 +02:00			`None -> Unit.contract_in_stdin () \|> wrap`
			`\| Some file_path -> Unit.contract_in_file file_path \|> wrap`