Hime - General Structure

Table of content

Home
- Downloads
Get started
- Kickstart in: C#, Java, Rust
- C# tutorials: Basics, Tree actions, Semantic actions
- Java tutorials: Basics, Tree actions, Semantic actions
- Rust tutorials: Basics, Tree actions, Semantic actions
Grammars and edition
- Editors for Hime grammars
- Library of Hime grammars
Reference
- Command Line: himecc
- Grammar Language
  - Grammar Inheritance
  - Grammar Options
  - Lexical Rules
  - Syntactic Rules
- Bibliography
API Documentation
- API Documentation v3.5.0 .Net, Java, Rust
- API Documentation v3.4.0 .Net, Java, Rust
- API Documentation v3.3.2 .Net, Java, Rust
- API Documentation v3.3.1 .Net, Java, Rust
- API Documentation v3.3.0 .Net, Java, Rust
- API Documentation v3.2.0 .Net, Java
- API Documentation v3.1.0 .Net, Java
- API Documentation v3.0.0 .Net, Java
- API Documentation v2.0.6 .Net, Java
- API Documentation v2.0.5 .Net, Java
- API Documentation v2.0.1 .Net, Java
- API Documentation v1.3.2 .Net, Java
- API Documentation v1.2.0 .Net, Java
- API Documentation v1.1.0 .Net, Java
- API Documentation v1.0.0 .Net, Java
Release Notes
- v3.5.1, 2020, August 6th.
- v3.5.0, 2020, May 11th.
- v3.4.1, 2019, January 10th.
- v3.4.0, 2018, August 9th.
- v3.3.2, 2018, May 18th.
- v3.3.1, 2018, February 18th.
- v3.3.0, 2018, January 24th.
- v3.2.2, 2017, Octobre 19th.
- v3.2.1, 2017, Octobre 15th.
- v3.2.0, 2017, Octobre 4th.
- v3.1.0, 2017, September 26th.
- v3.0.1, 2017, August 3rd.
- v3.0.0, 2017, May 4th.
- v2.0.6, 2017, February 7th.
- v2.0.5, 2016, September 10th.
- v2.0.4, 2016, March 29th.
- v2.0.3, 2016, March 1st.
- v2.0.2, 2016, January 20th.
- v2.0.1, 2015, October 25th.
- v1.3.2, 2015, January 22nd.
- v1.3.1, 2014, October 23rd.
- v1.3.0, 2014, September 16th.
- v1.2.0, 2014, August 14th.
- v1.1.0, 2014, May 29th.
- v1.0.0, 2014, May 12th.

General Structure

The Hime Parser Generator provides its own language for expressing context-free grammars. It is largely similar to the standard BNF form with some enhancements for the sake of expressivity.

// Single line comments begin with double slash
/*
 * This is a multiline comment
 */
grammar MathExp
{
    // The 'options' section in a grammar specifies various compilation options for the grammar.
    // At the very least this section may be empty.
    // For more information about this section, see: options reference
    options
    {
        // The Axiom option specifies the top rule for the grammar.
        Axiom = "exp";
        // The Separator option specifies the separator terminal (usually white space).
        Separator = "SEPARATOR";
    }
    // The 'terminals' section in a grammar specifies the lexical rules for terminals.
    // This section is optional.
    // For more information about this section, see: terminals reference
    terminals
    {
        // This is a lexical rule that defines the WHITE_SPACE terminal.
        // By convention, the name of terminals are generally UPPER_CASE.
        // U+XXXX represent the code of Unicode character.
        WHITE_SPACE -> U+0020 | U+0009 | U+000B | U+000C ;
        // This lexical rules reuses the previous definition of WHITE_SPACE.
        // Note that this rule defines the SEPARATOR terminal referred to in the Separator option above.
        // A terminal must be defined here before being used.
        SEPARATOR -> WHITE_SPACE+;
        // This set of three lexical rules defines the NUMBER terminal.
        // Their order of appearance is significant.
        INTEGER -> [1-9] [0-9]* | '0' ;
        // Now we can use INTEGER for the definition of REAL.
        REAL -> INTEGER? '.' INTEGER (('e' | 'E') ('+' | '-')? INTEGER)?
                    | INTEGER ('e' | 'E') ('+' | '-')? INTEGER ;
        // Now we can use both INTEGER and REAL for the definition of NUMBER.
        NUMBER -> INTEGER | REAL ;
    }
    
    // The 'rules' section in a grammar specifies the syntactic rules for variables.
    // At the very least this section may be empty.
    // For more information about this section, see: rules reference
    rules
    {
        // This is a syntactic rule that defines the exp_atom variable.
        // By convention, the name of variables are generally snake_case.
        // Note that the rule's definition refers to the NUMBER terminal.
        exp_atom -> NUMBER
                    | '(' exp ')' ;
        // The order of the syntactic rules is not significant.
        exp_factor -> exp_atom
                    | exp_factor '*' exp_atom
                    | exp_factor '/' exp_atom ;
        exp_term -> exp_factor
                    | exp_term '+' exp_factor
                    | exp_term '-' exp_factor ;
        exp -> exp_term ;
    }
}