MontiCore Grammars for Expressions, Literals and Types - an Overview
MontiCore is a language workbench. It uses an extended grammar format as primary mechanism to describe DSLs. This format allows to compose language components by grammar (1) inheritance, (2) extension, (3) embedding and (4) aggregating. Please refer the MontiCore handbook for more details.
MontiCore bundles a generator that can produce lots of infrastructure from MontiCore grammars. Like grammars, this infrastructure is composable. and can also be extended with handwrittten code. Most importantly, these extensions and the grammar composition are compatible which leads to optimal forms of reuse.
This documentation presents a library of language components provided by the core of the MontiCore project together with short descriptions and status information (Status of Grammars). The components are mainly defined by a primary grammar plus associated Java and template files.
The presented components are mainly based on the grammars in the
MontiCore/monticore
project. They are organized in the following
packages under the monticore-grammar/src/main/grammars/
folder hierarchy:
de.monticore
de.monticore.expressions
de.monticore.literals
de.monticore.statements
de.monticore.symbols
de.monticore.types
Additionally, the documentation presents some expression/type related language components in projects that extend MontiCore's core languages. For more languages and language components, follow this link.
General: List of Grammars in package de.monticore
MCBasics.mc4 (stable)
- This grammar defines basic rules for naming, spacing, and Java-like comments being useful in many languages.
Types: List of Grammars in package de.monticore.types
These grammars generally deal with type definitions and build on each other. Some snipets for type definitions:
grammars some examples
MCBasicTypes boolean byte short int
long char float double
void Person a.b.Person
import a.b.Foo.*;
MCCollectionTypes List<.> Set<.>
Optional<.> Map<.,.>
MCSimpleGenericTypes
Foo<.> a.b.Bar<.,..,.>
MCFullGenericTypes
Foo<? extends .>
Foo<? super .>
MCArrayTypes Person[] int[][]
SI Unit types km/h km/h<long>
RegExType R"[a-z][0-9*]"
MCBasicTypes.mc4 (stable)
- This grammar defines basic types and thus eases the reuse of type structures in languages similar to Java. However, the type definitions in the grammar are somewhat simplified, e.g., they do not comprise generics.
- The grammar contains types from Java, e.g., primitives,
void
, and classes (sometimes referred to as "reference types"). - Example type definitions:
boolean
,byte
,short
,int
,long
,char
,float
,double
,void
,Person
,a.b.Person
,import a.b.Foo.*;
.
MCCollectionTypes.mc4 (stable)
- This grammar defines four generic types:
List<A>
,Map<A,B>
,Set<A>
andOptional<A>
on top of basic types. - These four generic types correspond to a typical predefined set of generic types used, e.g., in connection with UML class diagrams or the OCL. UML associations typically have those association multiplicities which is why the generic types are of general interest.
- The grammar eases the reuse of type structures in languages similar to Java that are somewhat simplified, e.g., by omitting general generics.
- Example type definitions:
List<.>
,Set<.>
,Optional<.>
,Map<.,.>
.
MCSimpleGenericTypes.mc4 (stable)
- This grammar provides rules for the definition of custom generic types
such as
Blubb<A>
,Bla<B,C>
,Foo<Blubb<D>>
. - The grammar covers a wide range of uses for generic types. Unlike Java, it does however not cover type restrictions on arguments.
- Example type definitions:
Foo<.>
,a.b.Bar<.,..,.>
.
MCFullGenericTypes.mc4 (stable)
- This grammar completes type definitions to support the full Java type system
including wildcards on generic types like
Blubb<? extends A>
. - A general advice: When you are not sure that you need this kind of types, use a simpler version from above. Type checking is tricky.
- Example type definitions:
Foo<? extends .>
,Foo<? super .>
.
MCArrayTypes.mc4 (stable)
- The grammar provides means to define arrays.
- Arrays are orthogonal to the generic extensions and may thus be combined with any of the above grammars.
- Example type definitions:
Person[]
,int[][]
.
SIUnitTypes4Math.mc4 for Physical SI Units (stable)
The known units s, m, kg, A, K, mol, cd
from the international system of
units (SI Units) and their combinations, such as km/h
or mg
, etc. can
be used as ordinary types (instead of only numbers).
The typecheck is extended to prevent e.g. assignment of a weight to a length
variable or to add appropriate conversion, e.g. when a km/h
-based velocity is
e.g. stored in a m/s
-based variable.
The grammar resides in the MontiCore/SIunits project.
- Example type definitions:
km/h
SIUnitTypes4Computing.mc4 for Physical SI Units (stable)
Includes the types from SIUnitTypes4Math
(see above), like km/h
, but also allows to add a
resolution, such as km/h<int>
. Here SI Unit types,
like km/h<.>
, are used as generic type constructor that may take a number type,
such as int
, long
, double
, float
as argument.
The grammar resides in the MontiCore/SIunits project.
- Example type definitions:
km/h<long>
RegExType.mc4 (stable)
Embedded in R"..."
a regular expressions
can be used as ordinary type to constrain the values allowed for stored variables, attributes,
parameters. Types are e.g. , such as R"[a-z]"
(single character) or R"^([01][0-9]|2[0-3])$"
(hours).
A typecheck for these types can only be executed at runtime and e.g. issue
exceptions (or trigger repair functions) if violated. The static typecheck only uses String
as
underlying carrier type.
This grammar resides in the MontiCore/RegEx project.
Symbols: List of Grammars in package de.monticore.symbols
These two grammars do not provide syntax themselves, but characterize important forms of symbols, that will be used in the type and the expression grammars to define shared kinds of symbols.
BasicSymbols.mc4 (stable)
- This grammar defines symbols for Types (of all kinds), Functions, Variables and TypeVariables.
- The defined symbols are of general form and can be used in functional, OO and other contexts. They do not preculde a concrete syntax and do not yet embody OO specifics.
- Remark: This grammar is not intended to define concrete or abstract syntax, but the infrastructure for symbols.
OOSymbols.mc4 (stable)
- This grammar defines symbols for objectoriented Types, Methods, and
Fields by mainly extending the symbols defined in
BasicTypeSymbols
. - The newly defined symbols extend the general ones by typical objectoriented features, such as private, static, etc. Again they do not preculde a concrete syntax.
- Remark: This grammar is not intended to define concrete or abstract syntax, but the infrastructure for symbols in objectoriented context.
Expressions: List of Grammars in package de.monticore.expressions
Expressions are defined in several grammars forming a (nonlinear) hierarchy, so that developers can choose the optimal grammar they want to build on for their language and combine these with the appropriate typing infrastructure.
This modularity of expressions and associated types greatly eases the reuse of type structures in languages similar to Java. Some snipets for operators defined in expressions:
grammar operators and examples in this grammar
CommonExp: / % + - <= >= == > < != ~. !. .?.:.
&& || ~.
AssigementExp: ++ -- = += -= *= /= &= |= ^= >>= >>>= <<= %=
BitExp: & | ^ << >> >>>
OclExp: implies <=> | & forall exists let.in. .@pre .[.] .**
Set{.|.}
SetExp: .isin. .in. union intersect setand setor
{ item | specifier }
OptionalOps: ?: ?<= ?>= ?< ?> ?== ?!= ?~~ ?!~
LambdaExp: i->i (a,b)->a+b
SIUnits: 5km 3,2m/s 22l 2.400J
JavaClass: this .[.] (.). super .instanceof.
ExpressionsBasis.mc4 (stable)
- This grammar defines core interfaces for expressions and imports the kinds of symbols necessary.
- The symbols are taken over from the TypeSymbols grammar (see below).
- A hierarchy of conservative extensions to this grammar realize these interfaces in various forms.
CommonExpressions.mc4 (stable)
- This grammar defines a typical standard set of operations for expressions.
- This is a subset of Java as well as OCL/P, mainly for arithmetic, comparisons, variable use (v), attribute use (o.att), method call (foo(arg,arg2)) and brackets (exp).
AssignmentExpressions.mc4 (stable)
- This grammar defines all Java expressions that have side effects.
- This includes assignment expressions like =, +=, etc. and suffix and prefix expressions like ++, --, etc.
BitExpressions.mc4 (stable)
- This grammar defines a typical standard set of operations for expressions.
- This is a subset of Java for binary expressions like <<, >>, >>>, &, ^ and |
OCLExpressions.mc4 (stable)
- This grammar defines expressions typical to UMLs OCL .
OCL expressions can savely be composed if with other forms of expressions
given in the MontiCore core project (i.e. as conservative extension). - It contains various logical operations, such as quantifiers,
the
let
and the@pre
construct, and a transitive closure for associations, as discussed in [Rum17,Rum17]. - This grammar resides in the MontiCore/OCL project.
SetExpressions.mc4 (stable)
- This grammar defines set expressions like set union, intersection etc. these operations are typical for a logic with set operations, like UML's OCL. These operators are usually infix and are thus more intuitive as they allow math oriented style of specification.
- Most of these operators are in principle executable, so it might be interesting to include them in a high level programming language (see e.g. Haskell)
- This grammar resides in the MontiCore/OCL project.
OptionalOperators.mc4 (stable)
- This grammar defines nine operators dealing with optional values, e.g. defined by
java.lang.Optional
. The operators are also called Elvis operators. - E.g.:
val ?: 42
equals toval.isPresent() ? val.get() : 42
x ?>= y
equalsx.isPresent() ? x.get() >= y : false
- This grammar resides in the MontiCore/OCL project.
LambdaExpressions.mc4 (beta)
- This grammar defines lambda expressions. Lambda expression define anonymous functions that can be passed around as values e.g. to higher-order functions and that can be evaluated on arguments of appropriate types.
- Lambda expressions are fully typed (see e.g. in Haskell) and can be nested.
- This is only the subset of Java's lambda expressions that allows to define a functional programming style (thus preventing side effects).
SIUnits.mc4 for Physical SI Units (stable)
- This grammar the international system of units (SI units), based on
the basis units
s, m, kg, A, K, mol, cd
, provides a variety of derived units, and can be refined using prefixes such asm
(milli),k
(kilo), etc. - The SI Unit grammar provides an extension to expressions, but also to the
typing system, e.g. types such as
km/h
orkm/h<long>
, and literals, such as e.g.5.3 km/h
. - The grammars reside in the MontiCore/SIunits project
JavaClassExpressions.mc4 (stable)
- This grammar defines Java specific class expressions like super, this, type cast, etc.
- This grammar should only be included, when a mapping to Java is intended and the full power of Java should be available in the modeling language.
Literals: List of Grammars in package de.monticore.literals
Literals are the basic elements of expressions, such as numbers, strings, truth values. Some snipets:
grammar examples of this grammar
MCCommonLit 3 -3 2.17 -4 true false 'c'
3L 2.17d 2.17f 0xAF "string"
"str\uAF01\u0001" null
MCJavaLiterals 999_999 0x3F2A 0b0001_0101 0567 1.2e-7F
SIUnitLiterals 5.3km/h 7mg
MCLiteralsBasis.mc4 (stable)
- This grammar defines core interface for literals.
- Several conservative extensions to this grammar realize various forms of literals.
MCCommonLiterals.mc4 (stable)
- This grammar defines the typical literals for an expression language, such as characters: 'c', Strings "text", booleans: "true", "null", or numbers 10, -23, 48l, 23.1f.
- Strings and characters use the Java-like escapes like "\n".
- Each defined nonterminal is extended by a conversion function
getValue()
of appropriate type and a retrieve functiongetSource()
for a text representation of the literal.
MCJavaLiterals.mc4 (stable)
- This grammar defines Java compliant literals and builds on MCCommonLiterals.
- The scope of this grammar is to ease the reuse of literals structures in Java-like sublanguages.
- The grammar contains literals from Java, e.g., Boolean, Char, String, ....
- Please note that Java (and this grammar) has an extended syntax e.g. for integers using underscores or other kinds of encodings. They parse e.g. 999_999, 0x3F2A, or 0b10100.
- Like above
getValue()
andgetSource()
allow to retrive the content as value resp. as text string.
SIUnitLiterals.mc4 for Physical SI Units (stable)
Provides concrete values, such as 5.3 km/h
or 7 mg
for the international system of
units (SI Units).
The grammar resides in the MontiCore/SIunits project.
Statements: List of Grammars in package de.monticore.statements
Statements are the constructive part of programs: They allow to change variables, call functions, send messages etc. The following hierarchy of statement definitions should allow the developers to choose needed forms of statements and extend it by their own additional needs. The provided list of statements is inspired by Java (actually subset of Java). Some example statements:
int i; int j = 2; Person p[] = { foo(3+7), p2, ...}
if (.) then . else . for ( i = .; .; .) {.}
while (.) . do . while (.)
switch (.) { case .: .; default: .}
foo(1,2,3) return .
assert . : "..."
try {.} catch (.) {.} finally {.} throw .
break . continue .
label: private static final native ...
MCStatementsBasis.mc4 (stable)
- This grammar defines the core interface for statements.
- A hierarchy of conservative extensions to this grammar is provided below.
MCCommonStatements.mc4 (stable)
- This grammar defines typical statements, such as method calls (which are actually expressions), assignment of variables, if, for, while, switch statements, and blocks.
- This embodies a complete structured statement language, however does not provide return, assert, exceptions, and low-level constructs like break.
MCAssertStatements.mc4 (stable)
- This grammar defines exactly the assert statement as known from Java.
- It can be used independently of other Java statements.
MCExceptionStatements.mc4 (stable)
- This grammar defines the exception statements.
- This includes Java try with catch and finally, as well as throw.
MCSynchronizedStatements.mc4 (stable)
- This grammar defines the Java-like synchronized statement.
MCLowLevelStatements.mc4 (stable)
- This grammar defines three low-level statements that Java provides.
- It contains the break and continue statements and the possibility to label a statement.
MCReturnStatements.mc4 (stable)
- This grammar defines the Java-like return statement.
MCFullJavaStatements.mc4 (stable)
- This grammar defines all Java statements.
- This is neither a generalized approximation nor a restricted overapproximation, but exact.
Further grammars in package de.monticore
several other grammars are also available:
RegularExpressions.mc4 (stable)
- This grammar defines regular expressions (RegEx) as used in Java (see e.g.
java.util.regex.Pattern
). - It provides common regex tokens such as
- character classes, e.g., lowercase letters (
[a-z]
), the letters a, b, and c ([abc]
) - anchors, e.g., start of line (
^
), end of line ($
), word boundary (\b
), - quantifiers, e.g., zero or one (
?
), zero or more (*
), exactly 3 ({3}
), - RegEx also supports to capture groups and referencing these captured groups in replacements.
- For example,
^([01][0-9]|2[0-3]):[0-5][0-9]$
matches all valid timestamps inHH:MM
format. - The main nonterminal
RegularExpression
is not part of the expression hierarchy and thus regular expressions are not used as ordinary values. Instead the nonterminalRegularExpression
is can be used in aother places of a language e.g. we do that as additional restriction for String values in input/output channels in architectural langages. - This grammar resides in the MontiCore/RegEx project
Cardinality.mc4 (stable)
- This grammar defines UML Cardinalities of forms
*
,[n..m]
or[n..*]
.
Completeness.mc4 (stable)
- This grammar defines completeness information in UML
like
...
,(c)
, but also(...,c)
.
UMLModifier.mc4 (stable)
- The grammar contains the modifiers that UML provides.
- This includes
public
private
,protected
,final
,abstract
,local
,derived
,readonly
, andstatic
, but also the compact syntactic versions+
,#
,-
,/
and?
(for readonly). - UML modifiers are not identical to Java modifiers (e.g.
native
orthreadsafe
are missing.)
UMLStereotype.mc4 (stable)
- This grammars defines Stereotypes like
<<val1,val2="text",...>>
- Methods contains(name), getValue(name) assist Stereotype retrieval.
- Values may only be of type String. The real value unfortunately in UML is only encoded as String.
- We suggest to use a tagging infrastructure that even allows to type the possible forms of tags.
MCCommon.mc4 (stable)
- This grammar composes typical UML-like grammar components.
- This includes Cardinality, Completeness, UMLModifier, and UMLStereotype.
JavaLight.mc4 (stable)
int age = 3+x;
List<Person> myParents;
@Override
public int print(String name, Set<Person> p) {
int a = 2 + name.length();
if(a < p.size()) {
System.out.println("Hello " + name);
}
return a;
}
- JavaLight is a subset of Java that MontiCore itself uses as intermediate language for the code generation process.
- JavaLight doesn't provide all forms of classes (e.g., inner classes)
and reduces the type system to normal generic types.
However, that is sufficient for all code generated by MontiCore. - JavaLight supports Java expressions (including anonymous classes), Java statements as relevant to MontiCore code generation, method declaration, constructors, constants, interface methods, and annotations.
- JavaLight composes the grammars
CommonExpressions
,AssignmentExpressions
,JavaClassExpressions
,MCCommonStatements
,MCBasicTypes
, andOOSymbols
. - JavaLight can be used for other generator tools than MontiCore, especially as its core templates are reusable and new templates for customized method bodies can be added using MontiCore's Hook-Mechanisms.
MontiCore Example Grammars for the Interested Reader
The monticore-grammar/src/main/examples
folder hosts the following example
grammars: