ocean-lang.org Git - ocean/blob - csrc/oceani.mdc

   1 # Ocean Interpreter - Stoney Creek version
   2
   3 Ocean is intended to be a compiled language, so this interpreter is
   4 not targeted at being the final product.  It is, rather, an intermediate
   5 stage and fills that role in two distinct ways.
   6
   7 Firstly, it exists as a platform to experiment with the early language
   8 design.  An interpreter is easy to write and easy to get working, so
   9 the barrier for entry is lower if I aim to start with an interpreter.
  10
  11 Secondly, the plan for the Ocean compiler is to write it in the
  12 [Ocean language](http://ocean-lang.org).  To achieve this we naturally
  13 need some sort of boot-strap process and this interpreter - written in
  14 portable C - will fill that role.  It will be used to bootstrap the
  15 Ocean compiler.
  16
  17 Two features that are not needed to fill either of these roles are
  18 performance and completeness.  The interpreter only needs to be fast
  19 enough to run small test programs and occasionally to run the compiler
  20 on itself.  It only needs to be complete enough to test aspects of the
  21 design which are developed before the compiler is working, and to run
  22 the compiler on itself.  Any features not used by the compiler when
  23 compiling itself are superfluous.  They may be included anyway, but
  24 they may not.
  25
  26 Nonetheless, the interpreter should end up being reasonably complete,
  27 and any performance bottlenecks which appear and are easily fixed, will
  28 be.
  29
  30 ## Current version
  31
  32 This second version of the interpreter exists to test out the
  33 structured statement providing conditions and iteration, and simple
  34 variable scoping.  Clearly we need some minimal other functionality so
  35 that values can be tested and instructions iterated over.  All that
  36 functionality is clearly not normative at this stage (not that
  37 anything is **really** normative yet) and will change, so early test
  38 code will certainly break in later versions.
  39
  40 The under-test parts of the language are:
  41
  42  - conditional/looping structured statements
  43  - the `use` statement which is needed for that
  44  - Variable binding using ":=" and "::=", and assignment using "=".
  45
  46 Elements which are present to make a usable language are:
  47
  48  - "blocks" of multiple statements.
  49  - `pass`: a statement which does nothing.
  50  - expressions: `+`, `-`, `*`, `/`, `%` can apply to numbers and `++` can
  51    catenate strings.  `and`, `or`, `not` manipulate Booleans, and
  52    normal comparison operators can work on all three types.
  53  - `print`: will print the values in a list of expressions.
  54  - `program`: is given a list of identifiers to initialize from
  55    arguments.
  56
  57 ## Naming
  58
  59 Versions of the interpreter which obviously do not support a complete
  60 language will be named after creeks and streams.  This one is Stoney
  61 Creek.
  62
  63 Once we have something reasonably resembling a complete language, the
  64 names of rivers will be used.
  65 Early versions of the compiler will be named after seas.  Major
  66 releases of the compiler will be named after oceans.  Hopefully I will
  67 be finished once I get to the Pacific Ocean release.
  68
  69 ## Outline
  70
  71 As well as parsing and executing a program, the interpreter can print
  72 out the program from the parsed internal structure.  This is useful
  73 for validating the parsing.
  74 So the main requirements of the interpreter are:
  75
  76 - Parse the program, possibly with tracing,
  77 - Analyse the parsed program to ensure consistency,
  78 - Print the program,
  79 - Execute the program.
  80
  81 This is all performed by a single C program extracted with
  82 `parsergen`.
  83
  84 There will be two formats for printing the program: a default and one
  85 that uses bracketing.  So a `--bracket` command line option is needed
  86 for that.  Normally the first code section found is used, however an
  87 alternate section can be requested so that a file (such as this one)
  88 can contain multiple programs This is effected with the `--section`
  89 option.
  90
  91 This code must be compiled with `-fplan9-extensions` so that anonymous
  92 structures can be used.
  93
  94 ###### File: oceani.mk
  95
  96         myCFLAGS := -Wall -g -fplan9-extensions
  97         CFLAGS := $(filter-out $(myCFLAGS),$(CFLAGS)) $(myCFLAGS)
  98         myLDLIBS:= libparser.o libscanner.o libmdcode.o -licuuc
  99         LDLIBS := $(filter-out $(myLDLIBS),$(LDLIBS)) $(myLDLIBS)
 100         ## libs
 101         all :: $(LDLIBS) oceani
 102         oceani.c oceani.h : oceani.mdc parsergen
 103                 ./parsergen -o oceani --LALR --tag Parser oceani.mdc
 104         oceani.mk: oceani.mdc md2c
 105                 ./md2c oceani.mdc
 106
 107         oceani: oceani.o $(LDLIBS)
 108                 $(CC) $(CFLAGS) -o oceani oceani.o $(LDLIBS)
 109
 110 ###### Parser: header
 111         ## macros
 112         ## ast
 113         struct parse_context {
 114                 struct token_config config;
 115                 char *file_name;
 116                 int parse_error;
 117                 ## parse context
 118         };
 119
 120 ###### macros
 121
 122         #define container_of(ptr, type, member) ({                      \
 123                 const typeof( ((type *)0)->member ) *__mptr = (ptr);    \
 124                 (type *)( (char *)__mptr - offsetof(type,member) );})
 125
 126         #define config2context(_conf) container_of(_conf, struct parse_context, \
 127                 config)
 128
 129 ###### Parser: code
 130
 131         #include <unistd.h>
 132         #include <stdlib.h>
 133         #include <fcntl.h>
 134         #include <errno.h>
 135         #include <sys/mman.h>
 136         #include <string.h>
 137         #include <stdio.h>
 138         #include <locale.h>
 139         #include <malloc.h>
 140         #include "mdcode.h"
 141         #include "scanner.h"
 142         #include "parser.h"
 143
 144         ## includes
 145
 146         #include "oceani.h"
 147
 148         ## forward decls
 149         ## value functions
 150         ## ast functions
 151         ## core functions
 152
 153         #include <getopt.h>
 154         static char Usage[] = "Usage: oceani --trace --print --noexec --brackets"
 155                               "--section=SectionName prog.ocn\n";
 156         static const struct option long_options[] = {
 157                 {"trace",     0, NULL, 't'},
 158                 {"print",     0, NULL, 'p'},
 159                 {"noexec",    0, NULL, 'n'},
 160                 {"brackets",  0, NULL, 'b'},
 161                 {"section",   1, NULL, 's'},
 162                 {NULL,        0, NULL, 0},
 163         };
 164         const char *options = "tpnbs";
 165         int main(int argc, char *argv[])
 166         {
 167                 int fd;
 168                 int len;
 169                 char *file;
 170                 struct section *s;
 171                 char *section = NULL;
 172                 struct parse_context context = {
 173                         .config = {
 174                                 .ignored = (1 << TK_line_comment)
 175                                          | (1 << TK_block_comment),
 176                                 .number_chars = ".,_+-",
 177                                 .word_start = "_",
 178                                 .word_cont = "_",
 179                         },
 180                 };
 181                 int doprint=0, dotrace=0, doexec=1, brackets=0;
 182                 struct exec **prog;
 183                 int opt;
 184                 while ((opt = getopt_long(argc, argv, options, long_options, NULL))
 185                        != -1) {
 186                         switch(opt) {
 187                         case 't': dotrace=1; break;
 188                         case 'p': doprint=1; break;
 189                         case 'n': doexec=0; break;
 190                         case 'b': brackets=1; break;
 191                         case 's': section = optarg; break;
 192                         default: fprintf(stderr, Usage);
 193                                 exit(1);
 194                         }
 195                 }
 196                 if (optind >= argc) {
 197                         fprintf(stderr, "oceani: no input file given\n");
 198                         exit(1);
 199                 }
 200                 fd = open(argv[optind], O_RDONLY);
 201                 if (fd < 0) {
 202                         fprintf(stderr, "oceani: cannot open %s\n", argv[optind]);
 203                         exit(1);
 204                 }
 205                 context.file_name = argv[optind];
 206                 len = lseek(fd, 0, 2);
 207                 file = mmap(NULL, len, PROT_READ, MAP_SHARED, fd, 0);
 208                 s = code_extract(file, file+len, NULL);
 209                 if (!s) {
 210                         fprintf(stderr, "oceani: could not find any code in %s\n",
 211                                 argv[optind]);
 212                         exit(1);
 213                 }
 214
 215                 ## context initialization
 216
 217                 if (section) {
 218                         struct section *ss;
 219                         for (ss = s; ss; ss = ss->next) {
 220                                 struct text sec = ss->section;
 221                                 if (sec.len == strlen(section) &&
 222                                     strncmp(sec.txt, section, sec.len) == 0)
 223                                         break;
 224                         }
 225                         if (ss)
 226                                 prog = parse_oceani(ss->code, &context.config,
 227                                                     dotrace ? stderr : NULL);
 228                         else {
 229                                 fprintf(stderr, "oceani: cannot find section %s\n",
 230                                         section);
 231                                 exit(1);
 232                         }
 233                 } else
 234                         prog = parse_oceani(s->code, &context.config,
 235                                     dotrace ? stderr : NULL);
 236                 if (!prog) {
 237                         fprintf(stderr, "oceani: fatal parser error.\n");
 238                         context.parse_error = 1;
 239                 }
 240                 if (prog && doprint)
 241                         print_exec(*prog, 0, brackets);
 242                 if (prog && doexec && !context.parse_error) {
 243                         if (!analyse_prog(*prog, &context)) {
 244                                 fprintf(stderr, "oceani: type error in program - not running.\n");
 245                                 exit(1);
 246                         }
 247                         interp_prog(*prog, argv+optind+1);
 248                 }
 249                 if (prog) {
 250                         free_exec(*prog);
 251                         free(prog);
 252                 }
 253                 while (s) {
 254                         struct section *t = s->next;
 255                         code_free(s->code);
 256                         free(s);
 257                         s = t;
 258                 }
 259                 ## free context vars
 260                 ## free context types
 261                 exit(context.parse_error ? 1 : 0);
 262         }
 263
 264 ### Analysis
 265
 266 The four requirements of parse, analyse, print, interpret apply to
 267 each language element individually so that is how most of the code
 268 will be structured.
 269
 270 Three of the four are fairly self explanatory.  The one that requires
 271 a little explanation is the analysis step.
 272
 273 The current language design does not require the types of variables to
 274 be declared, but they must still have a single type.  Different
 275 operations impose different requirements on the variables, for example
 276 addition requires both arguments to be numeric, and assignment
 277 requires the variable on the left to have the same type as the
 278 expression on the right.
 279
 280 Analysis involves propagating these type requirements around and
 281 consequently setting the type of each variable.  If any requirements
 282 are violated (e.g. a string is compared with a number) or if a
 283 variable needs to have two different types, then an error is raised
 284 and the program will not run.
 285
 286 If the same variable is declared in both branchs of an 'if/else', or
 287 in all cases of a 'switch' then the multiple instances may be merged
 288 into just one variable if the variable is references after the
 289 conditional statement.  When this happens, the types must naturally be
 290 consistent across all the branches.  When the variable is not used
 291 outside the if, the variables in the different branches are distinct
 292 and can be of different types.
 293
 294 Determining the types of all variables early is important for
 295 processing command line arguments.  These can be assigned to any type
 296 of variable, but we must first know the correct type so any required
 297 conversion can happen.  If a variable is associated with a command
 298 line argument but no type can be interpreted (e.g. the variable is
 299 only ever used in a `print` statement), then the type is set to
 300 'string'.
 301
 302 Undeclared names may only appear in "use" statements and "case" expressions.
 303 These names are given a type of "label" and a unique value.
 304 This allows them to fill the role of a name in an enumerated type, which
 305 is useful for testing the `switch` statement.
 306
 307 As we will see, the condition part of a `while` statement can return
 308 either a Boolean or some other type.  This requires that the expect
 309 type that gets passed around comprises a type (`enum vtype`) and a
 310 flag to indicate that `Vbool` is also permitted.
 311
 312 As there are, as yet, no distinct types that are compatible, there
 313 isn't much subtlety in the analysis.  When we have distinct number
 314 types, this will become more interesting.
 315
 316 #### Error reporting
 317
 318 When analysis discovers an inconsistency it needs to report an error;
 319 just refusing to run the code ensures that the error doesn't cascade,
 320 but by itself it isn't very useful.  A clear understand of the sort of
 321 error message that are useful will help guide the process of analysis.
 322
 323 At a simplistic level, the only sort of error that type analysis can
 324 report is that the type of some construct doesn't match a contextual
 325 requirement.  For example, in `4 + "hello"` the addition provides a
 326 contextual requirement for numbers, but `"hello"` is not a number.  In
 327 this particular example no further information is needed as the types
 328 are obvious from local information.  When a variable is involved that
 329 isn't the case.  It may be helpful to explain why the variable has a
 330 particular type, by indicating the location where the type was set,
 331 whether by declaration or usage.
 332
 333 Using a recursive-descent analysis we can easily detect a problem at
 334 multiple locations. In "`hello:= "there"; 4 + hello`" the addition
 335 will detect that one argument is not a number and the usage of `hello`
 336 will detect that a number was wanted, but not provided.  In this
 337 (early) version of the language, we will generate error reports at
 338 multiple locations, so the use of `hello` will report an error and
 339 explain were the value was set, and the addition will report an error
 340 and say why numbers are needed.  To be able to report locations for
 341 errors, each language element will need to record a file location
 342 (line and column) and each variable will need to record the language
 343 element where its type was set.  For now we will assume that each line
 344 of an error message indicates one location in the file, and up to 2
 345 types.  So we provide a `printf`-like function which takes a format, a
 346 language (a `struct exec` which has not yet been introduced), and 2
 347 types. "`%1`" reports the first type, "`%2`" reports the second.  We
 348 will need a function to print the location, once we know how that is
 349 stored.  As will be explained later, there are sometimes extra rules for
 350 type matching and they might affect error messages, we need to pass those
 351 in too.
 352
 353 As well as type errors, we sometimes need to report problems with
 354 tokens, which might be unexpected or might name a type that has not
 355 been defined.  For these we have `tok_err()` which reports an error
 356 with a given token.  Each of the error functions sets the flag in the
 357 context so indicate that parsing failed.
 358
 359 ###### forward decls
 360
 361         static void fput_loc(struct exec *loc, FILE *f);
 362
 363 ###### core functions
 364
 365         static void type_err(struct parse_context *c,
 366                              char *fmt, struct exec *loc,
 367                              struct type *t1, int rules, struct type *t2)
 368         {
 369                 fprintf(stderr, "%s:", c->file_name);
 370                 fput_loc(loc, stderr);
 371                 for (; *fmt ; fmt++) {
 372                         if (*fmt != '%') {
 373                                 fputc(*fmt, stderr);
 374                                 continue;
 375                         }
 376                         fmt++;
 377                         switch (*fmt) {
 378                         case '%': fputc(*fmt, stderr); break;   // NOTEST
 379                         default: fputc('?', stderr); break;     // NOTEST
 380                         case '1':
 381                                 type_print(t1, stderr);
 382                                 break;
 383                         case '2':
 384                                 type_print(t2, stderr);
 385                                 break;
 386                         ## format cases
 387                         }
 388                 }
 389                 fputs("\n", stderr);
 390                 c->parse_error = 1;
 391         }
 392
 393         static void tok_err(struct parse_context *c, char *fmt, struct token *t)
 394         {
 395                 fprintf(stderr, "%s:%d:%d: %s: %.*s\n", c->file_name, t->line, t->col, fmt,
 396                         t->txt.len, t->txt.txt);
 397                 c->parse_error = 1;
 398         }
 399
 400 ## Data Structures
 401
 402 One last introductory step before detailing the language elements and
 403 providing their four requirements is to establish the data structures
 404 to store these elements.
 405
 406 There are two key objects that we need to work with: executable
 407 elements which comprise the program, and values which the program
 408 works with.  Between these are the variables in their various scopes
 409 which hold the values, and types which classify the values stored and
 410 manipulatd by executables.
 411
 412 ### Types
 413
 414 Values come in a wide range of types, with more likely to be added.
 415 Each type needs to be able to parse and print its own values (for
 416 convenience at least) as well as to compare two values, at least for
 417 equality and possibly for order.  For now, values might need to be
 418 duplicated and freed, though eventually such manipulations will be
 419 better integrated into the language.
 420
 421 Rather than requiring every numeric type to support all numeric
 422 operations (add, multiple, etc), we allow types to be able to present
 423 as one of a few standard types: integer, float, and fraction.  The
 424 existance of these conversion functions enable types to determine if
 425 they are compatible with other types.
 426
 427 Named type are stored in a simple linked list.  Objects of each type are "values"
 428 which are often passed around by value.
 429
 430 ###### ast
 431
 432         struct value {
 433                 struct type *type;
 434                 union {
 435                         ## value union fields
 436                 };
 437         };
 438
 439         struct type {
 440                 struct text name;
 441                 struct type *next;
 442                 struct value (*init)(struct type *type);
 443                 struct value (*prepare)(struct type *type);
 444                 struct value (*parse)(struct type *type, char *str);
 445                 void (*print)(struct value val);
 446                 void (*print_type)(struct type *type, FILE *f);
 447                 int (*cmp_order)(struct value v1, struct value v2);
 448                 int (*cmp_eq)(struct value v1, struct value v2);
 449                 struct value (*dup)(struct value val);
 450                 void (*free)(struct value val);
 451                 int (*compat)(struct type *this, struct type *other);
 452                 long long (*to_int)(struct value *v);
 453                 double (*to_float)(struct value *v);
 454                 int (*to_mpq)(mpq_t *q, struct value *v);
 455                 union {
 456                         ## type union fields
 457                 };
 458         };
 459
 460 ###### parse context
 461
 462         struct type *typelist;
 463
 464 ###### ast functions
 465
 466         static struct type *find_type(struct parse_context *c, struct text s)
 467         {
 468                 struct type *l = c->typelist;
 469
 470                 while (l &&
 471                        text_cmp(l->name, s) != 0)
 472                                 l = l->next;
 473                 return l;
 474         }
 475
 476         static struct type *add_type(struct parse_context *c, struct text s,
 477                                      struct type *proto)
 478         {
 479                 struct type *n;
 480
 481                 n = calloc(1, sizeof(*n));
 482                 *n = *proto;
 483                 n->name = s;
 484                 n->next = c->typelist;
 485                 c->typelist = n;
 486                 return n;
 487         }
 488
 489         static void free_type(struct type *t)
 490         {
 491                 /* The type is always a reference to something in the
 492                  * context, so we don't need to free anything.
 493                  */
 494         }
 495
 496         static void free_value(struct value v)
 497         {
 498                 if (v.type)
 499                         v.type->free(v);
 500         }
 501
 502         static int type_compat(struct type *require, struct type *have, int rules)
 503         {
 504                 if ((rules & Rboolok) && have == Tbool)
 505                         return 1;
 506                 if ((rules & Rnolabel) && have == Tlabel)
 507                         return 0;
 508                 if (!require || !have)
 509                         return 1;
 510
 511                 if (require->compat)
 512                         return require->compat(require, have);
 513
 514                 return require == have;
 515         }
 516
 517         static void type_print(struct type *type, FILE *f)
 518         {
 519                 if (!type)
 520                         fputs("*unknown*type*", f);
 521                 else if (type->name.len)
 522                         fprintf(f, "%.*s", type->name.len, type->name.txt);
 523                 else if (type->print_type)
 524                         type->print_type(type, f);
 525                 else
 526                         fputs("*invalid*type*", f);     // NOTEST
 527         }
 528
 529         static struct value val_prepare(struct type *type)
 530         {
 531                 struct value rv;
 532
 533                 if (type)
 534                         return type->prepare(type);
 535                 rv.type = type;
 536                 return rv;
 537         }
 538
 539         static struct value val_init(struct type *type)
 540         {
 541                 struct value rv;
 542
 543                 if (type)
 544                         return type->init(type);
 545                 rv.type = type;
 546                 return rv;
 547         }
 548
 549         static struct value dup_value(struct value v)
 550         {
 551                 if (v.type)
 552                         return v.type->dup(v);
 553                 return v;
 554         }
 555
 556         static int value_cmp(struct value left, struct value right)
 557         {
 558                 if (left.type && left.type->cmp_order)
 559                         return left.type->cmp_order(left, right);
 560                 if (left.type && left.type->cmp_eq)
 561                         return left.type->cmp_eq(left, right);
 562                 return -1;
 563         }
 564
 565         static void print_value(struct value v)
 566         {
 567                 if (v.type && v.type->print)
 568                         v.type->print(v);
 569                 else
 570                         printf("*Unknown*");            // NOTEST
 571         }
 572
 573         static struct value parse_value(struct type *type, char *arg)
 574         {
 575                 struct value rv;
 576
 577                 if (type && type->parse)
 578                         return type->parse(type, arg);
 579                 rv.type = NULL;                         // NOTEST
 580                 return rv;                              // NOTEST
 581         }
 582
 583 ###### forward decls
 584
 585         static void free_value(struct value v);
 586         static int type_compat(struct type *require, struct type *have, int rules);
 587         static void type_print(struct type *type, FILE *f);
 588         static struct value val_init(struct type *type);
 589         static struct value dup_value(struct value v);
 590         static int value_cmp(struct value left, struct value right);
 591         static void print_value(struct value v);
 592         static struct value parse_value(struct type *type, char *arg);
 593
 594 ###### free context types
 595
 596         while (context.typelist) {
 597                 struct type *t = context.typelist;
 598
 599                 context.typelist = t->next;
 600                 free(t);
 601         }
 602
 603 #### Base Types
 604
 605 Values of the base types can be numbers, which we represent as
 606 multi-precision fractions, strings, Booleans and labels.  When
 607 analysing the program we also need to allow for places where no value
 608 is meaningful (type `Tnone`) and where we don't know what type to
 609 expect yet (type is `NULL`).
 610
 611 Values are never shared, they are always copied when used, and freed
 612 when no longer needed.
 613
 614 When propagating type information around the program, we need to
 615 determine if two types are compatible, where type `NULL` is compatible
 616 with anything.  There are two special cases with type compatibility,
 617 both related to the Conditional Statement which will be described
 618 later.  In some cases a Boolean can be accepted as well as some other
 619 primary type, and in others any type is acceptable except a label (`Vlabel`).
 620 A separate function encode these cases will simplify some code later.
 621
 622 When assigning command line arguments to variables, we need to be able
 623 to parse each type from a string.
 624
 625 ###### includes
 626         #include <gmp.h>
 627         #include "string.h"
 628         #include "number.h"
 629
 630 ###### libs
 631         myLDLIBS := libnumber.o libstring.o -lgmp
 632         LDLIBS := $(filter-out $(myLDLIBS),$(LDLIBS)) $(myLDLIBS)
 633
 634 ###### type union fields
 635         enum vtype {Vnone, Vstr, Vnum, Vbool, Vlabel} vtype;
 636
 637 ###### value union fields
 638         struct text str;
 639         mpq_t num;
 640         int bool;
 641         void *label;
 642
 643 ###### ast functions
 644         static void _free_value(struct value v)
 645         {
 646                 switch (v.type->vtype) {
 647                 case Vnone: break;
 648                 case Vstr: free(v.str.txt); break;
 649                 case Vnum: mpq_clear(v.num); break;
 650                 case Vlabel:
 651                 case Vbool: break;
 652                 }
 653         }
 654
 655 ###### value functions
 656
 657         static struct value _val_prepare(struct type *type)
 658         {
 659                 struct value rv;
 660
 661                 rv.type = type;
 662                 switch(type->vtype) {
 663                 case Vnone:
 664                         break;
 665                 case Vnum:
 666                         memset(&rv.num, 0, sizeof(rv.num));
 667                         break;
 668                 case Vstr:
 669                         rv.str.txt = NULL;
 670                         rv.str.len = 0;
 671                         break;
 672                 case Vbool:
 673                         rv.bool = 0;
 674                         break;
 675                 case Vlabel:
 676                         rv.label = NULL;
 677                         break;
 678                 }
 679                 return rv;
 680         }
 681
 682         static struct value _val_init(struct type *type)
 683         {
 684                 struct value rv;
 685
 686                 rv.type = type;
 687                 switch(type->vtype) {
 688                 case Vnone:             // NOTEST
 689                         break;          // NOTEST
 690                 case Vnum:
 691                         mpq_init(rv.num); break;
 692                 case Vstr:
 693                         rv.str.txt = malloc(1);
 694                         rv.str.len = 0;
 695                         break;
 696                 case Vbool:
 697                         rv.bool = 0;
 698                         break;
 699                 case Vlabel:                    // NOTEST
 700                         rv.label = NULL;        // NOTEST
 701                         break;                  // NOTEST
 702                 }
 703                 return rv;
 704         }
 705
 706         static struct value _dup_value(struct value v)
 707         {
 708                 struct value rv;
 709                 rv.type = v.type;
 710                 switch (rv.type->vtype) {
 711                 case Vnone:             // NOTEST
 712                         break;          // NOTEST
 713                 case Vlabel:
 714                         rv.label = v.label;
 715                         break;
 716                 case Vbool:
 717                         rv.bool = v.bool;
 718                         break;
 719                 case Vnum:
 720                         mpq_init(rv.num);
 721                         mpq_set(rv.num, v.num);
 722                         break;
 723                 case Vstr:
 724                         rv.str.len = v.str.len;
 725                         rv.str.txt = malloc(rv.str.len);
 726                         memcpy(rv.str.txt, v.str.txt, v.str.len);
 727                         break;
 728                 }
 729                 return rv;
 730         }
 731
 732         static int _value_cmp(struct value left, struct value right)
 733         {
 734                 int cmp;
 735                 if (left.type != right.type)
 736                         return left.type - right.type;  // NOTEST
 737                 switch (left.type->vtype) {
 738                 case Vlabel: cmp = left.label == right.label ? 0 : 1; break;
 739                 case Vnum: cmp = mpq_cmp(left.num, right.num); break;
 740                 case Vstr: cmp = text_cmp(left.str, right.str); break;
 741                 case Vbool: cmp = left.bool - right.bool; break;
 742                 case Vnone: cmp = 0;                    // NOTEST
 743                 }
 744                 return cmp;
 745         }
 746
 747         static void _print_value(struct value v)
 748         {
 749                 switch (v.type->vtype) {
 750                 case Vnone:                             // NOTEST
 751                         printf("*no-value*"); break;    // NOTEST
 752                 case Vlabel:                            // NOTEST
 753                         printf("*label-%p*", v.label); break; // NOTEST
 754                 case Vstr:
 755                         printf("%.*s", v.str.len, v.str.txt); break;
 756                 case Vbool:
 757                         printf("%s", v.bool ? "True":"False"); break;
 758                 case Vnum:
 759                         {
 760                         mpf_t fl;
 761                         mpf_init2(fl, 20);
 762                         mpf_set_q(fl, v.num);
 763                         gmp_printf("%Fg", fl);
 764                         mpf_clear(fl);
 765                         break;
 766                         }
 767                 }
 768         }
 769
 770         static struct value _parse_value(struct type *type, char *arg)
 771         {
 772                 struct value val;
 773                 struct text tx;
 774                 int neg = 0;
 775                 char tail[3] = "";
 776
 777                 val.type = type;
 778                 switch(type->vtype) {
 779                 case Vlabel:                            // NOTEST
 780                 case Vnone:                             // NOTEST
 781                         val.type = NULL;                // NOTEST
 782                         break;                          // NOTEST
 783                 case Vstr:
 784                         val.str.len = strlen(arg);
 785                         val.str.txt = malloc(val.str.len);
 786                         memcpy(val.str.txt, arg, val.str.len);
 787                         break;
 788                 case Vnum:
 789                         if (*arg == '-') {
 790                                 neg = 1;
 791                                 arg++;
 792                         }
 793                         tx.txt = arg; tx.len = strlen(tx.txt);
 794                         if (number_parse(val.num, tail, tx) == 0)
 795                                 mpq_init(val.num);
 796                         else if (neg)
 797                                 mpq_neg(val.num, val.num);
 798                         if (tail[0]) {
 799                                 printf("Unsupported suffix: %s\n", arg);
 800                                 val.type = NULL;
 801                         }
 802                         break;
 803                 case Vbool:
 804                         if (strcasecmp(arg, "true") == 0 ||
 805                             strcmp(arg, "1") == 0)
 806                                 val.bool = 1;
 807                         else if (strcasecmp(arg, "false") == 0 ||
 808                                  strcmp(arg, "0") == 0)
 809                                 val.bool = 0;
 810                         else {
 811                                 printf("Bad bool: %s\n", arg);
 812                                 val.type = NULL;
 813                         }
 814                         break;
 815                 }
 816                 return val;
 817         }
 818
 819         static void _free_value(struct value v);
 820
 821         static struct type base_prototype = {
 822                 .init = _val_init,
 823                 .prepare = _val_prepare,
 824                 .parse = _parse_value,
 825                 .print = _print_value,
 826                 .cmp_order = _value_cmp,
 827                 .cmp_eq = _value_cmp,
 828                 .dup = _dup_value,
 829                 .free = _free_value,
 830         };
 831
 832         static struct type *Tbool, *Tstr, *Tnum, *Tnone, *Tlabel;
 833
 834 ###### ast functions
 835         static struct type *add_base_type(struct parse_context *c, char *n, enum vtype vt)
 836         {
 837                 struct text txt = { n, strlen(n) };
 838                 struct type *t;
 839
 840                 t = add_type(c, txt, &base_prototype);
 841                 t->vtype = vt;
 842                 return t;
 843         }
 844
 845 ###### context initialization
 846
 847         Tbool  = add_base_type(&context, "Boolean", Vbool);
 848         Tstr   = add_base_type(&context, "string", Vstr);
 849         Tnum   = add_base_type(&context, "number", Vnum);
 850         Tnone  = add_base_type(&context, "none", Vnone);
 851         Tlabel = add_base_type(&context, "label", Vlabel);
 852
 853 ### Variables
 854
 855 Variables are scoped named values.  We store the names in a linked
 856 list of "bindings" sorted lexically, and use sequential search and
 857 insertion sort.
 858
 859 ###### ast
 860
 861         struct binding {
 862                 struct text name;
 863                 struct binding *next;   // in lexical order
 864                 ## binding fields
 865         };
 866
 867 This linked list is stored in the parse context so that "reduce"
 868 functions can find or add variables, and so the analysis phase can
 869 ensure that every variable gets a type.
 870
 871 ###### parse context
 872
 873         struct binding *varlist;  // In lexical order
 874
 875 ###### ast functions
 876
 877         static struct binding *find_binding(struct parse_context *c, struct text s)
 878         {
 879                 struct binding **l = &c->varlist;
 880                 struct binding *n;
 881                 int cmp = 1;
 882
 883                 while (*l &&
 884                         (cmp = text_cmp((*l)->name, s)) < 0)
 885                                 l = & (*l)->next;
 886                 if (cmp == 0)
 887                         return *l;
 888                 n = calloc(1, sizeof(*n));
 889                 n->name = s;
 890                 n->next = *l;
 891                 *l = n;
 892                 return n;
 893         }
 894
 895 Each name can be linked to multiple variables defined in different
 896 scopes.  Each scope starts where the name is declared and continues
 897 until the end of the containing code block.  Scopes of a given name
 898 cannot nest, so a declaration while a name is in-scope is an error.
 899
 900 ###### binding fields
 901         struct variable *var;
 902
 903 ###### ast
 904         struct variable {
 905                 struct variable *previous;
 906                 struct value val;
 907                 struct binding *name;
 908                 struct exec *where_decl;// where name was declared
 909                 struct exec *where_set; // where type was set
 910                 ## variable fields
 911         };
 912
 913 While the naming seems strange, we include local constants in the
 914 definition of variables.  A name declared `var := value` can
 915 subsequently be changed, but a name declared `var ::= value` cannot -
 916 it is constant
 917
 918 ###### variable fields
 919         int constant;
 920
 921 Scopes in parallel branches can be partially merged.  More
 922 specifically, if a given name is declared in both branches of an
 923 if/else then its scope is a candidate for merging.  Similarly if
 924 every branch of an exhaustive switch (e.g. has an "else" clause)
 925 declares a given name, then the scopes from the branches are
 926 candidates for merging.
 927
 928 Note that names declared inside a loop (which is only parallel to
 929 itself) are never visible after the loop.  Similarly names defined in
 930 scopes which are not parallel, such as those started by `for` and
 931 `switch`, are never visible after the scope.  Only variables defined in
 932 both `then` and `else` (including the implicit then after an `if`, and
 933 excluding `then` used with `for`) and in all `case`s and `else` of a
 934 `switch` or `while` can be visible beyond the `if`/`switch`/`while`.
 935
 936 Labels, which are a bit like variables, follow different rules.
 937 Labels are not explicitly declared, but if an undeclared name appears
 938 in a context where a label is legal, that effectively declares the
 939 name as a label.  The declaration remains in force (or in scope) at
 940 least to the end of the immediately containing block and conditionally
 941 in any larger containing block which does not declare the name in some
 942 other way.  Importantly, the conditional scope extension happens even
 943 if the label is only used in one parallel branch of a conditional --
 944 when used in one branch it is treated as having been declared in all
 945 branches.
 946
 947 Merge candidates are tentatively visible beyond the end of the
 948 branching statement which creates them.  If the name is used, the
 949 merge is affirmed and they become a single variable visible at the
 950 outer layer.  If not - if it is redeclared first - the merge lapses.
 951
 952 To track scopes we have an extra stack, implemented as a linked list,
 953 which roughly parallels the parse stack and which is used exclusively
 954 for scoping.  When a new scope is opened, a new frame is pushed and
 955 the child-count of the parent frame is incremented.  This child-count
 956 is used to distinguish between the first of a set of parallel scopes,
 957 in which declared variables must not be in scope, and subsequent
 958 branches, whether they must already be conditionally scoped.
 959
 960 To push a new frame *before* any code in the frame is parsed, we need a
 961 grammar reduction.  This is most easily achieved with a grammar
 962 element which derives the empty string, and creates the new scope when
 963 it is recognized.  This can be placed, for example, between a keyword
 964 like "if" and the code following it.
 965
 966 ###### ast
 967         struct scope {
 968                 struct scope *parent;
 969                 int child_count;
 970         };
 971
 972 ###### parse context
 973         int scope_depth;
 974         struct scope *scope_stack;
 975
 976 ###### ast functions
 977         static void scope_pop(struct parse_context *c)
 978         {
 979                 struct scope *s = c->scope_stack;
 980
 981                 c->scope_stack = s->parent;
 982                 free(s);
 983                 c->scope_depth -= 1;
 984         }
 985
 986         static void scope_push(struct parse_context *c)
 987         {
 988                 struct scope *s = calloc(1, sizeof(*s));
 989                 if (c->scope_stack)
 990                         c->scope_stack->child_count += 1;
 991                 s->parent = c->scope_stack;
 992                 c->scope_stack = s;
 993                 c->scope_depth += 1;
 994         }
 995
 996 ###### Grammar
 997
 998         $void
 999         OpenScope -> ${ scope_push(config2context(config)); }$
1000
1001
1002 Each variable records a scope depth and is in one of four states:
1003
1004 - "in scope".  This is the case between the declaration of the
1005   variable and the end of the containing block, and also between
1006   the usage with affirms a merge and the end of that block.
1007
1008   The scope depth is not greater than the current parse context scope
1009   nest depth.  When the block of that depth closes, the state will
1010   change.  To achieve this, all "in scope" variables are linked
1011   together as a stack in nesting order.
1012
1013 - "pending".  The "in scope" block has closed, but other parallel
1014   scopes are still being processed.  So far, every parallel block at
1015   the same level that has closed has declared the name.
1016
1017   The scope depth is the depth of the last parallel block that
1018   enclosed the declaration, and that has closed.
1019
1020 - "conditionally in scope".  The "in scope" block and all parallel
1021   scopes have closed, and no further mention of the name has been
1022   seen.  This state includes a secondary nest depth which records the
1023   outermost scope seen since the variable became conditionally in
1024   scope.  If a use of the name is found, the variable becomes "in
1025   scope" and that secondary depth becomes the recorded scope depth.
1026   If the name is declared as a new variable, the old variable becomes
1027   "out of scope" and the recorded scope depth stays unchanged.
1028
1029 - "out of scope".  The variable is neither in scope nor conditionally
1030   in scope.  It is permanently out of scope now and can be removed from
1031   the "in scope" stack.
1032
1033
1034 ###### variable fields
1035         int depth, min_depth;
1036         enum { OutScope, PendingScope, CondScope, InScope } scope;
1037         struct variable *in_scope;
1038
1039 ###### parse context
1040
1041         struct variable *in_scope;
1042
1043 All variables with the same name are linked together using the
1044 'previous' link.  Those variable that have
1045 been affirmatively merged all have a 'merged' pointer that points to
1046 one primary variable - the most recently declared instance. When
1047 merging variables, we need to also adjust the 'merged' pointer on any
1048 other variables that had previously been merged with the one that will
1049 no longer be primary.
1050
1051 ###### variable fields
1052         struct variable *merged;
1053
1054 ###### ast functions
1055
1056         static void variable_merge(struct variable *primary, struct variable *secondary)
1057         {
1058                 struct variable *v;
1059
1060                 if (primary->merged)
1061                         // shouldn't happen
1062                         primary = primary->merged;
1063
1064                 for (v = primary->previous; v; v=v->previous)
1065                         if (v == secondary || v == secondary->merged ||
1066                             v->merged == secondary ||
1067                             (v->merged && v->merged == secondary->merged)) {
1068                                 v->scope = OutScope;
1069                                 v->merged = primary;
1070                         }
1071         }
1072
1073 ###### free context vars
1074
1075         while (context.varlist) {
1076                 struct binding *b = context.varlist;
1077                 struct variable *v = b->var;
1078                 context.varlist = b->next;
1079                 free(b);
1080                 while (v) {
1081                         struct variable *t = v;
1082
1083                         v = t->previous;
1084                         free_value(t->val);
1085                         free(t);
1086                 }
1087         }
1088
1089 #### Manipulating Bindings
1090
1091 When a name is conditionally visible, a new declaration discards the
1092 old binding - the condition lapses.  Conversely a usage of the name
1093 affirms the visibility and extends it to the end of the containing
1094 block - i.e. the block that contains both the original declaration and
1095 the latest usage.  This is determined from `min_depth`.  When a
1096 conditionally visible variable gets affirmed like this, it is also
1097 merged with other conditionally visible variables with the same name.
1098
1099 When we parse a variable declaration we either signal an error if the
1100 name is currently bound, or create a new variable at the current nest
1101 depth if the name is unbound or bound to a conditionally scoped or
1102 pending-scope variable.  If the previous variable was conditionally
1103 scoped, it and its homonyms becomes out-of-scope.
1104
1105 When we parse a variable reference (including non-declarative
1106 assignment) we signal an error if the name is not bound or is bound to
1107 a pending-scope variable; update the scope if the name is bound to a
1108 conditionally scoped variable; or just proceed normally if the named
1109 variable is in scope.
1110
1111 When we exit a scope, any variables bound at this level are either
1112 marked out of scope or pending-scoped, depending on whether the
1113 scope was sequential or parallel.
1114
1115 When exiting a parallel scope we check if there are any variables that
1116 were previously pending and are still visible. If there are, then
1117 there weren't redeclared in the most recent scope, so they cannot be
1118 merged and must become out-of-scope.  If it is not the first of
1119 parallel scopes (based on `child_count`), we check that there was a
1120 previous binding that is still pending-scope.  If there isn't, the new
1121 variable must now be out-of-scope.
1122
1123 When exiting a sequential scope that immediately enclosed parallel
1124 scopes, we need to resolve any pending-scope variables.  If there was
1125 no `else` clause, and we cannot determine that the `switch` was exhaustive,
1126 we need to mark all pending-scope variable as out-of-scope.  Otherwise
1127 all pending-scope variables become conditionally scoped.
1128
1129 ###### ast
1130         enum closetype { CloseSequential, CloseParallel, CloseElse };
1131
1132 ###### ast functions
1133
1134         static struct variable *var_decl(struct parse_context *c, struct text s)
1135         {
1136                 struct binding *b = find_binding(c, s);
1137                 struct variable *v = b->var;
1138
1139                 switch (v ? v->scope : OutScope) {
1140                 case InScope:
1141                         /* Caller will report the error */
1142                         return NULL;
1143                 case CondScope:
1144                         for (;
1145                              v && v->scope == CondScope;
1146                              v = v->previous)
1147                                 v->scope = OutScope;
1148                         break;
1149                 default: break;
1150                 }
1151                 v = calloc(1, sizeof(*v));
1152                 v->previous = b->var;
1153                 b->var = v;
1154                 v->name = b;
1155                 v->min_depth = v->depth = c->scope_depth;
1156                 v->scope = InScope;
1157                 v->in_scope = c->in_scope;
1158                 c->in_scope = v;
1159                 v->val = val_prepare(NULL);
1160                 return v;
1161         }
1162
1163         static struct variable *var_ref(struct parse_context *c, struct text s)
1164         {
1165                 struct binding *b = find_binding(c, s);
1166                 struct variable *v = b->var;
1167                 struct variable *v2;
1168
1169                 switch (v ? v->scope : OutScope) {
1170                 case OutScope:
1171                 case PendingScope:
1172                         /* Signal an error - once that is possible */
1173                         return NULL;
1174                 case CondScope:
1175                         /* All CondScope variables of this name need to be merged
1176                          * and become InScope
1177                          */
1178                         v->depth = v->min_depth;
1179                         v->scope = InScope;
1180                         for (v2 = v->previous;
1181                              v2 && v2->scope == CondScope;
1182                              v2 = v2->previous)
1183                                 variable_merge(v, v2);
1184                         break;
1185                 case InScope:
1186                         break;
1187                 }
1188                 return v;
1189         }
1190
1191         static void var_block_close(struct parse_context *c, enum closetype ct)
1192         {
1193                 /* close of all variables that are in_scope */
1194                 struct variable *v, **vp, *v2;
1195
1196                 scope_pop(c);
1197                 for (vp = &c->in_scope;
1198                      v = *vp, v && v->depth > c->scope_depth && v->min_depth > c->scope_depth;
1199                      ) {
1200                         switch (ct) {
1201                         case CloseElse:
1202                         case CloseParallel: /* handle PendingScope */
1203                                 switch(v->scope) {
1204                                 case InScope:
1205                                 case CondScope:
1206                                         if (c->scope_stack->child_count == 1)
1207                                                 v->scope = PendingScope;
1208                                         else if (v->previous &&
1209                                                  v->previous->scope == PendingScope)
1210                                                 v->scope = PendingScope;
1211                                         else if (v->val.type == Tlabel)
1212                                                 v->scope = PendingScope;
1213                                         else if (v->name->var == v)
1214                                                 v->scope = OutScope;
1215                                         if (ct == CloseElse) {
1216                                                 /* All Pending variables with this name
1217                                                  * are now Conditional */
1218                                                 for (v2 = v;
1219                                                      v2 && v2->scope == PendingScope;
1220                                                      v2 = v2->previous)
1221                                                         v2->scope = CondScope;
1222                                         }
1223                                         break;
1224                                 case PendingScope:
1225                                         for (v2 = v;
1226                                              v2 && v2->scope == PendingScope;
1227                                              v2 = v2->previous)
1228                                                 if (v2->val.type != Tlabel)
1229                                                         v2->scope = OutScope;
1230                                         break;
1231                                 case OutScope: break;
1232                                 }
1233                                 break;
1234                         case CloseSequential:
1235                                 if (v->val.type == Tlabel)
1236                                         v->scope = PendingScope;
1237                                 switch (v->scope) {
1238                                 case InScope:
1239                                         v->scope = OutScope;
1240                                         break;
1241                                 case PendingScope:
1242                                         /* There was no 'else', so we can only become
1243                                          * conditional if we know the cases were exhaustive,
1244                                          * and that doesn't mean anything yet.
1245                                          * So only labels become conditional..
1246                                          */
1247                                         for (v2 = v;
1248                                              v2 && v2->scope == PendingScope;
1249                                              v2 = v2->previous)
1250                                                 if (v2->val.type == Tlabel) {
1251                                                         v2->scope = CondScope;
1252                                                         v2->min_depth = c->scope_depth;
1253                                                 } else
1254                                                         v2->scope = OutScope;
1255                                         break;
1256                                 case CondScope:
1257                                 case OutScope: break;
1258                                 }
1259                                 break;
1260                         }
1261                         if (v->scope == OutScope)
1262                                 *vp = v->in_scope;
1263                         else
1264                                 vp = &v->in_scope;
1265                 }
1266         }
1267
1268 ### Executables
1269
1270 Executables can be lots of different things.  In many cases an
1271 executable is just an operation combined with one or two other
1272 executables.  This allows for expressions and lists etc.  Other times
1273 an executable is something quite specific like a constant or variable
1274 name.  So we define a `struct exec` to be a general executable with a
1275 type, and a `struct binode` which is a subclass of `exec`, forms a
1276 node in a binary tree, and holds an operation. There will be other
1277 subclasses, and to access these we need to be able to `cast` the
1278 `exec` into the various other types.
1279
1280 ###### macros
1281         #define cast(structname, pointer) ({            \
1282                 const typeof( ((struct structname *)0)->type) *__mptr = &(pointer)->type; \
1283                 if (__mptr && *__mptr != X##structname) abort();                \
1284                 (struct structname *)( (char *)__mptr);})
1285
1286         #define new(structname) ({                                              \
1287                 struct structname *__ptr = ((struct structname *)calloc(1,sizeof(struct structname))); \
1288                 __ptr->type = X##structname;                                            \
1289                 __ptr->line = -1; __ptr->column = -1;                                   \
1290                 __ptr;})
1291
1292         #define new_pos(structname, token) ({                                           \
1293                 struct structname *__ptr = ((struct structname *)calloc(1,sizeof(struct structname))); \
1294                 __ptr->type = X##structname;                                            \
1295                 __ptr->line = token.line; __ptr->column = token.col;                    \
1296                 __ptr;})
1297
1298 ###### ast
1299         enum exec_types {
1300                 Xbinode,
1301                 ## exec type
1302         };
1303         struct exec {
1304                 enum exec_types type;
1305                 int line, column;
1306         };
1307         struct binode {
1308                 struct exec;
1309                 enum Btype {
1310                         ## Binode types
1311                 } op;
1312                 struct exec *left, *right;
1313         };
1314
1315 ###### ast functions
1316
1317         static int __fput_loc(struct exec *loc, FILE *f)
1318         {
1319                 if (!loc)
1320                         return 0;               // NOTEST
1321                 if (loc->line >= 0) {
1322                         fprintf(f, "%d:%d: ", loc->line, loc->column);
1323                         return 1;
1324                 }
1325                 if (loc->type == Xbinode)
1326                         return __fput_loc(cast(binode,loc)->left, f) ||
1327                                __fput_loc(cast(binode,loc)->right, f);
1328                 return 0;
1329         }
1330         static void fput_loc(struct exec *loc, FILE *f)
1331         {
1332                 if (!__fput_loc(loc, f))
1333                         fprintf(f, "??:??: ");  // NOTEST
1334         }
1335
1336 Each different type of `exec` node needs a number of functions
1337 defined, a bit like methods.  We must be able to be able to free it,
1338 print it, analyse it and execute it.  Once we have specific `exec`
1339 types we will need to parse them too.  Let's take this a bit more
1340 slowly.
1341
1342 #### Freeing
1343
1344 The parser generator requires a `free_foo` function for each struct
1345 that stores attributes and they will be `exec`s and subtypes there-of.
1346 So we need `free_exec` which can handle all the subtypes, and we need
1347 `free_binode`.
1348
1349 ###### ast functions
1350
1351         static void free_binode(struct binode *b)
1352         {
1353                 if (!b)
1354                         return;
1355                 free_exec(b->left);
1356                 free_exec(b->right);
1357                 free(b);
1358         }
1359
1360 ###### core functions
1361         static void free_exec(struct exec *e)
1362         {
1363                 if (!e)
1364                         return;
1365                 switch(e->type) {
1366                         ## free exec cases
1367                 }
1368         }
1369
1370 ###### forward decls
1371
1372         static void free_exec(struct exec *e);
1373
1374 ###### free exec cases
1375         case Xbinode: free_binode(cast(binode, e)); break;
1376
1377 #### Printing
1378
1379 Printing an `exec` requires that we know the current indent level for
1380 printing line-oriented components.  As will become clear later, we
1381 also want to know what sort of bracketing to use.
1382
1383 ###### ast functions
1384
1385         static void do_indent(int i, char *str)
1386         {
1387                 while (i--)
1388                         printf("    ");
1389                 printf("%s", str);
1390         }
1391
1392 ###### core functions
1393         static void print_binode(struct binode *b, int indent, int bracket)
1394         {
1395                 struct binode *b2;
1396                 switch(b->op) {
1397                 ## print binode cases
1398                 }
1399         }
1400
1401         static void print_exec(struct exec *e, int indent, int bracket)
1402         {
1403                 if (!e)
1404                         return;         // NOTEST
1405                 switch (e->type) {
1406                 case Xbinode:
1407                         print_binode(cast(binode, e), indent, bracket); break;
1408                 ## print exec cases
1409                 }
1410         }
1411
1412 ###### forward decls
1413
1414         static void print_exec(struct exec *e, int indent, int bracket);
1415
1416 #### Analysing
1417
1418 As discussed, analysis involves propagating type requirements around
1419 the program and looking for errors.
1420
1421 So `propagate_types` is passed an expected type (being a `struct type`
1422 pointer together with some `val_rules` flags) that the `exec` is
1423 expected to return, and returns the type that it does return, either
1424 of which can be `NULL` signifying "unknown".  An `ok` flag is passed
1425 by reference. It is set to `0` when an error is found, and `2` when
1426 any change is made.  If it remains unchanged at `1`, then no more
1427 propagation is needed.
1428
1429 ###### ast
1430
1431         enum val_rules {Rnolabel = 1<<0, Rboolok = 1<<1, Rnoconstant = 2<<1};
1432
1433 ###### format cases
1434         case 'r':
1435                 if (rules & Rnolabel)
1436                         fputs(" (labels not permitted)", stderr);
1437                 break;
1438
1439 ###### core functions
1440
1441         static struct type *propagate_types(struct exec *prog, struct parse_context *c, int *ok,
1442                                             struct type *type, int rules)
1443         {
1444                 struct type *t;
1445
1446                 if (!prog)
1447                         return Tnone;
1448
1449                 switch (prog->type) {
1450                 case Xbinode:
1451                 {
1452                         struct binode *b = cast(binode, prog);
1453                         switch (b->op) {
1454                         ## propagate binode cases
1455                         }
1456                         break;
1457                 }
1458                 ## propagate exec cases
1459                 }
1460                 return Tnone;
1461         }
1462
1463 #### Interpreting
1464
1465 Interpreting an `exec` doesn't require anything but the `exec`.  State
1466 is stored in variables and each variable will be directly linked from
1467 within the `exec` tree.  The exception to this is the whole `program`
1468 which needs to look at command line arguments.  The `program` will be
1469 interpreted separately.
1470
1471 Each `exec` can return a value, which may be `Tnone` but must be non-NULL;
1472
1473 ###### core functions
1474
1475         struct lrval {
1476                 struct value val, *lval;
1477         };
1478
1479         static struct lrval _interp_exec(struct exec *e);
1480
1481         static struct value interp_exec(struct exec *e)
1482         {
1483                 struct lrval ret = _interp_exec(e);
1484
1485                 if (ret.lval)
1486                         return dup_value(*ret.lval);
1487                 else
1488                         return ret.val;
1489         }
1490
1491         static struct value *linterp_exec(struct exec *e)
1492         {
1493                 struct lrval ret = _interp_exec(e);
1494
1495                 return ret.lval;
1496         }
1497
1498         static struct lrval _interp_exec(struct exec *e)
1499         {
1500                 struct lrval ret;
1501                 struct value rv, *lrv = NULL;
1502                 rv.type = Tnone;
1503                 if (!e) {
1504                         ret.lval = lrv;
1505                         ret.val = rv;
1506                         return ret;
1507                 }
1508
1509                 switch(e->type) {
1510                 case Xbinode:
1511                 {
1512                         struct binode *b = cast(binode, e);
1513                         struct value left, right, *lleft;
1514                         left.type = right.type = Tnone;
1515                         switch (b->op) {
1516                         ## interp binode cases
1517                         }
1518                         free_value(left); free_value(right);
1519                         break;
1520                 }
1521                 ## interp exec cases
1522                 }
1523                 ret.lval = lrv;
1524                 ret.val = rv;
1525                 return ret;
1526         }
1527
1528 ### Complex types
1529
1530 Now that we have the shape of the interpreter in place we can add some
1531 complex types and connected them in to the data structures and the
1532 different phases of parse, analyse, print, interpret.
1533
1534 For now, just arrays.
1535
1536 #### Arrays
1537
1538 Arrays can be declared by giving a size and a type, as `[size]type' so
1539 `freq:[26]number` declares `freq` to be an array of 26 numbers.  The
1540 size can be an arbitrary expression which is evaluated when the name
1541 comes into scope.
1542
1543 Arrays cannot be assigned.  When pointers are introduced we will also
1544 introduce array slices which can refer to part or all of an array -
1545 the assignment syntax will create a slice.  For now, an array can only
1546 ever be referenced by the name it is declared with.  It is likely that
1547 a "`copy`" primitive will eventually be define which can be used to
1548 make a copy of an array with controllable depth.
1549
1550 ###### type union fields
1551
1552         struct {
1553                 int size;
1554                 struct variable *vsize;
1555                 struct type *member;
1556         } array;
1557
1558 ###### value union fields
1559         struct {
1560                 struct value *elmnts;
1561         } array;
1562
1563 ###### value functions
1564
1565         static struct value array_prepare(struct type *type)
1566         {
1567                 struct value ret;
1568
1569                 ret.type = type;
1570                 ret.array.elmnts = NULL;
1571                 return ret;
1572         }
1573
1574         static struct value array_init(struct type *type)
1575         {
1576                 struct value ret;
1577                 int i;
1578
1579                 ret.type = type;
1580                 if (type->array.vsize) {
1581                         mpz_t q;
1582                         mpz_init(q);
1583                         mpz_tdiv_q(q, mpq_numref(type->array.vsize->val.num),
1584                                    mpq_denref(type->array.vsize->val.num));
1585                         type->array.size = mpz_get_si(q);
1586                         mpz_clear(q);
1587                 }
1588                 ret.array.elmnts = calloc(type->array.size,
1589                                           sizeof(ret.array.elmnts[0]));
1590                 for (i = 0; ret.array.elmnts && i < type->array.size; i++)
1591                         ret.array.elmnts[i] = val_init(type->array.member);
1592                 return ret;
1593         }
1594
1595         static void array_free(struct value val)
1596         {
1597                 int i;
1598
1599                 if (val.array.elmnts)
1600                         for (i = 0; i < val.type->array.size; i++)
1601                                 free_value(val.array.elmnts[i]);
1602                 free(val.array.elmnts);
1603         }
1604
1605         static int array_compat(struct type *require, struct type *have)
1606         {
1607                 if (have->compat != require->compat)
1608                         return 0;
1609                 /* Both are arrays, so we can look at details */
1610                 if (!type_compat(require->array.member, have->array.member, 0))
1611                         return 0;
1612                 if (require->array.vsize == NULL && have->array.vsize == NULL)
1613                         return require->array.size == have->array.size;
1614
1615                 return require->array.vsize == have->array.vsize;
1616         }
1617
1618         static void array_print_type(struct type *type, FILE *f)
1619         {
1620                 fputs("[", f);
1621                 if (type->array.vsize) {
1622                         struct binding *b = type->array.vsize->name;
1623                         fprintf(f, "%.*s]", b->name.len, b->name.txt);
1624                 } else
1625                         fprintf(f, "%d]", type->array.size);
1626                 type_print(type->array.member, f);
1627         }
1628
1629         static struct type array_prototype = {
1630                 .prepare = array_prepare,
1631                 .init = array_init,
1632                 .print_type = array_print_type,
1633                 .compat = array_compat,
1634                 .free = array_free,
1635         };
1636
1637 ###### type grammar
1638
1639         | [ NUMBER ] Type ${
1640                 $0 = calloc(1, sizeof(struct type));
1641                 *($0) = array_prototype;
1642                 $0->array.member = $<4;
1643                 $0->array.vsize = NULL;
1644                 {
1645                 struct parse_context *c = config2context(config);
1646                 char tail[3];
1647                 mpq_t num;
1648                 if (number_parse(num, tail, $2.txt) == 0)
1649                         tok_err(c, "error: unrecognised number", &$2);
1650                 else if (tail[0])
1651                         tok_err(c, "error: unsupported number suffix", &$2);
1652                 else {
1653                         $0->array.size = mpz_get_ui(mpq_numref(num));
1654                         if (mpz_cmp_ui(mpq_denref(num), 1) != 0) {
1655                                 tok_err(c, "error: array size must be an integer",
1656                                         &$2);
1657                         } else if (mpz_cmp_ui(mpq_numref(num), 1UL << 30) >= 0)
1658                                 tok_err(c, "error: array size is too large",
1659                                         &$2);
1660                         mpq_clear(num);
1661                 }
1662                 $0->next= c->anon_typelist;
1663                 c->anon_typelist = $0;
1664                 }
1665         }$
1666
1667         | [ IDENTIFIER ] Type ${ {
1668                 struct parse_context *c = config2context(config);
1669                 struct variable *v = var_ref(c, $2.txt);
1670
1671                 if (!v)
1672                         tok_err(config2context(config), "error: name undeclared", &$2);
1673                 else if (!v->constant)
1674                         tok_err(config2context(config), "error: array size must be a constant", &$2);
1675
1676                 $0 = calloc(1, sizeof(struct type));
1677                 *($0) = array_prototype;
1678                 $0->array.member = $<4;
1679                 $0->array.size = 0;
1680                 $0->array.vsize = v;
1681                 $0->next= c->anon_typelist;
1682                 c->anon_typelist = $0;
1683         } }$
1684
1685 ###### parse context
1686
1687         struct type *anon_typelist;
1688
1689 ###### free context types
1690
1691         while (context.anon_typelist) {
1692                 struct type *t = context.anon_typelist;
1693
1694                 context.anon_typelist = t->next;
1695                 free(t);
1696         }
1697
1698 ###### Binode types
1699         Index,
1700
1701 ###### variable grammar
1702
1703         | Variable [ Expression ] ${ {
1704                 struct binode *b = new(binode);
1705                 b->op = Index;
1706                 b->left = $<1;
1707                 b->right = $<3;
1708                 $0 = b;
1709         } }$
1710
1711 ###### print binode cases
1712         case Index:
1713                 print_exec(b->left, -1, 0);
1714                 printf("[");
1715                 print_exec(b->right, -1, 0);
1716                 printf("]");
1717                 break;
1718
1719 ###### propagate binode cases
1720         case Index:
1721                 /* left must be an array, right must be a number,
1722                  * result is the member type of the array
1723                  */
1724                 propagate_types(b->right, c, ok, Tnum, 0);
1725                 t = propagate_types(b->left, c, ok, NULL, rules & Rnoconstant);
1726                 if (!t || t->compat != array_compat) {
1727                         type_err(c, "error: %1 cannot be indexed", prog, t, 0, NULL);
1728                         *ok = 0;
1729                         return NULL;
1730                 } else {
1731                         if (!type_compat(type, t->array.member, rules)) {
1732                                 type_err(c, "error: have %1 but need %2", prog,
1733                                          t->array.member, rules, type);
1734                                 *ok = 0;
1735                         }
1736                         return t->array.member;
1737                 }
1738                 break;
1739
1740 ###### interp binode cases
1741         case Index: {
1742                 mpz_t q;
1743                 long i;
1744
1745                 lleft = linterp_exec(b->left);
1746                 right = interp_exec(b->right);
1747                 mpz_init(q);
1748                 mpz_tdiv_q(q, mpq_numref(right.num), mpq_denref(right.num));
1749                 i = mpz_get_si(q);
1750                 mpz_clear(q);
1751
1752                 if (i >= 0 && i < lleft->type->array.size)
1753                         lrv = &lleft->array.elmnts[i];
1754                 else
1755                         rv = val_init(lleft->type->array.member);
1756                 break;
1757         }
1758
1759 ## Language elements
1760
1761 Each language element needs to be parsed, printed, analysed,
1762 interpreted, and freed.  There are several, so let's just start with
1763 the easy ones and work our way up.
1764
1765 ### Values
1766
1767 We have already met values as separate objects.  When manifest
1768 constants appear in the program text, that must result in an executable
1769 which has a constant value.  So the `val` structure embeds a value in
1770 an executable.
1771
1772 ###### exec type
1773         Xval,
1774
1775 ###### ast
1776         struct val {
1777                 struct exec;
1778                 struct value val;
1779         };
1780
1781 ###### Grammar
1782
1783         $*val
1784         Value ->  True ${
1785                         $0 = new_pos(val, $1);
1786                         $0->val.type = Tbool;
1787                         $0->val.bool = 1;
1788                         }$
1789                 | False ${
1790                         $0 = new_pos(val, $1);
1791                         $0->val.type = Tbool;
1792                         $0->val.bool = 0;
1793                         }$
1794                 | NUMBER ${
1795                         $0 = new_pos(val, $1);
1796                         $0->val.type = Tnum;
1797                         {
1798                         char tail[3];
1799                         if (number_parse($0->val.num, tail, $1.txt) == 0)
1800                                 mpq_init($0->val.num);
1801                                 if (tail[0])
1802                                         tok_err(config2context(config), "error: unsupported number suffix",
1803                                                 &$1);
1804                         }
1805                         }$
1806                 | STRING ${
1807                         $0 = new_pos(val, $1);
1808                         $0->val.type = Tstr;
1809                         {
1810                         char tail[3];
1811                         string_parse(&$1, '\\', &$0->val.str, tail);
1812                         if (tail[0])
1813                                 tok_err(config2context(config), "error: unsupported string suffix",
1814                                         &$1);
1815                         }
1816                         }$
1817                 | MULTI_STRING ${
1818                         $0 = new_pos(val, $1);
1819                         $0->val.type = Tstr;
1820                         {
1821                         char tail[3];
1822                         string_parse(&$1, '\\', &$0->val.str, tail);
1823                         if (tail[0])
1824                                 tok_err(config2context(config), "error: unsupported string suffix",
1825                                         &$1);
1826                         }
1827                         }$
1828
1829 ###### print exec cases
1830         case Xval:
1831         {
1832                 struct val *v = cast(val, e);
1833                 if (v->val.type == Tstr)
1834                         printf("\"");
1835                 print_value(v->val);
1836                 if (v->val.type == Tstr)
1837                         printf("\"");
1838                 break;
1839         }
1840
1841 ###### propagate exec cases
1842                 case Xval:
1843                 {
1844                         struct val *val = cast(val, prog);
1845                         if (!type_compat(type, val->val.type, rules)) {
1846                                 type_err(c, "error: expected %1%r found %2",
1847                                            prog, type, rules, val->val.type);
1848                                 *ok = 0;
1849                         }
1850                         return val->val.type;
1851                 }
1852
1853 ###### interp exec cases
1854         case Xval:
1855                 rv = dup_value(cast(val, e)->val);
1856                 break;
1857
1858 ###### ast functions
1859         static void free_val(struct val *v)
1860         {
1861                 if (!v)
1862                         return;
1863                 free_value(v->val);
1864                 free(v);
1865         }
1866
1867 ###### free exec cases
1868         case Xval: free_val(cast(val, e)); break;
1869
1870 ###### ast functions
1871         // Move all nodes from 'b' to 'rv', reversing the order.
1872         // In 'b' 'left' is a list, and 'right' is the last node.
1873         // In 'rv', left' is the first node and 'right' is a list.
1874         static struct binode *reorder_bilist(struct binode *b)
1875         {
1876                 struct binode *rv = NULL;
1877
1878                 while (b) {
1879                         struct exec *t = b->right;
1880                         b->right = rv;
1881                         rv = b;
1882                         if (b->left)
1883                                 b = cast(binode, b->left);
1884                         else
1885                                 b = NULL;
1886                         rv->left = t;
1887                 }
1888                 return rv;
1889         }
1890
1891 ### Variables
1892
1893 Just as we used a `val` to wrap a value into an `exec`, we similarly
1894 need a `var` to wrap a `variable` into an exec.  While each `val`
1895 contained a copy of the value, each `var` hold a link to the variable
1896 because it really is the same variable no matter where it appears.
1897 When a variable is used, we need to remember to follow the `->merged`
1898 link to find the primary instance.
1899
1900 ###### exec type
1901         Xvar,
1902
1903 ###### ast
1904         struct var {
1905                 struct exec;
1906                 struct variable *var;
1907         };
1908
1909 ###### Grammar
1910
1911         $*var
1912         VariableDecl -> IDENTIFIER : ${ {
1913                 struct variable *v = var_decl(config2context(config), $1.txt);
1914                 $0 = new_pos(var, $1);
1915                 $0->var = v;
1916                 if (v)
1917                         v->where_decl = $0;
1918                 else {
1919                         v = var_ref(config2context(config), $1.txt);
1920                         $0->var = v;
1921                         type_err(config2context(config), "error: variable '%v' redeclared",
1922                                  $0, Tnone, 0, Tnone);
1923                         type_err(config2context(config), "info: this is where '%v' was first declared",
1924                                  v->where_decl, Tnone, 0, Tnone);
1925                 }
1926         } }$
1927             | IDENTIFIER :: ${ {
1928                 struct variable *v = var_decl(config2context(config), $1.txt);
1929                 $0 = new_pos(var, $1);
1930                 $0->var = v;
1931                 if (v) {
1932                         v->where_decl = $0;
1933                         v->constant = 1;
1934                 } else {
1935                         v = var_ref(config2context(config), $1.txt);
1936                         $0->var = v;
1937                         type_err(config2context(config), "error: variable '%v' redeclared",
1938                                  $0, Tnone, 0, Tnone);
1939                         type_err(config2context(config), "info: this is where '%v' was first declared",
1940                                  v->where_decl, Tnone, 0, Tnone);
1941                 }
1942         } }$
1943             | IDENTIFIER : Type ${ {
1944                 struct variable *v = var_decl(config2context(config), $1.txt);
1945                 $0 = new_pos(var, $1);
1946                 $0->var = v;
1947                 if (v) {
1948                         v->where_decl = $0;
1949                         v->where_set = $0;
1950                         v->val = val_prepare($<3);
1951                 } else {
1952                         v = var_ref(config2context(config), $1.txt);
1953                         $0->var = v;
1954                         type_err(config2context(config), "error: variable '%v' redeclared",
1955                                  $0, Tnone, 0, Tnone);
1956                         type_err(config2context(config), "info: this is where '%v' was first declared",
1957                                  v->where_decl, Tnone, 0, Tnone);
1958                 }
1959         } }$
1960             | IDENTIFIER :: Type ${ {
1961                 struct variable *v = var_decl(config2context(config), $1.txt);
1962                 $0 = new_pos(var, $1);
1963                 $0->var = v;
1964                 if (v) {
1965                         v->where_decl = $0;
1966                         v->where_set = $0;
1967                         v->val = val_prepare($<3);
1968                         v->constant = 1;
1969                 } else {
1970                         v = var_ref(config2context(config), $1.txt);
1971                         $0->var = v;
1972                         type_err(config2context(config), "error: variable '%v' redeclared",
1973                                  $0, Tnone, 0, Tnone);
1974                         type_err(config2context(config), "info: this is where '%v' was first declared",
1975                                  v->where_decl, Tnone, 0, Tnone);
1976                 }
1977         } }$
1978
1979         $*exec
1980         Variable -> IDENTIFIER ${ {
1981                 struct variable *v = var_ref(config2context(config), $1.txt);
1982                 $0 = new_pos(var, $1);
1983                 if (v == NULL) {
1984                         /* This might be a label - allocate a var just in case */
1985                         v = var_decl(config2context(config), $1.txt);
1986                         if (v) {
1987                                 v->val = val_prepare(Tlabel);
1988                                 v->val.label = &v->val;
1989                                 v->where_set = $0;
1990                         }
1991                 }
1992                 cast(var, $0)->var = v;
1993         } }$
1994         ## variable grammar
1995
1996         $*type
1997         Type -> IDENTIFIER ${
1998                 $0 = find_type(config2context(config), $1.txt);
1999                 if (!$0) {
2000                         tok_err(config2context(config),
2001                                 "error: undefined type", &$1);
2002
2003                         $0 = Tnone;
2004                 }
2005         }$
2006         ## type grammar
2007
2008 ###### print exec cases
2009         case Xvar:
2010         {
2011                 struct var *v = cast(var, e);
2012                 if (v->var) {
2013                         struct binding *b = v->var->name;
2014                         printf("%.*s", b->name.len, b->name.txt);
2015                 }
2016                 break;
2017         }
2018
2019 ###### format cases
2020         case 'v':
2021                 if (loc->type == Xvar) {
2022                         struct var *v = cast(var, loc);
2023                         if (v->var) {
2024                                 struct binding *b = v->var->name;
2025                                 fprintf(stderr, "%.*s", b->name.len, b->name.txt);
2026                         } else
2027                                 fputs("???", stderr);   // NOTEST
2028                 } else
2029                         fputs("NOTVAR", stderr);        // NOTEST
2030                 break;
2031
2032 ###### propagate exec cases
2033
2034         case Xvar:
2035         {
2036                 struct var *var = cast(var, prog);
2037                 struct variable *v = var->var;
2038                 if (!v) {
2039                         type_err(c, "%d:BUG: no variable!!", prog, Tnone, 0, Tnone); // NOTEST
2040                         *ok = 0;                                        // NOTEST
2041                         return Tnone;                                   // NOTEST
2042                 }
2043                 if (v->merged)
2044                         v = v->merged;
2045                 if (v->constant && (rules & Rnoconstant)) {
2046                         type_err(c, "error: Cannot assign to a constant: %v",
2047                                  prog, NULL, 0, NULL);
2048                         type_err(c, "info: name was defined as a constant here",
2049                                  v->where_decl, NULL, 0, NULL);
2050                         *ok = 0;
2051                         return v->val.type;
2052                 }
2053                 if (v->val.type == NULL) {
2054                         if (type && *ok != 0) {
2055                                 v->val = val_prepare(type);
2056                                 v->where_set = prog;
2057                                 *ok = 2;
2058                         }
2059                         return type;
2060                 }
2061                 if (!type_compat(type, v->val.type, rules)) {
2062                         type_err(c, "error: expected %1%r but variable '%v' is %2", prog,
2063                                  type, rules, v->val.type);
2064                         type_err(c, "info: this is where '%v' was set to %1", v->where_set,
2065                                  v->val.type, rules, Tnone);
2066                         *ok = 0;
2067                 }
2068                 if (!type)
2069                         return v->val.type;
2070                 return type;
2071         }
2072
2073 ###### interp exec cases
2074         case Xvar:
2075         {
2076                 struct var *var = cast(var, e);
2077                 struct variable *v = var->var;
2078
2079                 if (v->merged)
2080                         v = v->merged;
2081                 lrv = &v->val;
2082                 break;
2083         }
2084
2085 ###### ast functions
2086
2087         static void free_var(struct var *v)
2088         {
2089                 free(v);
2090         }
2091
2092 ###### free exec cases
2093         case Xvar: free_var(cast(var, e)); break;
2094
2095 ### Expressions: Conditional
2096
2097 Our first user of the `binode` will be conditional expressions, which
2098 is a bit odd as they actually have three components.  That will be
2099 handled by having 2 binodes for each expression.  The conditional
2100 expression is the lowest precedence operatior, so it gets to define
2101 what an "Expression" is.  The next level up is "BoolExpr", which
2102 comes next.
2103
2104 Conditional expressions are of the form "value `if` condition `else`
2105 other_value".  There is no associativite with this operator: the
2106 values and conditions can only be other conditional expressions if
2107 they are enclosed in parentheses.  Allowing nesting without
2108 parentheses would be too confusing.
2109
2110 ###### Binode types
2111         CondExpr,
2112
2113 ###### Grammar
2114
2115         $*exec
2116         Expression -> BoolExpr if BoolExpr else BoolExpr ${ {
2117                         struct binode *b1 = new(binode);
2118                         struct binode *b2 = new(binode);
2119                         b1->op = CondExpr;
2120                         b1->left = $<3;
2121                         b1->right = b2;
2122                         b2->op = CondExpr;
2123                         b2->left = $<1;
2124                         b2->right = $<5;
2125                         $0 = b1;
2126                 } }$
2127                 | BoolExpr ${ $0 = $<1; }$
2128
2129 ###### print binode cases
2130
2131         case CondExpr:
2132                 b2 = cast(binode, b->right);
2133                 print_exec(b2->left, -1, 0);
2134                 printf(" if ");
2135                 print_exec(b->left, -1, 0);
2136                 printf(" else ");
2137                 print_exec(b2->right, -1, 0);
2138                 break;
2139
2140 ###### propagate binode cases
2141
2142         case CondExpr: {
2143                 /* cond must be Tbool, others must match */
2144                 struct binode *b2 = cast(binode, b->right);
2145                 struct type *t2;
2146
2147                 propagate_types(b->left, c, ok, Tbool, 0);
2148                 t = propagate_types(b2->left, c, ok, type, Rnolabel);
2149                 t2 = propagate_types(b2->right, c, ok, type ?: t, Rnolabel);
2150                 return t ?: t2;
2151         }
2152
2153 ###### interp binode cases
2154
2155         case CondExpr: {
2156                 struct binode *b2 = cast(binode, b->right);
2157                 left = interp_exec(b->left);
2158                 if (left.bool)
2159                         rv = interp_exec(b2->left);
2160                 else
2161                         rv = interp_exec(b2->right);
2162                 }
2163                 break;
2164
2165 ### Expressions: Boolean
2166
2167 The next class of expressions to use the `binode` will be Boolean
2168 expressions.  As I haven't implemented precedence in the parser
2169 generator yet, we need different names for each precedence level used
2170 by expressions.  The outer most or lowest level precedence are
2171 conditional expressions are Boolean operators which form an `BoolExpr`
2172 out of `BTerm`s and `BFact`s.  As well as `or` `and`, and `not` we
2173 have `and then` and `or else` which only evaluate the second operand
2174 if the result would make a difference.
2175
2176 ###### Binode types
2177         And,
2178         AndThen,
2179         Or,
2180         OrElse,
2181         Not,
2182
2183 ###### Grammar
2184
2185         $*exec
2186         BoolExpr -> BoolExpr or BTerm ${ {
2187                         struct binode *b = new(binode);
2188                         b->op = Or;
2189                         b->left = $<1;
2190                         b->right = $<3;
2191                         $0 = b;
2192                 } }$
2193                 | BoolExpr or else BTerm ${ {
2194                         struct binode *b = new(binode);
2195                         b->op = OrElse;
2196                         b->left = $<1;
2197                         b->right = $<4;
2198                         $0 = b;
2199                 } }$
2200                 | BTerm ${ $0 = $<1; }$
2201
2202         BTerm -> BTerm and BFact ${ {
2203                         struct binode *b = new(binode);
2204                         b->op = And;
2205                         b->left = $<1;
2206                         b->right = $<3;
2207                         $0 = b;
2208                 } }$
2209                 | BTerm and then BFact ${ {
2210                         struct binode *b = new(binode);
2211                         b->op = AndThen;
2212                         b->left = $<1;
2213                         b->right = $<4;
2214                         $0 = b;
2215                 } }$
2216                 | BFact ${ $0 = $<1; }$
2217
2218         BFact -> not BFact ${ {
2219                         struct binode *b = new(binode);
2220                         b->op = Not;
2221                         b->right = $<2;
2222                         $0 = b;
2223                 } }$
2224                 ## other BFact
2225
2226 ###### print binode cases
2227         case And:
2228                 print_exec(b->left, -1, 0);
2229                 printf(" and ");
2230                 print_exec(b->right, -1, 0);
2231                 break;
2232         case AndThen:
2233                 print_exec(b->left, -1, 0);
2234                 printf(" and then ");
2235                 print_exec(b->right, -1, 0);
2236                 break;
2237         case Or:
2238                 print_exec(b->left, -1, 0);
2239                 printf(" or ");
2240                 print_exec(b->right, -1, 0);
2241                 break;
2242         case OrElse:
2243                 print_exec(b->left, -1, 0);
2244                 printf(" or else ");
2245                 print_exec(b->right, -1, 0);
2246                 break;
2247         case Not:
2248                 printf("not ");
2249                 print_exec(b->right, -1, 0);
2250                 break;
2251
2252 ###### propagate binode cases
2253         case And:
2254         case AndThen:
2255         case Or:
2256         case OrElse:
2257         case Not:
2258                 /* both must be Tbool, result is Tbool */
2259                 propagate_types(b->left, c, ok, Tbool, 0);
2260                 propagate_types(b->right, c, ok, Tbool, 0);
2261                 if (type && type != Tbool) {
2262                         type_err(c, "error: %1 operation found where %2 expected", prog,
2263                                    Tbool, 0, type);
2264                         *ok = 0;
2265                 }
2266                 return Tbool;
2267
2268 ###### interp binode cases
2269         case And:
2270                 rv = interp_exec(b->left);
2271                 right = interp_exec(b->right);
2272                 rv.bool = rv.bool && right.bool;
2273                 break;
2274         case AndThen:
2275                 rv = interp_exec(b->left);
2276                 if (rv.bool)
2277                         rv = interp_exec(b->right);
2278                 break;
2279         case Or:
2280                 rv = interp_exec(b->left);
2281                 right = interp_exec(b->right);
2282                 rv.bool = rv.bool || right.bool;
2283                 break;
2284         case OrElse:
2285                 rv = interp_exec(b->left);
2286                 if (!rv.bool)
2287                         rv = interp_exec(b->right);
2288                 break;
2289         case Not:
2290                 rv = interp_exec(b->right);
2291                 rv.bool = !rv.bool;
2292                 break;
2293
2294 ### Expressions: Comparison
2295
2296 Of slightly higher precedence that Boolean expressions are
2297 Comparisons.
2298 A comparison takes arguments of any type, but the two types must be
2299 the same.
2300
2301 To simplify the parsing we introduce an `eop` which can record an
2302 expression operator.
2303
2304 ###### ast
2305         struct eop {
2306                 enum Btype op;
2307         };
2308
2309 ###### ast functions
2310         static void free_eop(struct eop *e)
2311         {
2312                 if (e)
2313                         free(e);
2314         }
2315
2316 ###### Binode types
2317         Less,
2318         Gtr,
2319         LessEq,
2320         GtrEq,
2321         Eql,
2322         NEql,
2323
2324 ###### other BFact
2325         | Expr CMPop Expr ${ {
2326                         struct binode *b = new(binode);
2327                         b->op = $2.op;
2328                         b->left = $<1;
2329                         b->right = $<3;
2330                         $0 = b;
2331         } }$
2332         | Expr ${ $0 = $<1; }$
2333
2334 ###### Grammar
2335
2336         $eop
2337         CMPop ->   < ${ $0.op = Less; }$
2338                 |  > ${ $0.op = Gtr; }$
2339                 |  <= ${ $0.op = LessEq; }$
2340                 |  >= ${ $0.op = GtrEq; }$
2341                 |  == ${ $0.op = Eql; }$
2342                 |  != ${ $0.op = NEql; }$
2343
2344 ###### print binode cases
2345
2346         case Less:
2347         case LessEq:
2348         case Gtr:
2349         case GtrEq:
2350         case Eql:
2351         case NEql:
2352                 print_exec(b->left, -1, 0);
2353                 switch(b->op) {
2354                 case Less:   printf(" < "); break;
2355                 case LessEq: printf(" <= "); break;
2356                 case Gtr:    printf(" > "); break;
2357                 case GtrEq:  printf(" >= "); break;
2358                 case Eql:    printf(" == "); break;
2359                 case NEql:   printf(" != "); break;
2360                 default: abort();               // NOTEST
2361                 }
2362                 print_exec(b->right, -1, 0);
2363                 break;
2364
2365 ###### propagate binode cases
2366         case Less:
2367         case LessEq:
2368         case Gtr:
2369         case GtrEq:
2370         case Eql:
2371         case NEql:
2372                 /* Both must match but not be labels, result is Tbool */
2373                 t = propagate_types(b->left, c, ok, NULL, Rnolabel);
2374                 if (t)
2375                         propagate_types(b->right, c, ok, t, 0);
2376                 else {
2377                         t = propagate_types(b->right, c, ok, NULL, Rnolabel);
2378                         if (t)
2379                                 t = propagate_types(b->left, c, ok, t, 0);
2380                 }
2381                 if (!type_compat(type, Tbool, 0)) {
2382                         type_err(c, "error: Comparison returns %1 but %2 expected", prog,
2383                                     Tbool, rules, type);
2384                         *ok = 0;
2385                 }
2386                 return Tbool;
2387
2388 ###### interp binode cases
2389         case Less:
2390         case LessEq:
2391         case Gtr:
2392         case GtrEq:
2393         case Eql:
2394         case NEql:
2395         {
2396                 int cmp;
2397                 left = interp_exec(b->left);
2398                 right = interp_exec(b->right);
2399                 cmp = value_cmp(left, right);
2400                 rv.type = Tbool;
2401                 switch (b->op) {
2402                 case Less:      rv.bool = cmp <  0; break;
2403                 case LessEq:    rv.bool = cmp <= 0; break;
2404                 case Gtr:       rv.bool = cmp >  0; break;
2405                 case GtrEq:     rv.bool = cmp >= 0; break;
2406                 case Eql:       rv.bool = cmp == 0; break;
2407                 case NEql:      rv.bool = cmp != 0; break;
2408                 default: rv.bool = 0; break;    // NOTEST
2409                 }
2410                 break;
2411         }
2412
2413 ### Expressions: The rest
2414
2415 The remaining expressions with the highest precedence are arithmetic
2416 and string concatenation.  They are `Expr`, `Term`, and `Factor`.
2417 The `Factor` is where the `Value` and `Variable` that we already have
2418 are included.
2419
2420 `+` and `-` are both infix and prefix operations (where they are
2421 absolute value and negation).  These have different operator names.
2422
2423 We also have a 'Bracket' operator which records where parentheses were
2424 found.  This makes it easy to reproduce these when printing.  Once
2425 precedence is handled better I might be able to discard this.
2426
2427 ###### Binode types
2428         Plus, Minus,
2429         Times, Divide, Rem,
2430         Concat,
2431         Absolute, Negate,
2432         Bracket,
2433
2434 ###### Grammar
2435
2436         $*exec
2437         Expr -> Expr Eop Term ${ {
2438                         struct binode *b = new(binode);
2439                         b->op = $2.op;
2440                         b->left = $<1;
2441                         b->right = $<3;
2442                         $0 = b;
2443                 } }$
2444                 | Term ${ $0 = $<1; }$
2445
2446         Term -> Term Top Factor ${ {
2447                         struct binode *b = new(binode);
2448                         b->op = $2.op;
2449                         b->left = $<1;
2450                         b->right = $<3;
2451                         $0 = b;
2452                 } }$
2453                 | Factor ${ $0 = $<1; }$
2454
2455         Factor -> ( Expression ) ${ {
2456                         struct binode *b = new_pos(binode, $1);
2457                         b->op = Bracket;
2458                         b->right = $<2;
2459                         $0 = b;
2460                 } }$
2461                 | Uop Factor ${ {
2462                         struct binode *b = new(binode);
2463                         b->op = $1.op;
2464                         b->right = $<2;
2465                         $0 = b;
2466                 } }$
2467                 | Value ${ $0 = $<1; }$
2468                 | Variable ${ $0 = $<1; }$
2469
2470         $eop
2471         Eop ->    + ${ $0.op = Plus; }$
2472                 | - ${ $0.op = Minus; }$
2473
2474         Uop ->    + ${ $0.op = Absolute; }$
2475                 | - ${ $0.op = Negate; }$
2476
2477         Top ->    * ${ $0.op = Times; }$
2478                 | / ${ $0.op = Divide; }$
2479                 | % ${ $0.op = Rem; }$
2480                 | ++ ${ $0.op = Concat; }$
2481
2482 ###### print binode cases
2483         case Plus:
2484         case Minus:
2485         case Times:
2486         case Divide:
2487         case Concat:
2488         case Rem:
2489                 print_exec(b->left, indent, 0);
2490                 switch(b->op) {
2491                 case Plus:   fputs(" + ", stdout); break;
2492                 case Minus:  fputs(" - ", stdout); break;
2493                 case Times:  fputs(" * ", stdout); break;
2494                 case Divide: fputs(" / ", stdout); break;
2495                 case Rem:    fputs(" % ", stdout); break;
2496                 case Concat: fputs(" ++ ", stdout); break;
2497                 default: abort();       // NOTEST
2498                 }                       // NOTEST
2499                 print_exec(b->right, indent, 0);
2500                 break;
2501         case Absolute:
2502                 printf("+");
2503                 print_exec(b->right, indent, 0);
2504                 break;
2505         case Negate:
2506                 printf("-");
2507                 print_exec(b->right, indent, 0);
2508                 break;
2509         case Bracket:
2510                 printf("(");
2511                 print_exec(b->right, indent, 0);
2512                 printf(")");
2513                 break;
2514
2515 ###### propagate binode cases
2516         case Plus:
2517         case Minus:
2518         case Times:
2519         case Rem:
2520         case Divide:
2521                 /* both must be numbers, result is Tnum */
2522         case Absolute:
2523         case Negate:
2524                 /* as propagate_types ignores a NULL,
2525                  * unary ops fit here too */
2526                 propagate_types(b->left, c, ok, Tnum, 0);
2527                 propagate_types(b->right, c, ok, Tnum, 0);
2528                 if (!type_compat(type, Tnum, 0)) {
2529                         type_err(c, "error: Arithmetic returns %1 but %2 expected", prog,
2530                                    Tnum, rules, type);
2531                         *ok = 0;
2532                 }
2533                 return Tnum;
2534
2535         case Concat:
2536                 /* both must be Tstr, result is Tstr */
2537                 propagate_types(b->left, c, ok, Tstr, 0);
2538                 propagate_types(b->right, c, ok, Tstr, 0);
2539                 if (!type_compat(type, Tstr, 0)) {
2540                         type_err(c, "error: Concat returns %1 but %2 expected", prog,
2541                                    Tstr, rules, type);
2542                         *ok = 0;
2543                 }
2544                 return Tstr;
2545
2546         case Bracket:
2547                 return propagate_types(b->right, c, ok, type, 0);
2548
2549 ###### interp binode cases
2550
2551         case Plus:
2552                 rv = interp_exec(b->left);
2553                 right = interp_exec(b->right);
2554                 mpq_add(rv.num, rv.num, right.num);
2555                 break;
2556         case Minus:
2557                 rv = interp_exec(b->left);
2558                 right = interp_exec(b->right);
2559                 mpq_sub(rv.num, rv.num, right.num);
2560                 break;
2561         case Times:
2562                 rv = interp_exec(b->left);
2563                 right = interp_exec(b->right);
2564                 mpq_mul(rv.num, rv.num, right.num);
2565                 break;
2566         case Divide:
2567                 rv = interp_exec(b->left);
2568                 right = interp_exec(b->right);
2569                 mpq_div(rv.num, rv.num, right.num);
2570                 break;
2571         case Rem: {
2572                 mpz_t l, r, rem;
2573
2574                 left = interp_exec(b->left);
2575                 right = interp_exec(b->right);
2576                 mpz_init(l); mpz_init(r); mpz_init(rem);
2577                 mpz_tdiv_q(l, mpq_numref(left.num), mpq_denref(left.num));
2578                 mpz_tdiv_q(r, mpq_numref(right.num), mpq_denref(right.num));
2579                 mpz_tdiv_r(rem, l, r);
2580                 rv = val_init(Tnum);
2581                 mpq_set_z(rv.num, rem);
2582                 mpz_clear(r); mpz_clear(l); mpz_clear(rem);
2583                 break;
2584         }
2585         case Negate:
2586                 rv = interp_exec(b->right);
2587                 mpq_neg(rv.num, rv.num);
2588                 break;
2589         case Absolute:
2590                 rv = interp_exec(b->right);
2591                 mpq_abs(rv.num, rv.num);
2592                 break;
2593         case Bracket:
2594                 rv = interp_exec(b->right);
2595                 break;
2596         case Concat:
2597                 left = interp_exec(b->left);
2598                 right = interp_exec(b->right);
2599                 rv.type = Tstr;
2600                 rv.str = text_join(left.str, right.str);
2601                 break;
2602
2603
2604 ###### value functions
2605
2606         static struct text text_join(struct text a, struct text b)
2607         {
2608                 struct text rv;
2609                 rv.len = a.len + b.len;
2610                 rv.txt = malloc(rv.len);
2611                 memcpy(rv.txt, a.txt, a.len);
2612                 memcpy(rv.txt+a.len, b.txt, b.len);
2613                 return rv;
2614         }
2615
2616
2617 ### Blocks, Statements, and Statement lists.
2618
2619 Now that we have expressions out of the way we need to turn to
2620 statements.  There are simple statements and more complex statements.
2621 Simple statements do not contain newlines, complex statements do.
2622
2623 Statements often come in sequences and we have corresponding simple
2624 statement lists and complex statement lists.
2625 The former comprise only simple statements separated by semicolons.
2626 The later comprise complex statements and simple statement lists.  They are
2627 separated by newlines.  Thus the semicolon is only used to separate
2628 simple statements on the one line.  This may be overly restrictive,
2629 but I'm not sure I ever want a complex statement to share a line with
2630 anything else.
2631
2632 Note that a simple statement list can still use multiple lines if
2633 subsequent lines are indented, so
2634
2635 ###### Example: wrapped simple statement list
2636
2637         a = b; c = d;
2638            e = f; print g
2639
2640 is a single simple statement list.  This might allow room for
2641 confusion, so I'm not set on it yet.
2642
2643 A simple statement list needs no extra syntax.  A complex statement
2644 list has two syntactic forms.  It can be enclosed in braces (much like
2645 C blocks), or it can be introduced by a colon and continue until an
2646 unindented newline (much like Python blocks).  With this extra syntax
2647 it is referred to as a block.
2648
2649 Note that a block does not have to include any newlines if it only
2650 contains simple statements.  So both of:
2651
2652         if condition: a=b; d=f
2653
2654         if condition { a=b; print f }
2655
2656 are valid.
2657
2658 In either case the list is constructed from a `binode` list with
2659 `Block` as the operator.  When parsing the list it is most convenient
2660 to append to the end, so a list is a list and a statement.  When using
2661 the list it is more convenient to consider a list to be a statement
2662 and a list.  So we need a function to re-order a list.
2663 `reorder_bilist` serves this purpose.
2664
2665 The only stand-alone statement we introduce at this stage is `pass`
2666 which does nothing and is represented as a `NULL` pointer in a `Block`
2667 list.  Other stand-alone statements will follow once the infrastructure
2668 is in-place.
2669
2670 ###### Binode types
2671         Block,
2672
2673 ###### Grammar
2674
2675         $void
2676         OptNL -> Newlines
2677                 |
2678
2679         Newlines -> NEWLINE
2680                 | Newlines NEWLINE
2681
2682         $*binode
2683         Open -> {
2684                 | NEWLINE {
2685         Close -> }
2686                 | NEWLINE }
2687         Block -> Open Statementlist Close ${ $0 = $<2; }$
2688                 | Open Newlines Statementlist Close ${ $0 = $<3; }$
2689                 | Open SimpleStatements } ${ $0 = reorder_bilist($<2); }$
2690                 | Open Newlines SimpleStatements } ${ $0 = reorder_bilist($<3); }$
2691                 | : Statementlist ${ $0 = $<2; }$
2692                 | : SimpleStatements ${ $0 = reorder_bilist($<2); }$
2693
2694         Statementlist -> ComplexStatements ${ $0 = reorder_bilist($<1); }$
2695
2696         ComplexStatements -> ComplexStatements ComplexStatement ${
2697                 $0 = new(binode);
2698                 $0->op = Block;
2699                 $0->left = $<1;
2700                 $0->right = $<2;
2701                 }$
2702                 | ComplexStatements NEWLINE ${ $0 = $<1; }$
2703                 | ComplexStatement ${
2704                 $0 = new(binode);
2705                 $0->op = Block;
2706                 $0->left = NULL;
2707                 $0->right = $<1;
2708                 }$
2709
2710         $*exec
2711         ComplexStatement -> SimpleStatements NEWLINE ${
2712                         $0 = reorder_bilist($<1);
2713                         }$
2714                 ## ComplexStatement Grammar
2715
2716         $*binode
2717         SimpleStatements -> SimpleStatements ; SimpleStatement ${
2718                         $0 = new(binode);
2719                         $0->op = Block;
2720                         $0->left = $<1;
2721                         $0->right = $<3;
2722                         }$
2723                 | SimpleStatement ${
2724                         $0 = new(binode);
2725                         $0->op = Block;
2726                         $0->left = NULL;
2727                         $0->right = $<1;
2728                         }$
2729                 | SimpleStatements ; ${ $0 = $<1; }$
2730
2731         SimpleStatement -> pass ${ $0 = NULL; }$
2732                 ## SimpleStatement Grammar
2733
2734 ###### print binode cases
2735         case Block:
2736                 if (indent < 0) {
2737                         // simple statement
2738                         if (b->left == NULL)
2739                                 printf("pass");
2740                         else
2741                                 print_exec(b->left, indent, 0);
2742                         if (b->right) {
2743                                 printf("; ");
2744                                 print_exec(b->right, indent, 0);
2745                         }
2746                 } else {
2747                         // block, one per line
2748                         if (b->left == NULL)
2749                                 do_indent(indent, "pass\n");
2750                         else
2751                                 print_exec(b->left, indent, bracket);
2752                         if (b->right)
2753                                 print_exec(b->right, indent, bracket);
2754                 }
2755                 break;
2756
2757 ###### propagate binode cases
2758         case Block:
2759         {
2760                 /* If any statement returns something other than Tnone
2761                  * or Tbool then all such must return same type.
2762                  * As each statement may be Tnone or something else,
2763                  * we must always pass NULL (unknown) down, otherwise an incorrect
2764                  * error might occur.  We never return Tnone unless it is
2765                  * passed in.
2766                  */
2767                 struct binode *e;
2768
2769                 for (e = b; e; e = cast(binode, e->right)) {
2770                         t = propagate_types(e->left, c, ok, NULL, rules);
2771                         if ((rules & Rboolok) && t == Tbool)
2772                                 t = NULL;
2773                         if (t && t != Tnone && t != Tbool) {
2774                                 if (!type)
2775                                         type = t;
2776                                 else if (t != type) {
2777                                         type_err(c, "error: expected %1%r, found %2",
2778                                                  e->left, type, rules, t);
2779                                         *ok = 0;
2780                                 }
2781                         }
2782                 }
2783                 return type;
2784         }
2785
2786 ###### interp binode cases
2787         case Block:
2788                 while (rv.type == Tnone &&
2789                        b) {
2790                         if (b->left)
2791                                 rv = interp_exec(b->left);
2792                         b = cast(binode, b->right);
2793                 }
2794                 break;
2795
2796 ### The Print statement
2797
2798 `print` is a simple statement that takes a comma-separated list of
2799 expressions and prints the values separated by spaces and terminated
2800 by a newline.  No control of formatting is possible.
2801
2802 `print` faces the same list-ordering issue as blocks, and uses the
2803 same solution.
2804
2805 ###### Binode types
2806         Print,
2807
2808 ###### SimpleStatement Grammar
2809
2810         | print ExpressionList ${
2811                 $0 = reorder_bilist($<2);
2812         }$
2813         | print ExpressionList , ${
2814                 $0 = new(binode);
2815                 $0->op = Print;
2816                 $0->right = NULL;
2817                 $0->left = $<2;
2818                 $0 = reorder_bilist($0);
2819         }$
2820         | print ${
2821                 $0 = new(binode);
2822                 $0->op = Print;
2823                 $0->right = NULL;
2824         }$
2825
2826 ###### Grammar
2827
2828         $*binode
2829         ExpressionList -> ExpressionList , Expression ${
2830                 $0 = new(binode);
2831                 $0->op = Print;
2832                 $0->left = $<1;
2833                 $0->right = $<3;
2834                 }$
2835                 | Expression ${
2836                         $0 = new(binode);
2837                         $0->op = Print;
2838                         $0->left = NULL;
2839                         $0->right = $<1;
2840                 }$
2841
2842 ###### print binode cases
2843
2844         case Print:
2845                 do_indent(indent, "print");
2846                 while (b) {
2847                         if (b->left) {
2848                                 printf(" ");
2849                                 print_exec(b->left, -1, 0);
2850                                 if (b->right)
2851                                         printf(",");
2852                         }
2853                         b = cast(binode, b->right);
2854                 }
2855                 if (indent >= 0)
2856                         printf("\n");
2857                 break;
2858
2859 ###### propagate binode cases
2860
2861         case Print:
2862                 /* don't care but all must be consistent */
2863                 propagate_types(b->left, c, ok, NULL, Rnolabel);
2864                 propagate_types(b->right, c, ok, NULL, Rnolabel);
2865                 break;
2866
2867 ###### interp binode cases
2868
2869         case Print:
2870         {
2871                 char sep = 0;
2872                 int eol = 1;
2873                 for ( ; b; b = cast(binode, b->right))
2874                         if (b->left) {
2875                                 if (sep)
2876                                         putchar(sep);
2877                                 left = interp_exec(b->left);
2878                                 print_value(left);
2879                                 free_value(left);
2880                                 if (b->right)
2881                                         sep = ' ';
2882                         } else if (sep)
2883                                 eol = 0;
2884                 left.type = Tnone;
2885                 if (eol)
2886                         printf("\n");
2887                 break;
2888         }
2889
2890 ###### Assignment statement
2891
2892 An assignment will assign a value to a variable, providing it hasn't
2893 be declared as a constant.  The analysis phase ensures that the type
2894 will be correct so the interpreter just needs to perform the
2895 calculation.  There is a form of assignment which declares a new
2896 variable as well as assigning a value.  If a name is assigned before
2897 it is declared, and error will be raised as the name is created as
2898 `Tlabel` and it is illegal to assign to such names.
2899
2900 ###### Binode types
2901         Assign,
2902         Declare,
2903
2904 ###### SimpleStatement Grammar
2905         | Variable = Expression ${
2906                         $0 = new(binode);
2907                         $0->op = Assign;
2908                         $0->left = $<1;
2909                         $0->right = $<3;
2910                 }$
2911         | VariableDecl = Expression ${
2912                         $0 = new(binode);
2913                         $0->op = Declare;
2914                         $0->left = $<1;
2915                         $0->right =$<3;
2916                 }$
2917
2918         | VariableDecl ${
2919                         if ($1->var->where_set == NULL) {
2920                                 type_err(config2context(config), "Variable declared with no type or value: %v",
2921                                          $1, NULL, 0, NULL);
2922                         } else {
2923                                 $0 = new(binode);
2924                                 $0->op = Declare;
2925                                 $0->left = $<1;
2926                                 $0->right = NULL;
2927                         }
2928                 }$
2929
2930 ###### print binode cases
2931
2932         case Assign:
2933                 do_indent(indent, "");
2934                 print_exec(b->left, indent, 0);
2935                 printf(" = ");
2936                 print_exec(b->right, indent, 0);
2937                 if (indent >= 0)
2938                         printf("\n");
2939                 break;
2940
2941         case Declare:
2942                 {
2943                 struct variable *v = cast(var, b->left)->var;
2944                 do_indent(indent, "");
2945                 print_exec(b->left, indent, 0);
2946                 if (cast(var, b->left)->var->constant) {
2947                         if (v->where_decl == v->where_set) {
2948                                 printf("::");
2949                                 type_print(v->val.type, stdout);
2950                                 printf(" ");
2951                         } else
2952                                 printf(" ::");
2953                 } else {
2954                         if (v->where_decl == v->where_set) {
2955                                 printf(":");
2956                                 type_print(v->val.type, stdout);
2957                                 printf(" ");
2958                         } else
2959                                 printf(" :");
2960                 }
2961                 if (b->right) {
2962                         printf("= ");
2963                         print_exec(b->right, indent, 0);
2964                 }
2965                 if (indent >= 0)
2966                         printf("\n");
2967                 }
2968                 break;
2969
2970 ###### propagate binode cases
2971
2972         case Assign:
2973         case Declare:
2974                 /* Both must match and not be labels,
2975                  * Type must support 'dup',
2976                  * For Assign, left must not be constant.
2977                  * result is Tnone
2978                  */
2979                 t = propagate_types(b->left, c, ok, NULL,
2980                                     Rnolabel | (b->op == Assign ? Rnoconstant : 0));
2981                 if (!b->right)
2982                         return Tnone;
2983
2984                 if (t) {
2985                         if (propagate_types(b->right, c, ok, t, 0) != t)
2986                                 if (b->left->type == Xvar)
2987                                         type_err(c, "info: variable '%v' was set as %1 here.",
2988                                                  cast(var, b->left)->var->where_set, t, rules, Tnone);
2989                 } else {
2990                         t = propagate_types(b->right, c, ok, NULL, Rnolabel);
2991                         if (t)
2992                                 propagate_types(b->left, c, ok, t,
2993                                                 (b->op == Assign ? Rnoconstant : 0));
2994                 }
2995                 if (t && t->dup == NULL) {
2996                         type_err(c, "error: cannot assign value of type %1", b, t, 0, NULL);
2997                         *ok = 0;
2998                 }
2999                 return Tnone;
3000
3001                 break;
3002
3003 ###### interp binode cases
3004
3005         case Assign:
3006                 lleft = linterp_exec(b->left);
3007                 right = interp_exec(b->right);
3008                 if (lleft) {
3009                         free_value(*lleft);
3010                         *lleft = right;
3011                 } else
3012                         free_value(right);      // NOTEST
3013                 right.type = NULL;
3014                 break;
3015
3016         case Declare:
3017         {
3018                 struct variable *v = cast(var, b->left)->var;
3019                 if (v->merged)
3020                         v = v->merged;
3021                 if (b->right)
3022                         right = interp_exec(b->right);
3023                 else
3024                         right = val_init(v->val.type);
3025                 free_value(v->val);
3026                 v->val = right;
3027                 right.type = NULL;
3028                 break;
3029         }
3030
3031 ### The `use` statement
3032
3033 The `use` statement is the last "simple" statement.  It is needed when
3034 the condition in a conditional statement is a block.  `use` works much
3035 like `return` in C, but only completes the `condition`, not the whole
3036 function.
3037
3038 ###### Binode types
3039         Use,
3040
3041 ###### SimpleStatement Grammar
3042         | use Expression ${
3043                 $0 = new_pos(binode, $1);
3044                 $0->op = Use;
3045                 $0->right = $<2;
3046         }$
3047
3048 ###### print binode cases
3049
3050         case Use:
3051                 do_indent(indent, "use ");
3052                 print_exec(b->right, -1, 0);
3053                 if (indent >= 0)
3054                         printf("\n");
3055                 break;
3056
3057 ###### propagate binode cases
3058
3059         case Use:
3060                 /* result matches value */
3061                 return propagate_types(b->right, c, ok, type, 0);
3062
3063 ###### interp binode cases
3064
3065         case Use:
3066                 rv = interp_exec(b->right);
3067                 break;
3068
3069 ### The Conditional Statement
3070
3071 This is the biggy and currently the only complex statement.  This
3072 subsumes `if`, `while`, `do/while`, `switch`, and some parts of `for`.
3073 It is comprised of a number of parts, all of which are optional though
3074 set combinations apply.  Each part is (usually) a key word (`then` is
3075 sometimes optional) followed by either an expression or a code block,
3076 except the `casepart` which is a "key word and an expression" followed
3077 by a code block.  The code-block option is valid for all parts and,
3078 where an expression is also allowed, the code block can use the `use`
3079 statement to report a value.  If the code block does not report a value
3080 the effect is similar to reporting `True`.
3081
3082 The `else` and `case` parts, as well as `then` when combined with
3083 `if`, can contain a `use` statement which will apply to some
3084 containing conditional statement. `for` parts, `do` parts and `then`
3085 parts used with `for` can never contain a `use`, except in some
3086 subordinate conditional statement.
3087
3088 If there is a `forpart`, it is executed first, only once.
3089 If there is a `dopart`, then it is executed repeatedly providing
3090 always that the `condpart` or `cond`, if present, does not return a non-True
3091 value.  `condpart` can fail to return any value if it simply executes
3092 to completion.  This is treated the same as returning `True`.
3093
3094 If there is a `thenpart` it will be executed whenever the `condpart`
3095 or `cond` returns True (or does not return any value), but this will happen
3096 *after* `dopart` (when present).
3097
3098 If `elsepart` is present it will be executed at most once when the
3099 condition returns `False` or some value that isn't `True` and isn't
3100 matched by any `casepart`.  If there are any `casepart`s, they will be
3101 executed when the condition returns a matching value.
3102
3103 The particular sorts of values allowed in case parts has not yet been
3104 determined in the language design, so nothing is prohibited.
3105
3106 The various blocks in this complex statement potentially provide scope
3107 for variables as described earlier.  Each such block must include the
3108 "OpenScope" nonterminal before parsing the block, and must call
3109 `var_block_close()` when closing the block.
3110
3111 The code following "`if`", "`switch`" and "`for`" does not get its own
3112 scope, but is in a scope covering the whole statement, so names
3113 declared there cannot be redeclared elsewhere.  Similarly the
3114 condition following "`while`" is in a scope the covers the body
3115 ("`do`" part) of the loop, and which does not allow conditional scope
3116 extension.  Code following "`then`" (both looping and non-looping),
3117 "`else`" and "`case`" each get their own local scope.
3118
3119 The type requirements on the code block in a `whilepart` are quite
3120 unusal.  It is allowed to return a value of some identifiable type, in
3121 which case the loop aborts and an appropriate `casepart` is run, or it
3122 can return a Boolean, in which case the loop either continues to the
3123 `dopart` (on `True`) or aborts and runs the `elsepart` (on `False`).
3124 This is different both from the `ifpart` code block which is expected to
3125 return a Boolean, or the `switchpart` code block which is expected to
3126 return the same type as the casepart values.  The correct analysis of
3127 the type of the `whilepart` code block is the reason for the
3128 `Rboolok` flag which is passed to `propagate_types()`.
3129
3130 The `cond_statement` cannot fit into a `binode` so a new `exec` is
3131 defined.
3132
3133 ###### exec type
3134         Xcond_statement,
3135
3136 ###### ast
3137         struct casepart {
3138                 struct exec *value;
3139                 struct exec *action;
3140                 struct casepart *next;
3141         };
3142         struct cond_statement {
3143                 struct exec;
3144                 struct exec *forpart, *condpart, *dopart, *thenpart, *elsepart;
3145                 struct casepart *casepart;
3146         };
3147
3148 ###### ast functions
3149
3150         static void free_casepart(struct casepart *cp)
3151         {
3152                 while (cp) {
3153                         struct casepart *t;
3154                         free_exec(cp->value);
3155                         free_exec(cp->action);
3156                         t = cp->next;
3157                         free(cp);
3158                         cp = t;
3159                 }
3160         }
3161
3162         static void free_cond_statement(struct cond_statement *s)
3163         {
3164                 if (!s)
3165                         return;
3166                 free_exec(s->forpart);
3167                 free_exec(s->condpart);
3168                 free_exec(s->dopart);
3169                 free_exec(s->thenpart);
3170                 free_exec(s->elsepart);
3171                 free_casepart(s->casepart);
3172                 free(s);
3173         }
3174
3175 ###### free exec cases
3176         case Xcond_statement: free_cond_statement(cast(cond_statement, e)); break;
3177
3178 ###### ComplexStatement Grammar
3179         | CondStatement ${ $0 = $<1; }$
3180
3181 ###### Grammar
3182
3183         $*cond_statement
3184         // both ForThen and Whilepart open scopes, and CondSuffix only
3185         // closes one - so in the first branch here we have another to close.
3186         CondStatement -> ForThen WhilePart CondSuffix ${
3187                         $0 = $<3;
3188                         $0->forpart = $1.forpart; $1.forpart = NULL;
3189                         $0->thenpart = $1.thenpart; $1.thenpart = NULL;
3190                         $0->condpart = $2.condpart; $2.condpart = NULL;
3191                         $0->dopart = $2.dopart; $2.dopart = NULL;
3192                         var_block_close(config2context(config), CloseSequential);
3193                         }$
3194                 | WhilePart CondSuffix ${
3195                         $0 = $<2;
3196                         $0->condpart = $1.condpart; $1.condpart = NULL;
3197                         $0->dopart = $1.dopart; $1.dopart = NULL;
3198                         }$
3199                 | SwitchPart CondSuffix ${
3200                         $0 = $<2;
3201                         $0->condpart = $<1;
3202                         }$
3203                 | IfPart IfSuffix ${
3204                         $0 = $<2;
3205                         $0->condpart = $1.condpart; $1.condpart = NULL;
3206                         $0->thenpart = $1.thenpart; $1.thenpart = NULL;
3207                         // This is where we close an "if" statement
3208                         var_block_close(config2context(config), CloseSequential);
3209                         }$
3210
3211         CondSuffix -> IfSuffix ${
3212                         $0 = $<1;
3213                         // This is where we close scope of the whole
3214                         // "for" or "while" statement
3215                         var_block_close(config2context(config), CloseSequential);
3216                 }$
3217                 | CasePart CondSuffix ${
3218                         $0 = $<2;
3219                         $1->next = $0->casepart;
3220                         $0->casepart = $<1;
3221                 }$
3222
3223         $*casepart
3224         CasePart -> Newlines case Expression OpenScope Block ${
3225                         $0 = calloc(1,sizeof(struct casepart));
3226                         $0->value = $<3;
3227                         $0->action = $<5;
3228                         var_block_close(config2context(config), CloseParallel);
3229                 }$
3230                 | case Expression OpenScope Block ${
3231                         $0 = calloc(1,sizeof(struct casepart));
3232                         $0->value = $<2;
3233                         $0->action = $<4;
3234                         var_block_close(config2context(config), CloseParallel);
3235                 }$
3236
3237         $*cond_statement
3238         IfSuffix -> Newlines ${ $0 = new(cond_statement); }$
3239                 | Newlines else OpenScope Block ${
3240                         $0 = new(cond_statement);
3241                         $0->elsepart = $<4;
3242                         var_block_close(config2context(config), CloseElse);
3243                 }$
3244                 | else OpenScope Block ${
3245                         $0 = new(cond_statement);
3246                         $0->elsepart = $<3;
3247                         var_block_close(config2context(config), CloseElse);
3248                 }$
3249                 | Newlines else OpenScope CondStatement ${
3250                         $0 = new(cond_statement);
3251                         $0->elsepart = $<4;
3252                         var_block_close(config2context(config), CloseElse);
3253                 }$
3254                 | else OpenScope CondStatement ${
3255                         $0 = new(cond_statement);
3256                         $0->elsepart = $<3;
3257                         var_block_close(config2context(config), CloseElse);
3258                 }$
3259
3260
3261         $*exec
3262         // These scopes are closed in CondSuffix
3263         ForPart -> for OpenScope SimpleStatements ${
3264                         $0 = reorder_bilist($<3);
3265                 }$
3266                 |  for OpenScope Block ${
3267                         $0 = $<3;
3268                 }$
3269
3270         ThenPart -> then OpenScope SimpleStatements ${
3271                         $0 = reorder_bilist($<3);
3272                         var_block_close(config2context(config), CloseSequential);
3273                 }$
3274                 |  then OpenScope Block ${
3275                         $0 = $<3;
3276                         var_block_close(config2context(config), CloseSequential);
3277                 }$
3278
3279         ThenPartNL -> ThenPart OptNL ${
3280                         $0 = $<1;
3281                 }$
3282
3283         // This scope is closed in CondSuffix
3284         WhileHead -> while OpenScope Block ${
3285                 $0 = $<3;
3286                 }$
3287
3288         $cond_statement
3289         ForThen -> ForPart OptNL ThenPartNL ${
3290                         $0.forpart = $<1;
3291                         $0.thenpart = $<3;
3292                 }$
3293                 | ForPart OptNL ${
3294                         $0.forpart = $<1;
3295                 }$
3296
3297         // This scope is closed in CondSuffix
3298         WhilePart -> while OpenScope Expression Block ${
3299                         $0.type = Xcond_statement;
3300                         $0.condpart = $<3;
3301                         $0.dopart = $<4;
3302                 }$
3303                 | WhileHead OptNL do Block ${
3304                         $0.type = Xcond_statement;
3305                         $0.condpart = $<1;
3306                         $0.dopart = $<4;
3307                 }$
3308
3309         IfPart -> if OpenScope Expression OpenScope Block ${
3310                         $0.type = Xcond_statement;
3311                         $0.condpart = $<3;
3312                         $0.thenpart = $<5;
3313                         var_block_close(config2context(config), CloseParallel);
3314                 }$
3315                 | if OpenScope Block OptNL then OpenScope Block ${
3316                         $0.type = Xcond_statement;
3317                         $0.condpart = $<3;
3318                         $0.thenpart = $<7;
3319                         var_block_close(config2context(config), CloseParallel);
3320                 }$
3321
3322         $*exec
3323         // This scope is closed in CondSuffix
3324         SwitchPart -> switch OpenScope Expression ${
3325                         $0 = $<3;
3326                 }$
3327                 | switch OpenScope Block ${
3328                         $0 = $<3;
3329                 }$
3330
3331 ###### print exec cases
3332
3333         case Xcond_statement:
3334         {
3335                 struct cond_statement *cs = cast(cond_statement, e);
3336                 struct casepart *cp;
3337                 if (cs->forpart) {
3338                         do_indent(indent, "for");
3339                         if (bracket) printf(" {\n"); else printf(":\n");
3340                         print_exec(cs->forpart, indent+1, bracket);
3341                         if (cs->thenpart) {
3342                                 if (bracket)
3343                                         do_indent(indent, "} then {\n");
3344                                 else
3345                                         do_indent(indent, "then:\n");
3346                                 print_exec(cs->thenpart, indent+1, bracket);
3347                         }
3348                         if (bracket) do_indent(indent, "}\n");
3349                 }
3350                 if (cs->dopart) {
3351                         // a loop
3352                         if (cs->condpart && cs->condpart->type == Xbinode &&
3353                             cast(binode, cs->condpart)->op == Block) {
3354                                 if (bracket)
3355                                         do_indent(indent, "while {\n");
3356                                 else
3357                                         do_indent(indent, "while:\n");
3358                                 print_exec(cs->condpart, indent+1, bracket);
3359                                 if (bracket)
3360                                         do_indent(indent, "} do {\n");
3361                                 else
3362                                         do_indent(indent, "do:\n");
3363                                 print_exec(cs->dopart, indent+1, bracket);
3364                                 if (bracket)
3365                                         do_indent(indent, "}\n");
3366                         } else {
3367                                 do_indent(indent, "while ");
3368                                 print_exec(cs->condpart, 0, bracket);
3369                                 if (bracket)
3370                                         printf(" {\n");
3371                                 else
3372                                         printf(":\n");
3373                                 print_exec(cs->dopart, indent+1, bracket);
3374                                 if (bracket)
3375                                         do_indent(indent, "}\n");
3376                         }
3377                 } else {
3378                         // a condition
3379                         if (cs->casepart)
3380                                 do_indent(indent, "switch");
3381                         else
3382                                 do_indent(indent, "if");
3383                         if (cs->condpart && cs->condpart->type == Xbinode &&
3384                             cast(binode, cs->condpart)->op == Block) {
3385                                 if (bracket)
3386                                         printf(" {\n");
3387                                 else
3388                                         printf(":\n");
3389                                 print_exec(cs->condpart, indent+1, bracket);
3390                                 if (bracket)
3391                                         do_indent(indent, "}\n");
3392                                 if (cs->thenpart) {
3393                                         do_indent(indent, "then:\n");
3394                                         print_exec(cs->thenpart, indent+1, bracket);
3395                                 }
3396                         } else {
3397                                 printf(" ");
3398                                 print_exec(cs->condpart, 0, bracket);
3399                                 if (cs->thenpart) {
3400                                         if (bracket)
3401                                                 printf(" {\n");
3402                                         else
3403                                                 printf(":\n");
3404                                         print_exec(cs->thenpart, indent+1, bracket);
3405                                         if (bracket)
3406                                                 do_indent(indent, "}\n");
3407                                 } else
3408                                         printf("\n");
3409                         }
3410                 }
3411                 for (cp = cs->casepart; cp; cp = cp->next) {
3412                         do_indent(indent, "case ");
3413                         print_exec(cp->value, -1, 0);
3414                         if (bracket)
3415                                 printf(" {\n");
3416                         else
3417                                 printf(":\n");
3418                         print_exec(cp->action, indent+1, bracket);
3419                         if (bracket)
3420                                 do_indent(indent, "}\n");
3421                 }
3422                 if (cs->elsepart) {
3423                         do_indent(indent, "else");
3424                         if (bracket)
3425                                 printf(" {\n");
3426                         else
3427                                 printf(":\n");
3428                         print_exec(cs->elsepart, indent+1, bracket);
3429                         if (bracket)
3430                                 do_indent(indent, "}\n");
3431                 }
3432                 break;
3433         }
3434
3435 ###### propagate exec cases
3436         case Xcond_statement:
3437         {
3438                 // forpart and dopart must return Tnone
3439                 // thenpart must return Tnone if there is a dopart,
3440                 // otherwise it is like elsepart.
3441                 // condpart must:
3442                 //    be bool if there is no casepart
3443                 //    match casepart->values if there is a switchpart
3444                 //    either be bool or match casepart->value if there
3445                 //             is a whilepart
3446                 // elsepart and casepart->action must match the return type
3447                 //   expected of this statement.
3448                 struct cond_statement *cs = cast(cond_statement, prog);
3449                 struct casepart *cp;
3450
3451                 t = propagate_types(cs->forpart, c, ok, Tnone, 0);
3452                 if (!type_compat(Tnone, t, 0))
3453                         *ok = 0;
3454                 t = propagate_types(cs->dopart, c, ok, Tnone, 0);
3455                 if (!type_compat(Tnone, t, 0))
3456                         *ok = 0;
3457                 if (cs->dopart) {
3458                         t = propagate_types(cs->thenpart, c, ok, Tnone, 0);
3459                         if (!type_compat(Tnone, t, 0))
3460                                 *ok = 0;
3461                 }
3462                 if (cs->casepart == NULL)
3463                         propagate_types(cs->condpart, c, ok, Tbool, 0);
3464                 else {
3465                         /* Condpart must match case values, with bool permitted */
3466                         t = NULL;
3467                         for (cp = cs->casepart;
3468                              cp && !t; cp = cp->next)
3469                                 t = propagate_types(cp->value, c, ok, NULL, 0);
3470                         if (!t && cs->condpart)
3471                                 t = propagate_types(cs->condpart, c, ok, NULL, Rboolok);
3472                         // Now we have a type (I hope) push it down
3473                         if (t) {
3474                                 for (cp = cs->casepart; cp; cp = cp->next)
3475                                         propagate_types(cp->value, c, ok, t, 0);
3476                                 propagate_types(cs->condpart, c, ok, t, Rboolok);
3477                         }
3478                 }
3479                 // (if)then, else, and case parts must return expected type.
3480                 if (!cs->dopart && !type)
3481                         type = propagate_types(cs->thenpart, c, ok, NULL, rules);
3482                 if (!type)
3483                         type = propagate_types(cs->elsepart, c, ok, NULL, rules);
3484                 for (cp = cs->casepart;
3485                      cp && !type;
3486                      cp = cp->next)
3487                         type = propagate_types(cp->action, c, ok, NULL, rules);
3488                 if (type) {
3489                         if (!cs->dopart)
3490                                 propagate_types(cs->thenpart, c, ok, type, rules);
3491                         propagate_types(cs->elsepart, c, ok, type, rules);
3492                         for (cp = cs->casepart; cp ; cp = cp->next)
3493                                 propagate_types(cp->action, c, ok, type, rules);
3494                         return type;
3495                 } else
3496                         return NULL;
3497         }
3498
3499 ###### interp exec cases
3500         case Xcond_statement:
3501         {
3502                 struct value v, cnd;
3503                 struct casepart *cp;
3504                 struct cond_statement *c = cast(cond_statement, e);
3505
3506                 if (c->forpart)
3507                         interp_exec(c->forpart);
3508                 do {
3509                         if (c->condpart)
3510                                 cnd = interp_exec(c->condpart);
3511                         else
3512                                 cnd.type = Tnone;
3513                         if (!(cnd.type == Tnone ||
3514                               (cnd.type == Tbool && cnd.bool != 0)))
3515                                 break;
3516                         // cnd is Tnone or Tbool, doesn't need to be freed
3517                         if (c->dopart)
3518                                 interp_exec(c->dopart);
3519
3520                         if (c->thenpart) {
3521                                 rv = interp_exec(c->thenpart);
3522                                 if (rv.type != Tnone || !c->dopart)
3523                                         goto Xcond_done;
3524                                 free_value(rv);
3525                         }
3526                 } while (c->dopart);
3527
3528                 for (cp = c->casepart; cp; cp = cp->next) {
3529                         v = interp_exec(cp->value);
3530                         if (value_cmp(v, cnd) == 0) {
3531                                 free_value(v);
3532                                 free_value(cnd);
3533                                 rv = interp_exec(cp->action);
3534                                 goto Xcond_done;
3535                         }
3536                         free_value(v);
3537                 }
3538                 free_value(cnd);
3539                 if (c->elsepart)
3540                         rv = interp_exec(c->elsepart);
3541                 else
3542                         rv.type = Tnone;
3543         Xcond_done:
3544                 break;
3545         }
3546
3547 ### Finally the whole program.
3548
3549 Somewhat reminiscent of Pascal a (current) Ocean program starts with
3550 the keyword "program" and a list of variable names which are assigned
3551 values from command line arguments.  Following this is a `block` which
3552 is the code to execute.
3553
3554 As this is the top level, several things are handled a bit
3555 differently.
3556 The whole program is not interpreted by `interp_exec` as that isn't
3557 passed the argument list which the program requires.  Similarly type
3558 analysis is a bit more interesting at this level.
3559
3560 ###### Binode types
3561         Program,
3562
3563 ###### Parser: grammar
3564
3565         $*binode
3566         Program -> program OpenScope Varlist Block OptNL ${
3567                 $0 = new(binode);
3568                 $0->op = Program;
3569                 $0->left = reorder_bilist($<3);
3570                 $0->right = $<4;
3571                 var_block_close(config2context(config), CloseSequential);
3572                 if (config2context(config)->scope_stack) abort();
3573                 }$
3574                 | ERROR ${
3575                         tok_err(config2context(config),
3576                                 "error: unhandled parse error", &$1);
3577                 }$
3578
3579         Varlist -> Varlist ArgDecl ${
3580                         $0 = new(binode);
3581                         $0->op = Program;
3582                         $0->left = $<1;
3583                         $0->right = $<2;
3584                 }$
3585                 | ${ $0 = NULL; }$
3586
3587         $*var
3588         ArgDecl -> IDENTIFIER ${ {
3589                 struct variable *v = var_decl(config2context(config), $1.txt);
3590                 $0 = new(var);
3591                 $0->var = v;
3592         } }$
3593
3594         ## Grammar
3595
3596 ###### print binode cases
3597         case Program:
3598                 do_indent(indent, "program");
3599                 for (b2 = cast(binode, b->left); b2; b2 = cast(binode, b2->right)) {
3600                         printf(" ");
3601                         print_exec(b2->left, 0, 0);
3602                 }
3603                 if (bracket)
3604                         printf(" {\n");
3605                 else
3606                         printf(":\n");
3607                 print_exec(b->right, indent+1, bracket);
3608                 if (bracket)
3609                         do_indent(indent, "}\n");
3610                 break;
3611
3612 ###### propagate binode cases
3613         case Program: abort();          // NOTEST
3614
3615 ###### core functions
3616
3617         static int analyse_prog(struct exec *prog, struct parse_context *c)
3618         {
3619                 struct binode *b = cast(binode, prog);
3620                 int ok = 1;
3621
3622                 if (!b)
3623                         return 0;       // NOTEST
3624                 do {
3625                         ok = 1;
3626                         propagate_types(b->right, c, &ok, Tnone, 0);
3627                 } while (ok == 2);
3628                 if (!ok)
3629                         return 0;
3630
3631                 for (b = cast(binode, b->left); b; b = cast(binode, b->right)) {
3632                         struct var *v = cast(var, b->left);
3633                         if (!v->var->val.type) {
3634                                 v->var->where_set = b;
3635                                 v->var->val = val_prepare(Tstr);
3636                         }
3637                 }
3638                 b = cast(binode, prog);
3639                 do {
3640                         ok = 1;
3641                         propagate_types(b->right, c, &ok, Tnone, 0);
3642                 } while (ok == 2);
3643                 if (!ok)
3644                         return 0;
3645
3646                 /* Make sure everything is still consistent */
3647                 propagate_types(b->right, c, &ok, Tnone, 0);
3648                 return !!ok;
3649         }
3650
3651         static void interp_prog(struct exec *prog, char **argv)
3652         {
3653                 struct binode *p = cast(binode, prog);
3654                 struct binode *al;
3655                 struct value v;
3656
3657                 if (!prog)
3658                         return;         // NOTEST
3659                 al = cast(binode, p->left);
3660                 while (al) {
3661                         struct var *v = cast(var, al->left);
3662                         struct value *vl = &v->var->val;
3663
3664                         if (argv[0] == NULL) {
3665                                 printf("Not enough args\n");
3666                                 exit(1);
3667                         }
3668                         al = cast(binode, al->right);
3669                         free_value(*vl);
3670                         *vl = parse_value(vl->type, argv[0]);
3671                         if (vl->type == NULL)
3672                                 exit(1);
3673                         argv++;
3674                 }
3675                 v = interp_exec(p->right);
3676                 free_value(v);
3677         }
3678
3679 ###### interp binode cases
3680         case Program: abort();  // NOTEST
3681
3682 ## And now to test it out.
3683
3684 Having a language requires having a "hello world" program. I'll
3685 provide a little more than that: a program that prints "Hello world"
3686 finds the GCD of two numbers, prints the first few elements of
3687 Fibonacci, and performs a binary search for a number.
3688
3689 ###### File: oceani.mk
3690         tests :: sayhello
3691         sayhello : oceani
3692                 @echo "===== TEST ====="
3693                 ./oceani --section "test: hello" oceani.mdc 55 33
3694
3695 ###### test: hello
3696
3697         program A B:
3698                 print "Hello World, what lovely oceans you have!"
3699                 /* When a variable is defined in both branches of an 'if',
3700                  * and used afterwards, the variables are merged.
3701                  */
3702                 if A > B:
3703                         bigger := "yes"
3704                 else:
3705                         bigger := "no"
3706                 print "Is", A, "bigger than", B,"? ", bigger
3707                 /* If a variable is not used after the 'if', no
3708                  * merge happens, so types can be different
3709                  */
3710                 if A > B * 2:
3711                         double:string = "yes"
3712                         print A, "is more than twice", B, "?", double
3713                 else:
3714                         double := B*2
3715                         print "double", B, "is", double
3716
3717                 a : number
3718                 a = A;
3719                 b:number = B
3720                 if a > 0 and then b > 0:
3721                         while a != b:
3722                                 if a < b:
3723                                         b = b - a
3724                                 else:
3725                                         a = a - b
3726                         print "GCD of", A, "and", B,"is", a
3727                 else if a <= 0:
3728                         print a, "is not positive, cannot calculate GCD"
3729                 else:
3730                         print b, "is not positive, cannot calculate GCD"
3731
3732                 for:
3733                         togo := 10
3734                         f1 := 1; f2 := 1
3735                         print "Fibonacci:", f1,f2,
3736                 then togo = togo - 1
3737                 while togo > 0:
3738                         f3 := f1 + f2
3739                         print "", f3,
3740                         f1 = f2
3741                         f2 = f3
3742                 print ""
3743
3744                 /* Binary search... */
3745                 for:
3746                         lo:= 0; hi := 100
3747                         target := 77
3748                 while:
3749                         mid := (lo + hi) / 2
3750                         if mid == target:
3751                                 use Found
3752                         if mid < target:
3753                                 lo = mid
3754                         else:
3755                                 hi = mid
3756                         if hi - lo < 1:
3757                                 use GiveUp
3758                         use True
3759                 do: pass
3760                 case Found:
3761                         print "Yay, I found", target
3762                 case GiveUp:
3763                         print "Closest I found was", mid
3764
3765                 size::=55
3766                 list:[size]number
3767                 list[0] = 1234
3768                 for i:=1; then i = i + 1; while i < size:
3769                         n := list[i-1] * list[i-1]
3770                         list[i] = (n / 100) % 10000
3771
3772                 print "Before sort:"
3773                 for i:=0; then i = i + 1; while i < size:
3774                         print "list[",i,"]=",list[i]
3775
3776                 for i := 1; then i=i+1; while i < size:
3777                         for j:=i-1; then j=j-1; while j >= 0:
3778                                 if list[j] > list[j+1]:
3779                                         t:= list[j]
3780                                         list[j] = list[j+1]
3781                                         list[j+1] = t
3782                 print "After sort:"
3783                 for i:=0; then i = i + 1; while i < size:
3784                         print "list[",i,"]=",list[i]