ocean-lang.org Git - ocean/blob - csrc/oceani.mdc

   1 # Ocean Interpreter - Stoney Creek version
   2
   3 Ocean is intended to be a compiled language, so this interpreter is
   4 not targeted at being the final product.  It is, rather, an intermediate
   5 stage and fills that role in two distinct ways.
   6
   7 Firstly, it exists as a platform to experiment with the early language
   8 design.  An interpreter is easy to write and easy to get working, so
   9 the barrier for entry is lower if I aim to start with an interpreter.
  10
  11 Secondly, the plan for the Ocean compiler is to write it in the
  12 [Ocean language](http://ocean-lang.org).  To achieve this we naturally
  13 need some sort of boot-strap process and this interpreter - written in
  14 portable C - will fill that role.  It will be used to bootstrap the
  15 Ocean compiler.
  16
  17 Two features that are not needed to fill either of these roles are
  18 performance and completeness.  The interpreter only needs to be fast
  19 enough to run small test programs and occasionally to run the compiler
  20 on itself.  It only needs to be complete enough to test aspects of the
  21 design which are developed before the compiler is working, and to run
  22 the compiler on itself.  Any features not used by the compiler when
  23 compiling itself are superfluous.  They may be included anyway, but
  24 they may not.
  25
  26 Nonetheless, the interpreter should end up being reasonably complete,
  27 and any performance bottlenecks which appear and are easily fixed, will
  28 be.
  29
  30 ## Current version
  31
  32 This second version of the interpreter exists to test out the
  33 structured statement providing conditions and iteration, and simple
  34 variable scoping.  Clearly we need some minimal other functionality so
  35 that values can be tested and instructions iterated over.  All that
  36 functionality is clearly not normative at this stage (not that
  37 anything is **really** normative yet) and will change, so early test
  38 code will certainly break in later versions.
  39
  40 The under-test parts of the language are:
  41
  42  - conditional/looping structured statements
  43  - the `use` statement which is needed for that
  44  - Variable binding using ":=" and "::=", and assignment using "=".
  45
  46 Elements which are present to make a usable language are:
  47
  48  - "blocks" of multiple statements.
  49  - `pass`: a statement which does nothing.
  50  - expressions: `+`, `-`, `*`, `/`, `%` can apply to numbers and `++` can
  51    catenate strings.  `and`, `or`, `not` manipulate Booleans, and
  52    normal comparison operators can work on all three types.
  53  - `print`: will print the values in a list of expressions.
  54  - `program`: is given a list of identifiers to initialize from
  55    arguments.
  56
  57 ## Naming
  58
  59 Versions of the interpreter which obviously do not support a complete
  60 language will be named after creeks and streams.  This one is Stoney
  61 Creek.
  62
  63 Once we have something reasonably resembling a complete language, the
  64 names of rivers will be used.
  65 Early versions of the compiler will be named after seas.  Major
  66 releases of the compiler will be named after oceans.  Hopefully I will
  67 be finished once I get to the Pacific Ocean release.
  68
  69 ## Outline
  70
  71 As well as parsing and executing a program, the interpreter can print
  72 out the program from the parsed internal structure.  This is useful
  73 for validating the parsing.
  74 So the main requirements of the interpreter are:
  75
  76 - Parse the program, possibly with tracing,
  77 - Analyse the parsed program to ensure consistency,
  78 - Print the program,
  79 - Execute the program.
  80
  81 This is all performed by a single C program extracted with
  82 `parsergen`.
  83
  84 There will be two formats for printing the program: a default and one
  85 that uses bracketing.  So a `--bracket` command line option is needed
  86 for that.  Normally the first code section found is used, however an
  87 alternate section can be requested so that a file (such as this one)
  88 can contain multiple programs This is effected with the `--section`
  89 option.
  90
  91 This code must be compiled with `-fplan9-extensions` so that anonymous
  92 structures can be used.
  93
  94 ###### File: oceani.mk
  95
  96         myCFLAGS := -Wall -g -fplan9-extensions
  97         CFLAGS := $(filter-out $(myCFLAGS),$(CFLAGS)) $(myCFLAGS)
  98         myLDLIBS:= libparser.o libscanner.o libmdcode.o -licuuc
  99         LDLIBS := $(filter-out $(myLDLIBS),$(LDLIBS)) $(myLDLIBS)
 100         ## libs
 101         all :: $(LDLIBS) oceani
 102         oceani.c oceani.h : oceani.mdc parsergen
 103                 ./parsergen -o oceani --LALR --tag Parser oceani.mdc
 104         oceani.mk: oceani.mdc md2c
 105                 ./md2c oceani.mdc
 106
 107         oceani: oceani.o $(LDLIBS)
 108                 $(CC) $(CFLAGS) -o oceani oceani.o $(LDLIBS)
 109
 110 ###### Parser: header
 111         ## macros
 112         ## ast
 113         struct parse_context {
 114                 struct token_config config;
 115                 char *file_name;
 116                 int parse_error;
 117                 ## parse context
 118         };
 119
 120 ###### macros
 121
 122         #define container_of(ptr, type, member) ({                      \
 123                 const typeof( ((type *)0)->member ) *__mptr = (ptr);    \
 124                 (type *)( (char *)__mptr - offsetof(type,member) );})
 125
 126         #define config2context(_conf) container_of(_conf, struct parse_context, \
 127                 config)
 128
 129 ###### Parser: code
 130
 131         #include <unistd.h>
 132         #include <stdlib.h>
 133         #include <fcntl.h>
 134         #include <errno.h>
 135         #include <sys/mman.h>
 136         #include <string.h>
 137         #include <stdio.h>
 138         #include <locale.h>
 139         #include <malloc.h>
 140         #include "mdcode.h"
 141         #include "scanner.h"
 142         #include "parser.h"
 143
 144         ## includes
 145
 146         #include "oceani.h"
 147
 148         ## forward decls
 149         ## value functions
 150         ## ast functions
 151         ## core functions
 152
 153         #include <getopt.h>
 154         static char Usage[] = "Usage: oceani --trace --print --noexec --brackets"
 155                               "--section=SectionName prog.ocn\n";
 156         static const struct option long_options[] = {
 157                 {"trace",     0, NULL, 't'},
 158                 {"print",     0, NULL, 'p'},
 159                 {"noexec",    0, NULL, 'n'},
 160                 {"brackets",  0, NULL, 'b'},
 161                 {"section",   1, NULL, 's'},
 162                 {NULL,        0, NULL, 0},
 163         };
 164         const char *options = "tpnbs";
 165         int main(int argc, char *argv[])
 166         {
 167                 int fd;
 168                 int len;
 169                 char *file;
 170                 struct section *s;
 171                 char *section = NULL;
 172                 struct parse_context context = {
 173                         .config = {
 174                                 .ignored = (1 << TK_line_comment)
 175                                          | (1 << TK_block_comment),
 176                                 .number_chars = ".,_+-",
 177                                 .word_start = "_",
 178                                 .word_cont = "_",
 179                         },
 180                 };
 181                 int doprint=0, dotrace=0, doexec=1, brackets=0;
 182                 struct exec **prog;
 183                 int opt;
 184                 while ((opt = getopt_long(argc, argv, options, long_options, NULL))
 185                        != -1) {
 186                         switch(opt) {
 187                         case 't': dotrace=1; break;
 188                         case 'p': doprint=1; break;
 189                         case 'n': doexec=0; break;
 190                         case 'b': brackets=1; break;
 191                         case 's': section = optarg; break;
 192                         default: fprintf(stderr, Usage);
 193                                 exit(1);
 194                         }
 195                 }
 196                 if (optind >= argc) {
 197                         fprintf(stderr, "oceani: no input file given\n");
 198                         exit(1);
 199                 }
 200                 fd = open(argv[optind], O_RDONLY);
 201                 if (fd < 0) {
 202                         fprintf(stderr, "oceani: cannot open %s\n", argv[optind]);
 203                         exit(1);
 204                 }
 205                 context.file_name = argv[optind];
 206                 len = lseek(fd, 0, 2);
 207                 file = mmap(NULL, len, PROT_READ, MAP_SHARED, fd, 0);
 208                 s = code_extract(file, file+len, NULL);
 209                 if (!s) {
 210                         fprintf(stderr, "oceani: could not find any code in %s\n",
 211                                 argv[optind]);
 212                         exit(1);
 213                 }
 214
 215                 ## context initialization
 216
 217                 if (section) {
 218                         struct section *ss;
 219                         for (ss = s; ss; ss = ss->next) {
 220                                 struct text sec = ss->section;
 221                                 if (sec.len == strlen(section) &&
 222                                     strncmp(sec.txt, section, sec.len) == 0)
 223                                         break;
 224                         }
 225                         if (ss)
 226                                 prog = parse_oceani(ss->code, &context.config,
 227                                                     dotrace ? stderr : NULL);
 228                         else {
 229                                 fprintf(stderr, "oceani: cannot find section %s\n",
 230                                         section);
 231                                 exit(1);
 232                         }
 233                 } else
 234                         prog = parse_oceani(s->code, &context.config,
 235                                     dotrace ? stderr : NULL);
 236                 if (!prog) {
 237                         fprintf(stderr, "oceani: fatal parser error.\n");
 238                         context.parse_error = 1;
 239                 }
 240                 if (prog && doprint)
 241                         print_exec(*prog, 0, brackets);
 242                 if (prog && doexec && !context.parse_error) {
 243                         if (!analyse_prog(*prog, &context)) {
 244                                 fprintf(stderr, "oceani: type error in program - not running.\n");
 245                                 exit(1);
 246                         }
 247                         interp_prog(*prog, argv+optind+1);
 248                 }
 249                 if (prog) {
 250                         free_exec(*prog);
 251                         free(prog);
 252                 }
 253                 while (s) {
 254                         struct section *t = s->next;
 255                         code_free(s->code);
 256                         free(s);
 257                         s = t;
 258                 }
 259                 ## free context vars
 260                 ## free context types
 261                 exit(context.parse_error ? 1 : 0);
 262         }
 263
 264 ### Analysis
 265
 266 The four requirements of parse, analyse, print, interpret apply to
 267 each language element individually so that is how most of the code
 268 will be structured.
 269
 270 Three of the four are fairly self explanatory.  The one that requires
 271 a little explanation is the analysis step.
 272
 273 The current language design does not require the types of variables to
 274 be declared, but they must still have a single type.  Different
 275 operations impose different requirements on the variables, for example
 276 addition requires both arguments to be numeric, and assignment
 277 requires the variable on the left to have the same type as the
 278 expression on the right.
 279
 280 Analysis involves propagating these type requirements around and
 281 consequently setting the type of each variable.  If any requirements
 282 are violated (e.g. a string is compared with a number) or if a
 283 variable needs to have two different types, then an error is raised
 284 and the program will not run.
 285
 286 If the same variable is declared in both branchs of an 'if/else', or
 287 in all cases of a 'switch' then the multiple instances may be merged
 288 into just one variable if the variable is references after the
 289 conditional statement.  When this happens, the types must naturally be
 290 consistent across all the branches.  When the variable is not used
 291 outside the if, the variables in the different branches are distinct
 292 and can be of different types.
 293
 294 Determining the types of all variables early is important for
 295 processing command line arguments.  These can be assigned to any type
 296 of variable, but we must first know the correct type so any required
 297 conversion can happen.  If a variable is associated with a command
 298 line argument but no type can be interpreted (e.g. the variable is
 299 only ever used in a `print` statement), then the type is set to
 300 'string'.
 301
 302 Undeclared names may only appear in "use" statements and "case" expressions.
 303 These names are given a type of "label" and a unique value.
 304 This allows them to fill the role of a name in an enumerated type, which
 305 is useful for testing the `switch` statement.
 306
 307 As we will see, the condition part of a `while` statement can return
 308 either a Boolean or some other type.  This requires that the expect
 309 type that gets passed around comprises a type (`enum vtype`) and a
 310 flag to indicate that `Vbool` is also permitted.
 311
 312 As there are, as yet, no distinct types that are compatible, there
 313 isn't much subtlety in the analysis.  When we have distinct number
 314 types, this will become more interesting.
 315
 316 #### Error reporting
 317
 318 When analysis discovers an inconsistency it needs to report an error;
 319 just refusing to run the code ensures that the error doesn't cascade,
 320 but by itself it isn't very useful.  A clear understand of the sort of
 321 error message that are useful will help guide the process of analysis.
 322
 323 At a simplistic level, the only sort of error that type analysis can
 324 report is that the type of some construct doesn't match a contextual
 325 requirement.  For example, in `4 + "hello"` the addition provides a
 326 contextual requirement for numbers, but `"hello"` is not a number.  In
 327 this particular example no further information is needed as the types
 328 are obvious from local information.  When a variable is involved that
 329 isn't the case.  It may be helpful to explain why the variable has a
 330 particular type, by indicating the location where the type was set,
 331 whether by declaration or usage.
 332
 333 Using a recursive-descent analysis we can easily detect a problem at
 334 multiple locations. In "`hello:= "there"; 4 + hello`" the addition
 335 will detect that one argument is not a number and the usage of `hello`
 336 will detect that a number was wanted, but not provided.  In this
 337 (early) version of the language, we will generate error reports at
 338 multiple locations, so the use of `hello` will report an error and
 339 explain were the value was set, and the addition will report an error
 340 and say why numbers are needed.  To be able to report locations for
 341 errors, each language element will need to record a file location
 342 (line and column) and each variable will need to record the language
 343 element where its type was set.  For now we will assume that each line
 344 of an error message indicates one location in the file, and up to 2
 345 types.  So we provide a `printf`-like function which takes a format, a
 346 language (a `struct exec` which has not yet been introduced), and 2
 347 types. "`%1`" reports the first type, "`%2`" reports the second.  We
 348 will need a function to print the location, once we know how that is
 349 stored.  As will be explained later, there are sometimes extra rules for
 350 type matching and they might affect error messages, we need to pass those
 351 in too.
 352
 353 As well as type errors, we sometimes need to report problems with
 354 tokens, which might be unexpected or might name a type that has not
 355 been defined.  For these we have `tok_err()` which reports an error
 356 with a given token.  Each of the error functions sets the flag in the
 357 context so indicate that parsing failed.
 358
 359 ###### forward decls
 360
 361         static void fput_loc(struct exec *loc, FILE *f);
 362
 363 ###### core functions
 364
 365         static void type_err(struct parse_context *c,
 366                              char *fmt, struct exec *loc,
 367                              struct type *t1, int rules, struct type *t2)
 368         {
 369                 fprintf(stderr, "%s:", c->file_name);
 370                 fput_loc(loc, stderr);
 371                 for (; *fmt ; fmt++) {
 372                         if (*fmt != '%') {
 373                                 fputc(*fmt, stderr);
 374                                 continue;
 375                         }
 376                         fmt++;
 377                         switch (*fmt) {
 378                         case '%': fputc(*fmt, stderr); break;
 379                         default: fputc('?', stderr); break;
 380                         case '1':
 381                                 type_print(t1, stderr);
 382                                 break;
 383                         case '2':
 384                                 type_print(t2, stderr);
 385                                 break;
 386                         ## format cases
 387                         }
 388                 }
 389                 fputs("\n", stderr);
 390                 c->parse_error = 1;
 391         }
 392
 393         static void tok_err(struct parse_context *c, char *fmt, struct token *t)
 394         {
 395                 fprintf(stderr, "%s:%d:%d: %s: %.*s\n", c->file_name, t->line, t->col, fmt,
 396                         t->txt.len, t->txt.txt);
 397                 c->parse_error = 1;
 398         }
 399
 400 ## Data Structures
 401
 402 One last introductory step before detailing the language elements and
 403 providing their four requirements is to establish the data structures
 404 to store these elements.
 405
 406 There are two key objects that we need to work with: executable
 407 elements which comprise the program, and values which the program
 408 works with.  Between these are the variables in their various scopes
 409 which hold the values, and types which classify the values stored and
 410 manipulatd by executables.
 411
 412 ### Types
 413
 414 Values come in a wide range of types, with more likely to be added.
 415 Each type needs to be able to parse and print its own values (for
 416 convenience at least) as well as to compare two values, at least for
 417 equality and possibly for order.  For now, values might need to be
 418 duplicated and freed, though eventually such manipulations will be
 419 better integrated into the language.
 420
 421 Rather than requiring every numeric type to support all numeric
 422 operations (add, multiple, etc), we allow types to be able to present
 423 as one of a few standard types: integer, float, and fraction.  The
 424 existance of these conversion functions enable types to determine if
 425 they are compatible with other types.
 426
 427 Named type are stored in a simple linked list.  Objects of each type are "values"
 428 which are often passed around by value.
 429
 430 ###### ast
 431
 432         struct value {
 433                 struct type *type;
 434                 union {
 435                         ## value union fields
 436                 };
 437         };
 438
 439         struct type {
 440                 struct text name;
 441                 struct type *next;
 442                 struct value (*init)(struct type *type);
 443                 struct value (*prepare)(struct type *type);
 444                 struct value (*parse)(struct type *type, char *str);
 445                 void (*print)(struct value val);
 446                 void (*print_type)(struct type *type, FILE *f);
 447                 int (*cmp_order)(struct value v1, struct value v2);
 448                 int (*cmp_eq)(struct value v1, struct value v2);
 449                 struct value (*dup)(struct value val);
 450                 void (*free)(struct value val);
 451                 int (*compat)(struct type *this, struct type *other);
 452                 long long (*to_int)(struct value *v);
 453                 double (*to_float)(struct value *v);
 454                 int (*to_mpq)(mpq_t *q, struct value *v);
 455                 union {
 456                         ## type union fields
 457                 };
 458         };
 459
 460 ###### parse context
 461
 462         struct type *typelist;
 463
 464 ###### ast functions
 465
 466         static struct type *find_type(struct parse_context *c, struct text s)
 467         {
 468                 struct type *l = c->typelist;
 469
 470                 while (l &&
 471                        text_cmp(l->name, s) != 0)
 472                                 l = l->next;
 473                 return l;
 474         }
 475
 476         static struct type *add_type(struct parse_context *c, struct text s,
 477                                      struct type *proto)
 478         {
 479                 struct type *n;
 480
 481                 n = calloc(1, sizeof(*n));
 482                 *n = *proto;
 483                 n->name = s;
 484                 n->next = c->typelist;
 485                 c->typelist = n;
 486                 return n;
 487         }
 488
 489         static void free_type(struct type *t)
 490         {
 491                 /* The type is always a reference to something in the
 492                  * context, so we don't need to free anything.
 493                  */
 494         }
 495
 496         static void free_value(struct value v)
 497         {
 498                 if (v.type)
 499                         v.type->free(v);
 500         }
 501
 502         static int type_compat(struct type *require, struct type *have, int rules)
 503         {
 504                 if ((rules & Rboolok) && have == Tbool)
 505                         return 1;
 506                 if ((rules & Rnolabel) && have == Tlabel)
 507                         return 0;
 508                 if (!require || !have)
 509                         return 1;
 510
 511                 if (require->compat)
 512                         return require->compat(require, have);
 513
 514                 return require == have;
 515         }
 516
 517         static void type_print(struct type *type, FILE *f)
 518         {
 519                 if (!type)
 520                         fputs("*unknown*type*", f);
 521                 else if (type->name.len)
 522                         fprintf(f, "%.*s", type->name.len, type->name.txt);
 523                 else if (type->print_type)
 524                         type->print_type(type, f);
 525                 else
 526                         fputs("*invalid*type*", f);
 527         }
 528
 529         static struct value val_prepare(struct type *type)
 530         {
 531                 struct value rv;
 532
 533                 if (type)
 534                         return type->prepare(type);
 535                 rv.type = type;
 536                 return rv;
 537         }
 538
 539         static struct value val_init(struct type *type)
 540         {
 541                 struct value rv;
 542
 543                 if (type)
 544                         return type->init(type);
 545                 rv.type = type;
 546                 return rv;
 547         }
 548
 549         static struct value dup_value(struct value v)
 550         {
 551                 if (v.type)
 552                         return v.type->dup(v);
 553                 return v;
 554         }
 555
 556         static int value_cmp(struct value left, struct value right)
 557         {
 558                 if (left.type && left.type->cmp_order)
 559                         return left.type->cmp_order(left, right);
 560                 if (left.type && left.type->cmp_eq)
 561                         return left.type->cmp_eq(left, right);
 562                 return -1;
 563         }
 564
 565         static void print_value(struct value v)
 566         {
 567                 if (v.type && v.type->print)
 568                         v.type->print(v);
 569                 else
 570                         printf("*Unknown*");
 571         }
 572
 573         static struct value parse_value(struct type *type, char *arg)
 574         {
 575                 struct value rv;
 576
 577                 if (type && type->parse)
 578                         return type->parse(type, arg);
 579                 rv.type = NULL;
 580                 return rv;
 581         }
 582
 583 ###### forward decls
 584
 585         static void free_value(struct value v);
 586         static int type_compat(struct type *require, struct type *have, int rules);
 587         static void type_print(struct type *type, FILE *f);
 588         static struct value val_init(struct type *type);
 589         static struct value dup_value(struct value v);
 590         static int value_cmp(struct value left, struct value right);
 591         static void print_value(struct value v);
 592         static struct value parse_value(struct type *type, char *arg);
 593
 594 ###### free context types
 595
 596         while (context.typelist) {
 597                 struct type *t = context.typelist;
 598
 599                 context.typelist = t->next;
 600                 free(t);
 601         }
 602
 603 #### Base Types
 604
 605 Values of the base types can be numbers, which we represent as
 606 multi-precision fractions, strings, Booleans and labels.  When
 607 analysing the program we also need to allow for places where no value
 608 is meaningful (type `Tnone`) and where we don't know what type to
 609 expect yet (type is `NULL`).
 610
 611 Values are never shared, they are always copied when used, and freed
 612 when no longer needed.
 613
 614 When propagating type information around the program, we need to
 615 determine if two types are compatible, where type `NULL` is compatible
 616 with anything.  There are two special cases with type compatibility,
 617 both related to the Conditional Statement which will be described
 618 later.  In some cases a Boolean can be accepted as well as some other
 619 primary type, and in others any type is acceptable except a label (`Vlabel`).
 620 A separate function encode these cases will simplify some code later.
 621
 622 When assigning command line arguments to variables, we need to be able
 623 to parse each type from a string.
 624
 625 ###### includes
 626         #include <gmp.h>
 627         #include "string.h"
 628         #include "number.h"
 629
 630 ###### libs
 631         myLDLIBS := libnumber.o libstring.o -lgmp
 632         LDLIBS := $(filter-out $(myLDLIBS),$(LDLIBS)) $(myLDLIBS)
 633
 634 ###### type union fields
 635         enum vtype {Vnone, Vstr, Vnum, Vbool, Vlabel} vtype;
 636
 637 ###### value union fields
 638         struct text str;
 639         mpq_t num;
 640         int bool;
 641         void *label;
 642
 643 ###### ast functions
 644         static void _free_value(struct value v)
 645         {
 646                 switch (v.type->vtype) {
 647                 case Vnone: break;
 648                 case Vstr: free(v.str.txt); break;
 649                 case Vnum: mpq_clear(v.num); break;
 650                 case Vlabel:
 651                 case Vbool: break;
 652                 }
 653         }
 654
 655 ###### value functions
 656
 657         static struct value _val_prepare(struct type *type)
 658         {
 659                 struct value rv;
 660
 661                 rv.type = type;
 662                 switch(type->vtype) {
 663                 case Vnone:
 664                         break;
 665                 case Vnum:
 666                         memset(&rv.num, 0, sizeof(rv.num));
 667                         break;
 668                 case Vstr:
 669                         rv.str.txt = NULL;
 670                         rv.str.len = 0;
 671                         break;
 672                 case Vbool:
 673                         rv.bool = 0;
 674                         break;
 675                 case Vlabel:
 676                         rv.label = NULL;
 677                         break;
 678                 }
 679                 return rv;
 680         }
 681
 682         static struct value _val_init(struct type *type)
 683         {
 684                 struct value rv;
 685
 686                 rv.type = type;
 687                 switch(type->vtype) {
 688                 case Vnone:
 689                         break;
 690                 case Vnum:
 691                         mpq_init(rv.num); break;
 692                 case Vstr:
 693                         rv.str.txt = malloc(1);
 694                         rv.str.len = 0;
 695                         break;
 696                 case Vbool:
 697                         rv.bool = 0;
 698                         break;
 699                 case Vlabel:
 700                         rv.label = NULL;
 701                         break;
 702                 }
 703                 return rv;
 704         }
 705
 706         static struct value _dup_value(struct value v)
 707         {
 708                 struct value rv;
 709                 rv.type = v.type;
 710                 switch (rv.type->vtype) {
 711                 case Vnone:
 712                         break;
 713                 case Vlabel:
 714                         rv.label = v.label;
 715                         break;
 716                 case Vbool:
 717                         rv.bool = v.bool;
 718                         break;
 719                 case Vnum:
 720                         mpq_init(rv.num);
 721                         mpq_set(rv.num, v.num);
 722                         break;
 723                 case Vstr:
 724                         rv.str.len = v.str.len;
 725                         rv.str.txt = malloc(rv.str.len);
 726                         memcpy(rv.str.txt, v.str.txt, v.str.len);
 727                         break;
 728                 }
 729                 return rv;
 730         }
 731
 732         static int _value_cmp(struct value left, struct value right)
 733         {
 734                 int cmp;
 735                 if (left.type != right.type)
 736                         return left.type - right.type;
 737                 switch (left.type->vtype) {
 738                 case Vlabel: cmp = left.label == right.label ? 0 : 1; break;
 739                 case Vnum: cmp = mpq_cmp(left.num, right.num); break;
 740                 case Vstr: cmp = text_cmp(left.str, right.str); break;
 741                 case Vbool: cmp = left.bool - right.bool; break;
 742                 case Vnone: cmp = 0;
 743                 }
 744                 return cmp;
 745         }
 746
 747         static void _print_value(struct value v)
 748         {
 749                 switch (v.type->vtype) {
 750                 case Vnone:
 751                         printf("*no-value*"); break;
 752                 case Vlabel:
 753                         printf("*label-%p*", v.label); break;
 754                 case Vstr:
 755                         printf("%.*s", v.str.len, v.str.txt); break;
 756                 case Vbool:
 757                         printf("%s", v.bool ? "True":"False"); break;
 758                 case Vnum:
 759                         {
 760                         mpf_t fl;
 761                         mpf_init2(fl, 20);
 762                         mpf_set_q(fl, v.num);
 763                         gmp_printf("%Fg", fl);
 764                         mpf_clear(fl);
 765                         break;
 766                         }
 767                 }
 768         }
 769
 770         static struct value _parse_value(struct type *type, char *arg)
 771         {
 772                 struct value val;
 773                 struct text tx;
 774                 int neg = 0;
 775                 char tail[3] = "";
 776
 777                 val.type = type;
 778                 switch(type->vtype) {
 779                 case Vlabel:
 780                 case Vnone:
 781                         val.type = NULL;
 782                         break;
 783                 case Vstr:
 784                         val.str.len = strlen(arg);
 785                         val.str.txt = malloc(val.str.len);
 786                         memcpy(val.str.txt, arg, val.str.len);
 787                         break;
 788                 case Vnum:
 789                         if (*arg == '-') {
 790                                 neg = 1;
 791                                 arg++;
 792                         }
 793                         tx.txt = arg; tx.len = strlen(tx.txt);
 794                         if (number_parse(val.num, tail, tx) == 0)
 795                                 mpq_init(val.num);
 796                         else if (neg)
 797                                 mpq_neg(val.num, val.num);
 798                         if (tail[0]) {
 799                                 printf("Unsupported suffix: %s\n", arg);
 800                                 val.type = NULL;
 801                         }
 802                         break;
 803                 case Vbool:
 804                         if (strcasecmp(arg, "true") == 0 ||
 805                             strcmp(arg, "1") == 0)
 806                                 val.bool = 1;
 807                         else if (strcasecmp(arg, "false") == 0 ||
 808                                  strcmp(arg, "0") == 0)
 809                                 val.bool = 0;
 810                         else {
 811                                 printf("Bad bool: %s\n", arg);
 812                                 val.type = NULL;
 813                         }
 814                         break;
 815                 }
 816                 return val;
 817         }
 818
 819         static void _free_value(struct value v);
 820
 821         static struct type base_prototype = {
 822                 .init = _val_init,
 823                 .prepare = _val_prepare,
 824                 .parse = _parse_value,
 825                 .print = _print_value,
 826                 .cmp_order = _value_cmp,
 827                 .cmp_eq = _value_cmp,
 828                 .dup = _dup_value,
 829                 .free = _free_value,
 830         };
 831
 832         static struct type *Tbool, *Tstr, *Tnum, *Tnone, *Tlabel;
 833
 834 ###### ast functions
 835         static struct type *add_base_type(struct parse_context *c, char *n, enum vtype vt)
 836         {
 837                 struct text txt = { n, strlen(n) };
 838                 struct type *t;
 839
 840                 t = add_type(c, txt, &base_prototype);
 841                 t->vtype = vt;
 842                 return t;
 843         }
 844
 845 ###### context initialization
 846
 847         Tbool  = add_base_type(&context, "Boolean", Vbool);
 848         Tstr   = add_base_type(&context, "string", Vstr);
 849         Tnum   = add_base_type(&context, "number", Vnum);
 850         Tnone  = add_base_type(&context, "none", Vnone);
 851         Tlabel = add_base_type(&context, "label", Vlabel);
 852
 853 ### Variables
 854
 855 Variables are scoped named values.  We store the names in a linked
 856 list of "bindings" sorted lexically, and use sequential search and
 857 insertion sort.
 858
 859 ###### ast
 860
 861         struct binding {
 862                 struct text name;
 863                 struct binding *next;   // in lexical order
 864                 ## binding fields
 865         };
 866
 867 This linked list is stored in the parse context so that "reduce"
 868 functions can find or add variables, and so the analysis phase can
 869 ensure that every variable gets a type.
 870
 871 ###### parse context
 872
 873         struct binding *varlist;  // In lexical order
 874
 875 ###### ast functions
 876
 877         static struct binding *find_binding(struct parse_context *c, struct text s)
 878         {
 879                 struct binding **l = &c->varlist;
 880                 struct binding *n;
 881                 int cmp = 1;
 882
 883                 while (*l &&
 884                         (cmp = text_cmp((*l)->name, s)) < 0)
 885                                 l = & (*l)->next;
 886                 if (cmp == 0)
 887                         return *l;
 888                 n = calloc(1, sizeof(*n));
 889                 n->name = s;
 890                 n->next = *l;
 891                 *l = n;
 892                 return n;
 893         }
 894
 895 Each name can be linked to multiple variables defined in different
 896 scopes.  Each scope starts where the name is declared and continues
 897 until the end of the containing code block.  Scopes of a given name
 898 cannot nest, so a declaration while a name is in-scope is an error.
 899
 900 ###### binding fields
 901         struct variable *var;
 902
 903 ###### ast
 904         struct variable {
 905                 struct variable *previous;
 906                 struct value val;
 907                 struct binding *name;
 908                 struct exec *where_decl;// where name was declared
 909                 struct exec *where_set; // where type was set
 910                 ## variable fields
 911         };
 912
 913 While the naming seems strange, we include local constants in the
 914 definition of variables.  A name declared `var := value` can
 915 subsequently be changed, but a name declared `var ::= value` cannot -
 916 it is constant
 917
 918 ###### variable fields
 919         int constant;
 920
 921 Scopes in parallel branches can be partially merged.  More
 922 specifically, if a given name is declared in both branches of an
 923 if/else then its scope is a candidate for merging.  Similarly if
 924 every branch of an exhaustive switch (e.g. has an "else" clause)
 925 declares a given name, then the scopes from the branches are
 926 candidates for merging.
 927
 928 Note that names declared inside a loop (which is only parallel to
 929 itself) are never visible after the loop.  Similarly names defined in
 930 scopes which are not parallel, such as those started by `for` and
 931 `switch`, are never visible after the scope.  Only variables defined in
 932 both `then` and `else` (including the implicit then after an `if`, and
 933 excluding `then` used with `for`) and in all `case`s and `else` of a
 934 `switch` or `while` can be visible beyond the `if`/`switch`/`while`.
 935
 936 Labels, which are a bit like variables, follow different rules.
 937 Labels are not explicitly declared, but if an undeclared name appears
 938 in a context where a label is legal, that effectively declares the
 939 name as a label.  The declaration remains in force (or in scope) at
 940 least to the end of the immediately containing block and conditionally
 941 in any larger containing block which does not declare the name in some
 942 other way.  Importantly, the conditional scope extension happens even
 943 if the label is only used in one parallel branch of a conditional --
 944 when used in one branch it is treated as having been declared in all
 945 branches.
 946
 947 Merge candidates are tentatively visible beyond the end of the
 948 branching statement which creates them.  If the name is used, the
 949 merge is affirmed and they become a single variable visible at the
 950 outer layer.  If not - if it is redeclared first - the merge lapses.
 951
 952 To track scopes we have an extra stack, implemented as a linked list,
 953 which roughly parallels the parse stack and which is used exclusively
 954 for scoping.  When a new scope is opened, a new frame is pushed and
 955 the child-count of the parent frame is incremented.  This child-count
 956 is used to distinguish between the first of a set of parallel scopes,
 957 in which declared variables must not be in scope, and subsequent
 958 branches, whether they must already be conditionally scoped.
 959
 960 To push a new frame *before* any code in the frame is parsed, we need a
 961 grammar reduction.  This is most easily achieved with a grammar
 962 element which derives the empty string, and creates the new scope when
 963 it is recognized.  This can be placed, for example, between a keyword
 964 like "if" and the code following it.
 965
 966 ###### ast
 967         struct scope {
 968                 struct scope *parent;
 969                 int child_count;
 970         };
 971
 972 ###### parse context
 973         int scope_depth;
 974         struct scope *scope_stack;
 975
 976 ###### ast functions
 977         static void scope_pop(struct parse_context *c)
 978         {
 979                 struct scope *s = c->scope_stack;
 980
 981                 c->scope_stack = s->parent;
 982                 free(s);
 983                 c->scope_depth -= 1;
 984         }
 985
 986         static void scope_push(struct parse_context *c)
 987         {
 988                 struct scope *s = calloc(1, sizeof(*s));
 989                 if (c->scope_stack)
 990                         c->scope_stack->child_count += 1;
 991                 s->parent = c->scope_stack;
 992                 c->scope_stack = s;
 993                 c->scope_depth += 1;
 994         }
 995
 996 ###### Grammar
 997
 998         $void
 999         OpenScope -> ${ scope_push(config2context(config)); }$
1000
1001
1002 Each variable records a scope depth and is in one of four states:
1003
1004 - "in scope".  This is the case between the declaration of the
1005   variable and the end of the containing block, and also between
1006   the usage with affirms a merge and the end of that block.
1007
1008   The scope depth is not greater than the current parse context scope
1009   nest depth.  When the block of that depth closes, the state will
1010   change.  To achieve this, all "in scope" variables are linked
1011   together as a stack in nesting order.
1012
1013 - "pending".  The "in scope" block has closed, but other parallel
1014   scopes are still being processed.  So far, every parallel block at
1015   the same level that has closed has declared the name.
1016
1017   The scope depth is the depth of the last parallel block that
1018   enclosed the declaration, and that has closed.
1019
1020 - "conditionally in scope".  The "in scope" block and all parallel
1021   scopes have closed, and no further mention of the name has been
1022   seen.  This state includes a secondary nest depth which records the
1023   outermost scope seen since the variable became conditionally in
1024   scope.  If a use of the name is found, the variable becomes "in
1025   scope" and that secondary depth becomes the recorded scope depth.
1026   If the name is declared as a new variable, the old variable becomes
1027   "out of scope" and the recorded scope depth stays unchanged.
1028
1029 - "out of scope".  The variable is neither in scope nor conditionally
1030   in scope.  It is permanently out of scope now and can be removed from
1031   the "in scope" stack.
1032
1033
1034 ###### variable fields
1035         int depth, min_depth;
1036         enum { OutScope, PendingScope, CondScope, InScope } scope;
1037         struct variable *in_scope;
1038
1039 ###### parse context
1040
1041         struct variable *in_scope;
1042
1043 All variables with the same name are linked together using the
1044 'previous' link.  Those variable that have
1045 been affirmatively merged all have a 'merged' pointer that points to
1046 one primary variable - the most recently declared instance. When
1047 merging variables, we need to also adjust the 'merged' pointer on any
1048 other variables that had previously been merged with the one that will
1049 no longer be primary.
1050
1051 ###### variable fields
1052         struct variable *merged;
1053
1054 ###### ast functions
1055
1056         static void variable_merge(struct variable *primary, struct variable *secondary)
1057         {
1058                 struct variable *v;
1059
1060                 if (primary->merged)
1061                         // shouldn't happen
1062                         primary = primary->merged;
1063
1064                 for (v = primary->previous; v; v=v->previous)
1065                         if (v == secondary || v == secondary->merged ||
1066                             v->merged == secondary ||
1067                             (v->merged && v->merged == secondary->merged)) {
1068                                 v->scope = OutScope;
1069                                 v->merged = primary;
1070                         }
1071         }
1072
1073 ###### free context vars
1074
1075         while (context.varlist) {
1076                 struct binding *b = context.varlist;
1077                 struct variable *v = b->var;
1078                 context.varlist = b->next;
1079                 free(b);
1080                 while (v) {
1081                         struct variable *t = v;
1082
1083                         v = t->previous;
1084                         free_value(t->val);
1085                         free(t);
1086                 }
1087         }
1088
1089 #### Manipulating Bindings
1090
1091 When a name is conditionally visible, a new declaration discards the
1092 old binding - the condition lapses.  Conversely a usage of the name
1093 affirms the visibility and extends it to the end of the containing
1094 block - i.e. the block that contains both the original declaration and
1095 the latest usage.  This is determined from `min_depth`.  When a
1096 conditionally visible variable gets affirmed like this, it is also
1097 merged with other conditionally visible variables with the same name.
1098
1099 When we parse a variable declaration we either signal an error if the
1100 name is currently bound, or create a new variable at the current nest
1101 depth if the name is unbound or bound to a conditionally scoped or
1102 pending-scope variable.  If the previous variable was conditionally
1103 scoped, it and its homonyms becomes out-of-scope.
1104
1105 When we parse a variable reference (including non-declarative
1106 assignment) we signal an error if the name is not bound or is bound to
1107 a pending-scope variable; update the scope if the name is bound to a
1108 conditionally scoped variable; or just proceed normally if the named
1109 variable is in scope.
1110
1111 When we exit a scope, any variables bound at this level are either
1112 marked out of scope or pending-scoped, depending on whether the
1113 scope was sequential or parallel.
1114
1115 When exiting a parallel scope we check if there are any variables that
1116 were previously pending and are still visible. If there are, then
1117 there weren't redeclared in the most recent scope, so they cannot be
1118 merged and must become out-of-scope.  If it is not the first of
1119 parallel scopes (based on `child_count`), we check that there was a
1120 previous binding that is still pending-scope.  If there isn't, the new
1121 variable must now be out-of-scope.
1122
1123 When exiting a sequential scope that immediately enclosed parallel
1124 scopes, we need to resolve any pending-scope variables.  If there was
1125 no `else` clause, and we cannot determine that the `switch` was exhaustive,
1126 we need to mark all pending-scope variable as out-of-scope.  Otherwise
1127 all pending-scope variables become conditionally scoped.
1128
1129 ###### ast
1130         enum closetype { CloseSequential, CloseParallel, CloseElse };
1131
1132 ###### ast functions
1133
1134         static struct variable *var_decl(struct parse_context *c, struct text s)
1135         {
1136                 struct binding *b = find_binding(c, s);
1137                 struct variable *v = b->var;
1138
1139                 switch (v ? v->scope : OutScope) {
1140                 case InScope:
1141                         /* Caller will report the error */
1142                         return NULL;
1143                 case CondScope:
1144                         for (;
1145                              v && v->scope == CondScope;
1146                              v = v->previous)
1147                                 v->scope = OutScope;
1148                         break;
1149                 default: break;
1150                 }
1151                 v = calloc(1, sizeof(*v));
1152                 v->previous = b->var;
1153                 b->var = v;
1154                 v->name = b;
1155                 v->min_depth = v->depth = c->scope_depth;
1156                 v->scope = InScope;
1157                 v->in_scope = c->in_scope;
1158                 c->in_scope = v;
1159                 v->val = val_prepare(NULL);
1160                 return v;
1161         }
1162
1163         static struct variable *var_ref(struct parse_context *c, struct text s)
1164         {
1165                 struct binding *b = find_binding(c, s);
1166                 struct variable *v = b->var;
1167                 struct variable *v2;
1168
1169                 switch (v ? v->scope : OutScope) {
1170                 case OutScope:
1171                 case PendingScope:
1172                         /* Signal an error - once that is possible */
1173                         return NULL;
1174                 case CondScope:
1175                         /* All CondScope variables of this name need to be merged
1176                          * and become InScope
1177                          */
1178                         v->depth = v->min_depth;
1179                         v->scope = InScope;
1180                         for (v2 = v->previous;
1181                              v2 && v2->scope == CondScope;
1182                              v2 = v2->previous)
1183                                 variable_merge(v, v2);
1184                         break;
1185                 case InScope:
1186                         break;
1187                 }
1188                 return v;
1189         }
1190
1191         static void var_block_close(struct parse_context *c, enum closetype ct)
1192         {
1193                 /* close of all variables that are in_scope */
1194                 struct variable *v, **vp, *v2;
1195
1196                 scope_pop(c);
1197                 for (vp = &c->in_scope;
1198                      v = *vp, v && v->depth > c->scope_depth && v->min_depth > c->scope_depth;
1199                      ) {
1200                         switch (ct) {
1201                         case CloseElse:
1202                         case CloseParallel: /* handle PendingScope */
1203                                 switch(v->scope) {
1204                                 case InScope:
1205                                 case CondScope:
1206                                         if (c->scope_stack->child_count == 1)
1207                                                 v->scope = PendingScope;
1208                                         else if (v->previous &&
1209                                                  v->previous->scope == PendingScope)
1210                                                 v->scope = PendingScope;
1211                                         else if (v->val.type == Tlabel)
1212                                                 v->scope = PendingScope;
1213                                         else if (v->name->var == v)
1214                                                 v->scope = OutScope;
1215                                         if (ct == CloseElse) {
1216                                                 /* All Pending variables with this name
1217                                                  * are now Conditional */
1218                                                 for (v2 = v;
1219                                                      v2 && v2->scope == PendingScope;
1220                                                      v2 = v2->previous)
1221                                                         v2->scope = CondScope;
1222                                         }
1223                                         break;
1224                                 case PendingScope:
1225                                         for (v2 = v;
1226                                              v2 && v2->scope == PendingScope;
1227                                              v2 = v2->previous)
1228                                                 if (v2->val.type != Tlabel)
1229                                                         v2->scope = OutScope;
1230                                         break;
1231                                 case OutScope: break;
1232                                 }
1233                                 break;
1234                         case CloseSequential:
1235                                 if (v->val.type == Tlabel)
1236                                         v->scope = PendingScope;
1237                                 switch (v->scope) {
1238                                 case InScope:
1239                                         v->scope = OutScope;
1240                                         break;
1241                                 case PendingScope:
1242                                         /* There was no 'else', so we can only become
1243                                          * conditional if we know the cases were exhaustive,
1244                                          * and that doesn't mean anything yet.
1245                                          * So only labels become conditional..
1246                                          */
1247                                         for (v2 = v;
1248                                              v2 && v2->scope == PendingScope;
1249                                              v2 = v2->previous)
1250                                                 if (v2->val.type == Tlabel) {
1251                                                         v2->scope = CondScope;
1252                                                         v2->min_depth = c->scope_depth;
1253                                                 } else
1254                                                         v2->scope = OutScope;
1255                                         break;
1256                                 case CondScope:
1257                                 case OutScope: break;
1258                                 }
1259                                 break;
1260                         }
1261                         if (v->scope == OutScope)
1262                                 *vp = v->in_scope;
1263                         else
1264                                 vp = &v->in_scope;
1265                 }
1266         }
1267
1268 ### Executables
1269
1270 Executables can be lots of different things.  In many cases an
1271 executable is just an operation combined with one or two other
1272 executables.  This allows for expressions and lists etc.  Other times
1273 an executable is something quite specific like a constant or variable
1274 name.  So we define a `struct exec` to be a general executable with a
1275 type, and a `struct binode` which is a subclass of `exec`, forms a
1276 node in a binary tree, and holds an operation. There will be other
1277 subclasses, and to access these we need to be able to `cast` the
1278 `exec` into the various other types.
1279
1280 ###### macros
1281         #define cast(structname, pointer) ({            \
1282                 const typeof( ((struct structname *)0)->type) *__mptr = &(pointer)->type; \
1283                 if (__mptr && *__mptr != X##structname) abort();                \
1284                 (struct structname *)( (char *)__mptr);})
1285
1286         #define new(structname) ({                                              \
1287                 struct structname *__ptr = ((struct structname *)calloc(1,sizeof(struct structname))); \
1288                 __ptr->type = X##structname;                                            \
1289                 __ptr->line = -1; __ptr->column = -1;                                   \
1290                 __ptr;})
1291
1292         #define new_pos(structname, token) ({                                           \
1293                 struct structname *__ptr = ((struct structname *)calloc(1,sizeof(struct structname))); \
1294                 __ptr->type = X##structname;                                            \
1295                 __ptr->line = token.line; __ptr->column = token.col;                    \
1296                 __ptr;})
1297
1298 ###### ast
1299         enum exec_types {
1300                 Xbinode,
1301                 ## exec type
1302         };
1303         struct exec {
1304                 enum exec_types type;
1305                 int line, column;
1306         };
1307         struct binode {
1308                 struct exec;
1309                 enum Btype {
1310                         ## Binode types
1311                 } op;
1312                 struct exec *left, *right;
1313         };
1314
1315 ###### ast functions
1316
1317         static int __fput_loc(struct exec *loc, FILE *f)
1318         {
1319                 if (!loc)
1320                         return 0;
1321                 if (loc->line >= 0) {
1322                         fprintf(f, "%d:%d: ", loc->line, loc->column);
1323                         return 1;
1324                 }
1325                 if (loc->type == Xbinode)
1326                         return __fput_loc(cast(binode,loc)->left, f) ||
1327                                __fput_loc(cast(binode,loc)->right, f);
1328                 return 0;
1329         }
1330         static void fput_loc(struct exec *loc, FILE *f)
1331         {
1332                 if (!__fput_loc(loc, f))
1333                         fprintf(f, "??:??: ");
1334         }
1335
1336 Each different type of `exec` node needs a number of functions
1337 defined, a bit like methods.  We must be able to be able to free it,
1338 print it, analyse it and execute it.  Once we have specific `exec`
1339 types we will need to parse them too.  Let's take this a bit more
1340 slowly.
1341
1342 #### Freeing
1343
1344 The parser generator requires a `free_foo` function for each struct
1345 that stores attributes and they will be `exec`s and subtypes there-of.
1346 So we need `free_exec` which can handle all the subtypes, and we need
1347 `free_binode`.
1348
1349 ###### ast functions
1350
1351         static void free_binode(struct binode *b)
1352         {
1353                 if (!b)
1354                         return;
1355                 free_exec(b->left);
1356                 free_exec(b->right);
1357                 free(b);
1358         }
1359
1360 ###### core functions
1361         static void free_exec(struct exec *e)
1362         {
1363                 if (!e)
1364                         return;
1365                 switch(e->type) {
1366                         ## free exec cases
1367                 }
1368         }
1369
1370 ###### forward decls
1371
1372         static void free_exec(struct exec *e);
1373
1374 ###### free exec cases
1375         case Xbinode: free_binode(cast(binode, e)); break;
1376
1377 #### Printing
1378
1379 Printing an `exec` requires that we know the current indent level for
1380 printing line-oriented components.  As will become clear later, we
1381 also want to know what sort of bracketing to use.
1382
1383 ###### ast functions
1384
1385         static void do_indent(int i, char *str)
1386         {
1387                 while (i--)
1388                         printf("    ");
1389                 printf("%s", str);
1390         }
1391
1392 ###### core functions
1393         static void print_binode(struct binode *b, int indent, int bracket)
1394         {
1395                 struct binode *b2;
1396                 switch(b->op) {
1397                 ## print binode cases
1398                 }
1399         }
1400
1401         static void print_exec(struct exec *e, int indent, int bracket)
1402         {
1403                 if (!e)
1404                         return;
1405                 switch (e->type) {
1406                 case Xbinode:
1407                         print_binode(cast(binode, e), indent, bracket); break;
1408                 ## print exec cases
1409                 }
1410         }
1411
1412 ###### forward decls
1413
1414         static void print_exec(struct exec *e, int indent, int bracket);
1415
1416 #### Analysing
1417
1418 As discussed, analysis involves propagating type requirements around
1419 the program and looking for errors.
1420
1421 So `propagate_types` is passed an expected type (being a `struct type`
1422 pointer together with some `val_rules` flags) that the `exec` is
1423 expected to return, and returns the type that it does return, either
1424 of which can be `NULL` signifying "unknown".  An `ok` flag is passed
1425 by reference. It is set to `0` when an error is found, and `2` when
1426 any change is made.  If it remains unchanged at `1`, then no more
1427 propagation is needed.
1428
1429 ###### ast
1430
1431         enum val_rules {Rnolabel = 1<<0, Rboolok = 1<<1, Rnoconstant = 2<<1};
1432
1433 ###### format cases
1434         case 'r':
1435                 if (rules & Rnolabel)
1436                         fputs(" (labels not permitted)", stderr);
1437                 break;
1438
1439 ###### core functions
1440
1441         static struct type *propagate_types(struct exec *prog, struct parse_context *c, int *ok,
1442                                             struct type *type, int rules)
1443         {
1444                 struct type *t;
1445
1446                 if (!prog)
1447                         return Tnone;
1448
1449                 switch (prog->type) {
1450                 case Xbinode:
1451                 {
1452                         struct binode *b = cast(binode, prog);
1453                         switch (b->op) {
1454                         ## propagate binode cases
1455                         }
1456                         break;
1457                 }
1458                 ## propagate exec cases
1459                 }
1460                 return Tnone;
1461         }
1462
1463 #### Interpreting
1464
1465 Interpreting an `exec` doesn't require anything but the `exec`.  State
1466 is stored in variables and each variable will be directly linked from
1467 within the `exec` tree.  The exception to this is the whole `program`
1468 which needs to look at command line arguments.  The `program` will be
1469 interpreted separately.
1470
1471 Each `exec` can return a value, which may be `Tnone` but must be non-NULL;
1472
1473 ###### core functions
1474
1475         struct lrval {
1476                 struct value val, *lval;
1477         };
1478
1479         static struct lrval _interp_exec(struct exec *e);
1480
1481         static struct value interp_exec(struct exec *e)
1482         {
1483                 struct lrval ret = _interp_exec(e);
1484
1485                 if (ret.lval)
1486                         return dup_value(*ret.lval);
1487                 else
1488                         return ret.val;
1489         }
1490
1491         static struct value *linterp_exec(struct exec *e)
1492         {
1493                 struct lrval ret = _interp_exec(e);
1494
1495                 return ret.lval;
1496         }
1497
1498         static struct lrval _interp_exec(struct exec *e)
1499         {
1500                 struct lrval ret;
1501                 struct value rv, *lrv = NULL;
1502                 rv.type = Tnone;
1503                 if (!e) {
1504                         ret.lval = lrv;
1505                         ret.val = rv;
1506                         return ret;
1507                 }
1508
1509                 switch(e->type) {
1510                 case Xbinode:
1511                 {
1512                         struct binode *b = cast(binode, e);
1513                         struct value left, right, *lleft;
1514                         left.type = right.type = Tnone;
1515                         switch (b->op) {
1516                         ## interp binode cases
1517                         }
1518                         free_value(left); free_value(right);
1519                         break;
1520                 }
1521                 ## interp exec cases
1522                 }
1523                 ret.lval = lrv;
1524                 ret.val = rv;
1525                 return ret;
1526         }
1527
1528 ## Language elements
1529
1530 Each language element needs to be parsed, printed, analysed,
1531 interpreted, and freed.  There are several, so let's just start with
1532 the easy ones and work our way up.
1533
1534 ### Values
1535
1536 We have already met values as separate objects.  When manifest
1537 constants appear in the program text, that must result in an executable
1538 which has a constant value.  So the `val` structure embeds a value in
1539 an executable.
1540
1541 ###### exec type
1542         Xval,
1543
1544 ###### ast
1545         struct val {
1546                 struct exec;
1547                 struct value val;
1548         };
1549
1550 ###### Grammar
1551
1552         $*val
1553         Value ->  True ${
1554                         $0 = new_pos(val, $1);
1555                         $0->val.type = Tbool;
1556                         $0->val.bool = 1;
1557                         }$
1558                 | False ${
1559                         $0 = new_pos(val, $1);
1560                         $0->val.type = Tbool;
1561                         $0->val.bool = 0;
1562                         }$
1563                 | NUMBER ${
1564                         $0 = new_pos(val, $1);
1565                         $0->val.type = Tnum;
1566                         {
1567                         char tail[3];
1568                         if (number_parse($0->val.num, tail, $1.txt) == 0)
1569                                 mpq_init($0->val.num);
1570                                 if (tail[0])
1571                                         tok_err(config2context(config), "error: unsupported number suffix",
1572                                                 &$1);
1573                         }
1574                         }$
1575                 | STRING ${
1576                         $0 = new_pos(val, $1);
1577                         $0->val.type = Tstr;
1578                         {
1579                         char tail[3];
1580                         string_parse(&$1, '\\', &$0->val.str, tail);
1581                         if (tail[0])
1582                                 tok_err(config2context(config), "error: unsupported string suffix",
1583                                         &$1);
1584                         }
1585                         }$
1586                 | MULTI_STRING ${
1587                         $0 = new_pos(val, $1);
1588                         $0->val.type = Tstr;
1589                         {
1590                         char tail[3];
1591                         string_parse(&$1, '\\', &$0->val.str, tail);
1592                         if (tail[0])
1593                                 tok_err(config2context(config), "error: unsupported string suffix",
1594                                         &$1);
1595                         }
1596                         }$
1597
1598 ###### print exec cases
1599         case Xval:
1600         {
1601                 struct val *v = cast(val, e);
1602                 if (v->val.type == Tstr)
1603                         printf("\"");
1604                 print_value(v->val);
1605                 if (v->val.type == Tstr)
1606                         printf("\"");
1607                 break;
1608         }
1609
1610 ###### propagate exec cases
1611                 case Xval:
1612                 {
1613                         struct val *val = cast(val, prog);
1614                         if (!type_compat(type, val->val.type, rules)) {
1615                                 type_err(c, "error: expected %1%r found %2",
1616                                            prog, type, rules, val->val.type);
1617                                 *ok = 0;
1618                         }
1619                         return val->val.type;
1620                 }
1621
1622 ###### interp exec cases
1623         case Xval:
1624                 rv = dup_value(cast(val, e)->val);
1625                 break;
1626
1627 ###### ast functions
1628         static void free_val(struct val *v)
1629         {
1630                 if (!v)
1631                         return;
1632                 free_value(v->val);
1633                 free(v);
1634         }
1635
1636 ###### free exec cases
1637         case Xval: free_val(cast(val, e)); break;
1638
1639 ###### ast functions
1640         // Move all nodes from 'b' to 'rv', reversing the order.
1641         // In 'b' 'left' is a list, and 'right' is the last node.
1642         // In 'rv', left' is the first node and 'right' is a list.
1643         static struct binode *reorder_bilist(struct binode *b)
1644         {
1645                 struct binode *rv = NULL;
1646
1647                 while (b) {
1648                         struct exec *t = b->right;
1649                         b->right = rv;
1650                         rv = b;
1651                         if (b->left)
1652                                 b = cast(binode, b->left);
1653                         else
1654                                 b = NULL;
1655                         rv->left = t;
1656                 }
1657                 return rv;
1658         }
1659
1660 ### Variables
1661
1662 Just as we used a `val` to wrap a value into an `exec`, we similarly
1663 need a `var` to wrap a `variable` into an exec.  While each `val`
1664 contained a copy of the value, each `var` hold a link to the variable
1665 because it really is the same variable no matter where it appears.
1666 When a variable is used, we need to remember to follow the `->merged`
1667 link to find the primary instance.
1668
1669 ###### exec type
1670         Xvar,
1671
1672 ###### ast
1673         struct var {
1674                 struct exec;
1675                 struct variable *var;
1676         };
1677
1678 ###### Grammar
1679
1680         $*var
1681         VariableDecl -> IDENTIFIER : ${ {
1682                 struct variable *v = var_decl(config2context(config), $1.txt);
1683                 $0 = new_pos(var, $1);
1684                 $0->var = v;
1685                 if (v)
1686                         v->where_decl = $0;
1687                 else {
1688                         v = var_ref(config2context(config), $1.txt);
1689                         $0->var = v;
1690                         type_err(config2context(config), "error: variable '%v' redeclared",
1691                                  $0, Tnone, 0, Tnone);
1692                         type_err(config2context(config), "info: this is where '%v' was first declared",
1693                                  v->where_decl, Tnone, 0, Tnone);
1694                 }
1695         } }$
1696             | IDENTIFIER :: ${ {
1697                 struct variable *v = var_decl(config2context(config), $1.txt);
1698                 $0 = new_pos(var, $1);
1699                 $0->var = v;
1700                 if (v) {
1701                         v->where_decl = $0;
1702                         v->constant = 1;
1703                 } else {
1704                         v = var_ref(config2context(config), $1.txt);
1705                         $0->var = v;
1706                         type_err(config2context(config), "error: variable '%v' redeclared",
1707                                  $0, Tnone, 0, Tnone);
1708                         type_err(config2context(config), "info: this is where '%v' was first declared",
1709                                  v->where_decl, Tnone, 0, Tnone);
1710                 }
1711         } }$
1712             | IDENTIFIER : Type ${ {
1713                 struct variable *v = var_decl(config2context(config), $1.txt);
1714                 $0 = new_pos(var, $1);
1715                 $0->var = v;
1716                 if (v) {
1717                         v->where_decl = $0;
1718                         v->where_set = $0;
1719                         v->val = val_prepare($<3);
1720                 } else {
1721                         v = var_ref(config2context(config), $1.txt);
1722                         $0->var = v;
1723                         type_err(config2context(config), "error: variable '%v' redeclared",
1724                                  $0, Tnone, 0, Tnone);
1725                         type_err(config2context(config), "info: this is where '%v' was first declared",
1726                                  v->where_decl, Tnone, 0, Tnone);
1727                 }
1728         } }$
1729             | IDENTIFIER :: Type ${ {
1730                 struct variable *v = var_decl(config2context(config), $1.txt);
1731                 $0 = new_pos(var, $1);
1732                 $0->var = v;
1733                 if (v) {
1734                         v->where_decl = $0;
1735                         v->where_set = $0;
1736                         v->val = val_prepare($<3);
1737                         v->constant = 1;
1738                 } else {
1739                         v = var_ref(config2context(config), $1.txt);
1740                         $0->var = v;
1741                         type_err(config2context(config), "error: variable '%v' redeclared",
1742                                  $0, Tnone, 0, Tnone);
1743                         type_err(config2context(config), "info: this is where '%v' was first declared",
1744                                  v->where_decl, Tnone, 0, Tnone);
1745                 }
1746         } }$
1747
1748         $*exec
1749         Variable -> IDENTIFIER ${ {
1750                 struct variable *v = var_ref(config2context(config), $1.txt);
1751                 $0 = new_pos(var, $1);
1752                 if (v == NULL) {
1753                         /* This might be a label - allocate a var just in case */
1754                         v = var_decl(config2context(config), $1.txt);
1755                         if (v) {
1756                                 v->val = val_prepare(Tlabel);
1757                                 v->val.label = &v->val;
1758                                 v->where_set = $0;
1759                         }
1760                 }
1761                 cast(var, $0)->var = v;
1762         } }$
1763         ## variable grammar
1764
1765         $*type
1766         Type -> IDENTIFIER ${
1767                 $0 = find_type(config2context(config), $1.txt);
1768                 if (!$0) {
1769                         tok_err(config2context(config),
1770                                 "error: undefined type", &$1);
1771
1772                         $0 = Tnone;
1773                 }
1774         }$
1775         ## type grammar
1776
1777 ###### print exec cases
1778         case Xvar:
1779         {
1780                 struct var *v = cast(var, e);
1781                 if (v->var) {
1782                         struct binding *b = v->var->name;
1783                         printf("%.*s", b->name.len, b->name.txt);
1784                 }
1785                 break;
1786         }
1787
1788 ###### format cases
1789         case 'v':
1790                 if (loc->type == Xvar) {
1791                         struct var *v = cast(var, loc);
1792                         if (v->var) {
1793                                 struct binding *b = v->var->name;
1794                                 fprintf(stderr, "%.*s", b->name.len, b->name.txt);
1795                         } else
1796                                 fputs("???", stderr);
1797                 } else
1798                         fputs("NOTVAR", stderr);
1799                 break;
1800
1801 ###### propagate exec cases
1802
1803         case Xvar:
1804         {
1805                 struct var *var = cast(var, prog);
1806                 struct variable *v = var->var;
1807                 if (!v) {
1808                         type_err(c, "%d:BUG: no variable!!", prog, Tnone, 0, Tnone);
1809                         *ok = 0;
1810                         return Tnone;
1811                 }
1812                 if (v->merged)
1813                         v = v->merged;
1814                 if (v->constant && (rules & Rnoconstant)) {
1815                         type_err(c, "error: Cannot assign to a constant: %v",
1816                                  prog, NULL, 0, NULL);
1817                         type_err(c, "info: name was defined as a constant here",
1818                                  v->where_decl, NULL, 0, NULL);
1819                         *ok = 0;
1820                         return v->val.type;
1821                 }
1822                 if (v->val.type == NULL) {
1823                         if (type && *ok != 0) {
1824                                 v->val = val_prepare(type);
1825                                 v->where_set = prog;
1826                                 *ok = 2;
1827                         }
1828                         return type;
1829                 }
1830                 if (!type_compat(type, v->val.type, rules)) {
1831                         type_err(c, "error: expected %1%r but variable '%v' is %2", prog,
1832                                  type, rules, v->val.type);
1833                         type_err(c, "info: this is where '%v' was set to %1", v->where_set,
1834                                  v->val.type, rules, Tnone);
1835                         *ok = 0;
1836                 }
1837                 if (!type)
1838                         return v->val.type;
1839                 return type;
1840         }
1841
1842 ###### interp exec cases
1843         case Xvar:
1844         {
1845                 struct var *var = cast(var, e);
1846                 struct variable *v = var->var;
1847
1848                 if (v->merged)
1849                         v = v->merged;
1850                 lrv = &v->val;
1851                 break;
1852         }
1853
1854 ###### ast functions
1855
1856         static void free_var(struct var *v)
1857         {
1858                 free(v);
1859         }
1860
1861 ###### free exec cases
1862         case Xvar: free_var(cast(var, e)); break;
1863
1864 ### Expressions: Conditional
1865
1866 Our first user of the `binode` will be conditional expressions, which
1867 is a bit odd as they actually have three components.  That will be
1868 handled by having 2 binodes for each expression.  The conditional
1869 expression is the lowest precedence operatior, so it gets to define
1870 what an "Expression" is.  The next level up is "BoolExpr", which
1871 comes next.
1872
1873 Conditional expressions are of the form "value `if` condition `else`
1874 other_value".  There is no associativite with this operator: the
1875 values and conditions can only be other conditional expressions if
1876 they are enclosed in parentheses.  Allowing nesting without
1877 parentheses would be too confusing.
1878
1879 ###### Binode types
1880         CondExpr,
1881
1882 ###### Grammar
1883
1884         $*exec
1885         Expression -> BoolExpr if BoolExpr else BoolExpr ${ {
1886                         struct binode *b1 = new(binode);
1887                         struct binode *b2 = new(binode);
1888                         b1->op = CondExpr;
1889                         b1->left = $<3;
1890                         b1->right = b2;
1891                         b2->op = CondExpr;
1892                         b2->left = $<1;
1893                         b2->right = $<5;
1894                         $0 = b1;
1895                 } }$
1896                 | BoolExpr ${ $0 = $<1; }$
1897
1898 ###### print binode cases
1899
1900         case CondExpr:
1901                 b2 = cast(binode, b->right);
1902                 print_exec(b2->left, -1, 0);
1903                 printf(" if ");
1904                 print_exec(b->left, -1, 0);
1905                 printf(" else ");
1906                 print_exec(b2->right, -1, 0);
1907                 break;
1908
1909 ###### propagate binode cases
1910
1911         case CondExpr: {
1912                 /* cond must be Tbool, others must match */
1913                 struct binode *b2 = cast(binode, b->right);
1914                 struct type *t2;
1915
1916                 propagate_types(b->left, c, ok, Tbool, 0);
1917                 t = propagate_types(b2->left, c, ok, type, Rnolabel);
1918                 t2 = propagate_types(b2->right, c, ok, type ?: t, Rnolabel);
1919                 return t ?: t2;
1920         }
1921
1922 ###### interp binode cases
1923
1924         case CondExpr: {
1925                 struct binode *b2 = cast(binode, b->right);
1926                 left = interp_exec(b->left);
1927                 if (left.bool)
1928                         rv = interp_exec(b2->left);
1929                 else
1930                         rv = interp_exec(b2->right);
1931                 }
1932                 break;
1933
1934 ### Expressions: Boolean
1935
1936 The next class of expressions to use the `binode` will be Boolean
1937 expressions.  As I haven't implemented precedence in the parser
1938 generator yet, we need different names for each precedence level used
1939 by expressions.  The outer most or lowest level precedence are
1940 conditional expressions are Boolean operators which form an `BoolExpr`
1941 out of `BTerm`s and `BFact`s.  As well as `or` `and`, and `not` we
1942 have `and then` and `or else` which only evaluate the second operand
1943 if the result would make a difference.
1944
1945 ###### Binode types
1946         And,
1947         AndThen,
1948         Or,
1949         OrElse,
1950         Not,
1951
1952 ###### Grammar
1953
1954         $*exec
1955         BoolExpr -> BoolExpr or BTerm ${ {
1956                         struct binode *b = new(binode);
1957                         b->op = Or;
1958                         b->left = $<1;
1959                         b->right = $<3;
1960                         $0 = b;
1961                 } }$
1962                 | BoolExpr or else BTerm ${ {
1963                         struct binode *b = new(binode);
1964                         b->op = OrElse;
1965                         b->left = $<1;
1966                         b->right = $<4;
1967                         $0 = b;
1968                 } }$
1969                 | BTerm ${ $0 = $<1; }$
1970
1971         BTerm -> BTerm and BFact ${ {
1972                         struct binode *b = new(binode);
1973                         b->op = And;
1974                         b->left = $<1;
1975                         b->right = $<3;
1976                         $0 = b;
1977                 } }$
1978                 | BTerm and then BFact ${ {
1979                         struct binode *b = new(binode);
1980                         b->op = AndThen;
1981                         b->left = $<1;
1982                         b->right = $<4;
1983                         $0 = b;
1984                 } }$
1985                 | BFact ${ $0 = $<1; }$
1986
1987         BFact -> not BFact ${ {
1988                         struct binode *b = new(binode);
1989                         b->op = Not;
1990                         b->right = $<2;
1991                         $0 = b;
1992                 } }$
1993                 ## other BFact
1994
1995 ###### print binode cases
1996         case And:
1997                 print_exec(b->left, -1, 0);
1998                 printf(" and ");
1999                 print_exec(b->right, -1, 0);
2000                 break;
2001         case AndThen:
2002                 print_exec(b->left, -1, 0);
2003                 printf(" and then ");
2004                 print_exec(b->right, -1, 0);
2005                 break;
2006         case Or:
2007                 print_exec(b->left, -1, 0);
2008                 printf(" or ");
2009                 print_exec(b->right, -1, 0);
2010                 break;
2011         case OrElse:
2012                 print_exec(b->left, -1, 0);
2013                 printf(" or else ");
2014                 print_exec(b->right, -1, 0);
2015                 break;
2016         case Not:
2017                 printf("not ");
2018                 print_exec(b->right, -1, 0);
2019                 break;
2020
2021 ###### propagate binode cases
2022         case And:
2023         case AndThen:
2024         case Or:
2025         case OrElse:
2026         case Not:
2027                 /* both must be Tbool, result is Tbool */
2028                 propagate_types(b->left, c, ok, Tbool, 0);
2029                 propagate_types(b->right, c, ok, Tbool, 0);
2030                 if (type && type != Tbool) {
2031                         type_err(c, "error: %1 operation found where %2 expected", prog,
2032                                    Tbool, 0, type);
2033                         *ok = 0;
2034                 }
2035                 return Tbool;
2036
2037 ###### interp binode cases
2038         case And:
2039                 rv = interp_exec(b->left);
2040                 right = interp_exec(b->right);
2041                 rv.bool = rv.bool && right.bool;
2042                 break;
2043         case AndThen:
2044                 rv = interp_exec(b->left);
2045                 if (rv.bool)
2046                         rv = interp_exec(b->right);
2047                 break;
2048         case Or:
2049                 rv = interp_exec(b->left);
2050                 right = interp_exec(b->right);
2051                 rv.bool = rv.bool || right.bool;
2052                 break;
2053         case OrElse:
2054                 rv = interp_exec(b->left);
2055                 if (!rv.bool)
2056                         rv = interp_exec(b->right);
2057                 break;
2058         case Not:
2059                 rv = interp_exec(b->right);
2060                 rv.bool = !rv.bool;
2061                 break;
2062
2063 ### Expressions: Comparison
2064
2065 Of slightly higher precedence that Boolean expressions are
2066 Comparisons.
2067 A comparison takes arguments of any type, but the two types must be
2068 the same.
2069
2070 To simplify the parsing we introduce an `eop` which can record an
2071 expression operator.
2072
2073 ###### ast
2074         struct eop {
2075                 enum Btype op;
2076         };
2077
2078 ###### ast functions
2079         static void free_eop(struct eop *e)
2080         {
2081                 if (e)
2082                         free(e);
2083         }
2084
2085 ###### Binode types
2086         Less,
2087         Gtr,
2088         LessEq,
2089         GtrEq,
2090         Eql,
2091         NEql,
2092
2093 ###### other BFact
2094         | Expr CMPop Expr ${ {
2095                         struct binode *b = new(binode);
2096                         b->op = $2.op;
2097                         b->left = $<1;
2098                         b->right = $<3;
2099                         $0 = b;
2100         } }$
2101         | Expr ${ $0 = $<1; }$
2102
2103 ###### Grammar
2104
2105         $eop
2106         CMPop ->   < ${ $0.op = Less; }$
2107                 |  > ${ $0.op = Gtr; }$
2108                 |  <= ${ $0.op = LessEq; }$
2109                 |  >= ${ $0.op = GtrEq; }$
2110                 |  == ${ $0.op = Eql; }$
2111                 |  != ${ $0.op = NEql; }$
2112
2113 ###### print binode cases
2114
2115         case Less:
2116         case LessEq:
2117         case Gtr:
2118         case GtrEq:
2119         case Eql:
2120         case NEql:
2121                 print_exec(b->left, -1, 0);
2122                 switch(b->op) {
2123                 case Less:   printf(" < "); break;
2124                 case LessEq: printf(" <= "); break;
2125                 case Gtr:    printf(" > "); break;
2126                 case GtrEq:  printf(" >= "); break;
2127                 case Eql:    printf(" == "); break;
2128                 case NEql:   printf(" != "); break;
2129                 default: abort();
2130                 }
2131                 print_exec(b->right, -1, 0);
2132                 break;
2133
2134 ###### propagate binode cases
2135         case Less:
2136         case LessEq:
2137         case Gtr:
2138         case GtrEq:
2139         case Eql:
2140         case NEql:
2141                 /* Both must match but not be labels, result is Tbool */
2142                 t = propagate_types(b->left, c, ok, NULL, Rnolabel);
2143                 if (t)
2144                         propagate_types(b->right, c, ok, t, 0);
2145                 else {
2146                         t = propagate_types(b->right, c, ok, NULL, Rnolabel);
2147                         if (t)
2148                                 t = propagate_types(b->left, c, ok, t, 0);
2149                 }
2150                 if (!type_compat(type, Tbool, 0)) {
2151                         type_err(c, "error: Comparison returns %1 but %2 expected", prog,
2152                                     Tbool, rules, type);
2153                         *ok = 0;
2154                 }
2155                 return Tbool;
2156
2157 ###### interp binode cases
2158         case Less:
2159         case LessEq:
2160         case Gtr:
2161         case GtrEq:
2162         case Eql:
2163         case NEql:
2164         {
2165                 int cmp;
2166                 left = interp_exec(b->left);
2167                 right = interp_exec(b->right);
2168                 cmp = value_cmp(left, right);
2169                 rv.type = Tbool;
2170                 switch (b->op) {
2171                 case Less:      rv.bool = cmp <  0; break;
2172                 case LessEq:    rv.bool = cmp <= 0; break;
2173                 case Gtr:       rv.bool = cmp >  0; break;
2174                 case GtrEq:     rv.bool = cmp >= 0; break;
2175                 case Eql:       rv.bool = cmp == 0; break;
2176                 case NEql:      rv.bool = cmp != 0; break;
2177                 default: rv.bool = 0; break;
2178                 }
2179                 break;
2180         }
2181
2182 ### Expressions: The rest
2183
2184 The remaining expressions with the highest precedence are arithmetic
2185 and string concatenation.  They are `Expr`, `Term`, and `Factor`.
2186 The `Factor` is where the `Value` and `Variable` that we already have
2187 are included.
2188
2189 `+` and `-` are both infix and prefix operations (where they are
2190 absolute value and negation).  These have different operator names.
2191
2192 We also have a 'Bracket' operator which records where parentheses were
2193 found.  This makes it easy to reproduce these when printing.  Once
2194 precedence is handled better I might be able to discard this.
2195
2196 ###### Binode types
2197         Plus, Minus,
2198         Times, Divide, Rem,
2199         Concat,
2200         Absolute, Negate,
2201         Bracket,
2202
2203 ###### Grammar
2204
2205         $*exec
2206         Expr -> Expr Eop Term ${ {
2207                         struct binode *b = new(binode);
2208                         b->op = $2.op;
2209                         b->left = $<1;
2210                         b->right = $<3;
2211                         $0 = b;
2212                 } }$
2213                 | Term ${ $0 = $<1; }$
2214
2215         Term -> Term Top Factor ${ {
2216                         struct binode *b = new(binode);
2217                         b->op = $2.op;
2218                         b->left = $<1;
2219                         b->right = $<3;
2220                         $0 = b;
2221                 } }$
2222                 | Factor ${ $0 = $<1; }$
2223
2224         Factor -> ( Expression ) ${ {
2225                         struct binode *b = new_pos(binode, $1);
2226                         b->op = Bracket;
2227                         b->right = $<2;
2228                         $0 = b;
2229                 } }$
2230                 | Uop Factor ${ {
2231                         struct binode *b = new(binode);
2232                         b->op = $1.op;
2233                         b->right = $<2;
2234                         $0 = b;
2235                 } }$
2236                 | Value ${ $0 = $<1; }$
2237                 | Variable ${ $0 = $<1; }$
2238
2239         $eop
2240         Eop ->    + ${ $0.op = Plus; }$
2241                 | - ${ $0.op = Minus; }$
2242
2243         Uop ->    + ${ $0.op = Absolute; }$
2244                 | - ${ $0.op = Negate; }$
2245
2246         Top ->    * ${ $0.op = Times; }$
2247                 | / ${ $0.op = Divide; }$
2248                 | % ${ $0.op = Rem; }$
2249                 | ++ ${ $0.op = Concat; }$
2250
2251 ###### print binode cases
2252         case Plus:
2253         case Minus:
2254         case Times:
2255         case Divide:
2256         case Concat:
2257         case Rem:
2258                 print_exec(b->left, indent, 0);
2259                 switch(b->op) {
2260                 case Plus:   fputs(" + ", stdout); break;
2261                 case Minus:  fputs(" - ", stdout); break;
2262                 case Times:  fputs(" * ", stdout); break;
2263                 case Divide: fputs(" / ", stdout); break;
2264                 case Rem:    fputs(" % ", stdout); break;
2265                 case Concat: fputs(" ++ ", stdout); break;
2266                 default: abort();
2267                 }
2268                 print_exec(b->right, indent, 0);
2269                 break;
2270         case Absolute:
2271                 printf("+");
2272                 print_exec(b->right, indent, 0);
2273                 break;
2274         case Negate:
2275                 printf("-");
2276                 print_exec(b->right, indent, 0);
2277                 break;
2278         case Bracket:
2279                 printf("(");
2280                 print_exec(b->right, indent, 0);
2281                 printf(")");
2282                 break;
2283
2284 ###### propagate binode cases
2285         case Plus:
2286         case Minus:
2287         case Times:
2288         case Rem:
2289         case Divide:
2290                 /* both must be numbers, result is Tnum */
2291         case Absolute:
2292         case Negate:
2293                 /* as propagate_types ignores a NULL,
2294                  * unary ops fit here too */
2295                 propagate_types(b->left, c, ok, Tnum, 0);
2296                 propagate_types(b->right, c, ok, Tnum, 0);
2297                 if (!type_compat(type, Tnum, 0)) {
2298                         type_err(c, "error: Arithmetic returns %1 but %2 expected", prog,
2299                                    Tnum, rules, type);
2300                         *ok = 0;
2301                 }
2302                 return Tnum;
2303
2304         case Concat:
2305                 /* both must be Tstr, result is Tstr */
2306                 propagate_types(b->left, c, ok, Tstr, 0);
2307                 propagate_types(b->right, c, ok, Tstr, 0);
2308                 if (!type_compat(type, Tstr, 0)) {
2309                         type_err(c, "error: Concat returns %1 but %2 expected", prog,
2310                                    Tstr, rules, type);
2311                         *ok = 0;
2312                 }
2313                 return Tstr;
2314
2315         case Bracket:
2316                 return propagate_types(b->right, c, ok, type, 0);
2317
2318 ###### interp binode cases
2319
2320         case Plus:
2321                 rv = interp_exec(b->left);
2322                 right = interp_exec(b->right);
2323                 mpq_add(rv.num, rv.num, right.num);
2324                 break;
2325         case Minus:
2326                 rv = interp_exec(b->left);
2327                 right = interp_exec(b->right);
2328                 mpq_sub(rv.num, rv.num, right.num);
2329                 break;
2330         case Times:
2331                 rv = interp_exec(b->left);
2332                 right = interp_exec(b->right);
2333                 mpq_mul(rv.num, rv.num, right.num);
2334                 break;
2335         case Divide:
2336                 rv = interp_exec(b->left);
2337                 right = interp_exec(b->right);
2338                 mpq_div(rv.num, rv.num, right.num);
2339                 break;
2340         case Rem: {
2341                 mpz_t l, r, rem;
2342
2343                 left = interp_exec(b->left);
2344                 right = interp_exec(b->right);
2345                 mpz_init(l); mpz_init(r); mpz_init(rem);
2346                 mpz_tdiv_q(l, mpq_numref(left.num), mpq_denref(left.num));
2347                 mpz_tdiv_q(r, mpq_numref(right.num), mpq_denref(right.num));
2348                 mpz_tdiv_r(rem, l, r);
2349                 rv = val_init(Tnum);
2350                 mpq_set_z(rv.num, rem);
2351                 mpz_clear(r); mpz_clear(l); mpz_clear(rem);
2352                 break;
2353         }
2354         case Negate:
2355                 rv = interp_exec(b->right);
2356                 mpq_neg(rv.num, rv.num);
2357                 break;
2358         case Absolute:
2359                 rv = interp_exec(b->right);
2360                 mpq_abs(rv.num, rv.num);
2361                 break;
2362         case Bracket:
2363                 rv = interp_exec(b->right);
2364                 break;
2365         case Concat:
2366                 left = interp_exec(b->left);
2367                 right = interp_exec(b->right);
2368                 rv.type = Tstr;
2369                 rv.str = text_join(left.str, right.str);
2370                 break;
2371
2372
2373 ###### value functions
2374
2375         static struct text text_join(struct text a, struct text b)
2376         {
2377                 struct text rv;
2378                 rv.len = a.len + b.len;
2379                 rv.txt = malloc(rv.len);
2380                 memcpy(rv.txt, a.txt, a.len);
2381                 memcpy(rv.txt+a.len, b.txt, b.len);
2382                 return rv;
2383         }
2384
2385
2386 ### Blocks, Statements, and Statement lists.
2387
2388 Now that we have expressions out of the way we need to turn to
2389 statements.  There are simple statements and more complex statements.
2390 Simple statements do not contain newlines, complex statements do.
2391
2392 Statements often come in sequences and we have corresponding simple
2393 statement lists and complex statement lists.
2394 The former comprise only simple statements separated by semicolons.
2395 The later comprise complex statements and simple statement lists.  They are
2396 separated by newlines.  Thus the semicolon is only used to separate
2397 simple statements on the one line.  This may be overly restrictive,
2398 but I'm not sure I ever want a complex statement to share a line with
2399 anything else.
2400
2401 Note that a simple statement list can still use multiple lines if
2402 subsequent lines are indented, so
2403
2404 ###### Example: wrapped simple statement list
2405
2406         a = b; c = d;
2407            e = f; print g
2408
2409 is a single simple statement list.  This might allow room for
2410 confusion, so I'm not set on it yet.
2411
2412 A simple statement list needs no extra syntax.  A complex statement
2413 list has two syntactic forms.  It can be enclosed in braces (much like
2414 C blocks), or it can be introduced by a colon and continue until an
2415 unindented newline (much like Python blocks).  With this extra syntax
2416 it is referred to as a block.
2417
2418 Note that a block does not have to include any newlines if it only
2419 contains simple statements.  So both of:
2420
2421         if condition: a=b; d=f
2422
2423         if condition { a=b; print f }
2424
2425 are valid.
2426
2427 In either case the list is constructed from a `binode` list with
2428 `Block` as the operator.  When parsing the list it is most convenient
2429 to append to the end, so a list is a list and a statement.  When using
2430 the list it is more convenient to consider a list to be a statement
2431 and a list.  So we need a function to re-order a list.
2432 `reorder_bilist` serves this purpose.
2433
2434 The only stand-alone statement we introduce at this stage is `pass`
2435 which does nothing and is represented as a `NULL` pointer in a `Block`
2436 list.  Other stand-alone statements will follow once the infrastructure
2437 is in-place.
2438
2439 ###### Binode types
2440         Block,
2441
2442 ###### Grammar
2443
2444         $void
2445         OptNL -> Newlines
2446                 |
2447
2448         Newlines -> NEWLINE
2449                 | Newlines NEWLINE
2450
2451         $*binode
2452         Open -> {
2453                 | NEWLINE {
2454         Close -> }
2455                 | NEWLINE }
2456         Block -> Open Statementlist Close ${ $0 = $<2; }$
2457                 | Open Newlines Statementlist Close ${ $0 = $<3; }$
2458                 | Open SimpleStatements } ${ $0 = reorder_bilist($<2); }$
2459                 | Open Newlines SimpleStatements } ${ $0 = reorder_bilist($<3); }$
2460                 | : Statementlist ${ $0 = $<2; }$
2461                 | : SimpleStatements ${ $0 = reorder_bilist($<2); }$
2462
2463         Statementlist -> ComplexStatements ${ $0 = reorder_bilist($<1); }$
2464
2465         ComplexStatements -> ComplexStatements ComplexStatement ${
2466                 $0 = new(binode);
2467                 $0->op = Block;
2468                 $0->left = $<1;
2469                 $0->right = $<2;
2470                 }$
2471                 | ComplexStatements NEWLINE ${ $0 = $<1; }$
2472                 | ComplexStatement ${
2473                 $0 = new(binode);
2474                 $0->op = Block;
2475                 $0->left = NULL;
2476                 $0->right = $<1;
2477                 }$
2478
2479         $*exec
2480         ComplexStatement -> SimpleStatements NEWLINE ${
2481                         $0 = reorder_bilist($<1);
2482                         }$
2483                 ## ComplexStatement Grammar
2484
2485         $*binode
2486         SimpleStatements -> SimpleStatements ; SimpleStatement ${
2487                         $0 = new(binode);
2488                         $0->op = Block;
2489                         $0->left = $<1;
2490                         $0->right = $<3;
2491                         }$
2492                 | SimpleStatement ${
2493                         $0 = new(binode);
2494                         $0->op = Block;
2495                         $0->left = NULL;
2496                         $0->right = $<1;
2497                         }$
2498                 | SimpleStatements ; ${ $0 = $<1; }$
2499
2500         SimpleStatement -> pass ${ $0 = NULL; }$
2501                 ## SimpleStatement Grammar
2502
2503 ###### print binode cases
2504         case Block:
2505                 if (indent < 0) {
2506                         // simple statement
2507                         if (b->left == NULL)
2508                                 printf("pass");
2509                         else
2510                                 print_exec(b->left, indent, 0);
2511                         if (b->right) {
2512                                 printf("; ");
2513                                 print_exec(b->right, indent, 0);
2514                         }
2515                 } else {
2516                         // block, one per line
2517                         if (b->left == NULL)
2518                                 do_indent(indent, "pass\n");
2519                         else
2520                                 print_exec(b->left, indent, bracket);
2521                         if (b->right)
2522                                 print_exec(b->right, indent, bracket);
2523                 }
2524                 break;
2525
2526 ###### propagate binode cases
2527         case Block:
2528         {
2529                 /* If any statement returns something other than Tnone
2530                  * or Tbool then all such must return same type.
2531                  * As each statement may be Tnone or something else,
2532                  * we must always pass NULL (unknown) down, otherwise an incorrect
2533                  * error might occur.  We never return Tnone unless it is
2534                  * passed in.
2535                  */
2536                 struct binode *e;
2537
2538                 for (e = b; e; e = cast(binode, e->right)) {
2539                         t = propagate_types(e->left, c, ok, NULL, rules);
2540                         if ((rules & Rboolok) && t == Tbool)
2541                                 t = NULL;
2542                         if (t && t != Tnone && t != Tbool) {
2543                                 if (!type)
2544                                         type = t;
2545                                 else if (t != type) {
2546                                         type_err(c, "error: expected %1%r, found %2",
2547                                                  e->left, type, rules, t);
2548                                         *ok = 0;
2549                                 }
2550                         }
2551                 }
2552                 return type;
2553         }
2554
2555 ###### interp binode cases
2556         case Block:
2557                 while (rv.type == Tnone &&
2558                        b) {
2559                         if (b->left)
2560                                 rv = interp_exec(b->left);
2561                         b = cast(binode, b->right);
2562                 }
2563                 break;
2564
2565 ### The Print statement
2566
2567 `print` is a simple statement that takes a comma-separated list of
2568 expressions and prints the values separated by spaces and terminated
2569 by a newline.  No control of formatting is possible.
2570
2571 `print` faces the same list-ordering issue as blocks, and uses the
2572 same solution.
2573
2574 ###### Binode types
2575         Print,
2576
2577 ###### SimpleStatement Grammar
2578
2579         | print ExpressionList ${
2580                 $0 = reorder_bilist($<2);
2581         }$
2582         | print ExpressionList , ${
2583                 $0 = new(binode);
2584                 $0->op = Print;
2585                 $0->right = NULL;
2586                 $0->left = $<2;
2587                 $0 = reorder_bilist($0);
2588         }$
2589         | print ${
2590                 $0 = new(binode);
2591                 $0->op = Print;
2592                 $0->right = NULL;
2593         }$
2594
2595 ###### Grammar
2596
2597         $*binode
2598         ExpressionList -> ExpressionList , Expression ${
2599                 $0 = new(binode);
2600                 $0->op = Print;
2601                 $0->left = $<1;
2602                 $0->right = $<3;
2603                 }$
2604                 | Expression ${
2605                         $0 = new(binode);
2606                         $0->op = Print;
2607                         $0->left = NULL;
2608                         $0->right = $<1;
2609                 }$
2610
2611 ###### print binode cases
2612
2613         case Print:
2614                 do_indent(indent, "print");
2615                 while (b) {
2616                         if (b->left) {
2617                                 printf(" ");
2618                                 print_exec(b->left, -1, 0);
2619                                 if (b->right)
2620                                         printf(",");
2621                         }
2622                         b = cast(binode, b->right);
2623                 }
2624                 if (indent >= 0)
2625                         printf("\n");
2626                 break;
2627
2628 ###### propagate binode cases
2629
2630         case Print:
2631                 /* don't care but all must be consistent */
2632                 propagate_types(b->left, c, ok, NULL, Rnolabel);
2633                 propagate_types(b->right, c, ok, NULL, Rnolabel);
2634                 break;
2635
2636 ###### interp binode cases
2637
2638         case Print:
2639         {
2640                 char sep = 0;
2641                 int eol = 1;
2642                 for ( ; b; b = cast(binode, b->right))
2643                         if (b->left) {
2644                                 if (sep)
2645                                         putchar(sep);
2646                                 left = interp_exec(b->left);
2647                                 print_value(left);
2648                                 free_value(left);
2649                                 if (b->right)
2650                                         sep = ' ';
2651                         } else if (sep)
2652                                 eol = 0;
2653                 left.type = Tnone;
2654                 if (eol)
2655                         printf("\n");
2656                 break;
2657         }
2658
2659 ###### Assignment statement
2660
2661 An assignment will assign a value to a variable, providing it hasn't
2662 be declared as a constant.  The analysis phase ensures that the type
2663 will be correct so the interpreter just needs to perform the
2664 calculation.  There is a form of assignment which declares a new
2665 variable as well as assigning a value.  If a name is assigned before
2666 it is declared, and error will be raised as the name is created as
2667 `Tlabel` and it is illegal to assign to such names.
2668
2669 ###### Binode types
2670         Assign,
2671         Declare,
2672
2673 ###### SimpleStatement Grammar
2674         | Variable = Expression ${
2675                         $0 = new(binode);
2676                         $0->op = Assign;
2677                         $0->left = $<1;
2678                         $0->right = $<3;
2679                 }$
2680         | VariableDecl = Expression ${
2681                         $0 = new(binode);
2682                         $0->op = Declare;
2683                         $0->left = $<1;
2684                         $0->right =$<3;
2685                 }$
2686
2687         | VariableDecl ${
2688                         if ($1->var->where_set == NULL) {
2689                                 type_err(config2context(config), "Variable declared with no type or value: %v",
2690                                          $1, NULL, 0, NULL);
2691                         } else {
2692                                 $0 = new(binode);
2693                                 $0->op = Declare;
2694                                 $0->left = $<1;
2695                                 $0->right = NULL;
2696                         }
2697                 }$
2698
2699 ###### print binode cases
2700
2701         case Assign:
2702                 do_indent(indent, "");
2703                 print_exec(b->left, indent, 0);
2704                 printf(" = ");
2705                 print_exec(b->right, indent, 0);
2706                 if (indent >= 0)
2707                         printf("\n");
2708                 break;
2709
2710         case Declare:
2711                 {
2712                 struct variable *v = cast(var, b->left)->var;
2713                 do_indent(indent, "");
2714                 print_exec(b->left, indent, 0);
2715                 if (cast(var, b->left)->var->constant) {
2716                         if (v->where_decl == v->where_set) {
2717                                 printf("::");
2718                                 type_print(v->val.type, stdout);
2719                                 printf(" ");
2720                         } else
2721                                 printf(" ::");
2722                 } else {
2723                         if (v->where_decl == v->where_set) {
2724                                 printf(":");
2725                                 type_print(v->val.type, stdout);
2726                                 printf(" ");
2727                         } else
2728                                 printf(" :");
2729                 }
2730                 if (b->right) {
2731                         printf("= ");
2732                         print_exec(b->right, indent, 0);
2733                 }
2734                 if (indent >= 0)
2735                         printf("\n");
2736                 }
2737                 break;
2738
2739 ###### propagate binode cases
2740
2741         case Assign:
2742         case Declare:
2743                 /* Both must match and not be labels,
2744                  * Type must support 'dup',
2745                  * For Assign, left must not be constant.
2746                  * result is Tnone
2747                  */
2748                 t = propagate_types(b->left, c, ok, NULL,
2749                                     Rnolabel | (b->op == Assign ? Rnoconstant : 0));
2750                 if (!b->right)
2751                         return Tnone;
2752
2753                 if (t) {
2754                         if (propagate_types(b->right, c, ok, t, 0) != t)
2755                                 if (b->left->type == Xvar)
2756                                         type_err(c, "info: variable '%v' was set as %1 here.",
2757                                                  cast(var, b->left)->var->where_set, t, rules, Tnone);
2758                 } else {
2759                         t = propagate_types(b->right, c, ok, NULL, Rnolabel);
2760                         if (t)
2761                                 propagate_types(b->left, c, ok, t,
2762                                                 (b->op == Assign ? Rnoconstant : 0));
2763                 }
2764                 if (t && t->dup == NULL) {
2765                         type_err(c, "error: cannot assign value of type %1", b, t, 0, NULL);
2766                         *ok = 0;
2767                 }
2768                 return Tnone;
2769
2770                 break;
2771
2772 ###### interp binode cases
2773
2774         case Assign:
2775                 lleft = linterp_exec(b->left);
2776                 right = interp_exec(b->right);
2777                 if (lleft) {
2778                         free_value(*lleft);
2779                         *lleft = right;
2780                 } else
2781                         free_value(right);
2782                 right.type = NULL;
2783                 break;
2784
2785         case Declare:
2786         {
2787                 struct variable *v = cast(var, b->left)->var;
2788                 if (v->merged)
2789                         v = v->merged;
2790                 if (b->right)
2791                         right = interp_exec(b->right);
2792                 else
2793                         right = val_init(v->val.type);
2794                 free_value(v->val);
2795                 v->val = right;
2796                 right.type = NULL;
2797                 break;
2798         }
2799
2800 ### The `use` statement
2801
2802 The `use` statement is the last "simple" statement.  It is needed when
2803 the condition in a conditional statement is a block.  `use` works much
2804 like `return` in C, but only completes the `condition`, not the whole
2805 function.
2806
2807 ###### Binode types
2808         Use,
2809
2810 ###### SimpleStatement Grammar
2811         | use Expression ${
2812                 $0 = new_pos(binode, $1);
2813                 $0->op = Use;
2814                 $0->right = $<2;
2815         }$
2816
2817 ###### print binode cases
2818
2819         case Use:
2820                 do_indent(indent, "use ");
2821                 print_exec(b->right, -1, 0);
2822                 if (indent >= 0)
2823                         printf("\n");
2824                 break;
2825
2826 ###### propagate binode cases
2827
2828         case Use:
2829                 /* result matches value */
2830                 return propagate_types(b->right, c, ok, type, 0);
2831
2832 ###### interp binode cases
2833
2834         case Use:
2835                 rv = interp_exec(b->right);
2836                 break;
2837
2838 ### The Conditional Statement
2839
2840 This is the biggy and currently the only complex statement.  This
2841 subsumes `if`, `while`, `do/while`, `switch`, and some parts of `for`.
2842 It is comprised of a number of parts, all of which are optional though
2843 set combinations apply.  Each part is (usually) a key word (`then` is
2844 sometimes optional) followed by either an expression or a code block,
2845 except the `casepart` which is a "key word and an expression" followed
2846 by a code block.  The code-block option is valid for all parts and,
2847 where an expression is also allowed, the code block can use the `use`
2848 statement to report a value.  If the code block does not report a value
2849 the effect is similar to reporting `True`.
2850
2851 The `else` and `case` parts, as well as `then` when combined with
2852 `if`, can contain a `use` statement which will apply to some
2853 containing conditional statement. `for` parts, `do` parts and `then`
2854 parts used with `for` can never contain a `use`, except in some
2855 subordinate conditional statement.
2856
2857 If there is a `forpart`, it is executed first, only once.
2858 If there is a `dopart`, then it is executed repeatedly providing
2859 always that the `condpart` or `cond`, if present, does not return a non-True
2860 value.  `condpart` can fail to return any value if it simply executes
2861 to completion.  This is treated the same as returning `True`.
2862
2863 If there is a `thenpart` it will be executed whenever the `condpart`
2864 or `cond` returns True (or does not return any value), but this will happen
2865 *after* `dopart` (when present).
2866
2867 If `elsepart` is present it will be executed at most once when the
2868 condition returns `False` or some value that isn't `True` and isn't
2869 matched by any `casepart`.  If there are any `casepart`s, they will be
2870 executed when the condition returns a matching value.
2871
2872 The particular sorts of values allowed in case parts has not yet been
2873 determined in the language design, so nothing is prohibited.
2874
2875 The various blocks in this complex statement potentially provide scope
2876 for variables as described earlier.  Each such block must include the
2877 "OpenScope" nonterminal before parsing the block, and must call
2878 `var_block_close()` when closing the block.
2879
2880 The code following "`if`", "`switch`" and "`for`" does not get its own
2881 scope, but is in a scope covering the whole statement, so names
2882 declared there cannot be redeclared elsewhere.  Similarly the
2883 condition following "`while`" is in a scope the covers the body
2884 ("`do`" part) of the loop, and which does not allow conditional scope
2885 extension.  Code following "`then`" (both looping and non-looping),
2886 "`else`" and "`case`" each get their own local scope.
2887
2888 The type requirements on the code block in a `whilepart` are quite
2889 unusal.  It is allowed to return a value of some identifiable type, in
2890 which case the loop aborts and an appropriate `casepart` is run, or it
2891 can return a Boolean, in which case the loop either continues to the
2892 `dopart` (on `True`) or aborts and runs the `elsepart` (on `False`).
2893 This is different both from the `ifpart` code block which is expected to
2894 return a Boolean, or the `switchpart` code block which is expected to
2895 return the same type as the casepart values.  The correct analysis of
2896 the type of the `whilepart` code block is the reason for the
2897 `Rboolok` flag which is passed to `propagate_types()`.
2898
2899 The `cond_statement` cannot fit into a `binode` so a new `exec` is
2900 defined.
2901
2902 ###### exec type
2903         Xcond_statement,
2904
2905 ###### ast
2906         struct casepart {
2907                 struct exec *value;
2908                 struct exec *action;
2909                 struct casepart *next;
2910         };
2911         struct cond_statement {
2912                 struct exec;
2913                 struct exec *forpart, *condpart, *dopart, *thenpart, *elsepart;
2914                 struct casepart *casepart;
2915         };
2916
2917 ###### ast functions
2918
2919         static void free_casepart(struct casepart *cp)
2920         {
2921                 while (cp) {
2922                         struct casepart *t;
2923                         free_exec(cp->value);
2924                         free_exec(cp->action);
2925                         t = cp->next;
2926                         free(cp);
2927                         cp = t;
2928                 }
2929         }
2930
2931         static void free_cond_statement(struct cond_statement *s)
2932         {
2933                 if (!s)
2934                         return;
2935                 free_exec(s->forpart);
2936                 free_exec(s->condpart);
2937                 free_exec(s->dopart);
2938                 free_exec(s->thenpart);
2939                 free_exec(s->elsepart);
2940                 free_casepart(s->casepart);
2941                 free(s);
2942         }
2943
2944 ###### free exec cases
2945         case Xcond_statement: free_cond_statement(cast(cond_statement, e)); break;
2946
2947 ###### ComplexStatement Grammar
2948         | CondStatement ${ $0 = $<1; }$
2949
2950 ###### Grammar
2951
2952         $*cond_statement
2953         // both ForThen and Whilepart open scopes, and CondSuffix only
2954         // closes one - so in the first branch here we have another to close.
2955         CondStatement -> ForThen WhilePart CondSuffix ${
2956                         $0 = $<3;
2957                         $0->forpart = $1.forpart; $1.forpart = NULL;
2958                         $0->thenpart = $1.thenpart; $1.thenpart = NULL;
2959                         $0->condpart = $2.condpart; $2.condpart = NULL;
2960                         $0->dopart = $2.dopart; $2.dopart = NULL;
2961                         var_block_close(config2context(config), CloseSequential);
2962                         }$
2963                 | WhilePart CondSuffix ${
2964                         $0 = $<2;
2965                         $0->condpart = $1.condpart; $1.condpart = NULL;
2966                         $0->dopart = $1.dopart; $1.dopart = NULL;
2967                         }$
2968                 | SwitchPart CondSuffix ${
2969                         $0 = $<2;
2970                         $0->condpart = $<1;
2971                         }$
2972                 | IfPart IfSuffix ${
2973                         $0 = $<2;
2974                         $0->condpart = $1.condpart; $1.condpart = NULL;
2975                         $0->thenpart = $1.thenpart; $1.thenpart = NULL;
2976                         // This is where we close an "if" statement
2977                         var_block_close(config2context(config), CloseSequential);
2978                         }$
2979
2980         CondSuffix -> IfSuffix ${
2981                         $0 = $<1;
2982                         // This is where we close scope of the whole
2983                         // "for" or "while" statement
2984                         var_block_close(config2context(config), CloseSequential);
2985                 }$
2986                 | CasePart CondSuffix ${
2987                         $0 = $<2;
2988                         $1->next = $0->casepart;
2989                         $0->casepart = $<1;
2990                 }$
2991
2992         $*casepart
2993         CasePart -> Newlines case Expression OpenScope Block ${
2994                         $0 = calloc(1,sizeof(struct casepart));
2995                         $0->value = $<3;
2996                         $0->action = $<5;
2997                         var_block_close(config2context(config), CloseParallel);
2998                 }$
2999                 | case Expression OpenScope Block ${
3000                         $0 = calloc(1,sizeof(struct casepart));
3001                         $0->value = $<2;
3002                         $0->action = $<4;
3003                         var_block_close(config2context(config), CloseParallel);
3004                 }$
3005
3006         $*cond_statement
3007         IfSuffix -> Newlines ${ $0 = new(cond_statement); }$
3008                 | Newlines else OpenScope Block ${
3009                         $0 = new(cond_statement);
3010                         $0->elsepart = $<4;
3011                         var_block_close(config2context(config), CloseElse);
3012                 }$
3013                 | else OpenScope Block ${
3014                         $0 = new(cond_statement);
3015                         $0->elsepart = $<3;
3016                         var_block_close(config2context(config), CloseElse);
3017                 }$
3018                 | Newlines else OpenScope CondStatement ${
3019                         $0 = new(cond_statement);
3020                         $0->elsepart = $<4;
3021                         var_block_close(config2context(config), CloseElse);
3022                 }$
3023                 | else OpenScope CondStatement ${
3024                         $0 = new(cond_statement);
3025                         $0->elsepart = $<3;
3026                         var_block_close(config2context(config), CloseElse);
3027                 }$
3028
3029
3030         $*exec
3031         // These scopes are closed in CondSuffix
3032         ForPart -> for OpenScope SimpleStatements ${
3033                         $0 = reorder_bilist($<3);
3034                 }$
3035                 |  for OpenScope Block ${
3036                         $0 = $<3;
3037                 }$
3038
3039         ThenPart -> then OpenScope SimpleStatements ${
3040                         $0 = reorder_bilist($<3);
3041                         var_block_close(config2context(config), CloseSequential);
3042                 }$
3043                 |  then OpenScope Block ${
3044                         $0 = $<3;
3045                         var_block_close(config2context(config), CloseSequential);
3046                 }$
3047
3048         ThenPartNL -> ThenPart OptNL ${
3049                         $0 = $<1;
3050                 }$
3051
3052         // This scope is closed in CondSuffix
3053         WhileHead -> while OpenScope Block ${
3054                 $0 = $<3;
3055                 }$
3056
3057         $cond_statement
3058         ForThen -> ForPart OptNL ThenPartNL ${
3059                         $0.forpart = $<1;
3060                         $0.thenpart = $<3;
3061                 }$
3062                 | ForPart OptNL ${
3063                         $0.forpart = $<1;
3064                 }$
3065
3066         // This scope is closed in CondSuffix
3067         WhilePart -> while OpenScope Expression Block ${
3068                         $0.type = Xcond_statement;
3069                         $0.condpart = $<3;
3070                         $0.dopart = $<4;
3071                 }$
3072                 | WhileHead OptNL do Block ${
3073                         $0.type = Xcond_statement;
3074                         $0.condpart = $<1;
3075                         $0.dopart = $<4;
3076                 }$
3077
3078         IfPart -> if OpenScope Expression OpenScope Block ${
3079                         $0.type = Xcond_statement;
3080                         $0.condpart = $<3;
3081                         $0.thenpart = $<5;
3082                         var_block_close(config2context(config), CloseParallel);
3083                 }$
3084                 | if OpenScope Block OptNL then OpenScope Block ${
3085                         $0.type = Xcond_statement;
3086                         $0.condpart = $<3;
3087                         $0.thenpart = $<7;
3088                         var_block_close(config2context(config), CloseParallel);
3089                 }$
3090
3091         $*exec
3092         // This scope is closed in CondSuffix
3093         SwitchPart -> switch OpenScope Expression ${
3094                         $0 = $<3;
3095                 }$
3096                 | switch OpenScope Block ${
3097                         $0 = $<3;
3098                 }$
3099
3100 ###### print exec cases
3101
3102         case Xcond_statement:
3103         {
3104                 struct cond_statement *cs = cast(cond_statement, e);
3105                 struct casepart *cp;
3106                 if (cs->forpart) {
3107                         do_indent(indent, "for");
3108                         if (bracket) printf(" {\n"); else printf(":\n");
3109                         print_exec(cs->forpart, indent+1, bracket);
3110                         if (cs->thenpart) {
3111                                 if (bracket)
3112                                         do_indent(indent, "} then {\n");
3113                                 else
3114                                         do_indent(indent, "then:\n");
3115                                 print_exec(cs->thenpart, indent+1, bracket);
3116                         }
3117                         if (bracket) do_indent(indent, "}\n");
3118                 }
3119                 if (cs->dopart) {
3120                         // a loop
3121                         if (cs->condpart && cs->condpart->type == Xbinode &&
3122                             cast(binode, cs->condpart)->op == Block) {
3123                                 if (bracket)
3124                                         do_indent(indent, "while {\n");
3125                                 else
3126                                         do_indent(indent, "while:\n");
3127                                 print_exec(cs->condpart, indent+1, bracket);
3128                                 if (bracket)
3129                                         do_indent(indent, "} do {\n");
3130                                 else
3131                                         do_indent(indent, "do:\n");
3132                                 print_exec(cs->dopart, indent+1, bracket);
3133                                 if (bracket)
3134                                         do_indent(indent, "}\n");
3135                         } else {
3136                                 do_indent(indent, "while ");
3137                                 print_exec(cs->condpart, 0, bracket);
3138                                 if (bracket)
3139                                         printf(" {\n");
3140                                 else
3141                                         printf(":\n");
3142                                 print_exec(cs->dopart, indent+1, bracket);
3143                                 if (bracket)
3144                                         do_indent(indent, "}\n");
3145                         }
3146                 } else {
3147                         // a condition
3148                         if (cs->casepart)
3149                                 do_indent(indent, "switch");
3150                         else
3151                                 do_indent(indent, "if");
3152                         if (cs->condpart && cs->condpart->type == Xbinode &&
3153                             cast(binode, cs->condpart)->op == Block) {
3154                                 if (bracket)
3155                                         printf(" {\n");
3156                                 else
3157                                         printf(":\n");
3158                                 print_exec(cs->condpart, indent+1, bracket);
3159                                 if (bracket)
3160                                         do_indent(indent, "}\n");
3161                                 if (cs->thenpart) {
3162                                         do_indent(indent, "then:\n");
3163                                         print_exec(cs->thenpart, indent+1, bracket);
3164                                 }
3165                         } else {
3166                                 printf(" ");
3167                                 print_exec(cs->condpart, 0, bracket);
3168                                 if (cs->thenpart) {
3169                                         if (bracket)
3170                                                 printf(" {\n");
3171                                         else
3172                                                 printf(":\n");
3173                                         print_exec(cs->thenpart, indent+1, bracket);
3174                                         if (bracket)
3175                                                 do_indent(indent, "}\n");
3176                                 } else
3177                                         printf("\n");
3178                         }
3179                 }
3180                 for (cp = cs->casepart; cp; cp = cp->next) {
3181                         do_indent(indent, "case ");
3182                         print_exec(cp->value, -1, 0);
3183                         if (bracket)
3184                                 printf(" {\n");
3185                         else
3186                                 printf(":\n");
3187                         print_exec(cp->action, indent+1, bracket);
3188                         if (bracket)
3189                                 do_indent(indent, "}\n");
3190                 }
3191                 if (cs->elsepart) {
3192                         do_indent(indent, "else");
3193                         if (bracket)
3194                                 printf(" {\n");
3195                         else
3196                                 printf(":\n");
3197                         print_exec(cs->elsepart, indent+1, bracket);
3198                         if (bracket)
3199                                 do_indent(indent, "}\n");
3200                 }
3201                 break;
3202         }
3203
3204 ###### propagate exec cases
3205         case Xcond_statement:
3206         {
3207                 // forpart and dopart must return Tnone
3208                 // thenpart must return Tnone if there is a dopart,
3209                 // otherwise it is like elsepart.
3210                 // condpart must:
3211                 //    be bool if there is no casepart
3212                 //    match casepart->values if there is a switchpart
3213                 //    either be bool or match casepart->value if there
3214                 //             is a whilepart
3215                 // elsepart and casepart->action must match the return type
3216                 //   expected of this statement.
3217                 struct cond_statement *cs = cast(cond_statement, prog);
3218                 struct casepart *cp;
3219
3220                 t = propagate_types(cs->forpart, c, ok, Tnone, 0);
3221                 if (!type_compat(Tnone, t, 0))
3222                         *ok = 0;
3223                 t = propagate_types(cs->dopart, c, ok, Tnone, 0);
3224                 if (!type_compat(Tnone, t, 0))
3225                         *ok = 0;
3226                 if (cs->dopart) {
3227                         t = propagate_types(cs->thenpart, c, ok, Tnone, 0);
3228                         if (!type_compat(Tnone, t, 0))
3229                                 *ok = 0;
3230                 }
3231                 if (cs->casepart == NULL)
3232                         propagate_types(cs->condpart, c, ok, Tbool, 0);
3233                 else {
3234                         /* Condpart must match case values, with bool permitted */
3235                         t = NULL;
3236                         for (cp = cs->casepart;
3237                              cp && !t; cp = cp->next)
3238                                 t = propagate_types(cp->value, c, ok, NULL, 0);
3239                         if (!t && cs->condpart)
3240                                 t = propagate_types(cs->condpart, c, ok, NULL, Rboolok);
3241                         // Now we have a type (I hope) push it down
3242                         if (t) {
3243                                 for (cp = cs->casepart; cp; cp = cp->next)
3244                                         propagate_types(cp->value, c, ok, t, 0);
3245                                 propagate_types(cs->condpart, c, ok, t, Rboolok);
3246                         }
3247                 }
3248                 // (if)then, else, and case parts must return expected type.
3249                 if (!cs->dopart && !type)
3250                         type = propagate_types(cs->thenpart, c, ok, NULL, rules);
3251                 if (!type)
3252                         type = propagate_types(cs->elsepart, c, ok, NULL, rules);
3253                 for (cp = cs->casepart;
3254                      cp && !type;
3255                      cp = cp->next)
3256                         type = propagate_types(cp->action, c, ok, NULL, rules);
3257                 if (type) {
3258                         if (!cs->dopart)
3259                                 propagate_types(cs->thenpart, c, ok, type, rules);
3260                         propagate_types(cs->elsepart, c, ok, type, rules);
3261                         for (cp = cs->casepart; cp ; cp = cp->next)
3262                                 propagate_types(cp->action, c, ok, type, rules);
3263                         return type;
3264                 } else
3265                         return NULL;
3266         }
3267
3268 ###### interp exec cases
3269         case Xcond_statement:
3270         {
3271                 struct value v, cnd;
3272                 struct casepart *cp;
3273                 struct cond_statement *c = cast(cond_statement, e);
3274
3275                 if (c->forpart)
3276                         interp_exec(c->forpart);
3277                 do {
3278                         if (c->condpart)
3279                                 cnd = interp_exec(c->condpart);
3280                         else
3281                                 cnd.type = Tnone;
3282                         if (!(cnd.type == Tnone ||
3283                               (cnd.type == Tbool && cnd.bool != 0)))
3284                                 break;
3285                         // cnd is Tnone or Tbool, doesn't need to be freed
3286                         if (c->dopart)
3287                                 interp_exec(c->dopart);
3288
3289                         if (c->thenpart) {
3290                                 rv = interp_exec(c->thenpart);
3291                                 if (rv.type != Tnone || !c->dopart)
3292                                         goto Xcond_done;
3293                                 free_value(rv);
3294                         }
3295                 } while (c->dopart);
3296
3297                 for (cp = c->casepart; cp; cp = cp->next) {
3298                         v = interp_exec(cp->value);
3299                         if (value_cmp(v, cnd) == 0) {
3300                                 free_value(v);
3301                                 free_value(cnd);
3302                                 rv = interp_exec(cp->action);
3303                                 goto Xcond_done;
3304                         }
3305                         free_value(v);
3306                 }
3307                 free_value(cnd);
3308                 if (c->elsepart)
3309                         rv = interp_exec(c->elsepart);
3310                 else
3311                         rv.type = Tnone;
3312         Xcond_done:
3313                 break;
3314         }
3315
3316 ## Complex types
3317
3318 Now that we have the shape of the interpreter in place we can add some
3319 complex types and connected them in to the data structures and the
3320 different phases of parse, analyse, print, interpret.
3321
3322 For now, just arrays.
3323
3324 ### Arrays
3325
3326 Arrays can be declared by giving a size and a type, as `[size]type' so
3327 `freq:[26]number` declares `freq` to be an array of 26 numbers.  The
3328 size can be an arbitrary expression which is evaluated when the name
3329 comes into scope.
3330
3331 Arrays cannot be assigned.  When pointers are introduced we will also
3332 introduce array slices which can refer to part or all of an array -
3333 the assignment syntax will create a slice.  For now, an array can only
3334 ever be referenced by the name it is declared with.  It is likely that
3335 a "`copy`" primitive will eventually be define which can be used to
3336 make a copy of an array with controllable depth.
3337
3338 ###### type union fields
3339
3340         struct {
3341                 int size;
3342                 struct variable *vsize;
3343                 struct type *member;
3344         } array;
3345
3346 ###### value union fields
3347         struct {
3348                 struct value *elmnts;
3349         } array;
3350
3351 ###### value functions
3352
3353         static struct value array_prepare(struct type *type)
3354         {
3355                 struct value ret;
3356
3357                 ret.type = type;
3358                 ret.array.elmnts = NULL;
3359                 return ret;
3360         }
3361
3362         static struct value array_init(struct type *type)
3363         {
3364                 struct value ret;
3365                 int i;
3366
3367                 ret.type = type;
3368                 if (type->array.vsize) {
3369                         mpz_t q;
3370                         mpz_init(q);
3371                         mpz_tdiv_q(q, mpq_numref(type->array.vsize->val.num),
3372                                    mpq_denref(type->array.vsize->val.num));
3373                         type->array.size = mpz_get_si(q);
3374                         mpz_clear(q);
3375                 }
3376                 ret.array.elmnts = calloc(type->array.size,
3377                                           sizeof(ret.array.elmnts[0]));
3378                 for (i = 0; ret.array.elmnts && i < type->array.size; i++)
3379                         ret.array.elmnts[i] = val_init(type->array.member);
3380                 return ret;
3381         }
3382
3383         static void array_free(struct value val)
3384         {
3385                 int i;
3386
3387                 if (val.array.elmnts)
3388                         for (i = 0; i < val.type->array.size; i++)
3389                                 free_value(val.array.elmnts[i]);
3390                 free(val.array.elmnts);
3391         }
3392
3393         static int array_compat(struct type *require, struct type *have)
3394         {
3395                 if (have->compat != require->compat)
3396                         return 0;
3397                 /* Both are arrays, so we can look at details */
3398                 if (!type_compat(require->array.member, have->array.member, 0))
3399                         return 0;
3400                 if (require->array.vsize == NULL && have->array.vsize == NULL)
3401                         return require->array.size == have->array.size;
3402
3403                 return require->array.vsize == have->array.vsize;
3404         }
3405
3406         static void array_print_type(struct type *type, FILE *f)
3407         {
3408                 fputs("[", f);
3409                 if (type->array.vsize) {
3410                         struct binding *b = type->array.vsize->name;
3411                         fprintf(f, "%.*s]", b->name.len, b->name.txt);
3412                 } else
3413                         fprintf(f, "%d]", type->array.size);
3414                 type_print(type->array.member, f);
3415         }
3416
3417         static struct type array_prototype = {
3418                 .prepare = array_prepare,
3419                 .init = array_init,
3420                 .print_type = array_print_type,
3421                 .compat = array_compat,
3422                 .free = array_free,
3423         };
3424
3425 ###### type grammar
3426
3427         | [ NUMBER ] Type ${
3428                 $0 = calloc(1, sizeof(struct type));
3429                 *($0) = array_prototype;
3430                 $0->array.member = $<4;
3431                 $0->array.vsize = NULL;
3432                 {
3433                 char tail[3];
3434                 mpq_t num;
3435                 if (number_parse(num, tail, $2.txt) == 0)
3436                         tok_err(config2context(config), "error: unrecognised number", &$2);
3437                 else if (tail[0])
3438                         tok_err(config2context(config), "error: unsupported number suffix", &$2);
3439                 else {
3440                         $0->array.size = mpz_get_ui(mpq_numref(num));
3441                         if (mpz_cmp_ui(mpq_denref(num), 1) != 0) {
3442                                 tok_err(config2context(config), "error: array size must be an integer",
3443                                         &$2);
3444                         } else if (mpz_cmp_ui(mpq_numref(num), 1UL << 30) >= 0)
3445                                 tok_err(config2context(config), "error: array size is too large",
3446                                         &$2);
3447                 }
3448                 }
3449         }$
3450
3451         | [ IDENTIFIER ] Type ${ {
3452                 struct variable *v = var_ref(config2context(config), $2.txt);
3453
3454                 if (!v)
3455                         tok_err(config2context(config), "error: name undeclared", &$2);
3456                 else if (!v->constant)
3457                         tok_err(config2context(config), "error: array size must be a constant", &$2);
3458
3459                 $0 = calloc(1, sizeof(struct type));
3460                 *($0) = array_prototype;
3461                 $0->array.member = $<4;
3462                 $0->array.size = 0;
3463                 $0->array.vsize = v;
3464         } }$
3465
3466 ###### Binode types
3467         Index,
3468
3469 ###### variable grammar
3470
3471         | Variable [ Expression ] ${ {
3472                 struct binode *b = new(binode);
3473                 b->op = Index;
3474                 b->left = $<1;
3475                 b->right = $<3;
3476                 $0 = b;
3477         } }$
3478
3479 ###### print binode cases
3480         case Index:
3481                 print_exec(b->left, -1, 0);
3482                 printf("[");
3483                 print_exec(b->right, -1, 0);
3484                 printf("]");
3485                 break;
3486
3487 ###### propagate binode cases
3488         case Index:
3489                 /* left must be an array, right must be a number,
3490                  * result is the member type of the array
3491                  */
3492                 propagate_types(b->right, c, ok, Tnum, 0);
3493                 t = propagate_types(b->left, c, ok, NULL, rules & Rnoconstant);
3494                 if (!t || t->compat != array_compat) {
3495                         type_err(c, "error: %1 cannot be indexed", prog, t, 0, NULL);
3496                         *ok = 0;
3497                         return NULL;
3498                 } else {
3499                         if (!type_compat(type, t->array.member, rules)) {
3500                                 type_err(c, "error: have %1 but need %2", prog,
3501                                          t->array.member, rules, type);
3502                                 *ok = 0;
3503                         }
3504                         return t->array.member;
3505                 }
3506                 break;
3507
3508 ###### interp binode cases
3509         case Index: {
3510                 mpz_t q;
3511                 long i;
3512
3513                 lleft = linterp_exec(b->left);
3514                 right = interp_exec(b->right);
3515                 mpz_init(q);
3516                 mpz_tdiv_q(q, mpq_numref(right.num), mpq_denref(right.num));
3517                 i = mpz_get_si(q);
3518                 mpz_clear(q);
3519
3520                 if (i >= 0 && i < lleft->type->array.size)
3521                         lrv = &lleft->array.elmnts[i];
3522                 else
3523                         rv = val_init(lleft->type->array.member);
3524                 break;
3525         }
3526
3527 ### Finally the whole program.
3528
3529 Somewhat reminiscent of Pascal a (current) Ocean program starts with
3530 the keyword "program" and a list of variable names which are assigned
3531 values from command line arguments.  Following this is a `block` which
3532 is the code to execute.
3533
3534 As this is the top level, several things are handled a bit
3535 differently.
3536 The whole program is not interpreted by `interp_exec` as that isn't
3537 passed the argument list which the program requires.  Similarly type
3538 analysis is a bit more interesting at this level.
3539
3540 ###### Binode types
3541         Program,
3542
3543 ###### Parser: grammar
3544
3545         $*binode
3546         Program -> program OpenScope Varlist Block OptNL ${
3547                 $0 = new(binode);
3548                 $0->op = Program;
3549                 $0->left = reorder_bilist($<3);
3550                 $0->right = $<4;
3551                 var_block_close(config2context(config), CloseSequential);
3552                 if (config2context(config)->scope_stack) abort();
3553                 }$
3554                 | ERROR ${
3555                         tok_err(config2context(config),
3556                                 "error: unhandled parse error", &$1);
3557                 }$
3558
3559         Varlist -> Varlist ArgDecl ${
3560                         $0 = new(binode);
3561                         $0->op = Program;
3562                         $0->left = $<1;
3563                         $0->right = $<2;
3564                 }$
3565                 | ${ $0 = NULL; }$
3566
3567         $*var
3568         ArgDecl -> IDENTIFIER ${ {
3569                 struct variable *v = var_decl(config2context(config), $1.txt);
3570                 $0 = new(var);
3571                 $0->var = v;
3572         } }$
3573
3574         ## Grammar
3575
3576 ###### print binode cases
3577         case Program:
3578                 do_indent(indent, "program");
3579                 for (b2 = cast(binode, b->left); b2; b2 = cast(binode, b2->right)) {
3580                         printf(" ");
3581                         print_exec(b2->left, 0, 0);
3582                 }
3583                 if (bracket)
3584                         printf(" {\n");
3585                 else
3586                         printf(":\n");
3587                 print_exec(b->right, indent+1, bracket);
3588                 if (bracket)
3589                         do_indent(indent, "}\n");
3590                 break;
3591
3592 ###### propagate binode cases
3593         case Program: abort();
3594
3595 ###### core functions
3596
3597         static int analyse_prog(struct exec *prog, struct parse_context *c)
3598         {
3599                 struct binode *b = cast(binode, prog);
3600                 int ok = 1;
3601
3602                 if (!b)
3603                         return 0;
3604                 do {
3605                         ok = 1;
3606                         propagate_types(b->right, c, &ok, Tnone, 0);
3607                 } while (ok == 2);
3608                 if (!ok)
3609                         return 0;
3610
3611                 for (b = cast(binode, b->left); b; b = cast(binode, b->right)) {
3612                         struct var *v = cast(var, b->left);
3613                         if (!v->var->val.type) {
3614                                 v->var->where_set = b;
3615                                 v->var->val = val_prepare(Tstr);
3616                         }
3617                 }
3618                 b = cast(binode, prog);
3619                 do {
3620                         ok = 1;
3621                         propagate_types(b->right, c, &ok, Tnone, 0);
3622                 } while (ok == 2);
3623                 if (!ok)
3624                         return 0;
3625
3626                 /* Make sure everything is still consistent */
3627                 propagate_types(b->right, c, &ok, Tnone, 0);
3628                 return !!ok;
3629         }
3630
3631         static void interp_prog(struct exec *prog, char **argv)
3632         {
3633                 struct binode *p = cast(binode, prog);
3634                 struct binode *al;
3635                 struct value v;
3636
3637                 if (!prog)
3638                         return;
3639                 al = cast(binode, p->left);
3640                 while (al) {
3641                         struct var *v = cast(var, al->left);
3642                         struct value *vl = &v->var->val;
3643
3644                         if (argv[0] == NULL) {
3645                                 printf("Not enough args\n");
3646                                 exit(1);
3647                         }
3648                         al = cast(binode, al->right);
3649                         free_value(*vl);
3650                         *vl = parse_value(vl->type, argv[0]);
3651                         if (vl->type == NULL)
3652                                 exit(1);
3653                         argv++;
3654                 }
3655                 v = interp_exec(p->right);
3656                 free_value(v);
3657         }
3658
3659 ###### interp binode cases
3660         case Program: abort();
3661
3662 ## And now to test it out.
3663
3664 Having a language requires having a "hello world" program. I'll
3665 provide a little more than that: a program that prints "Hello world"
3666 finds the GCD of two numbers, prints the first few elements of
3667 Fibonacci, and performs a binary search for a number.
3668
3669 ###### File: oceani.mk
3670         tests :: sayhello
3671         sayhello : oceani
3672                 @echo "===== TEST ====="
3673                 ./oceani --section "test: hello" oceani.mdc 55 33
3674
3675 ###### test: hello
3676
3677         program A B:
3678                 print "Hello World, what lovely oceans you have!"
3679                 /* When a variable is defined in both branches of an 'if',
3680                  * and used afterwards, the variables are merged.
3681                  */
3682                 if A > B:
3683                         bigger := "yes"
3684                 else:
3685                         bigger := "no"
3686                 print "Is", A, "bigger than", B,"? ", bigger
3687                 /* If a variable is not used after the 'if', no
3688                  * merge happens, so types can be different
3689                  */
3690                 if A > B * 2:
3691                         double:string = "yes"
3692                         print A, "is more than twice", B, "?", double
3693                 else:
3694                         double := B*2
3695                         print "double", B, "is", double
3696
3697                 a : number
3698                 a = A;
3699                 b:number = B
3700                 if a > 0 and then b > 0:
3701                         while a != b:
3702                                 if a < b:
3703                                         b = b - a
3704                                 else:
3705                                         a = a - b
3706                         print "GCD of", A, "and", B,"is", a
3707                 else if a <= 0:
3708                         print a, "is not positive, cannot calculate GCD"
3709                 else:
3710                         print b, "is not positive, cannot calculate GCD"
3711
3712                 for:
3713                         togo := 10
3714                         f1 := 1; f2 := 1
3715                         print "Fibonacci:", f1,f2,
3716                 then togo = togo - 1
3717                 while togo > 0:
3718                         f3 := f1 + f2
3719                         print "", f3,
3720                         f1 = f2
3721                         f2 = f3
3722                 print ""
3723
3724                 /* Binary search... */
3725                 for:
3726                         lo:= 0; hi := 100
3727                         target := 77
3728                 while:
3729                         mid := (lo + hi) / 2
3730                         if mid == target:
3731                                 use Found
3732                         if mid < target:
3733                                 lo = mid
3734                         else:
3735                                 hi = mid
3736                         if hi - lo < 1:
3737                                 use GiveUp
3738                         use True
3739                 do: pass
3740                 case Found:
3741                         print "Yay, I found", target
3742                 case GiveUp:
3743                         print "Closest I found was", mid
3744
3745                 size::=55
3746                 list:[size]number
3747                 list[0] = 1234
3748                 for i:=1; then i = i + 1; while i < size:
3749                         n := list[i-1] * list[i-1]
3750                         list[i] = (n / 100) % 10000
3751
3752                 print "Before sort:"
3753                 for i:=0; then i = i + 1; while i < size:
3754                         print "list[",i,"]=",list[i]
3755
3756                 for i := 1; then i=i+1; while i < size:
3757                         for j:=i-1; then j=j-1; while j >= 0:
3758                                 if list[j] > list[j+1]:
3759                                         t:= list[j]
3760                                         list[j] = list[j+1]
3761                                         list[j+1] = t
3762                 print "After sort:"
3763                 for i:=0; then i = i + 1; while i < size:
3764                         print "list[",i,"]=",list[i]