ae2d4189eb
gcc always generates large code for accesses to globals. For locals it only generates large code if there are more than 128 bytes of locals. It sorts scalar locals after array locals to pessimize for space in the usual case when there are more (static) references to scalars than to arrays. Saved another 16 bytes (13 before padding) by adding a `continue'. Fall-through tests normally save space, but here one of them made gcc do space-unoptimal register allocation (it allocates ch in %bl because preserving this register across function calls is "free", but comparisions with %bl take one byte fewer than comparsions with %bl).